Created base structure for adding discovery and other algorithms #68

amit-sharma · 2025-07-20T07:42:28Z

New files added that convey the directory structure.

This PR does not add any functionality, just the protocol class and structure. The goal is to enable collaborators to contribute code.

Signed-off-by: Amit Sharma <[email protected]>

Copilot

Pull Request Overview

This PR establishes the foundational structure for a causal discovery library by creating base interfaces and utility functions. The changes introduce a protocol-based design for datasets and skeleton implementations for evaluation metrics.

Defines a Dataset protocol with methods for graph access, data retrieval, and synthetic data generation
Creates placeholder functions for standard causal discovery evaluation metrics (accuracy, precision, recall, F1)

Reviewed Changes

Copilot reviewed 2 out of 8 changed files in this pull request and generated 4 comments.

File	Description
pywhyllm/datasets/dataset.py	Defines the Dataset protocol interface with methods for graph, data, and synthetic data generation
pywhyllm/datasets/metrics.py	Creates skeleton functions for evaluation metrics used in causal discovery

Comments suppressed due to low confidence (1)

pywhyllm/datasets/metrics.py:21

Function name 'F1' should follow Python naming conventions. Consider renaming to 'f1_score' or 'f1' (lowercase).

def F1(edges, true_edges)

pywhyllm/datasets/metrics.py

pywhyllm/datasets/dataset.py

Co-authored-by: Copilot <[email protected]> Signed-off-by: Amit Sharma <[email protected]>

emrekiciman

This is great. I added a few questions about the structure.

Also: do we want this dataset protocol to integrate well with data handling in other pywhyllm libraries? Or is this only for self-contained pywhy-llm benchmarking for example?

emrekiciman · 2025-07-27T15:22:17Z

pywhyllm/datasets/dataset.py

+
+class Dataset(Protocol):
+
+    def graph(self):


Is this supposed to return the ground truth graph, or is this function intended to execute the PyWhyLLM functions to derive a candidate graph?

emrekiciman · 2025-07-27T15:23:45Z

pywhyllm/datasets/dataset.py

+        """
+        pass
+
+    def generate_data(self):


What do you think about merging data() and generate_data() into a single function? Then some Dataset objects might be synthetic Datasets, and some might be grounded datasets (real data)?

emrekiciman · 2025-07-27T15:25:23Z

pywhyllm/datasets/metrics.py

@@ -0,0 +1,25 @@
+
+
+def accuracy(edges, true_edges):


Given edges and true_edges, is the calculation of accuracy, precision, and recall specific to the dataset?

created base structure for adding discovery and other algorithms

1f90a65

Signed-off-by: Amit Sharma <[email protected]>

amit-sharma requested a review from Copilot July 20, 2025 07:43

Copilot AI reviewed Jul 20, 2025

View reviewed changes

pywhyllm/datasets/metrics.py Outdated Show resolved Hide resolved

pywhyllm/datasets/dataset.py Outdated Show resolved Hide resolved

pywhyllm/datasets/dataset.py Outdated Show resolved Hide resolved

pywhyllm/datasets/dataset.py Outdated Show resolved Hide resolved

amit-sharma and others added 4 commits July 20, 2025 13:14

Update pywhyllm/datasets/metrics.py

b268445

Co-authored-by: Copilot <[email protected]> Signed-off-by: Amit Sharma <[email protected]>

Update pywhyllm/datasets/dataset.py

a8d4804

Co-authored-by: Copilot <[email protected]> Signed-off-by: Amit Sharma <[email protected]>

Update pywhyllm/datasets/dataset.py

b3ab0da

Co-authored-by: Copilot <[email protected]> Signed-off-by: Amit Sharma <[email protected]>

Update pywhyllm/datasets/dataset.py

eded1e3

Co-authored-by: Copilot <[email protected]> Signed-off-by: Amit Sharma <[email protected]>

amit-sharma requested a review from emrekiciman July 21, 2025 04:56

emrekiciman reviewed Jul 27, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Created base structure for adding discovery and other algorithms #68

Created base structure for adding discovery and other algorithms #68

Uh oh!

amit-sharma commented Jul 20, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

emrekiciman left a comment

Uh oh!

emrekiciman Jul 27, 2025

Uh oh!

emrekiciman Jul 27, 2025

Uh oh!

emrekiciman Jul 27, 2025

Uh oh!

Uh oh!

Created base structure for adding discovery and other algorithms #68

Are you sure you want to change the base?

Created base structure for adding discovery and other algorithms #68

Uh oh!

Conversation

amit-sharma commented Jul 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

emrekiciman left a comment

Choose a reason for hiding this comment

Uh oh!

emrekiciman Jul 27, 2025

Choose a reason for hiding this comment

Uh oh!

emrekiciman Jul 27, 2025

Choose a reason for hiding this comment

Uh oh!

emrekiciman Jul 27, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

amit-sharma commented Jul 20, 2025 •

edited

Loading