Feat/add prediction artifact and upload method #292

j279li · 2025-06-03T15:49:46Z

Summary of Changes

This PR introduces a new Prediction artifact, and an associated upload method

New Features

Prediction Artifact (Predictions)
- Added a new Predictions model to represent prediction artifacts as Zarr archives.
- Includes validators to ensure Zarr structure consistency.
- Provides convenient properties (columns, dtypes, n_rows, etc.) for quick data exploration.
- Methods for setting/getting prediction values, creating Zarr archives from dicts, and uploading to the Polaris Hub.
Upload API (upload_prediction)
- Added a new upload_prediction method in PolarisHubClient for uploading prediction artifacts.
- Handles metadata, manifest, and archive uploads with progress tracking.

jstlaurent

Left some comments about the model structure. We can chat about it some more if you'd like. 😄

polaris/evaluate/__init__.py

polaris/hub/client.py

polaris/evaluate/_predictions.py

cwognum

Nice work, @j279li !

polaris/evaluate/_predictions.py

polaris/hub/client.py

polaris/evaluate/_predictions.py

j279li · 2025-06-05T20:20:15Z

@jstlaurent I've encountered circular import issues with BenchmarkV2Specification, since the benchmark package already imports from the evaluate package. To work around this, I moved the Predictions class into its own package for now. Would that be alright, or do you have other ideas?

danielpeng1

Nice job!

cwognum · 2025-06-17T14:52:05Z

@j279li Ping me once this is ready for another review! Happy to take a look!

j279li · 2025-06-17T16:53:29Z

@j279li Ping me once this is ready for another review! Happy to take a look!

Hey @cwognum, It's basically good for another review! Predictions upload should be fully functional now.

Andrewq11

Thanks again, @j279li 💪 I've left a few comments we should probably take a look at. It's coming along nicely!

polaris/evaluate/_predictions.py

polaris/hub/storage.py

polaris/prediction/_predictions_v2.py

Andrewq11 · 2025-06-17T17:38:47Z

polaris/prediction/_predictions_v2.py

+                if len(data) > 0:
+                    # Extract a sample value to determine the type
+                    if isinstance(data, (list, tuple)):
+                        sample_value = data[0]
+                    elif hasattr(data, "__getitem__"):
+                        try:
+                            sample_value = data[0]
+                        except (IndexError, TypeError):
+                            sample_value = None
+                    else:
+                        sample_value = None
+
+                    if sample_value is not None:
+                        sample_type_name = type(sample_value).__name__
+                        if sample_type_name == "Mol" and "rdkit" in str(type(sample_value).__module__):
+                            codec_kwargs["object_codec"] = RDKitMolCodec()
+                            codec_kwargs["dtype"] = object
+                        elif sample_type_name == "AtomArray" and "biotite" in str(
+                            type(sample_value).__module__
+                        ):
+                            codec_kwargs["object_codec"] = AtomArrayCodec()
+                            codec_kwargs["dtype"] = object
+                        elif annotation.dtype == np.dtype(object):
+                            # For other object types, use object dtype
+                            codec_kwargs["dtype"] = object
+
+            # Create the array in the Zarr archive
+            if "object_codec" in codec_kwargs:
+                # For object codecs, we need to create a numpy object array first
+                # Use np.empty to avoid numpy trying to convert AtomArrays to numpy arrays
+                data_array = np.empty(len(data), dtype=object)
+                for i, item in enumerate(data):
+                    data_array[i] = item
+                root.array(col, data=data_array, **codec_kwargs)
+            else:
+                root.array(col, data=data, **codec_kwargs)


I'll need some more context on this block here. Let's chat about this a little further once @cwognum has had a chance to look at it.

polaris/prediction/_predictions_v2.py

tests/test_dataset_v2.py

cwognum

Hi @j279li, sorry for not spotting any of this on my first review. I've been out of the loop on recent changes to Polaris, so I may also be missing some context here.

We're making good progress, but I think we should take another look at how we've done things for competitions. Specifically, I would like us to revisit the user-facing API as well as the validation of the Predictions class.

Happy to find some chat to about this tomorrow!

polaris/dataset/_dataset_v2.py

polaris/hub/client.py

polaris/hub/oauth.py

polaris/prediction/_predictions_v2.py

cwognum · 2025-06-18T01:32:08Z

polaris/prediction/_predictions_v2.py

+            if hasattr(self.benchmark.dataset, "annotations") and col in self.benchmark.dataset.annotations:
+                annotation = self.benchmark.dataset.annotations[col]


Change requested: Rather than using the dataset annotations as the source of truth, we should use the dataset's Zarr archive to determine the dtype. Each prediction column should match the corresponding dataset column.

cwognum · 2025-06-18T01:34:08Z

polaris/prediction/_predictions_v2.py

+    def zarr_root_path(self) -> str:
+        return self._zarr_root_path
+
+    def _create_zarr_from_predictions(self):


Change requested: I think this method can be simplified a lot. It can copy the configuration (dtype, codec, etc.) of each array from the benchmark's dataset's Zarr Root and infer the array size from the test split.

polaris/hub/client.py

added predictions artifact and upload method

b0b118e

j279li requested review from jstlaurent, mercuryseries, cwognum, danielpeng1 and Andrewq11 June 3, 2025 15:50

j279li self-assigned this Jun 3, 2025

j279li added the feature Annotates any PR that adds new features; Used in the release process label Jun 3, 2025

j279li added 4 commits June 3, 2025 12:03

fix import

3f76ffc

fix import again

637361d

fix typo

36a7993

uploaded correct prediction file

8c7a4e9

jstlaurent reviewed Jun 3, 2025

View reviewed changes

j279li added 2 commits June 3, 2025 16:27

small fixes to client and init.py for evaluate

92f39b7

remove duplicate keyword

9271f69

cwognum approved these changes Jun 5, 2025

View reviewed changes

remove metadata consolidation

db77f00

j279li force-pushed the feat/predictions-submission branch from fe10e34 to db77f00 Compare June 5, 2025 15:23

j279li added 7 commits June 5, 2025 16:39

dataset subgroup change

7d0abc0

refactor of prediction files + update

6d6dfcc

update tests

10d94a9

added cached directory and updated payload json

7692997

ruff format

d78cc28

small changes

5b6349f

zarr upload + store

be4dee7

j279li marked this pull request as ready for review June 11, 2025 13:38

j279li added 2 commits June 11, 2025 09:40

ruff

8a73ad2

small fix

fee2898

danielpeng1 approved these changes Jun 17, 2025

View reviewed changes

Andrewq11 reviewed Jun 17, 2025

View reviewed changes

small fixes

048df0a

cwognum requested changes Jun 18, 2025

View reviewed changes

j279li added 11 commits June 19, 2025 14:38

move zarr to utils + prediction artifact method changes

ace3587

small fixes

7773baa

formatting

fc6c343

ruff

169c751

moved predictions upload method to benchmarkv2

c7eaaf4

circular import + test fix

05d98a0

update error message`

090dc51

type fix

c5d320c

fix to zarr creation

3f2e54d

formatting

fbb5add

added some tests for predictions

fa45532

		if hasattr(self.benchmark.dataset, "annotations") and col in self.benchmark.dataset.annotations:
		annotation = self.benchmark.dataset.annotations[col]

Feat/add prediction artifact and upload method #292

Are you sure you want to change the base?

Feat/add prediction artifact and upload method #292

Uh oh!

Conversation

j279li commented Jun 3, 2025

Summary of Changes

New Features

Uh oh!

jstlaurent left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cwognum left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

j279li commented Jun 5, 2025

Uh oh!

danielpeng1 left a comment

Choose a reason for hiding this comment

Uh oh!

cwognum commented Jun 17, 2025

Uh oh!

j279li commented Jun 17, 2025

Uh oh!

Andrewq11 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Andrewq11 Jun 17, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cwognum left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cwognum Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

cwognum Jun 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

cwognum left a comment •

edited

Loading

cwognum Jun 18, 2025 •

edited

Loading