[WIP] Add functions for query-target distance calculations #17

schlegelp · 2024-09-11T12:33:28Z

Hi! This PR would add functions to run query->target distance calculations, complementing the current all-by-all functionality.

The use-case for these (at least in my work) would be to find, for each query neuron, the best match(es) among a pool of target neurons.

TODOs:

query->target version of:
- GW distance
- QGW distance
- SLB distance
update docs

Before I continue working on the TODOs above, I was wondering if you think these would be a good fit? And if you do, whether you agree with the way it is implemented in this first pass?

patrick-nicodemus · 2024-09-27T21:59:11Z

src/cajal/run_gw.py

+    gw_coupling_mat_csv: Optional[str] = None,
+    return_coupling_mats: bool = False,
+) -> tuple[
+    DistanceMatrix,  # GW distance matrix (Squareform)


should be

Matrix, # GW distance matrix (not necessarily a square)

patrick-nicodemus · 2024-09-27T22:04:02Z

src/cajal/run_gw.py

+
+    :return: If `return_coupling_mats` is True,
+        returns `( gw_dmat, couplings )`,
+        where gw_dmat is a square matrix whose (i,j) entry is the GW distance


gw_dmat is not necessarily square

patrick-nicodemus · 2024-09-27T22:06:27Z

src/cajal/run_gw.py

+                raise Exception(
+                    "Must supply list of cell identifiers for writing to file."
+                )
+            gw_data = csv_output_writer(


It's nice that you were able to make use of this existing function. When I wrote it I had square matrices in mind but I think this should be correct.

patrick-nicodemus · 2024-09-27T22:07:20Z

src/cajal/run_gw.py

+    return_coupling_mats: bool = False,
+    verbose: Optional[bool] = False,
+) -> tuple[
+    DistanceMatrix,  # Pairwise GW distance matrix (Squareform)


Matrix, # Pairwise GW distance matrix (need not be square)

patrick-nicodemus · 2024-09-27T22:20:54Z

Hi Philipp,
Thanks very much for this contribution. It looks great.
I think the code is simple and clear.
I think the overall architecture / structure is good and fits in with other parts of the library.
Thank you very much for writing type annotations and adding a unit test.

I don't have any comments on the code but there are a few changes to the docstrings and type labels I suggested.

The way the code currently stands, the type DistanceMatrix refers to a square, symmetric distance matrix with zeroes along the diagonal. I have been using the name Matrix = NewType("Matrix", npt.NDArray[np.float64]) to refer to a numpy array of shape 2.

We can rename the type to something else to make it easier to distinguish between square symmetric distance matrices and general rectangular distance matrices, but on the other hand the distance matrices in the query/target code don't have any special properties other than being nonnegative, so perhaps they need a special name.

For future work, I don't think it's necessary (for computational reasons, at least) to do something like this for the SLB distances, which I expect to be sufficiently fast to compute that one can just do all-by-all comparisons and then filter afterwards. Although the SLB is itself a poor estimator of GW, in its own right it seems to give reasonably good performance on our k-nn classifier tests, so it may be of direct use.

On the other hand, I do think that implementing the query version of the QGW is likely to be useful, and the design can be basically identical to the design of this one. It would be nice to unify some of this redundant code but I am not sure that the benefits of a uniform interface would outweigh the increased complexity (in Python, at least).

Again, thank you for this contribution!

add functions for query-target GW distance calculations

15e89ce

schlegelp changed the title ~~Add functions for query-target GW distance calculations~~ [WIP] Add functions for query-target GW distance calculations Sep 11, 2024

schlegelp changed the title ~~[WIP] Add functions for query-target GW distance calculations~~ [WIP] Add functions for query-target distance calculations Sep 11, 2024

drop duplicate import (was for testing only and accidentally committed)

c079426

patrick-nicodemus requested changes Sep 27, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Add functions for query-target distance calculations #17

[WIP] Add functions for query-target distance calculations #17

schlegelp commented Sep 11, 2024 •

edited by ehgunther

Loading

patrick-nicodemus Sep 27, 2024

patrick-nicodemus Sep 27, 2024

patrick-nicodemus Sep 27, 2024

patrick-nicodemus Sep 27, 2024

patrick-nicodemus commented Sep 27, 2024

[WIP] Add functions for query-target distance calculations #17

Are you sure you want to change the base?

[WIP] Add functions for query-target distance calculations #17

Conversation

schlegelp commented Sep 11, 2024 • edited by ehgunther Loading

patrick-nicodemus Sep 27, 2024

Choose a reason for hiding this comment

patrick-nicodemus Sep 27, 2024

Choose a reason for hiding this comment

patrick-nicodemus Sep 27, 2024

Choose a reason for hiding this comment

patrick-nicodemus Sep 27, 2024

Choose a reason for hiding this comment

patrick-nicodemus commented Sep 27, 2024

schlegelp commented Sep 11, 2024 •

edited by ehgunther

Loading