Corr metric overhaul #393

dstorer · 2021-02-01T06:44:34Z

Add correlation metric and cross-polarization metric to use for antenna flagging. The correlation metric is measures how well antennas are correlating with each other, and the cross-polarization metric compares this value between the same-pols and different-pols to identify crossed antennas. See the daily notebooks for visuals of this metric and the cross-polarization metric.

The correlation metric is calculated for each polarization as follows:

When only sum visibilities are provided (e.g. for H1C), then interleaved integrations are used to calculate the evens and odds.

Additionally, we've stripped out the unneeded old functionality for Mean Vij based metrics and the cross-polarization detection based on Mean Vij. The former is superseded by auto_metrics, the latter is superseded by the new cross-correlation based detection of cross-polarized antennas. We've also removed all modified z-score based antenna removal in favor of strict cuts.

…era_qm into corr_metric_overhaul

This function is being eliminated in HERA-Team/hera_qm#393 and since it's only used in hera_cal, it should live there.

dannyjacobs · 2021-02-17T02:44:10Z

Thanks @dstorer for your explanation, they made things clearer. My revised notes (here, now more? readable) try to expand on and and justify the proposed change. Things are close. I see two main questions.

Q1
In your equation 3 you write even_ij / | even_ij|. Do you really mean to divide each visibility spectrum by its abs? Is that maybe a typo? That operation would cancel the entire amplitude of the fringe. It’s hard to predict what that would look like. I was expecting you to divide by the autos. This would cancel out the gain terms to get back the “true” visibility (see my notes).

Q2
In your pasted writeup you describe a process for computing a cross-pol metric based on cross correlations then below you say that cross-polarization detection has been preceded by auto_metrics. Which is being used to ID cross pols? The method using cross poles is has many steps and summations making it difficult to predict what the output ought to be, a method that estimates the polarization fraction in the auto would be far vastly more preferable.

dstorer · 2021-02-17T04:38:35Z

@dannyjacobs in response to your 2 questions:

1: no, that's not a typo, the idea behind that choice is exactly that the amplitude of each baseline is normalized to 1, so if the phases are noise-like this value will average down to zero, but if the antennas are well correlated then the phases should not be noise-like, and this value should average to 1. In the very beginning of developing this plot I was normalizing by the autos, but found that when I normalized the way described above it did a better job at highlighting when nodes weren't correlating and made the metric somewhat less sensitive to baseline length. I agree that this is a fundamentally different metric, but what we're currently doing seems better motivated to me (and we've seen it work for a long time now).

I think there was a small typo in that last sentence that made it confusing - auto_metrics is not identifying cross-polarized antennas, that is being done using the cross pol metric that is based on the correlation metric. The reason for switching away from identifying cross-polarized ants based on relative power in the autos is that we observed that method was not very robust, and was very sensitive to antennas with one polarization that was completely dead - while this is still a problem, it is a distinctly different problem than polarization cables being swapped. The new metric described above is much more reliable in catching antennas that are actually crossed, rather than low power or dead in one or both pols.

I'm not sure I 100% followed your write-up, but it seems like aside from these 2 points we are basically on the same page. Let me know if you have more questions.

jsdillon · 2021-02-17T16:51:06Z

On 1: Dividing by the amplitude has the benefit of making the RFI largely irrelevant without having to use any medians over frequency or time. Part of what I really like about this metric, now that we have auto_metrics as a complement, is that it's really focused on what information the phases of the visibilities can give us about the health of the array, compared to auto_metrics which (by necessity) only looks at amplitudes.

On 2: Regarding "The reason for switching away from identifying cross-polarized ants based on relative power in the autos" we actually used the crosscorrelations, not just the autos, to identify cross-polarized antennas before. But the MeanVij-based metrics were still just looking at visibility amplitudes, including the one for identifying crosses (which looked for larger amplitudes in en/ne than in nn/ee).

This is designed for the new corr-based ant_metrics with absolute cuts (see HERA-Team/hera_qm#393)

jsdillon · 2021-02-18T06:26:15Z

As a confidence-building measure (and as discussed on today's Analysis/QM telecon) I ran a whole day (2459122) through the new ant_metrics (without any a priori antenna flags). Here's a section of the result from the in-development summary notebook:

Looks like we're pretty consistently finding dead and/or crossed antennas. There are other pathologies which auto_metrics picks up that ant_metrics doesn't, but this is very promising.

dannyjacobs

Small documentation requests. Only top level thing is to add a CHANGELOG file to the root hera_qm/ directory with the following.

Changes metric approach for correlations and cross_polarizations. TBD memo in draft see issue #395

This is a breaking change which removes many functions
remove meanvij_metrics, antpol_metric_sum_ratio, per_antenna_modified_z_scores, mean_vij_cross_pol_metrics,
add cross_polMetrics, calc_corr_stats
many changes to AntennaMetrics object and hidden functions
many test changes as necessary

hera_qm/ant_metrics.py

dannyjacobs · 2021-02-25T20:39:58Z

hera_qm/ant_metrics.py



-def mean_Vij_metrics(abs_vis_stats, xants=[], pols=None, rawMetric=False):
-    """Calculate how an antennas's average |Vij| deviates from others.
+def corr_metrics(corr_stats, xants=[], pols=None):


Once the description memo is posted, link to it in the docstring. Tracked in issue #395 .

hera_qm/ant_metrics.py

jsdillon · 2021-03-03T02:50:46Z

We good to go on this @dannyjacobs? I'd prefer to stop running on this branch on site, if possible.

dstorer added 30 commits January 28, 2021 07:42

Replace abs_vis_stats with corr_metric

ad7d6c2

Load diff and sum files

3bf366e

Resolve naming inconsistencies

54673b6

Clean up comments

26b5abc

Minor bug fix

28da442

Make diff files optional parameter

7a48f7f

Update out params

5ecec39

Update write function

33c8251

Make corr_stats a separate function

399431a

Update doc strings

4306067

Redefine cross pol metric

db76147

Rename mean_Vij_cross_pol_metrics to corr_cross_pol_metrics

dea4381

Looking for most negative values

96c0827

Change key name to reflect metric

f4c3493

Remove deprecated function

81151e7

Store max value of 4 pol combinations

f86d0fc

Flag on small corr values

8018e64

Set appropriate cuts for cross and dead

9ad6a15

Rename out_dict keys

1521213

Clean up comments

0ca363f

Update doc strings

6c666ab

Update doc strings

2656851

Update metric name

f548a62

Update metric description

e7fb149

'meanVij' -> 'corr'

d983ce3

Bug fix

7b722e6

Bug fix

46c2ccc

Flag on raw metric values

1234f0f

Add per-ant corr metric function

d0fbe06

Call per-ant corr metric

e15111c

dstorer and others added 8 commits February 4, 2021 10:28

Warn and proceed if different number of sum and diff files

8fe9d22

Raise error for mismatch sum and diff files

dee75e9

Remove old comment

79ab2bf

Merge branch 'corr_metric_overhaul' of https://github.com/HERA-Team/h…

95e99de

…era_qm into corr_metric_overhaul

Remove comment and print statement

43a39ff

Fix dead cut bug

9ade252

Undo previous commit

05d969a

fix bug and improve error message

6e12560

jsdillon added a commit to HERA-Team/hera_cal that referenced this pull request Feb 11, 2021

Bring over per_antenna_modified_z_scores from hera_qm

169be39

This function is being eliminated in HERA-Team/hera_qm#393 and since it's only used in hera_cal, it should live there.

jsdillon mentioned this pull request Feb 11, 2021

Bring over per_antenna_modified_z_scores from hera_qm HERA-Team/hera_cal#682

Merged

jsdillon added a commit to HERA-Team/hera_cal that referenced this pull request Feb 13, 2021

Bring over per_antenna_modified_z_scores from hera_qm

b4fbbac

This function is being eliminated in HERA-Team/hera_qm#393 and since it's only used in hera_cal, it should live there.

fix default

ff2a0e1

jsdillon added a commit to HERA-Team/hera_pipelines that referenced this pull request Feb 17, 2021

create ant_metrics do script that runs on all antennas

74a323d

This is designed for the new corr-based ant_metrics with absolute cuts (see HERA-Team/hera_qm#393)

jsdillon added 2 commits February 17, 2021 19:50

remove unneccesary abs/max

bc20859

explicitly deal with non-finite metrics

2fddde2

Merge branch 'master' into corr_metric_overhaul

55d6f40

dannyjacobs reviewed Feb 26, 2021

View reviewed changes

dstorer added 4 commits February 26, 2021 10:46

Specify meaning of data_sum

67e33b6

Update corr_cross_pol_metrics docs

4b31a2c

Correlation matrix -> corr_metric for clarity

b7604e6

Add changelog

0ab9d14

dannyjacobs approved these changes Mar 3, 2021

View reviewed changes

Merge branch 'master' into corr_metric_overhaul

4947045

jsdillon merged commit b1bbfcc into master Mar 3, 2021

jsdillon deleted the corr_metric_overhaul branch March 3, 2021 20:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Corr metric overhaul #393

Corr metric overhaul #393

dstorer commented Feb 1, 2021 •

edited

Loading

dannyjacobs commented Feb 17, 2021

dstorer commented Feb 17, 2021

jsdillon commented Feb 17, 2021

jsdillon commented Feb 18, 2021

dannyjacobs left a comment

dannyjacobs Feb 25, 2021

jsdillon commented Mar 3, 2021

Corr metric overhaul #393

Corr metric overhaul #393

Conversation

dstorer commented Feb 1, 2021 • edited Loading

dannyjacobs commented Feb 17, 2021

dstorer commented Feb 17, 2021

jsdillon commented Feb 17, 2021

jsdillon commented Feb 18, 2021

dannyjacobs left a comment

Choose a reason for hiding this comment

dannyjacobs Feb 25, 2021

Choose a reason for hiding this comment

jsdillon commented Mar 3, 2021

dstorer commented Feb 1, 2021 •

edited

Loading