Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

chore: removing symbols from QTL study identifiers #971

Open
wants to merge 5 commits into
base: dev
Choose a base branch
from

Conversation

DSuveges
Copy link
Contributor

Context

When concatenating various fields of QTL studies to generate the study identifier, there is an unexpected side effect: the resulting string might contain special characters that interfere with using study identifiers as URLs. To resolve this problem, upon ingestion, study id string is lower cased, and all non-alphanumeric (plus _ and -) characters are replaced with underscores.

@@ -1,17 +1,17 @@
molecular_trait_id gene_id cs_id variant rsid cs_size pip pvalue beta se z cs_min_r2 region
RBP7.14208.3.3..1 ENSG00000162444 RBP7.14208.3.3..1_L1 chr1_10000458_T_C rs3003377 22 0.0170703689724805 1.33564e-10 0.139135 0.0215922 6.45120520807609 0.649351082174683 chr1:8997206-10997206
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The sample files were changed to provide a more complete representation of the issues caused by the special characters in the studyId.

@DSuveges DSuveges requested a review from d0choa January 14, 2025 10:35
@DSuveges DSuveges changed the title 3619 ds eqtl catalog ids are not url safe chore: removing symbols from QTL study identifiers Jan 14, 2025
@github-actions github-actions bot added the Chore label Jan 14, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

eQTL catalog IDs are not URL-safe
1 participant