You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
First use case: We are doing some research about how to control FDR when integrating multiple datasets at quantms.org at protein level. We currently have a method mad_decoy that we are trying to improve now with the entrapment approach. For the new method, we need to study the distribution of peptide probabilities for each identification.
Second use case: In addition, we want to explore for AI-dataset generation the impact of rank 1,2,3 PSMs in search engines. Most of the AI-dataset for spectra prediction training are based on rank 1 spectra. We want to release dataset with rank 1, rank 2, and 3 spectra for exploring the impact on AI prediction methods.
Third use case: ID at large-scale for spectra library generation.
This subworkflow will help used to provide a solution for peptide identification outside the quant part. It is actually related with #345. I recommend the following @daichengxin@jpfeuffer@timosachsenberg :
Perform peptide identification with the three search engines (SAGE, MSGF+, COMET)
We should make sure that this work for inmunopeptidomics datasets, when searching with no enzymatic restriction.
Percolator will be optional, this is needed because if you want to study pure search engine results and all the search engine ranks, you may need to skip Percolator who select the first rank for each search engine.
ConsesusID should be applied on Percolator results or other type of search engines that do not remove the ranks.
We should have an ID-filter that filter by PSMs FDR.
Description of feature
First use case: We are doing some research about how to control FDR when integrating multiple datasets at quantms.org at protein level. We currently have a method mad_decoy that we are trying to improve now with the entrapment approach. For the new method, we need to study the distribution of peptide probabilities for each identification.
Second use case: In addition, we want to explore for AI-dataset generation the impact of rank 1,2,3 PSMs in search engines. Most of the AI-dataset for spectra prediction training are based on rank 1 spectra. We want to release dataset with rank 1, rank 2, and 3 spectra for exploring the impact on AI prediction methods.
Third use case: ID at large-scale for spectra library generation.
This subworkflow will help used to provide a solution for peptide identification outside the quant part. It is actually related with #345. I recommend the following @daichengxin @jpfeuffer @timosachsenberg :
Feedback from @jpfeuffer @daichengxin @timosachsenberg would be great.
The text was updated successfully, but these errors were encountered: