-
Notifications
You must be signed in to change notification settings - Fork 38
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
MSstats - relaxing default parameters. #437
Conversation
Important Review skippedAuto reviews are disabled on base/target branches other than the default branch. Please check the settings in the CodeRabbit UI or the You can disable this status message by setting the Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media? 🪧 TipsChatThere are 3 ways to chat with CodeRabbit:
Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments. CodeRabbit Commands (Invoked using PR comments)
Other keywords and placeholders
CodeRabbit Configuration File (
|
|
This will heavily inflate and distort quantities for good experiments. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I am against this. See my comment.
I agree with you 100%. However, the aim of the workflow is not to judge the data but to give some results for the data provided. We are defining default parameters in two different ways:
This is why Im advocating to relax the thresholds here, especially because the majority of users use other packages, such as limma and others and not MSstats. I want more people to use MSstats and quantms and not find from the very beginning errors, and I need to take this step quickly because it is too stringent for the data they have. Can you jump here @tonywu1999 and give your opinion? Do you know how many people disable |
Or can we catch this error and throw a warning, then try the relax parameter to run. |
My point is, that once the pipeline finishes, 90% of the users will be happy and take the data as-is. This means, that people which have good data and could heavily benefit from outlier removal, will never try this and publish bad results with our pipeline (which will shed a bad light on it). |
Better, but 90% of the people will ignore the warning. |
The only thing I would agree with, is falling back to less stringent parameters if MSstats is forced to fail with those parameters ( i.e. if the dataset has one replicate only?) |
Also if the dataset has one condition only we should completely disable MSstats if we are not doing that already. But we should not relax settings in general in my opinion. |
I am in favor of the stricter settings as default. I think one already accumulates a lot of wrong features (e.g. through MBR) in large studies, and we should have a mechanism to flag studies were less stringent setting had to be set. |
I will close this PR. I think we should document how when the parameters do not work, what could be happening. |
I guess failing with good error messages would be sufficient. |
I discuss it with them, but it will take some time. @tonywu1999 you can learn from this. |
Description
Since the release of quantms, we have had multiple users complaining that with the current default parameters of MSstats, it always fails this step.
This is because default parameters remove poor features, like proteins with only one feature, features that are not across replicates, etc. The majority of the users and also the public data do not have too many replicates, and in some way, the data do not fill the requirements of MSstats for high-quality results. I have discussed this issue with @tonywu1999, and I think we should, by default, define more relaxed parameters, and users can then play with more stringent thresholds manually if they want to.
PR checklist
nf-core lint
).nextflow run . -profile test,docker --outdir <OUTDIR>
).nextflow run . -profile debug,test,docker --outdir <OUTDIR>
).docs/usage.md
is updated.docs/output.md
is updated.CHANGELOG.md
is updated.README.md
is updated (including new tool citations and authors/contributors).