Generate new sequence by fixing the length #14

shrimonmuke0202 · 2024-12-29T06:55:43Z

Hi,

Your work is fantastic!

I have a question: I want to generate a sequence of proteins by providing the length as input.

For example:
Generate a sequence of length 100.

How can I do this?

Thanks and regards,
Shrimon Mukherjee

Lyu6PosHao · 2024-12-30T08:43:18Z

Hi,
Thanks for your recognization!
For decoder-only autoregressive models such as ProLLaMA, there is no direct way to control the output length to fixed values such as 100.
One possible approach is to let the model generate many sequences, and then manually filter out those with a length of 100.

Kind regards

shrimonmuke0202 · 2025-01-05T12:28:23Z

Thanks for your answer. I want to produce the results present in the paper for my work, in particular the Table 2. could you share the code used to calculate the metrics like Tm score, RMSD score

Lyu6PosHao · 2025-01-07T04:27:35Z

Please refer to https://github.com/steineggerlab/foldseek. You could easily use the tools provided by foldseek to calculate TMscore, RMSD, etc.

shrimonmuke0202 · 2025-01-07T04:44:28Z

Thanks, how you perform the evaluation process i.e; how many new sequences are generated to performa the evaluation process?

shrimonmuke0202 · 2025-01-07T06:19:45Z

Also could you share the test set. It will help for me to compare between ProLlama and our proposed model.

Lyu6PosHao · 2025-01-07T16:05:18Z

1000 protein sequences used for each model in Table 2. We use models in Table2 to generate 1000 proteins respectively.

You could use ProLLaMA_Stage_1 to generate 1000 sequecens unconditionally throught scripts/infer.py. And then compare the proteins generated by ProLLaMA with your proposed model.

Best Regards

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generate new sequence by fixing the length #14

Generate new sequence by fixing the length #14

shrimonmuke0202 commented Dec 29, 2024

Lyu6PosHao commented Dec 30, 2024

shrimonmuke0202 commented Jan 5, 2025 •

edited

Loading

Lyu6PosHao commented Jan 7, 2025

shrimonmuke0202 commented Jan 7, 2025

shrimonmuke0202 commented Jan 7, 2025 •

edited

Loading

Lyu6PosHao commented Jan 7, 2025

Generate new sequence by fixing the length #14

Generate new sequence by fixing the length #14

Comments

shrimonmuke0202 commented Dec 29, 2024

Lyu6PosHao commented Dec 30, 2024

shrimonmuke0202 commented Jan 5, 2025 • edited Loading

Lyu6PosHao commented Jan 7, 2025

shrimonmuke0202 commented Jan 7, 2025

shrimonmuke0202 commented Jan 7, 2025 • edited Loading

Lyu6PosHao commented Jan 7, 2025

shrimonmuke0202 commented Jan 5, 2025 •

edited

Loading

shrimonmuke0202 commented Jan 7, 2025 •

edited

Loading