-
Notifications
You must be signed in to change notification settings - Fork 443
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
update BUSCO and fix --miniprot parameter (#6153)
* update BUSCO and fiw --miniprot parameter * error version * update * fix miniprot parameter * update test-data * update * small modification on busco db * small modifications * fix output errors * small modification * small modification * fix test * add test-data/genome_results_miniprot * small change in busco.xml * update test-data * add assert * fix test 7
- Loading branch information
Showing
32 changed files
with
1,452 additions
and
180 deletions.
There are no files selected for viewing
Large diffs are not rendered by default.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
213 changes: 213 additions & 0 deletions
213
tools/busco/test-data/busco_downloads/file_versions.tsv
Large diffs are not rendered by default.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,35 +1,36 @@ | ||
# BUSCO version is: 5\.5\.0 | ||
# The lineage dataset is: arthropoda_odb10 \(Creation date: [0-9]{4}-[0-9]{2}-[0-9]{2}, number of genomes: 90, number of BUSCOs: 1013\) | ||
# Summarized benchmarking in BUSCO notation for file [a-z0-9_\-/\.]+ | ||
# BUSCO version is: 5.7.1 | ||
# The lineage dataset is: arthropoda_odb10 (Creation date: 2024-01-08, number of genomes: 90, number of BUSCOs: 1013) | ||
# Summarized benchmarking in BUSCO notation for file /tmp/tmpl5l1blpe/files/7/a/3/dataset_7a33f452-1064-4b4a-943f-b0efef6a4a4a.dat | ||
# BUSCO was run in mode: euk_genome_aug | ||
# Gene predictor used: augustus | ||
|
||
\*\*\*\*\* Results: \*\*\*\*\* | ||
***** Results: ***** | ||
|
||
C:0\.1%\[S:0\.1%,D:0\.0%\],F:0\.0%,M:99\.9%,n:1013 | ||
1 Complete BUSCOs \(C\) | ||
1 Complete and single-copy BUSCOs \(S\) | ||
0 Complete and duplicated BUSCOs \(D\) | ||
0 Fragmented BUSCOs \(F\) | ||
1012 Missing BUSCOs \(M\) | ||
C:0.1%[S:0.1%,D:0.0%],F:0.0%,M:99.9%,n:1013 | ||
1 Complete BUSCOs (C) | ||
1 Complete and single-copy BUSCOs (S) | ||
0 Complete and duplicated BUSCOs (D) | ||
0 Fragmented BUSCOs (F) | ||
1012 Missing BUSCOs (M) | ||
1013 Total BUSCO groups searched | ||
|
||
Assembly Statistics: | ||
1 Number of scaffolds | ||
1 Number of contigs | ||
62370 Total length | ||
0\.000% Percent gaps | ||
0.000% Percent gaps | ||
62 KB Scaffold N50 | ||
62 KB Contigs N50 | ||
|
||
|
||
Dependencies and versions: | ||
hmmsearch: [0-9\.\+]+ | ||
bbtools: [0-9\.\+]+ | ||
makeblastdb: [0-9\.\+]+ | ||
tblastn: [0-9\.\+]+ | ||
augustus: [0-9\.\+]+ | ||
gff2gbSmallDNA\.pl: None | ||
new_species\.pl: None | ||
hmmsearch: 3.1 | ||
bbtools: 39.06 | ||
makeblastdb: 2.15.0+ | ||
tblastn: 2.15.0+ | ||
augustus: 3.5.0 | ||
gff2gbSmallDNA.pl: None | ||
new_species.pl: None | ||
etraining: None | ||
busco: [0-9\.\+]+ | ||
python: sys.version_info(major=3, minor=9, micro=19, releaselevel='final', serial=0) | ||
busco: 5.7.1 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
4 changes: 2 additions & 2 deletions
4
tools/busco/test-data/genome_results_metaeuk/missing_buscos_list
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,3 @@ | ||
##gff-version 3 | ||
sample MetaEuk gene 40256 42071 198 + . Target_ID=68987at6656_29053_0:00071c;TCS_ID=68987at6656_29053_0:00071c|sample|+|40255 | ||
sample MetaEuk gene 34846 35694 527 - . Target_ID=94238at6656_7245_0:00200b;TCS_ID=94238at6656_7245_0:00200b|sample|-|34845 |
36 changes: 19 additions & 17 deletions
36
tools/busco/test-data/genome_results_metaeuk/short_summary
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,30 +1,32 @@ | ||
# BUSCO version is: 5\.5\.0 | ||
# The lineage dataset is: arthropoda_odb10 \(Creation date: [0-9]{4}-[0-9]{2}-[0-9]{2}, number of genomes: 90, number of BUSCOs: 1013\) | ||
# Summarized benchmarking in BUSCO notation for file [a-z0-9_\-/\.]+ | ||
# BUSCO was run in mode: euk_genome_met | ||
# Gene predictor used: metaeuk | ||
# BUSCO version is: 5.7.1 | ||
# The lineage dataset is: arthropoda_odb10 (Creation date: 2024-01-08, number of genomes: 90, number of BUSCOs: 1013) | ||
# Summarized benchmarking in BUSCO notation for file /tmp/tmpl5l1blpe/files/f/3/1/dataset_f31d44e3-c824-4cdf-92ba-99a2c26071d2.dat | ||
# BUSCO was run in mode: euk_genome_min | ||
# Gene predictor used: miniprot | ||
|
||
\*\*\*\*\* Results: \*\*\*\*\* | ||
***** Results: ***** | ||
|
||
C:0\.2%\[S:0\.2%,D:0\.0%\],F:0\.0%,M:99\.8%,n:1013 | ||
2 Complete BUSCOs \(C\) | ||
2 Complete and single-copy BUSCOs \(S\) | ||
0 Complete and duplicated BUSCOs \(D\) | ||
0 Fragmented BUSCOs \(F\) | ||
1011 Missing BUSCOs \(M\) | ||
C:0.1%[S:0.1%,D:0.0%],F:0.0%,M:99.9%,n:1013 | ||
1 Complete BUSCOs (C) | ||
1 Complete and single-copy BUSCOs (S) | ||
0 Complete and duplicated BUSCOs (D) | ||
0 Fragmented BUSCOs (F) | ||
1012 Missing BUSCOs (M) | ||
1013 Total BUSCO groups searched | ||
|
||
Assembly Statistics: | ||
1 Number of scaffolds | ||
1 Number of contigs | ||
62370 Total length | ||
0\.000% Percent gaps | ||
0.000% Percent gaps | ||
62 KB Scaffold N50 | ||
62 KB Contigs N50 | ||
|
||
|
||
Dependencies and versions: | ||
hmmsearch: [0-9\.\+]+ | ||
bbtools: [0-9\.\+]+ | ||
metaeuk: [0-9a-z\.\+]+ | ||
busco: [0-9\.\+]+ | ||
hmmsearch: 3.1 | ||
bbtools: 39.06 | ||
miniprot_index: 0.13-r248 | ||
miniprot_align: 0.13-r248 | ||
python: sys.version_info(major=3, minor=9, micro=19, releaselevel='final', serial=0) | ||
busco: 5.7.1 |
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
4 changes: 2 additions & 2 deletions
4
tools/busco/test-data/genome_results_metaeuk_auto/missing_buscos_list
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,5 +1,2 @@ | ||
##gff-version 3 | ||
sample MetaEuk gene 34846 35694 527 - . Target_ID=1053181at2759_7245_0:00200b;TCS_ID=1053181at2759_7245_0:00200b|sample|-|34845 | ||
sample MetaEuk mRNA 34846 35694 527 - . Target_ID=1053181at2759_7245_0:00200b;TCS_ID=1053181at2759_7245_0:00200b|sample|-|34845_mRNA;Parent=1053181at2759_7245_0:00200b|sample|-|34845 | ||
sample MetaEuk exon 34846 35694 527 - . Target_ID=1053181at2759_7245_0:00200b;TCS_ID=1053181at2759_7245_0:00200b|sample|-|34845_exon_0;Parent=1053181at2759_7245_0:00200b|sample|-|34845_mRNA | ||
sample MetaEuk CDS 34846 35694 527 - . Target_ID=1053181at2759_7245_0:00200b;TCS_ID=1053181at2759_7245_0:00200b|sample|-|34845_CDS_0;Parent=1053181at2759_7245_0:00200b|sample|-|34845_exon_0 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Binary file modified
BIN
+10.9 KB
(110%)
tools/busco/test-data/genome_results_metaeuk_auto/summary.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Oops, something went wrong.