OrthologAnalysis

A flexible pipeline for deducing orthology relationships between proteins in genomic (proteomic) datasets.

Data that can be generated and considered includes pairwise (BLASTP/DIAMOND) bi-directional best hits, synteny (conserved gene order), and membership in TIGRfam equivalog families (based on embedded trusted cutoffs).

Prerequisites: perl Bio::FeatureIO (for parsing gff files) DIAMOND (https://github.com/bbuchfink/diamond) or BLASTP (https://blast.ncbi.nlm.nih.gov/Blast.cgi?CMD=Web&PAGE_TYPE=BlastDocs&DOC_TYPE=Download) mcl (https://micans.org/mcl/) hmmer (http://hmmer.org/)

This process also uses an altered version of MultiParanoid (

Input is protein fasta files (1 per organism) and gff3 files (required for synteny analysis). The fasta identifiers must match the ID values in the gff3 file.

The files should all be in one directory. Running the ortholog_pipeline.sh script should be sufficient to run the process. The shell script can be altered to customize analysis.

Output is a tab-delimited table with the following columns

mcl cluster id
bbh cluster id
synteny family id
TIGRfam id
genome count
proteins count
product name
gene symbol
role category
EC number
TC number
. . n. one column per organism containing the accession(s) for putative ortholog family members

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
LICENSE		LICENSE
OrthologAnalysis.pm		OrthologAnalysis.pm
README.md		README.md
build_ortholog_table.pl		build_ortholog_table.pl
gbf_to_gff3.pl		gbf_to_gff3.pl
multi_bbh.pl		multi_bbh.pl
myMultiParanoid.pl		myMultiParanoid.pl
ortholog_pipeline.sh		ortholog_pipeline.sh
synteny_analysis.pl		synteny_analysis.pl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OrthologAnalysis

About

Releases

Packages

Languages

License

wichne/OrthologAnalysis

Folders and files

Latest commit

History

Repository files navigation

OrthologAnalysis

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages