Skip to content

Commit

Permalink
Created a new FASTA distribution of the curated portion of Dfam
Browse files Browse the repository at this point in the history
 and placed it in:

   https://www.dfam.org/releases/current/families/Dfam-RepeatMasker.lib.gz

 This can be used as a baseline library for RepeatClassifier in TETools.
 I added this as a download in getsrc and added Dockerfile commands
 to unzip it and run makeblastdb on it.
  • Loading branch information
Robert Hubley committed Sep 10, 2024
1 parent cfed79f commit 795a110
Show file tree
Hide file tree
Showing 2 changed files with 25 additions and 6 deletions.
30 changes: 24 additions & 6 deletions Dockerfile
Original file line number Diff line number Diff line change
Expand Up @@ -123,23 +123,41 @@ RUN cd /opt \
&& make

# Configure RepeatMasker
# With Minimal TE Library
# - Also with full Dfam curated RepeatMasker.lib for RepeatClassifier
RUN cd /opt \
&& tar -x -f src/RepeatMasker-4.1.6.tar.gz \
&& tar -x -f src/RepeatMasker-4.1.7.tar.gz \
&& chmod a+w RepeatMasker/Libraries \
&& chmod a+w RepeatMasker/Libraries/famdb \
&& gunzip src/dfam38_full.0.h5.gz \
&& mv src/dfam38_full.0.h5 /opt/RepeatMasker/Libraries/famdb/dfam38_full.0.h5 \
&& cd RepeatMasker \
&& perl configure \
-hmmer_dir=/opt/hmmer/bin \
-rmblast_dir=/opt/rmblast/bin \
-libdir=/opt/RepeatMasker/Libraries \
-trf_prgm=/opt/trf \
-default_search_engine=rmblast \
&& cd .. && rm src/RepeatMasker-4.1.6.tar.gz
&& gunzip -c src/Dfam-RepeatMasker.lib.gz > RepeatMasker/Libraries/RepeatMasker.lib \
&& /opt/rmblast/bin/makeblastdb -dbtype nucl -in RepeatMasker/Libraries/RepeatMasker.lib \
&& cd .. && rm src/RepeatMasker-4.1.7.tar.gz

# With Dfam root partition
#RUN cd /opt \
# && tar -x -f src/RepeatMasker-4.1.6.tar.gz \
# && chmod a+w RepeatMasker/Libraries \
# && chmod a+w RepeatMasker/Libraries/famdb \
# && gunzip src/dfam38_full.0.h5.gz \
# && mv src/dfam38_full.0.h5 /opt/RepeatMasker/Libraries/famdb/dfam38_full.0.h5 \
# && cd RepeatMasker \
# && perl configure \
# -hmmer_dir=/opt/hmmer/bin \
# -rmblast_dir=/opt/rmblast/bin \
# -libdir=/opt/RepeatMasker/Libraries \
# -trf_prgm=/opt/trf \
# -default_search_engine=rmblast \
# && gunzip -c src/Dfam-RepeatMasker.lib.gz > RepeatMasker/Libraries/RepeatMasker.lib \
# && /opt/rmblast/bin/makeblastdb -dbtype nucl -in RepeatMasker/Libraries/RepeatMasker.lib \
# && cd .. && rm src/RepeatMasker-4.1.6.tar.gz

## Get the RepeatMasker.lib from a full installation
## --and run makeblastdb

# Include config update
COPY tetoolsDfamUpdate.pl /opt/RepeatMasker/tetoolsDfamUpdate.pl
Expand Down
1 change: 1 addition & 0 deletions getsrc.sh
Original file line number Diff line number Diff line change
Expand Up @@ -36,6 +36,7 @@ download https://github.com/TravisWheelerLab/NINJA/archive/0.99-cluster_only.tar
# download https://www.repeatmasker.org/coseg-0.2.3.tar.gz
download https://github.com/rmhubley/coseg/archive/refs/tags/coseg-0.2.3.tar.gz
download https://www.dfam.org/releases/Dfam_3.8/families/FamDB/dfam38_full.0.h5.gz
download https://www.dfam.org/releases/Dfam_3.8/families/Dfam-RepeatMasker.lib.gz
download http://www.repeatmasker.org/RepeatMasker/RepeatMasker-4.1.6.tar.gz
download https://github.com/Dfam-consortium/RepeatModeler/archive/2.0.5.tar.gz RepeatModeler-2.0.5.tar.gz
# download https://github.com/zhangrengang/TEsorter/archive/v1.4.6.tar.gz TEsorter-1.4.6.tar.gz
Expand Down

0 comments on commit 795a110

Please sign in to comment.