Abstract

The silkworm, Bombyx mori, is one of the major insect model organisms, and its draft and fine genome sequences became available in 2004 and 2008, respectively. Transposable elements (TEs) constitute ∼40% of the silkworm genome. To better understand the roles of TEs in organization, structure and evolution of the silkworm genome, we used a combination of de novo, structure-based and homology-based approaches for identification of the silkworm TEs and identified 1308 silkworm TE families. These TE families and their classification information were organized into a comprehensive and easy-to-use web-based database, BmTEdb. Users are entitled to browse, search and download the sequences in the database. Sequence analyses such as BLAST, HMMER and EMBOSS GetORF were also provided in BmTEdb. This database will facilitate studies for the silkworm genomics, the TE functions in the silkworm and the comparative analysis of the insect TEs.

Database URL: http://gene.cqu.edu.cn/BmTEdb/.

Introduction

Transposable elements (TEs) are fragments of DNA that can move in a genome and insert themselves into new chromosomal locations (1). They occupy large fractions of higher eukaryotic genomes owing to their ability to increase copy number in the process of transposition. Based on whether the intermediate they use to move is RNA or DNA, eukaryotic TEs have been subdivided into two major classes, class 1 (retrotransposons) and class 2 (transposons) (2). Class 1 TEs use their encoded transcripts (mRNA), not themselves, to form the transposition intermediate to transpose by ‘copy and paste’ mechanism, whereas class 2 TEs transpose via DNA intermediate by the so-called ‘cut-and-paste’ mechanism. Within each class, TEs are further subdivided on the basis of the structural features or enzymatic criteria (3). Class 1 elements are further classified into two subclasses, the elements that are characterized by long terminal repeats (LTR retrotransposons), and the elements that lack long terminal repeats (non-LTR retrotransposons). Autonomous non-LTR retrotransposons (long interspersed nuclear elements, LINEs) are thought to be responsible for transposition of non-autonomous short interspersed nuclear elements (SINEs) (4). Class 2 elements are further classified into three main subclasses, terminal inverted repeats (TIRs), Helitrons and Mavericks (5).

Although TEs were considered as ‘junk DNA’, now there is compelling evidence that TEs play important roles in the evolution of genes and regulatory networks (6). Besides, the repetitive nature of TEs poses great challenges for genome sequencing, assembly and gene annotation (7, 8). Thus, it is of great importance to identify and annotate TEs in sequenced genomes. However, the identification and classification of TEs in higher eukaryotic genomes are complicated and difficult owing to the fact that their structure and classification are complex, diverse and controversial (3, 9, 10). Nevertheless, a number of approaches and tools reviewed in recent articles (1, 7, 11) have been developed in the past decades. These approaches and tools are divided into three main types: de novo, homology-based and structure-based (1, 7). As for de novo methods, two basic approaches have been employed—query vs. query similarity searches and word counting/seed extension (1). Because different types of approaches have both advantages and drawbacks, a combined approach is required to accurately identify, classify and annotate TEs in a given genome.

The silkworm, Bombyx mori, is one of the major insect model organisms, and its draft genome sequence became available in 2004 (12). TEs constitute ∼40% of the silkworm genome (13). Before the release of the silkworm genome, many TEs had been identified, such as Pao, Kamikaze and Yamato, BMC1, L1Bm, R1Bm, R2Bm, SART1, TRAS1, Bm1 and Mariner. A de novo repeat library for silkworm was created using a repeat search program Recovery of Ancestral Sequences (ReAS) (14), which can recover ancestral sequences for TEs from the unassembled whole genome shotgun reads, based on the silkworm whole-genome shotgun sequences (15). And Osanai-Futahashi et al. (2008) modified this library by adding 22 known TEs, named it as TELib and annotated the sequences based on homology (13). This library is incomplete because ReAS, like other de novo methods, tended to miss or split structurally composite repeats (i.e. LTR retrotransposons, non-LTR retrotransposons) (1). Accordingly, in this work, we used a combined approach to identify, classify and annotate TEs in the silkworm genome, and organized the obtained results into a database, BmTEdb. It can be accessed at the address http://gene.cqu.edu.cn/BmTEdb/.

Database construction and content

Data sources

The new assembly of the silkworm genome, the nucleotide sequences of protein-coding genes and the sequences of tRNA and rRNA were downloaded from the Web site of the silkworm genome database, SilkDB v2.0 (16). The silkworm repeat library TELib- and bacterial artificial chromosome (BAC)-assembled sequences were downloaded from the KAIKObase (17). The Repbase Update collection (18), RepBase version 16.04 was downloaded from the Genetic Information Research Institute (http://www.girinst.org/).

Collection and identification of TEs in the silkworm genome

Two libraries are generated in this section: BmTE and BmTEdenovo. BmTE integrated the results obtained from Step1 to Step 3. BmTEdenovo is composed of sequences from library generated by PILER (PILER_Lib) and TELib. Because sequences in the former are highly reliable, BmTE is used to annotate the latter one. A flowchart is shown in Figure 1.

Figure 1.

An outline of the collection and identification of TEs in the silkworm genome.

Step 1: On 5 June 2011, the NCBI nucleotide database was searched using the following search expression (silkworm [Organism] OR Bombyx [Organism]) AND ((((((transposon [All Fields]) OR retrotransposon [All Fields]) OR LTR retrotransposon [All Fields]) OR non-LTR retrotransposon [All Fields]) OR mobile element [All Fields]) OR retro-element [All Fields]). Four hundred eighty sequences were retrieved. Not all these sequences are real silkworm TEs and some of these sequences only include partial regions of TEs. Then the sequences were inspected one by one to check whether it is real TE or to extract TE sequences from the position information provided in the descriptive text. In this step, we excluded 20 sequences. Eighty-seven silkworm TE families were extracted from RepBase version 16.04. Then TE sequences from NCBI were integrated into BmTE after being masked by 87 RepBase-derived TE sequences using RepeatMasker (Smit, AFA, Hubley, R & Green, P. RepeatMasker Open-3.0.1996-2010 <http://www.repeatmasker.org>). Recently, 17 families of miniature inverted transposable elements (MITEs) have been identified in the silkworm genome by a MITE search program named MUST, followed by a strict approach to filter pseudo-MITEs [see (19) for details]. In addition, two families of silkworm SINEs named BmSE and BmSer_SINE have been identified in two separate studies (20, 21). To our knowledge, these TE families have not been submitted to the NCBI nucleotide database, and thus we collected these families into our data set. Besides, 22 families of Tc1/Mariner DNA transposons in the silkworm genome have been identified using the same methods for DNA transposons in Step 3 (Zhang HH, unpublished data), and these consensus sequences were also included. In this step, 228 sequences were generated (Figure 1).

Step 2: Structure-based computational tools LTR_STRUC (22), LTR_FINDER (23) and MGEScan-non-LTR (24) with default parameters were used to search the new assembly of the silkworm genome (15), the silkworm repeat library TELib and BAC assembled sequences to identify the LTR retrotransposons and non-LTR retrotransposons. LTR retrotransposons identified from the draft sequence of the silkworm genome were included in this study (25). For LTR retrotransposons, the obtained candidates with known silkworm transposons in their LTRs were removed. Then LTR retrotransposons were searched for known pfam models online. We considered LTR retrotransposons with similarity to these pfam models [Peptidase_A17 (PF05380), RVT_1 (PF00078), RVT_2 (PF07727), Retrotransposon gag protein (PF03732), Integrase DNA binding domain (PF00552), Integrase core domain (PF00665), Retroviral aspartyl protease (PF00077 and PF08284), zf-H2C2 (PF09337), DUF1759 (PF03564) and DUF1758 (PF05585)] as full-length LTR retrotransposons. The cut-off e-value for this process is 1 e-5. Because these full-length LTR retrotransposons have few full-length copies (<3) in the silkworm genome, we cannot identify consensus sequences. And for each family, we keep a full-length LTR retrotransposon as a representative. For non-LTR retrotransposons, obtained non-LTR retrotransposons were first masked by known non-LTR retrotransposons from RepBase using RepeatMasker. Non-LTR retrotransposons of which >80% were masked were removed, and new candidates were used as queries to search silkworm genomes. The consensus was created if there were more than three copies; otherwise the candidates were retained as a representative. In this step, 92 LTR retrotransposons families and 4 non-LTR retrotransposons families were identified (Figure 1).

Step 3: Two hundred thirty-eight transposase protein sequences from insect DNA transposons were extracted from the RepBase version 16.04. These sequences were used as queries for tBLASTN search against the new assembly of the silkworm genome. The sequences whose coverage was <50% or whose similarity to the query was <30% were removed. The obtained sequences with 5 kb of flanking region were searched by using FastPCR v6.1 (26) to look for TIRs with initial searching word size of 12, filter minimal string length of 17, minimal alignment string length of 18, gap size between strings of 10, local similarity of 75 and alignment output length of 100. In total, 40 DNA families were identified in this step (Figure 1).

The above results were integrated into a silkworm TE library, BmTE, and the redundant sequences were removed from the library based on the standard used in the previous study (27). Specifically, sequences within the library were compared with one another using BLASTN, and any sequences covering ≥90% of the length of, and with ≥90% identity to, any other sequences were discarded. This library was used to annotate the following repetitive sequences identified using de novo approaches because these sequences were highly reliable (7).

The de novo program PILER-DF (28) was used to identify repetitive sequences in the new assembly of the silkworm genome. It relies on the pairwise aligner for long sequences (PALS) algorithm that finds local alignments between DNA sequences by aligning the genome to itself. Because the genome is too large to align the entire sequence to itself, we assigned the scaffolds into chromosomes according to the mapping information indicated in the SilkDB v2.0 (16). After this, PALS was used to align one chromosome to another. The minimum hit length is specified by the -length option (200 in this study), the minimum identity by the -pctid option (94.0 used in this study).

The number of sequences in the PILER_Lib is 2239. PILER_Lib was integrated with TELib [total 1690 sequences, 1668 ReAS de novo repetitive sequences, 17 known TEs and five sequences contributed by Hiroaki Abe (13)], and the resulting library was designated as BmTEdenovo. Sequences in the BmTEdenovo were sorted by length from short to long, and the redundant sequences were removed from BmTEdenovo based on the above standard. The non-redundant 2220 sequences were checked to rule out two potential types of false positives: sequences with close similarity to the annotated genes (protein-coding genes, tRNA and rRNA) in the silkworm, and sequences containing a significant fraction (>25%) of tandem repeats, as determined by Tandem Repeat Finder (TRF) (29) with recommended parameters on the TRF Web site (http://tandem.bu.edu/trf/trf.unix.help.html). After this procedure, 2134 TE sequences were obtained.

Classification and annotation of TEs in the silkworm genome

The resulting 2134 sequences were further analyzed for their TE class. Specifically, TE sequences with ≥90% nucleic acid identity as well as >50% in length to any sequences in the library BmTE were discarded from BmTEdenovo. For remaining sequences, tBLASTX searches were performed by CENSOR (30) in the sensitive mode using BmTE and RepBase16.04 as repeat libraries. The sequences with >50% similarity were removed from BmTEdenovo and added into BmTE. The definition of ‘Chimera’ and ‘Others’ was based on the criteria used in the previous study (13). The rest of the sequences were classified by using TEclass (31), a tool classifying unknown TEs into their functional categories using machine learning support vector machine for classification. These sequences were classified as ‘unknown’.

False-positive rate of this pipeline

We tested LTR_STRUC and MGEScan-non-LTR using reversed silkworm genomic sequence. LTR_STRUC returned 350 LTR retrotransposons candidates. And after a validation of known silkworm TEs in LTRs of LTR retrotransposons candidates, 350 sequences were retained. Then we considered LTR retrotransposons with similarity to these pfam models (see above) as full-length LTR retrotransposons. Only one candidate was retained, and the e-value was 0.035, larger than the cutoff of 1 e-5. MGEScan-non-LTR returned no results using reversed silkworm genomic sequence. So the false discovery rate of LTR_STRUC and MGEScan-non-LTR plus additional process is estimated to be 0.

We also tested PALS (search engine) and PILER using shuffled silkworm genomic sequences. Each silkworm scaffold sequence was shuffled with the entropy source of TRUE_RANDOM and the starting ordinal of zero by shuffle program, which was downloaded from http://eyegene.ophthy.med.umich.edu/shuffle/. PALS did not find any local alignments between these shuffled sequences. Thus, the false discovery rate of PALS and PILER is estimated to be 0.

Results and user interface

Using the approaches above, we identified 1308 TE families in the silkworm genome and organized these families and their classification information into an easy-to-use web-based database, BmTEdb. The composition of BmTEdb can be found in Table 1. The BmTEdb web interface is organized into four functional sections, data browsing section, keywords search section, sequences analysis tools section and help information section.

Table 1.

Description of TEs deposited in BmTEdb

ClassOrderSuperfamilyNumber of entries
Class 1LTRCopia23
Gypsy121
Bel73
Chimera LTR3
Unknown LTR119
LINECR122
CRE4
Daphne18
I27
Jockey14
Kiri1
L17
L212
LOA1
Nimb1
Proto24
R132
R21
R48
RTE41
Vingi2
Chimera LINE11
Unknown LINE264
SINEBM13
BM1-related5
BmSer1
tRNA2
Unknown SINE6
Class 2TIRAcadem4
Chapaev4
EnSpm1
Harbinger9
hAT11
ISL2EU1
Tc1/Mariner32
P6
piggyBac15
Sola21
Unknown TIR269
Transib3
Zator1
MITEMITE19
HelitronHelitron7
OtherOtherOther79
ClassOrderSuperfamilyNumber of entries
Class 1LTRCopia23
Gypsy121
Bel73
Chimera LTR3
Unknown LTR119
LINECR122
CRE4
Daphne18
I27
Jockey14
Kiri1
L17
L212
LOA1
Nimb1
Proto24
R132
R21
R48
RTE41
Vingi2
Chimera LINE11
Unknown LINE264
SINEBM13
BM1-related5
BmSer1
tRNA2
Unknown SINE6
Class 2TIRAcadem4
Chapaev4
EnSpm1
Harbinger9
hAT11
ISL2EU1
Tc1/Mariner32
P6
piggyBac15
Sola21
Unknown TIR269
Transib3
Zator1
MITEMITE19
HelitronHelitron7
OtherOtherOther79
Table 1.

Description of TEs deposited in BmTEdb

ClassOrderSuperfamilyNumber of entries
Class 1LTRCopia23
Gypsy121
Bel73
Chimera LTR3
Unknown LTR119
LINECR122
CRE4
Daphne18
I27
Jockey14
Kiri1
L17
L212
LOA1
Nimb1
Proto24
R132
R21
R48
RTE41
Vingi2
Chimera LINE11
Unknown LINE264
SINEBM13
BM1-related5
BmSer1
tRNA2
Unknown SINE6
Class 2TIRAcadem4
Chapaev4
EnSpm1
Harbinger9
hAT11
ISL2EU1
Tc1/Mariner32
P6
piggyBac15
Sola21
Unknown TIR269
Transib3
Zator1
MITEMITE19
HelitronHelitron7
OtherOtherOther79
ClassOrderSuperfamilyNumber of entries
Class 1LTRCopia23
Gypsy121
Bel73
Chimera LTR3
Unknown LTR119
LINECR122
CRE4
Daphne18
I27
Jockey14
Kiri1
L17
L212
LOA1
Nimb1
Proto24
R132
R21
R48
RTE41
Vingi2
Chimera LINE11
Unknown LINE264
SINEBM13
BM1-related5
BmSer1
tRNA2
Unknown SINE6
Class 2TIRAcadem4
Chapaev4
EnSpm1
Harbinger9
hAT11
ISL2EU1
Tc1/Mariner32
P6
piggyBac15
Sola21
Unknown TIR269
Transib3
Zator1
MITEMITE19
HelitronHelitron7
OtherOtherOther79

In the browsing interface, classification structures of TEs in BmTEdb are shown. Users can browse different superfamilies by clicking them, as shown in Figure 2A. And the detailed information of each family in this superfamily can be retrieved by clicking the corresponding entry, including the classification information, references, nucleotide sequences, copy numbers and coordinates of each copy (Figure 2A).

Figure 2.

Different functional sections of BmTEdb web interface. (A) The browsing interface of database BmTEdb. All TE sequences in BmTEdb were classified into different classes, orders and superfamilies. Users may choose superfamily they are interested to view the families that belong to the chosen TE superfamily. The detailed information of each family can be retrieved by clicking the corresponding entry, including the classification information, references, nucleotide sequences, copy numbers and coordinates of each copy. (B) The searching interface of database BmTEdb. Users can search for TE sequences with keywords through the keyword search interface. (C) The BLAST interface of database BmTEdb. A sample of tBLASTN results was shown.

In the keyword search interface of BmTEdb, users can use a keyword to search the BmTEdb (TE class, TE order, TE superfamily, TE family and references) to find entries interesting to the users, as shown in Figure 2B.

The web interface of sequence analysis tools is provided to facilitate the quick comparison of users’ sequences with the silkworm TEs deposited in BmTEdb. Three types of tools, BLAST (32), GetORF [EMBOSS (33)] and HMMER (34) are included in the BmTEdb infrastructure to assist the annotation of TE elements based on nucleotide sequence and protein sequence. Through this interface, users can submit the query sequences to do BLASTN or tBLASTN against the BmTEdb for homology search. Users can also search the potential open reading frame (ORF) of the query sequence in the GetORF page, and then search protein sequences against TE profile-HMM (profile hidden Morkov model) database collected from previous studies (24, 35). In them, these models were used in the identification of protein domains in LTR retrotransposons and the classification of non-LTR retrotransposons superfamilies. HMMER is provided to facilitate the identification and classification of TEs from evolutionarily distant species that do not show similarity to silkworm TEs at nucleotide level. A sample of tBLASTN results is shown in Figure 2C.

In the help information section, a user’s manual is included in the interface to help the users to learn how to use BmTEdb. Besides the help section, a collection of computational resources for TEs is provided in the links page.

Conclusion

We have generated a comprehensive TE database for B. mori, BmTEdb. This database currently consists of 1308 TE families in the silkworm genome along with classification information. Various web interfaces are provided in support of using BmTEdb by users. One unique feature of BmTEdb is that it allows users to search the potential ORF of the nucleotide sequence, and then search protein sequences against a customized TE profile-HMM database. BmTEdb will be valuable for study of the silkworm TEs. In addition, the availability of the complete set of TEs from Lepidoptera species allows evolutionary and comparative analyses of TEs between Lepidoptera and other insect species at the whole genome level.

Accessibility

The publicly accessible BmTEdb Web site is http://gene.cqu.edu.cn/BmTEdb/. All data deposited in the database except the data that derive directly from RepBase are freely available to all users without any restrictions.

Acknowledgements

We are grateful for the comments and suggestions from three anonymous reviewers.

Funding

This work was supported by the Hi-Tech R&D Program (863) of China (2013AA102507), Natural Science Foundation Project of CQ CSTC (cstc2012jjB80007) and the Doctorial Innovation Fund of Southwest University in China (kb2010016). Funding for open access charge: Hi-Tech R&D Program (863) of China (2013AA102507).

Conflict of interest. None declared.

References

1
Feschotte
C
Pritham
EJ
Stojanovic
N
Computational analysis and paleogenomics of interspersed repeats in eukaryotes
Computational Genomics: Current Methods
2007
London
Taylor & Francis
(pg. 
31
-
54
)
2
Finnegan
DJ
Eukaryotic transposable elements and genome evolution
Trends Genet.
1989
, vol. 
5
 (pg. 
103
-
107
)
3
Wicker
T
Sabot
F
Hua-Van
A
, et al. 
A unified classification system for eukaryotic transposable elements
Nat. Rev. Genet.
2007
, vol. 
8
 (pg. 
973
-
982
)
4
Okada
N
Hamada
M
Ogiwara
I
, et al. 
SINEs and LINEs share common 3′ sequences: a review
Gene
1997
, vol. 
205
 (pg. 
229
-
243
)
5
Feschotte
C
Pritham
EJ
DNA transposons and the evolution of eukaryotic genomes
Annu. Rev. Genet.
2007
, vol. 
41
 (pg. 
331
-
368
)
6
Feschotte
C
Transposable elements and the evolution of regulatory networks
Nat. Rev. Genet.
2008
, vol. 
9
 (pg. 
397
-
405
)
7
Bergman
CM
Quesneville
H
Discovering and detecting transposable elements in genome sequences
Brief. Bioinform.
2007
, vol. 
8
 (pg. 
382
-
392
)
8
Tang
H
Genome assembly, rearrangement, and repeats
Chem. Rev.
2007
, vol. 
38
 (pg. 
3391
-
3406
)
9
Seberg
O
Petersen
G
A unified classification system for eukaryotic transposable elements should reflect their phylogeny
Nat. Rev. Genet.
2009
, vol. 
10
 pg. 
276
 
10
Kapitonov
VV
Jurka
J
A universal classification of eukaryotic transposable elements implemented in Repbase
Nat. Rev. Genet.
2008
, vol. 
9
 (pg. 
411
-
412
)
11
Lerat
E
Identifying repeats and transposable elements in sequenced genomes: how to find your way through the dense forest of programs
Heredity (Edinb)
2009
, vol. 
104
 (pg. 
520
-
533
)
12
Xia
Q
Zhou
Z
Lu
C
, et al. 
A draft sequence for the genome of the domesticated silkworm (Bombyx mori)
Science
2004
, vol. 
306
 (pg. 
1937
-
1940
)
13
Osanai-Futahashi
M
Suetsugu
Y
Mita
K
, et al. 
Genome-wide screening and characterization of transposable elements and their distribution analysis in the silkworm, Bombyx mori
Insect Biochem. Mol. Biol.
2008
, vol. 
38
 (pg. 
1046
-
1057
)
14
Li
R
Ye
J
Li
S
, et al. 
ReAS: Recovery of ancestral sequences for transposable elements from the unassembled reads of a whole genome shotgun
PLoS Comp. Biol.
2005
, vol. 
1
 pg. 
e43
 
15
Xia
QY
Wang
J
Zhou
ZY
, et al. 
The genome of a lepidopteran model insect, the silkworm Bombyx mori
Insect Biochem. Mol. Biol.
2008
, vol. 
38
 (pg. 
1036
-
1045
)
16
Duan
J
Li
R
Cheng
D
, et al. 
SilkDB v2.0: a platform for silkworm (Bombyx mori) genome biology
Nucleic Acids Res.
2010
, vol. 
38
 (pg. 
D453
-
D456
)
17
Shimomura
M
Minami
H
Suetsugu
Y
, et al. 
KAIKObase: An integrated silkworm genome database and data mining tool
BMC Genomics
2009
, vol. 
10
 pg. 
486
 
18
Jurka
J
Kapitonov
VV
Pavlicek
A
, et al. 
Repbase Update, a database of eukaryotic repetitive elements
Cytogenet. Genome Res.
2005
, vol. 
110
 (pg. 
462
-
467
)
19
Han
MJ
Shen
Y-H
Gao
Y-H
, et al. 
Burst expansion, distribution and diversification of MITEs in the silkworm genome
BMC Genomics
2010
, vol. 
11
 pg. 
520
 
20
Xu
J
Liu
T
Li
D
, et al. 
BmSE, a SINE family with 3′ ends of (ATTT) repeats in domesticated silkworm (Bombyx mori)
J Genet Genomics
2010
, vol. 
37
 (pg. 
125
-
135
)
21
Huang
K
Identification and fuction analysis of short interspersed repetitive sequence (BmSer_SINE) of silkworm, Bombyx mori. Ph.D Thesis
2009
 
The Institute of Sericulture and Systems Biology, Southwest University
22
McCarthy
EM
McDonald
JF
LTR_STRUC: a novel search and identification program for LTR retrotransposons
Bioinformatics
2003
, vol. 
19
 (pg. 
362
-
367
)
23
Xu
Z
Wang
H
LTR_FINDER: an efficient tool for the prediction of full-length LTR retrotransposons
Nucleic Acids Res.
2007
, vol. 
35
 (pg. 
W265
-
W268
)
24
Rho
M
Tang
H
MGEScan-non-LTR: computational identification and classification of autonomous non-LTR retrotransposons in eukaryotic genomes
Nucleic Acids Res.
2009
, vol. 
37
 pg. 
e143
 
25
Xu
JS
Xia
QY
Li
J
, et al. 
Survey of long terminal repeat retrotransposons of domesticated silkworm (Bombyx mori)
Insect Biochem. Mol. Biol.
2005
, vol. 
35
 (pg. 
921
-
929
)
26
Kalendar
R
Lee
D
Schulman
AH
FastPCR software for PCR primer and probe design and repeat search
Genes Genomes Genomics
2009
, vol. 
3
 (pg. 
1
-
14
)
27
Smith
CD
Edgar
RC
Yandell
MD
, et al. 
Improved repeat identification and masking in Dipterans
Gene
2007
, vol. 
389
 (pg. 
1
-
9
)
28
Edgar
RC
Myers
EW
PILER: identification and classification of genomic repeats
Bioinformatics
2005
, vol. 
21
 
Suppl 1
(pg. 
i152
-
i158
)
29
Benson
G
Tandem repeats finder: a program to analyze DNA sequences
Nucleic Acids Res.
1999
, vol. 
27
 (pg. 
573
-
580
)
30
Kohany
O
Gentles
AJ
Hankus
L
, et al. 
Annotation, submission and screening of repetitive elements in Repbase: RepbaseSubmitter and Censor
BMC Bioinformatics
2006
, vol. 
7
 pg. 
474
 
31
Abrusan
G
Grundmann
N
DeMester
L
, et al. 
TEclass—a tool for automated classification of unknown eukaryotic transposable elements
Bioinformatics
2009
, vol. 
25
 (pg. 
1329
-
1330
)
32
Altschul
SF
Madden
TL
Schaffer
AA
, et al. 
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
Nucleic Acids Res.
1997
, vol. 
25
 (pg. 
3389
-
3402
)
33
Rice
P
Longden
I
Bleasby
A
EMBOSS: the European Molecular Biology Open Software Suite
Trends Genet.
2000
, vol. 
16
 (pg. 
276
-
277
)
34
Finn
RD
Clements
J
Eddy
SR
HMMER web server: interactive sequence similarity searching
Nucleic Acids Res.
2011
, vol. 
39
 (pg. 
W29
-
W37
)
35
Rho
M
Choi
JH
Kim
S
, et al. 
De novo identification of LTR retrotransposons in eukaryotic genomes
BMC Genomics
2007
, vol. 
8
 pg. 
90
 

Author notes

Present address: Hong-En Xu, Department of Genome Oriented Bioinformatics, Wissenschaftszentrum Weihenstephan, TU Muenchen, 85354 Freising, Germany

Citation details: Xu,H.-E., Zhang,H.-H., Xia,T., et al. BmTEdb: a collective database of transposable elements in the silkworm genome. Database (2013) Vol. 2013: article ID bat055; doi:10.1093/database/bat055.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.