From the 647 SSTs, we found that 132 had a significant BLASTN hit (E-score? ?1e-50) to our database of 999 non-redundant cDNA and genomic sequences

From the 647 SSTs, we found that 132 had a significant BLASTN hit (E-score? ?1e-50) to our database of 999 non-redundant cDNA and genomic sequences. GO term annotation of both silk gland-specific transcripts (SSTs) and non-silk gland specific transcripts (nSSTs) using both the full GO and the GO SLIM annotations based on UniProtKB and PFAM homology. (XLSX 399 KB) 12864_2013_6107_MOESM5_ESM.xlsx (399K) GUID:?12E1868B-F1A7-4A6A-B6AC-795E06A2B428 Abstract Background Spiders (Order Araneae) are essential predators in every terrestrial ecosystem largely because they have evolved potent arsenals of silk and venom. Spider silks are high performance materials made almost entirely of proteins, and thus represent an ideal system for investigating genome level evolution of novel protein functions. However, genomic level resources remain limited for spiders. Results We assembled a transcriptome for the Western black widow (assembly, Spidroin, Gene family, Molecular evolution, construction of transcriptomes (e.g. [4, 5]). The transcriptomes can enable identification of functional Norethindrone acetate genes without sequencing and assembling the often Norethindrone acetate repetitive non-coding genomic regions (e.g., [6C10]). Additionally, sequencing mRNAs from specific tissues, developmental time points, or experimental conditions allows for rapid profiling of transcript abundance at a global scale [1] and analyzing phylogenetically restricted adaptations (e.g., social phenotypes in ants [10] and the capsaicinoid pathway in peppers [11]). Spiders (Araneae) are a genome resource poor arthropod order, despite their taxonomic and ecological prominence. Araneae is one of the most species rich metazoan orders, consisting of over 44,000 described species that are found in every terrestrial ecosystem on the globe [12]. Spiders synthesize the most diverse repertoire of functionally differentiated silk fiber types among all the silk producing organisms. They are also by far the largest clade of venomous animals. The most closely related species with fully sequenced and annotated genomes diverged from spiders nearly 500 mya [13C16]. Unlike these fully sequenced arachnids, spiders are characterized by silk and venom production, the mechanics of which are poorly understood, due in part to the paucity of spider genomic resources. Recent studies have begun to use next generation sequencing in spiders to assemble partial transcriptomes for a tarantula, an orb-web weaver [17], two cobweb weavers [18, 19], and three social species in the genus synthesis of silk [24, 25], Rabbit polyclonal to MAP1LC3A but these efforts have been hindered by a lack of understanding of the full molecular processes that create the silk fibers [26, 27]. Past molecular studies of silk have overwhelmingly focused on fiber-forming structural proteins (fibroins). The spider specific fibroins, also called spidroins, are encoded by members of a single gene family [28, 29]. Spidroin genes are known to be very large and internally repetitive (e.g., [30C33]) making them difficult to sequence and assemble assembly of deeply sequenced cDNA fragments to characterize the transcriptome of the Western black widow, (Theridiidae). The Western black widow is an attractive spider with which to investigate the genomics of silk production Norethindrone acetate given their strong dragline silk [41] and the existing molecular characterization of this species’ spidroin encoding genes. has an estimated genome size of 1 1.3 billion bp [42], which while on the lower range of spider genomes, would be challenging to fully sequence. Thus far, spidroin paralogs have been described from five of the six functionally differentiated gland types in the black widow [33C35, 43, 44], including three of the six completely sequenced spidroin genes (builds a three dimensional cobweb that lacks the capture spiral. This collection of spidroin sequences can Norethindrone acetate serve as a benchmark for the quality of the transcriptome and the capacity of the assembly to successfully integrate the large repetitive regions, where transcriptome assembly has traditionally struggled [45, 46]. Our goal was to construct a high quality reference database that could be used to identify tissue specific expression patterns in black widows, and to contribute to ongoing evolutionary genomics of spiders. Here, we identify and analyze genes specifically expressed in silk glands, and thus represent candidates for silk components or involvement with silk synthesis, assembly or regulation. We evaluate the functions of these silk specific genes to generate a transcriptional program for silk glands. We also identify putative gene families to which these silk specific transcripts belong, allowing us to test if silk-restricted expression tends to evolve once, followed by gene expansions, or if, in contrast, silk-restricted expression has been co-opted from gene copies expressed in other tissues. More generally, our multi-tissue transcriptome is important for evolutionary analyses of any adaptive.