R&D CENTER

Genome sequence of Plantago depressa, as the first genome in Plantaginaceae

Jongsun Park*, Hong Xi, Yongsung Kim, Chanho Park
URL  
Plantago depressa is a perennial flowering plant of genus Plantago. It inhabits meadows and roadside and distributed in Middle and Northeast Asia. Its seeds have been used as a medicine which boosts immunomodulatory and antioxidant activities. We sequenced P. depressa as the first genome in Plantaginaceae with generating 79.7Gbp (around 100.7x coverage) from four different libraries. Current assembly (version 0.5) shows that total length is 765.16Mb, which is similar to genome size estimating using JellyFish (792.26Mb). Maximum scaffold length is 433kb and N50 length is 27,288bp. Due to lack of publicly available RNA-Seq data of P. depressa, AUGUSTUS with Arabidopsis training set was used for gene prediction. 228,356 genes were predicted presenting N50 length is 456aa, which is lower than those of Arabidopsis gene model by 56aa. 79,467 (34.80\%) out of 228,356 genes have functional domains detected by InterProScan. 413 cytochrome P450s identified from current genome show that length of majority of cytochrome P450s ranges from 461 to 520aa, which is same to that of Arabidopsis cytochrome P450s. All these data will be accessed soon in Plantago Genome Database (PGD; http://www.plantagogenome.info/) which provides web-based bioinformatics tools including BLAST search. GlobalScarpTM was also implemented in PCD, providing the virtual space to collect all sequences and executing several tools for analyzing them on the web. P. depressa genome and PCD will be useful genomic resources to understand plants in Plantaginaceae.