Thursday, September 15, 2022
HomeBiotechnologyComparative transcriptome profiling and molecular marker growth for oil palm fruit colour

Comparative transcriptome profiling and molecular marker growth for oil palm fruit colour


Transcriptome sequencing and de novo meeting

Excessive-throughput sequencing was used, ensuing within the era of 48,017,160–66,162,228 pairs and 46,896,912–64,521,874 pairs of uncooked reads and clear reads from the nigrescent-type library and the virescent-type library, respectively, with quantities of unpolluted bases starting from 7.0 to 9.7 Gb, respectively. High quality management evaluation of the information revealed a minimal error base price of 0.01% with a median high quality worth and Q20 and Q30 scores larger than 97% and 93%, respectively. The CG content material of the clear reads ranged from 48.61 to 50.16% (Suppl. 1). Primarily based on the high-quality clear reads, a complete of 143,797 transcripts and 143,668 unigenes had been recognized. Of those, 46,664 unigenes with a measurement between 500 and 1000 bp had been the most typical. A complete of 160,423,261 nucleotides had been assembled, which had a imply size of 1117 bp, an N50 of 1664 bp and an N90 of 505 bp (Desk 1).

Desk 1 Statistics of the meeting high quality.

Practical annotations of unigenes

Among the many 143,668 unigenes, 95.048 unigenes (66.15%) had been probably the most important within the NCBI nucleotide sequence (Nt) database, adopted by the NCBI nonredundant protein (NR) (76,996 unigenes, 53.59%), SwissProt (62,143 unigenes, 43.25%), gene ontology (GO) (58,842 unigenes, 40.95%), Protein household (Pfam) (58,677 unigenes, 40.85%), Kyoto Encyclopedia of Genes and Genomes (KEGG) Ortholog (KO) (31,191 unigenes, 21.71%) and euKaryotic Ortholog Teams (KOG) (29,939 unigenes, 20.83%) databases (Desk 2). A complete of 17,374 unigenes (12.09%) matched entries in 5 of the seven databases (Suppl. 2). BLASTX similarity evaluation of the 143,668 unigenes revealed 33.4% (47,985 unigenes) important homology (E-values < 1e−100) with the opposite sequences within the NR database, with the best similarity distribution of unigenes starting from 80 to 90%. The NR annotation species distribution evaluation revealed homology with sequences of E. guineensis (63.9%), Phoenix dactylifera (9.2%) and Pyrus × (7.0%) (Suppl. 3).

Desk 2 Abstract of the annotation data of unigenes associated to grease palm fruit exocarp colour in public databases.

Moreover, the 143,668 unigenes had been annotated into three main classes, i.e., organic processes, molecular features and mobile elements, with 57 subcategories (Suppl. 4). Most unigenes within the organic course of class had been particular to mobile processes (33,805 unigenes), metabolic processes (30,781 unigenes) and single-organism processes (24,691 unigenes). Within the molecular operate class, many of the genes had been implicated in binding (33,217 unigenes), adopted by catalytic exercise (25,917 unigenes) and transporter exercise (4021 unigenes). Within the mobile part class, cells (18,594 unigenes) had been probably the most represented, adopted by cell components (18,592 unigenes) and macromolecular complexes (12,211 unigenes).

As well as, KOG operate classification annotated 29,939 unigenes into 25 classes, amongst which the vast majority of genes (4459 unigenes) had been related to common operate prediction solely, adopted by posttranslational modification, protein turnover, chaperones (4037 unigenes) and sign transduction mechanisms (2933 unigenes) (Suppl. 5).

Subsequent, the unigenes had been mapped to the KEGG pathway database to determine the organic pathways. In complete, 31,191 unigenes had been labeled into 21 secondary pathways of 5 main pathways. Metabolism-related pathways had been the vast majority of these pathways (10 pathways), adopted by genetic data processing (4 pathways). Amongst these unigenes, the interpretation class had the biggest variety of unigenes (3008 unigenes), adopted by the carbohydrate metabolism class (2498 unigenes). The folding, sorting and degradation class (2430 unigenes) was the smallest group (Suppl. 6).

Furthermore, when the unigenes within the transcription issue group had been analyzed, 4805 unigenes had been finally mapped to 80 transcription issue households. Of those, the C2H2 transcription issue was probably the most ample (390 unigenes), adopted by the MYB transcription issue (298 unigenes), C3H transcription issue (232 unigenes), AP2-EREBP transcription issue (225 unigenes), and orphan transcription components (204 unigenes) (Suppl. 7).

Identification and purposeful annotation of DEGs

DEGs had been analyzed after normalization of the learn depend of every unigene. The proportion of complete mapped reads of every pattern ranged from 75 to 80%. G_exo had the best complete mapped reads (Suppl. 8). Many of the DEGs had been downregulated, and there have been no correlations. The G_exo:B_exo comparability group contained probably the most DEGs (6345 genes), with 3359 downregulated genes. Nonetheless, the O_exo:R_exo comparability group contained extra upregulated genes (1917 genes) than downregulated genes (1889 genes) (Fig. 1). To find out the features and purposeful patterns of the unknown genes, correlations of the DEGs had been decided. The evaluation outcomes confirmed 4 completely different gene expression patterns in response to fruit colour. B_exo introduced a DEG sample that was just like that of R_exo, whereas G_exo introduced a DEG sample just like that of Or_exo (Suppl. 9).

Determine 1
figure 1

Volcano plot of nigrescent and virescent oil palm fruit in every fruit growth stage. (G_exo = inexperienced immature virescent kind, Or_exo = orange mature virescent kind, B_exo = black immature nigrescent kind and R_exo = pink mature nigrescent kind).

Moreover, GO enrichment evaluation of the genes differentially expressed between every morphotype (B_exo:R_exo and G_exo:Or_exo) revealed related outcomes at every GO degree. Within the organic course of class, metabolic processes had been related to the best gene expression, adopted by single-organism metabolic processes. Within the molecular operate class, catalytic exercise was related to the best gene expression, adopted by transferase exercise (virescent kind), steel ion binding (nigrescent kind) and cation binding (nigrescent kind) (Fig. 2). In distinction, KEGG pathway enrichment evaluation of the DEGs revealed important variations in every kind. Within the virescent kind, the genes concerned within the carbon metabolism pathway had been probably the most extremely expressed, adopted by these concerned within the biosynthesis of amino acids pathway. Within the nigrescent kind, most DEGs had been related to plant–pathogen interactions adopted by glycolysis/gluconeogenesis (Fig. 3).

Determine 2
figure 2

GO phrases of genes differentially expressed between oil palm fruit varieties and fruit growth phases. (a) Nigrescent immature/mature kind. (b) Virescent immature/mature kind. (BP = organic course of, CC = mobile part, MF = molecular operate).

Determine 3
figure 3

KEGG pathway enrichment evaluation of genes differentially expressed between oil palm fruit varieties and at completely different fruit growth phases. (a) Nigrescent immature/mature kind. (b) Virescent immature/mature kind.

Comparability of genes differentially expressed between nigrescent and virescent varieties

Annotations of the DEGs concerned within the ABP revealed increased transcription on the immature stage than on the mature stage, particularly within the nigrescent kind, leading to upregulated gene expression. The genes concerned within the ABP had been cinnamic acid 4-hydroxylase (C4H), chalcone synthase (CHS), flavanone 3-hydroxylase (F3H), flavonoid 3-monooxygenase (F3’H), flavonoid 3′, 5′-hydroxylase (F3′5′H), dihydroflavonol-4-reductase (DFR), leucoanthocyanidin reductase (LAR), anthocyanidin synthase (ANS), anthocyanidin reductase (ANR), flavonoid 3-O-glucosyltransferase (3GT) and shikimate O-hydroxycinnamoyl transferase (HCT). Essentially the most regulated on this pathway was 3GT. As well as, each fruit varieties had been discovered to comprise two homologs of 3GT, three homologs of C4H, and 4 homologs of CHS and F3′5′H (Fig. 4). In distinction, the annotations of the DEGs concerned in CBP revealed increased transcription of genes on the mature stage than on the immature stage, particularly within the virescent kind, ensuing within the downregulation of genes comparable to phytoene synthase (PSY), phytoene desaturase (PDS), ζ-carotene desaturase (ZDS), β-ring hydroxylase (LUT5) and β-carotene 3-hydroxylase (CRTZ). Lycopene ε-cyclase (LCYE) and β-carotene isomerase (DWARF27) had been extremely upregulated within the virescent kind in contrast with the nigrescent kind (Fig. 5).

Determine 4
figure 4

Simplified scheme and heatmap of the expression of genes associated to flavonoid and anthocyanin biosynthesis within the exocarp of oil palm fruit of nigrescent (Nig) and virescent (Vig) varieties. Every coloured cell represents the typical log2(FPKM) worth of every pathway gene in response to the colour scale. PAL, phenylalanine ammonia-lyase; C4H, cinnamic acid 4-hydroxylase; CHS, chalcone synthase; CHI, chalcone isomerase; HCT, shikimate O-hydroxycinnamoyl transferase; F3H, flavanone 3-hydroxylase; F3′H, flavonoid 3-monooxygenase; F3′5′H, flavonoid 3′, 5′-hydroxylase; DFR, dihydroflavonol-4-reductase; LAR, leucoanthocyanidin reductase; ANS, anthocyanidin synthase; ANR, anthocyanidin reductase; 3GT, flavonoid 3-O-glucosyltransferase; MT, methyl transferase.

Determine 5
figure 5

Simplified scheme and heatmap of the expression of genes associated to carotenoid biosynthesis within the exocarp oil palm fruit of nigrescent (Nig) and virescent (Vig) varieties. Every coloured cell represents the typical log2(FPKM) worth of every pathway gene in response to the colour scale. PSY, phytoene synthase; PDS, phytoene desaturase; ZDS, ζ-carotene desaturase; LCYE, lycopene ε-cyclase; LCYB, lycopene β-cyclase; LUT1, carotene ε-ring hydroxylase; LUT5, β-ring hydroxylase; CRTZ, β-carotene 3-hydroxylase; DWARF27, β-carotene isomerase.

EST-SSR discovery

A complete of 902 genes differentially expressed between immature and mature virescent varieties had been employed for primer design, of which 114 genes (144 positions) (12.64%) had SSR-like positions. As anticipated, trinucleotide repeat motifs had been probably the most frequent kind of microsatellite (55; 38.19%). The opposite repeat motifs occurred at a lot decrease frequencies, i.e., there have been 42 (29.17%), 35 (24.31%), 7 (4.86%) and 5 (3.47%) dinucleotide, hexanucleotide, pentanucleotide and tetranucleotide repeats, respectively. Nonetheless, 82 primers (56.94%) had flanking sequences adequate for designing applicable distinctive primers (Suppl. 10). Fifty out of those 82 primer (60.98%) pairs had been successfully amplified. Of those, six (12%) had been polymorphic, with 2–4 alleles per SSR and ten genotype patterns (Fig. 6). Nonetheless, the allele frequency and genotype frequency evaluations revealed that just one SSR marker might distinguish between the nigrescent and virescent varieties. Marker 9949 revealed alleles A, B and C. Within the nigrescent kind, alleles B and C had the best BC genotype (0.6). Nonetheless, within the virescent kind, alleles A and B had been discovered, and the best BB genotype was 0.8. (Suppl. 11).

Determine 6
figure 6

Polymorphisms of DNA bands after amplification with 6 primer pairs and separation by 5% denaturing polyacrylamide gel electrophoresis. (a) Primer 20,186, 54,322 and 55,957, (b) Primer 76,163, (c) Primer 9949, (d) Primer 8605–2. nir = nigrescent kind of Nigeria Black, vir = virescent kind of Deli × Calabar. 1–5 = replication in every morphotype. The pink arrows point out the completely different alleles.

RELATED ARTICLES

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Most Popular

Recent Comments