Regulatory sequences and transposable elements(TEs)account for a large proportion of the genomic sequences of species;however,their roles in gene transcription,especially tissue-specific expression,remain largely unkn...Regulatory sequences and transposable elements(TEs)account for a large proportion of the genomic sequences of species;however,their roles in gene transcription,especially tissue-specific expression,remain largely unknown.Pigs serve as an excellent animal model for studying genomic sequence biology due to the extensive diversity among their wild and domesticated populations.Here,we conducted an integrated analysis using H3K27ac ChIP-seq,H3K4me3 ChIP-seq,and RNA-seq data from 10 different tissues of seven fetuses and eight closely related adult pigs.We aimed to annotate the regulatory elements and TEs to elucidate their associations with histone modifications and mRNA expression across different tissues and developmental stages.Based on correlation analysis between mRNA expression and H3K27ac and H3K4me3 peak activity,results indicated that H3K27ac exhibited stronger associations with gene expression than H3K4me3.Furthermore,1.45%of TEs overlapped with either the H3K27ac or H3K4me3 peaks,with the majority displaying tissue-specific activity.Notably,a TE subfamily(LTR4C_SS),containing binding motifs for SIX1 and SIX4,showed specific enrichment in the H3K27ac peaks of the adult and fetal ovaries.RNA-seq analysis also revealed widespread expression of TEs in the exons or promoters of genes,including 4688 TE-containing transcripts with distinct development stage-specific and tissue-specific expression.Of note,1967 TE-containing transcripts were enriched in the testes.We identified a long terminal repeat(LTR),MLT1F1,acting as a testis-specific alternative promoter in SRPK2(a cell cycle-related protein kinase)in our pig dataset.This element was also conserved in humans and mice,suggesting either an ancient integration of TEs in genes specifically expressed in the testes or parallel evolutionary patterns.Collectively,our findings demonstrate that TEs are deeply embedded in the genome and exhibit important tissue-specific biological functions,particularly in the reproductive organs.展开更多
Aim: To identify a novel alternative transcript of the novel retinal pigment epithelial cell gene (NORPEG) expressed in the human testis. Methods: A human testis cDNA microarray was established and hybridized with...Aim: To identify a novel alternative transcript of the novel retinal pigment epithelial cell gene (NORPEG) expressed in the human testis. Methods: A human testis cDNA microarray was established and hybridized with cDNA probes from human fetal testes, adult testes and human spermatozoa. Differentially expressed clones were sequenced and analyzed. One of these clones was a short transcript of NORPEG which we proceeded to analyze by RT-PCR. Results: The novel short alternative transcript of NORPEG was isolated and named sNORPEG. It was 3486 bp in length and contained a 2952-bp open reading frame, encoding a 110.4-kDa protein of 983 amino acids. Amino acid sequence analysis showed that the sNORPEG protein contains six ankyrin repeats and two coiled-coil domains. It shares a high homology with the NORPEG and ankycorbin proteins in both its sequence and motifs. Blasting the human genome database localized sNORPEG to human chromosome 5pl 3.2-13.3. Expression profiles showed that sNORPEG was expressed in human fetal testes, adult testes and spermatozoa. Moreover, sNORPEG was found to be ubiquitously expressed in human tissues. Conclusion: sNORPEG is expressed in different developmental stages of the testis and encodes a protein that may have roles in human testis development and spermatogenesis. (Asian J Androl 2005 Sep; 7: 277-288)展开更多
The number and variability of transcripts determine the diversity and complexity of the proteins in a species.To reveal the variations in transcript abundance and the related factors in Chinese fir(Cunninghamia lanceo...The number and variability of transcripts determine the diversity and complexity of the proteins in a species.To reveal the variations in transcript abundance and the related factors in Chinese fir(Cunninghamia lanceolata),we performed second-generation sequencing(RNA-seq)and third-generation sequencing(PacBio)analyses.The forms and rates of alternative splicing in Chinese fir were studied on the basis of the obtained transcripts.Furthermore,the number of full-length transcripts(isoforms)produced by alternative splicing and the variation patterns in‘Long 15'were analyzed.The transcript diversity in Chinese fir was largely caused by six alternative splicing forms,of which intron retention was the most common(47.98%of all alternative splicing events),followed by alternative 3′splice site(24.29%).The third-generation sequencing analysis detected 61613 isoforms in‘Long 15',with each gene producing 1-10 isoforms.Only 0.06%of the genes produced more than 10 isoforms.Transcript abundance was similar among Chinese fir varieties,but 615 more transcripts were detected in‘Long 15'than in clone 1339,implying that‘Long 15'synthesized more protein types in vivo than 1339.This difference may explain why‘Long 15'grows better and is more adaptable to environmental conditions than 1339.An examination of Chinese fir clone Kai6 detected more transcripts after fertilization than following a nutrient stress treatment.Moreover,transcript polymorphism was greater after fertilization than in response to nutrient stress.This finding may be useful for improving fertilizer applications to enhance Chinese fir growth and development.Sequences and alternative splicing forms are critical for research on the Chinese fir transcriptome and the potential biological consequences of alternative splicing.展开更多
Alternative splicing of pre-mRNA transcripts is an important regulatory mechanism that increases the diversity of gene products in eukaryotes.Various studies have linked specific transcript isoforms to altered drug re...Alternative splicing of pre-mRNA transcripts is an important regulatory mechanism that increases the diversity of gene products in eukaryotes.Various studies have linked specific transcript isoforms to altered drug response in cancer;however,few algorithms have incorporated splicing information into drug response prediction.In this study,we evaluated whether basal-level splicing information could be used to predict drug sensitivity by constructing doxorubicin-sensitivity classification models with splicing and expression data.We detailed splicing differences between sensitive and resistant cell lines by implementing quasi-binomial generalized linear modeling(QBGLM)and found altered inclusion of 277 skipped exons.We additionally conducted RNA-binding protein(RBP)binding motif enrichment and differential ex-pression analysis to characterize cis-and trans-acting elements that potentially influence doxorubicin response-mediating splicing alterations.Our results showed that a classification model built with skipped exon data exhibited strong predictive power.We discovered an association between differentially spliced events and epithelial-mesenchymal transition(EMT)and observed motif enrichment,as well as differential expression of RBFOX and ELAVL RBP family members.Our work demonstrates the potential of incorporating splicing data into drug response algorithms and the utility of a QBGLM approach for fast,scalable identification of relevant splicing differences between large groups of samples.展开更多
AIM To investigate the expression characteristics of heterogeneous nuclear ribonucleoprotein H1(HNRNPH1) m RNA and protein in cell lines and tissues of esophageal squamous cell carcinoma(ESCC). METHODS Western blottin...AIM To investigate the expression characteristics of heterogeneous nuclear ribonucleoprotein H1(HNRNPH1) m RNA and protein in cell lines and tissues of esophageal squamous cell carcinoma(ESCC). METHODS Western blotting was used to assess the expression of HNRNPH1 protein in seven ESCC cell lines and 30 paired fresh tissue specimens. The subcellular localization of HNRNPH1 was determined by immunofluorescence in ESCC cells. The RNA sequencing data from 87 patients with ESCC were obtained from the cancer genome atlas(TCGA), and the expression and clinical characteristics analysis of different transcript variants of HNRNPH1 were evaluated in this dataset. In addition, immunohistochemistry was carried out to detect the expression of HNRNPH1 protein in 125 patients.RESULTS The expression of HNRNPH1 protein varied across different ESCC cell lines. It was exclusively restricted to the nucleus of the ESCC cells. There are two transcript variants of the HNRNPH1 gene. Variant 1 was constitutively expressed, and its expression did not change during tumorigenesis. In contrast, levels of variant 2 were low in non-tumorous tissues and were dramatically increased in ESCC(P = 0.0026). The high levels of variant 2 were associated with poorer differentiated tumors(P = 0.0287). Furthermore, in paired fresh tissue specimens, HNRNPH1 protein was overexpressed in 73.3%(22/30) of neoplastic tissues. HNRNPH1 was significantly upregulated in ESCC, with strong staining in 43.2%(54/125) of tumor tissues and 22.4%(28/125) of matched non-cancerous tissues(P = 0.0005). Positive HNRNPH1 expression was significantly associated with poor tumor differentiation degree(P = 0.0337).CONCLUSION The different alternative transcript variants of HNRNPH1 exhibited different expression changes during tumorigenesis. Its m RNA and protein were overexpressed in ESCC and associated with poorer differentiation of tumor cells. These findings highlight the potential of HNRNPH1 in the therapy and diagnosis of ESCC.展开更多
基金supported by the National Natural Science Foundation of China(32160781)。
文摘Regulatory sequences and transposable elements(TEs)account for a large proportion of the genomic sequences of species;however,their roles in gene transcription,especially tissue-specific expression,remain largely unknown.Pigs serve as an excellent animal model for studying genomic sequence biology due to the extensive diversity among their wild and domesticated populations.Here,we conducted an integrated analysis using H3K27ac ChIP-seq,H3K4me3 ChIP-seq,and RNA-seq data from 10 different tissues of seven fetuses and eight closely related adult pigs.We aimed to annotate the regulatory elements and TEs to elucidate their associations with histone modifications and mRNA expression across different tissues and developmental stages.Based on correlation analysis between mRNA expression and H3K27ac and H3K4me3 peak activity,results indicated that H3K27ac exhibited stronger associations with gene expression than H3K4me3.Furthermore,1.45%of TEs overlapped with either the H3K27ac or H3K4me3 peaks,with the majority displaying tissue-specific activity.Notably,a TE subfamily(LTR4C_SS),containing binding motifs for SIX1 and SIX4,showed specific enrichment in the H3K27ac peaks of the adult and fetal ovaries.RNA-seq analysis also revealed widespread expression of TEs in the exons or promoters of genes,including 4688 TE-containing transcripts with distinct development stage-specific and tissue-specific expression.Of note,1967 TE-containing transcripts were enriched in the testes.We identified a long terminal repeat(LTR),MLT1F1,acting as a testis-specific alternative promoter in SRPK2(a cell cycle-related protein kinase)in our pig dataset.This element was also conserved in humans and mice,suggesting either an ancient integration of TEs in genes specifically expressed in the testes or parallel evolutionary patterns.Collectively,our findings demonstrate that TEs are deeply embedded in the genome and exhibit important tissue-specific biological functions,particularly in the reproductive organs.
文摘Aim: To identify a novel alternative transcript of the novel retinal pigment epithelial cell gene (NORPEG) expressed in the human testis. Methods: A human testis cDNA microarray was established and hybridized with cDNA probes from human fetal testes, adult testes and human spermatozoa. Differentially expressed clones were sequenced and analyzed. One of these clones was a short transcript of NORPEG which we proceeded to analyze by RT-PCR. Results: The novel short alternative transcript of NORPEG was isolated and named sNORPEG. It was 3486 bp in length and contained a 2952-bp open reading frame, encoding a 110.4-kDa protein of 983 amino acids. Amino acid sequence analysis showed that the sNORPEG protein contains six ankyrin repeats and two coiled-coil domains. It shares a high homology with the NORPEG and ankycorbin proteins in both its sequence and motifs. Blasting the human genome database localized sNORPEG to human chromosome 5pl 3.2-13.3. Expression profiles showed that sNORPEG was expressed in human fetal testes, adult testes and spermatozoa. Moreover, sNORPEG was found to be ubiquitously expressed in human tissues. Conclusion: sNORPEG is expressed in different developmental stages of the testis and encodes a protein that may have roles in human testis development and spermatogenesis. (Asian J Androl 2005 Sep; 7: 277-288)
基金Supported by the Fundamental Research Funds of the Central Public Welfare Research Institutes of The Chinese Academy of Forestry(CAFYbb2017ZA001-1-2)the"14^(th)Five-year"Forest Variety Breeding Project of Zhejiang Province(2021C010010808)。
文摘The number and variability of transcripts determine the diversity and complexity of the proteins in a species.To reveal the variations in transcript abundance and the related factors in Chinese fir(Cunninghamia lanceolata),we performed second-generation sequencing(RNA-seq)and third-generation sequencing(PacBio)analyses.The forms and rates of alternative splicing in Chinese fir were studied on the basis of the obtained transcripts.Furthermore,the number of full-length transcripts(isoforms)produced by alternative splicing and the variation patterns in‘Long 15'were analyzed.The transcript diversity in Chinese fir was largely caused by six alternative splicing forms,of which intron retention was the most common(47.98%of all alternative splicing events),followed by alternative 3′splice site(24.29%).The third-generation sequencing analysis detected 61613 isoforms in‘Long 15',with each gene producing 1-10 isoforms.Only 0.06%of the genes produced more than 10 isoforms.Transcript abundance was similar among Chinese fir varieties,but 615 more transcripts were detected in‘Long 15'than in clone 1339,implying that‘Long 15'synthesized more protein types in vivo than 1339.This difference may explain why‘Long 15'grows better and is more adaptable to environmental conditions than 1339.An examination of Chinese fir clone Kai6 detected more transcripts after fertilization than following a nutrient stress treatment.Moreover,transcript polymorphism was greater after fertilization than in response to nutrient stress.This finding may be useful for improving fertilizer applications to enhance Chinese fir growth and development.Sequences and alternative splicing forms are critical for research on the Chinese fir transcriptome and the potential biological consequences of alternative splicing.
基金supported by the National Institutes of Health,USA(Grant No.R01CA213466)awarded to YL.the Precision Health Initiative at Indiana University.
文摘Alternative splicing of pre-mRNA transcripts is an important regulatory mechanism that increases the diversity of gene products in eukaryotes.Various studies have linked specific transcript isoforms to altered drug response in cancer;however,few algorithms have incorporated splicing information into drug response prediction.In this study,we evaluated whether basal-level splicing information could be used to predict drug sensitivity by constructing doxorubicin-sensitivity classification models with splicing and expression data.We detailed splicing differences between sensitive and resistant cell lines by implementing quasi-binomial generalized linear modeling(QBGLM)and found altered inclusion of 277 skipped exons.We additionally conducted RNA-binding protein(RBP)binding motif enrichment and differential ex-pression analysis to characterize cis-and trans-acting elements that potentially influence doxorubicin response-mediating splicing alterations.Our results showed that a classification model built with skipped exon data exhibited strong predictive power.We discovered an association between differentially spliced events and epithelial-mesenchymal transition(EMT)and observed motif enrichment,as well as differential expression of RBFOX and ELAVL RBP family members.Our work demonstrates the potential of incorporating splicing data into drug response algorithms and the utility of a QBGLM approach for fast,scalable identification of relevant splicing differences between large groups of samples.
基金Supported by the Natural Science Foundation of China,No.81572840,No.81372591 and No.81572365the State Key Project for Basic Research,No.2014CBA02002National High-tech R and D Program,No.2012AA020206
文摘AIM To investigate the expression characteristics of heterogeneous nuclear ribonucleoprotein H1(HNRNPH1) m RNA and protein in cell lines and tissues of esophageal squamous cell carcinoma(ESCC). METHODS Western blotting was used to assess the expression of HNRNPH1 protein in seven ESCC cell lines and 30 paired fresh tissue specimens. The subcellular localization of HNRNPH1 was determined by immunofluorescence in ESCC cells. The RNA sequencing data from 87 patients with ESCC were obtained from the cancer genome atlas(TCGA), and the expression and clinical characteristics analysis of different transcript variants of HNRNPH1 were evaluated in this dataset. In addition, immunohistochemistry was carried out to detect the expression of HNRNPH1 protein in 125 patients.RESULTS The expression of HNRNPH1 protein varied across different ESCC cell lines. It was exclusively restricted to the nucleus of the ESCC cells. There are two transcript variants of the HNRNPH1 gene. Variant 1 was constitutively expressed, and its expression did not change during tumorigenesis. In contrast, levels of variant 2 were low in non-tumorous tissues and were dramatically increased in ESCC(P = 0.0026). The high levels of variant 2 were associated with poorer differentiated tumors(P = 0.0287). Furthermore, in paired fresh tissue specimens, HNRNPH1 protein was overexpressed in 73.3%(22/30) of neoplastic tissues. HNRNPH1 was significantly upregulated in ESCC, with strong staining in 43.2%(54/125) of tumor tissues and 22.4%(28/125) of matched non-cancerous tissues(P = 0.0005). Positive HNRNPH1 expression was significantly associated with poor tumor differentiation degree(P = 0.0337).CONCLUSION The different alternative transcript variants of HNRNPH1 exhibited different expression changes during tumorigenesis. Its m RNA and protein were overexpressed in ESCC and associated with poorer differentiation of tumor cells. These findings highlight the potential of HNRNPH1 in the therapy and diagnosis of ESCC.