COMPLETE CODING SEQUENCE AND DEDUCED PRIMARY STRUCTURE OF THE HUMAN CARTILAGE LARGE AGGREGATING PROTEOGLYCAN, AGGRECAN - HUMAN-SPECIFIC REPEATS, AND ADDITIONAL ALTERNATIVELY SPLICED FORMS

被引:1
|
作者
DOEGE, KJ
SASAKI, M
KIMURA, T
YAMADA, Y
机构
[1] OREGON HLTH SCI UNIV, DEPT BIOCHEM & MOLEC BIOL, PORTLAND, OR 97201 USA
[2] NATL BEPPU HOSP, BEPPU, OITA 87401, JAPAN
[3] OSAKA UNIV, SCH MED, DEPT ORTHOPED SURG, OSAKA 553, JAPAN
[4] NIDR, DEV BIOL & ANOMALIES LAB, BETHESDA, MD 20892 USA
关键词
D O I
暂无
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We have obtained the complete coding sequence of the large aggregating chondroitin sulfate proteoglycan of human cartilage (aggrecan) from a combination of cDNA and genomic exon sequencing. We screened a human costal chondrocyte cDNA library, using rat aggrecan cDNA probes, and obtained three nonoverlapping clones totaling 6.2 kilobases in length. These clones were sequenced, and the sequence of the gaps between clones was obtained from genomic exon fragments and polymerase chain reaction-amplified cDNA. The composite sequence is 7137 nucleotides long, encoding 2316 amino acids. The human and rat aggrecan amino acid sequences are about 75% identical, with domains ranging from 100% to about 60% of conserved amino acids. The human sequence contains two regions of highly conserved repeats not found in rat aggrecan: 11 repeats of a hexameric sequence in the keratan sulfate attachment domain, E-E-P-(S,F)-P-S; and a 19-amino acid sequence reiterated 19 times, in the CS-1 portion of the serine-glycine-containing region. There are at least three forms of aggrecan transcripts, generated by alternative exon usage, and the form reported here is the shortest and also the most prevalent, lacking both the epidermal growth factor-like domain, and the complement regulatory protein-like sequence.
引用
收藏
页码:894 / 902
页数:9
相关论文
empty
未找到相关数据