Signal Peptide Website
Search my Protein
Advanced Search
Database Search
References
Hints
Links
Imprint
Signal Peptide Database - Mammalia
Entry Details
ID
159
Source Database
UniProtKB/Swiss-Prot
UniProtKB/Swiss-Prot Accession Number
P12111 (Created: 1989-10-01 Updated: 2008-12-16)
UniProtKB/Swiss-Prot Entry Name
CO6A3_HUMAN
Protein Name
Collagen alpha-3(VI) chain
Gene
COL6A3
Organism Scientific
Homo sapiens
Organism Common
Human
Lineage
Eukaryota
Metazoa
Chordata
Craniata
Vertebrata
Euteleostomi
Mammalia
Eutheria
Euarchontoglires
Primates
Haplorrhini
Catarrhini
Hominidae
Homo
Protein Length [aa]
3177
Protein Mass [Da]
343665
Features
Type
Description
Status
Start
End
signal peptide
potential
1
25
chain
Collagen alpha-3(VI) chain
26
3177
disulfide bond
Interchain
by similarity
2087
2087
disulfide bond
by similarity
3112
3162
disulfide bond
by similarity
3121
3145
disulfide bond
by similarity
3137
3158
domain
VWFA 1
39
213
domain
VWFA 2
242
419
domain
VWFA 3
445
620
domain
VWFA 4
639
816
domain
VWFA 5
837
1009
domain
VWFA 6
1029
1205
domain
VWFA 7
1233
1404
domain
VWFA 8
1436
1609
domain
VWFA 9
1639
1812
domain
VWFA 10
1838
2024
domain
Collagen-like 1
2038
2097
domain
Collagen-like 2
2104
2163
domain
Collagen-like 3
2174
2233
domain
Collagen-like 4
2249
2300
domain
Collagen-like 5
2314
2373
domain
VWFA 11
2402
2581
domain
VWFA 12
2619
2815
domain
Fibronectin type-III
2988
3076
domain
BPTI/Kunitz inhibitor
3112
3162
region of interest
Nonhelical region
26
2038
region of interest
Triple-helical region
2039
2375
region of interest
Nonhelical region
2376
3177
modified residue
Pyrrolidone carboxylic acid
probable
26
26
modified residue
4-hydroxyproline
2100
2100
modified residue
5-hydroxylysine
2103
2103
modified residue
4-hydroxyproline
2206
2206
modified residue
5-hydroxylysine
2209
2209
modified residue
5-hydroxylysine
2212
2212
modified residue
4-hydroxyproline
2239
2239
modified residue
4-hydroxyproline
2316
2316
modified residue
4-hydroxyproline
2319
2319
modified residue
5-hydroxylysine
2322
2322
modified residue
5-hydroxylysine
2337
2337
glycosylation site
N-linked (GlcNAc...)
potential
108
108
glycosylation site
N-linked (GlcNAc...)
potential
116
116
glycosylation site
N-linked (GlcNAc...)
potential
202
202
glycosylation site
N-linked (GlcNAc...)
potential
251
251
glycosylation site
N-linked (GlcNAc...)
potential
2079
2079
glycosylation site
O-linked (Gal...)
2103
2103
glycosylation site
O-linked (Gal...)
2209
2209
glycosylation site
O-linked (Gal...)
2212
2212
glycosylation site
O-linked (Gal...)
2322
2322
glycosylation site
N-linked (GlcNAc...)
potential
2331
2331
glycosylation site
O-linked (Gal...)
2337
2337
glycosylation site
N-linked (GlcNAc...)
potential
2558
2558
glycosylation site
N-linked (GlcNAc...)
2677
2677
glycosylation site
N-linked (GlcNAc...)
potential
2861
2861
glycosylation site
N-linked (GlcNAc...)
potential
3037
3037
sequence variant
(in dbSNP:rs11690358)
0
0
sequence variant
(in dbSNP:rs6728818)
0
0
sequence variant
(in dbSNP:rs36104025)
0
0
sequence variant
(in dbSNP:rs35848091)
0
0
sequence variant
(in dbSNP:rs36117715)
0
0
sequence variant
(in BM)
0
0
sequence variant
(in dbSNP:rs11896521)
0
0
sequence variant
(in dbSNP:rs9630964)
0
0
sequence variant
(in dbSNP:rs36092870)
0
0
sequence variant
(in dbSNP:rs34741387)
0
0
sequence variant
(in dbSNP:rs2270669)
0
0
sequence variant
(in dbSNP:rs1131296)
0
0
strand
3120
3122
strand
3125
3131
strand
3136
3142
strand
3144
3146
strand
3152
3154
helix
3110
3113
helix
3155
3162
repeat
LRR 1
63
85
repeat
LRR 2
152
175
repeat
LRR 3
256
279
repeat
LRR 4
469
492
repeat
LRR 5
589
610
repeat
LRR 6
663
690
repeat
LRR 7
703
729
repeat
LRR 8
792
815
repeat
LRR 9
1052
1075
repeat
LRR 10
1299
1326
repeat
LRR 11
1500
1524
repeat
LRR 12
1583
1611
repeat
LRR 13
1690
1713
repeat
LRR 14
1988
2013
repeat
LRR 15
2523
2548
repeat
LRR 16
2549
2576
site
Reactive bond
3122
3123
splice variant
(in isoform 2)
31
236
compositionally biased region
Thr-rich
2863
2898
compositionally biased region
Ala-rich
2908
2983
short sequence motif
Cell attachment site
2040
2042
short sequence motif
Cell attachment site
2136
2138
short sequence motif
Cell attachment site
2148
2150
short sequence motif
Cell attachment site
2154
2156
short sequence motif
Cell attachment site
2370
2372
turn
3132
3135
SP Length
25
----+----1----+----2----+----3----+----4----+----5
Signal Peptide
MRKHRHLPLVAVFCLFLSGFPTTHA
Sequence
MRKHRHLPLVAVFCLFLSGFPTTHA
Q
QQQA
DVKNGAAADIIFLVDSSWTI
GEEHFQLVREFLYDVVKSLAVGENDFHFALVQFNGNPHTEFLLNTYRTKQ
EVLSHISNMSYIGGTNQTGKGLEYIMQSHLTKAAGSRAGDGVPQVIVVLT
DGHSKDGLALPSAELKSADVNVFAIGVEDADEGALKEIASEPLNMHMFNL
ENFTSLHDIVGNLVSCVHSSVSPERAGDTETLKDIT
AQDSADIIFLIDGS
N
NTGS
VNFAVILDFLVNLLEKLPIGTQQI
RVGVVQFSDEPRTMFSLDTYS
TKAQVLGAVKALGFAGGELANIGLALDFVVENHFTRAGGSRVEEGVPQVL
VLISAGPSSDEIRYGVVALKQASVFSFGLGAQAASRAELQHIATDDNLVF
TVPEFRSFGDLQEKLLPYIVGVAQRHIVLKPPTIVTQVIEVNKRDIVFLV
DGSSALGLANFNAIRDFI
AKVIQRLEIGQDLIQVAVAQYADT
VRPEFYFN
THPTKREVITAVRKMKPLDGSALYTGSALDFVRNNLFTSSAGYRAAEGIP
KLLVLITGGKSLDEISQPAQELKRSSIMAFAIGNKGAD
QAELEEIAFDSS
LVFIPAEFRA
APLQGMLPGLLAPLRTLSGTPEVHSNKRDIIFLLDGSANV
GKTNFPYVRDFV
MNLVNSLDIGNDNIRVGLVQFSDTPVTE
FSLNTYQTKS
DI
LGHLRQLQLQGGSGLNTGSALSYVYAN
HFTEAGGSRIREHVPQLLLLL
TAGQSEDSYLQAANALTRAGILTFCVGASQANKAELEQIAF
NPSLVYLMD
DFSSLPALPQQLIQP
LTTYVSGGVEEVPLAQPESKRDILFLFDGSANLVG
QFPVVRDFLYKIIDELNVKPEGTRIAVAQYSDDVKVESRFDEHQSKPEIL
NLVKRMKIKTGKALNLGYALDYAQRYIFVKSAGSRIEDGVLQFLVLLVAG
RSSDRVDGPASNLKQSGVVPFIFQAKNADPAELEQIVLSPAFILAAESLP
KIGDLHPQIVNLLKSVHNGAPAPVSGEKDVVFLLDGSEGVRSGFPLLKEF
V
QRVVESLDVGQDRVRVAVVQYSDR
TRPEFYLNSYMNKQDVVNAVRQLTL
LGGPTPNTGAALEFVLRNILVSSAGSRITEGVPQLLIVLTADRSGDDVRN
PSVVVKRGGAVPIGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQV
ISERVTQLTREELSRLQPVLQPLPSPGVGGKRDVVFLIDGSQSAGPEFQY
VRTLIERLVDYLDVGFDTTRVAVIQFSDDPKVEFLLNAHSSKDEVQNA
VQ
RLRPKGGRQINVGNALEYVSRNIFKR
PLGSRIEEGVPQFLVLISSGKSDD
EVDDPAVELKQFGVAPFTIARNADQEELVKISLSPEYVFSVSTFRELPSL
EQKLLTPITTLTSEQIQKLLASTRYPPPAVESDAADIVFLIDSSEGVRPD
GFAHIRDFVSRIVRRLNIGPSKVRVGVVQFSNDVFPEFYLKTYRSQAPV
L
DAIRRLRLRGGSPLNTGKALEFVA
RNLFVKSAGSRIEDGVPQHLVLVLGG
KSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTE
LQTITNDPRLVFTVREFR
ELPNIEERIMN
SFGPSAATPAPPGVDTPPPSRPEKKKADIVFLLDGSINF
RRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTD
EFFLKDFSTKR
QIIDAINKVVYKG
GRHANTKVGLEHLRVNHFVPEAGSRLDQRVPQIAFVI
TGGKSVEDAQDVSLALTQRGVKVFAVGVRNIDSEEVGKIASNSATAFRVG
NVQELSELSEQVLETLHDAMHETLCPGVTDAAKACNLDVILGFDGSRDQN
VFVAQKGFESKVDAILNRISQMHRVSCSGGRSPTVRVSVVANTPSGPVEA
FDFDEYQPEMLEKFRNMRSQHPYVLTEDTLKVYLNKFRQSSPDSVKVVIH
FTDGADGDLADLHRASENLRQEGVRALILVGLERVVN
LERLMHLEFGRGF
MYDRPLRLNLLDL
DYELAEQLDNIAEKACCGVPCKCSG
Q
RGD
RGPIGSIG
PKGIPGEDGYRGYPGDEGGPGERGPPGV
N
GTQGFQGCPGQRGVKGSRGF
P
GE
K
GEVGEIGLDGLDGEDGDKGLPGSSGEKGNPGR
RGD
KGPRGEKGE
RGD
VGI
RGD
PGNPGQDSQERGPKGETGDLGPMGVPGRDGVPGGPGETGKNGGF
GRRGP
P
GA
K
GN
K
GGPGQPGFEGEQGTRGAQGPAGPAGP
P
GLIGEQGISGP
RGSGGAAGAPGERGRTGPLGRKGEPGEPGPKGGIGNRGPRGETGDDGRDG
VGSEGRRGKKGERGF
P
GY
P
GP
K
GNPGEPGL
N
GTTGP
K
GIRGRRGNSGPPG
IVGQKGDPGYPGPAGPKGN
RGD
SID
QCALIQSIKDKCPCCYGPLECPVFP
TELAFALDTSEGVNQDTFGRMRDVVLSIVNDLTIAESNCPRGARVAVVTY
NNEVTTEIRFADSKRKSVLLDKIKNLQVALTSKQQSLETAMSFVARNTFK
RVRNGFLMRKVAVFFSNTPTRA
SPQLREAVLKLSDAGITPLFLTRQED
RQ
LINALQINNTAVGHALVLPAGRDLTD
FLENVLTCHVCLDICNIDPSCGFG
SWRPSFRDRRAAGSDVDIDMAFILDSAETTTLFQFNEMKKYIAYLVRQLD
MSPDPKASQHFARVAVVQHAPSESVD
N
ASMPPVKVEFSLTDYGSKEKLVD
FLSRGMTQLQGTRALGSAIEYTIENVFESAPNPRDLKIVVLMLTGEVPEQ
QLEEAQRVILQAKCKGYFFVVLGIGRKVNIKEVYTFASEPNDVFFKLVDK
STELNEEPLMRFGRLLPSFVSSENAFYLSPDIRKQCDWFQGDQPTKNLVK
FGHKQVNVPN
N
V
TSSPTSNPVTTTKPVTTTKPVTTTTKPVTTTTKPVT
II
NQPSVKP
AAAKPAPAKPVAAKPVATKTATVRPPVAVKPATAAKPVAAKPA
AVRPPAAAAAKPVATKPEVPRPQAAKPAATKPA
TTKPMVKMSREVQVFEI
TENSAKLHWERPEPPGPYFYDLTVTSAHDQSLVLKQ
N
LTVTDRVIGGLLA
GQTYHVAVVCYLRSQVRATYHGSFSTKKSQPPPPQPARSASSSTINLMVS
TEPLALTET
DICK
LPKDEG
TC
RD
F
ILKWYYD
PNTK
SCARFWY
G
GCG
GNEN
K
FGS
QKECEKVC
APVLAKPGVISVMGT
Original
MRKHRHLPLVAVFCLFLSGFPTTHAQQQQADVKNGAAADIIFLVDSSWTI
GEEHFQLVREFLYDVVKSLAVGENDFHFALVQFNGNPHTEFLLNTYRTKQ
EVLSHISNMSYIGGTNQTGKGLEYIMQSHLTKAAGSRAGDGVPQVIVVLT
DGHSKDGLALPSAELKSADVNVFAIGVEDADEGALKEIASEPLNMHMFNL
ENFTSLHDIVGNLVSCVHSSVSPERAGDTETLKDITAQDSADIIFLIDGS
NNTGSVNFAVILDFLVNLLEKLPIGTQQIRVGVVQFSDEPRTMFSLDTYS
TKAQVLGAVKALGFAGGELANIGLALDFVVENHFTRAGGSRVEEGVPQVL
VLISAGPSSDEIRYGVVALKQASVFSFGLGAQAASRAELQHIATDDNLVF
TVPEFRSFGDLQEKLLPYIVGVAQRHIVLKPPTIVTQVIEVNKRDIVFLV
DGSSALGLANFNAIRDFIAKVIQRLEIGQDLIQVAVAQYADTVRPEFYFN
THPTKREVITAVRKMKPLDGSALYTGSALDFVRNNLFTSSAGYRAAEGIP
KLLVLITGGKSLDEISQPAQELKRSSIMAFAIGNKGADQAELEEIAFDSS
LVFIPAEFRAAPLQGMLPGLLAPLRTLSGTPEVHSNKRDIIFLLDGSANV
GKTNFPYVRDFVMNLVNSLDIGNDNIRVGLVQFSDTPVTEFSLNTYQTKS
DILGHLRQLQLQGGSGLNTGSALSYVYANHFTEAGGSRIREHVPQLLLLL
TAGQSEDSYLQAANALTRAGILTFCVGASQANKAELEQIAFNPSLVYLMD
DFSSLPALPQQLIQPLTTYVSGGVEEVPLAQPESKRDILFLFDGSANLVG
QFPVVRDFLYKIIDELNVKPEGTRIAVAQYSDDVKVESRFDEHQSKPEIL
NLVKRMKIKTGKALNLGYALDYAQRYIFVKSAGSRIEDGVLQFLVLLVAG
RSSDRVDGPASNLKQSGVVPFIFQAKNADPAELEQIVLSPAFILAAESLP
KIGDLHPQIVNLLKSVHNGAPAPVSGEKDVVFLLDGSEGVRSGFPLLKEF
VQRVVESLDVGQDRVRVAVVQYSDRTRPEFYLNSYMNKQDVVNAVRQLTL
LGGPTPNTGAALEFVLRNILVSSAGSRITEGVPQLLIVLTADRSGDDVRN
PSVVVKRGGAVPIGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQV
ISERVTQLTREELSRLQPVLQPLPSPGVGGKRDVVFLIDGSQSAGPEFQY
VRTLIERLVDYLDVGFDTTRVAVIQFSDDPKVEFLLNAHSSKDEVQNAVQ
RLRPKGGRQINVGNALEYVSRNIFKRPLGSRIEEGVPQFLVLISSGKSDD
EVDDPAVELKQFGVAPFTIARNADQEELVKISLSPEYVFSVSTFRELPSL
EQKLLTPITTLTSEQIQKLLASTRYPPPAVESDAADIVFLIDSSEGVRPD
GFAHIRDFVSRIVRRLNIGPSKVRVGVVQFSNDVFPEFYLKTYRSQAPVL
DAIRRLRLRGGSPLNTGKALEFVARNLFVKSAGSRIEDGVPQHLVLVLGG
KSQDDVSRFAQVIRSSGIVSLGVGDRNIDRTELQTITNDPRLVFTVREFR
ELPNIEERIMNSFGPSAATPAPPGVDTPPPSRPEKKKADIVFLLDGSINF
RRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLKDFSTKR
QIIDAINKVVYKGGRHANTKVGLEHLRVNHFVPEAGSRLDQRVPQIAFVI
TGGKSVEDAQDVSLALTQRGVKVFAVGVRNIDSEEVGKIASNSATAFRVG
NVQELSELSEQVLETLHDAMHETLCPGVTDAAKACNLDVILGFDGSRDQN
VFVAQKGFESKVDAILNRISQMHRVSCSGGRSPTVRVSVVANTPSGPVEA
FDFDEYQPEMLEKFRNMRSQHPYVLTEDTLKVYLNKFRQSSPDSVKVVIH
FTDGADGDLADLHRASENLRQEGVRALILVGLERVVNLERLMHLEFGRGF
MYDRPLRLNLLDLDYELAEQLDNIAEKACCGVPCKCSGQRGDRGPIGSIG
PKGIPGEDGYRGYPGDEGGPGERGPPGVNGTQGFQGCPGQRGVKGSRGFP
GEKGEVGEIGLDGLDGEDGDKGLPGSSGEKGNPGRRGDKGPRGEKGERGD
VGIRGDPGNPGQDSQERGPKGETGDLGPMGVPGRDGVPGGPGETGKNGGF
GRRGPPGAKGNKGGPGQPGFEGEQGTRGAQGPAGPAGPPGLIGEQGISGP
RGSGGAAGAPGERGRTGPLGRKGEPGEPGPKGGIGNRGPRGETGDDGRDG
VGSEGRRGKKGERGFPGYPGPKGNPGEPGLNGTTGPKGIRGRRGNSGPPG
IVGQKGDPGYPGPAGPKGNRGDSIDQCALIQSIKDKCPCCYGPLECPVFP
TELAFALDTSEGVNQDTFGRMRDVVLSIVNDLTIAESNCPRGARVAVVTY
NNEVTTEIRFADSKRKSVLLDKIKNLQVALTSKQQSLETAMSFVARNTFK
RVRNGFLMRKVAVFFSNTPTRASPQLREAVLKLSDAGITPLFLTRQEDRQ
LINALQINNTAVGHALVLPAGRDLTDFLENVLTCHVCLDICNIDPSCGFG
SWRPSFRDRRAAGSDVDIDMAFILDSAETTTLFQFNEMKKYIAYLVRQLD
MSPDPKASQHFARVAVVQHAPSESVDNASMPPVKVEFSLTDYGSKEKLVD
FLSRGMTQLQGTRALGSAIEYTIENVFESAPNPRDLKIVVLMLTGEVPEQ
QLEEAQRVILQAKCKGYFFVVLGIGRKVNIKEVYTFASEPNDVFFKLVDK
STELNEEPLMRFGRLLPSFVSSENAFYLSPDIRKQCDWFQGDQPTKNLVK
FGHKQVNVPNNVTSSPTSNPVTTTKPVTTTKPVTTTTKPVTTTTKPVTII
NQPSVKPAAAKPAPAKPVAAKPVATKTATVRPPVAVKPATAAKPVAAKPA
AVRPPAAAAAKPVATKPEVPRPQAAKPAATKPATTKPMVKMSREVQVFEI
TENSAKLHWERPEPPGPYFYDLTVTSAHDQSLVLKQNLTVTDRVIGGLLA
GQTYHVAVVCYLRSQVRATYHGSFSTKKSQPPPPQPARSASSSTINLMVS
TEPLALTETDICKLPKDEGTCRDFILKWYYDPNTKSCARFWYGGCGGNEN
KFGSQKECEKVCAPVLAKPGVISVMGT
----+----1----+----2----+----3----+----4----+----5
Hydropathies
Home
Imprint
© 2007-2017
Dr. Katja Kapp
, Kassel &
thpr.net e. K.
, Dresden, Germany, last update 2010-06-11