Tenascin family sequence alignment -

H. P. Erickson Oct., 1997

Email: H.Erickson@cellbio.duke.edu

Use Back or return to HPE home page


Disclaimer: These sequences have been collected over the years and added to the alignment, always by hand. I have not gone back and checked for accuracy, so there may be mistakes. Sorry, I haven't even tabulated all the accession numbers, but I will state that the best reference for human TN-C is X78565 (Ghezi...Zardi JBC 270:3429-3434), which has corrected a few errors in previous entries.


Intron-exon boundaries have been determined for TNChum, TNRhum, TNXhum, and partially
for TNXpig. These are indicated by Red and double underline, but only in a separate WORD file.
;


                    First 22 aa missing from cell culture hum TN-C (Zardi)
                TNRhum     M G A D GET V V L K N M L I G V N L I L L G
                TNRrat     M G I E GET V V L K N M L I G V N L I L L G
                TNRchk     M G T D SEN P V L R N V L I S F N L L L L G 
                TNCchk     M G L P S Q V L A C A I L G L L Y Q H A S G 
                TNChum     M G A M T Q L L A G V F L A F L A L A T E G
                mouse      M G A V T W L L P G I F L A L F A L T P E G
                pig        M G V V T R L L V G T F L A S L A L P A Q G
                nwt        M E A T GCARV L A C L T L R L L C H H S D G
 
                         These aa also missing from chick brain TN-C (tissue TN-C?; Jones)
                TNRhum                S M I K P S E C Q L EVTTERVQRQSVEE
                TNRrat                S M L K P S E C R L EVTTERVQRQTVEE 
                TNRchk                A V L K P F E C R L EVTTEPAERPAVDE
                TNCchk                G L I K R I I R Q K R                 
                TNChum                G V L K K V I R H K R
                mouse                 G V L K K I I R H K R 
                pig                   G V L K K V I R H K R
                nwt                   G L I K K I I R Q K R
 
 
 
TNRhum   E G G IAN Y N T S S K E Q P V V F N H V Y N I N V P L D N L C S S G L E A S A EQE
TNRr     E G G ASS Y N T S S K E Q P M V F N H V Y N I N V P L E S L C S S G L E A S A EQD  
TNRc     E G G LAN C S P P V K E Q P M V F H H I Y N I N V P V D S C C S S M L R S S A E E
TNCchk   E T G L N V T L P E D N Q P V V F N H V Y N I K L P V G S L C S V D L D T A S G D 
TNChum   Q S G V N A T L P E E N Q P V V F N H V Y N I K L P V G S Q C S V D L E S A S G E
m        E S G L N M T L P E E N Q P V V F N H I Y N I K L P M G S Q C S V D L E S A S G E 
p        Q T G V N V T L P E E S Q P V V F N H V Y N I K L P V G S Q C S V D L E S A S G D
nwt      E S GLTSN V T L P D D N Q P V V F N H V Y N I N L P M G S M C S V D L D P V P G G
 
          Underlined segment in TNRchk is alternatively spliced!                         EPPVLASE\
TNXhum              .....no homology, no Cys....G E K Q V V F T H R I N L PPS T G C G C P P G T ^
TNRhum    V S . . A E D E T L A E Y M G Q T S D H E S Q V T F T H R I N F P K K A C P C A S SAQ V
TNRrat    V S . . A E D D T L A E Y T G Q T S D H E S Q V T F T H K I N L P K K A C P C A S SAQ V
TNRchk    V S . . S E D D R L A E Y T E Q T S D S E S Q V T F T H R I N L P K Q A C K C S T SLP S
TNCchk    A D L K A E I E P V K N Y E E H T V N E G N Q I V F T H R I N I P R R A C G C A A A P D
TNChum    K D L A P P S E P S E S F Q E H T V D G E N Q I V F T H R I N I P R R A C G C A A A P D
mouse     K D L T P T P E S S G S F Q E H T V D G E N Q I V F T H R I N I P R R A C G C A A A P D
pig       K D L A A P S E P S E S V Q E H T V D G E N Q I V F T H R I N I P R R A C G C A A A P D
nwt       A G Q N A L S E P G S Q Q E E H T V D G N N Q I V F T H R I N I R . G A C G C A A A P D
 
  heptad of
 alpha-helix *     *       *     *       *     *      *     
    TNXhum   V Q A L R V R L E I L E E L V K G L K E Q C T . G . G C C 
    TNRhum   L Q E L L S R I E M L E R E V S V L R D Q C . . N A N C C
    TNRrat   L Q E L L S R I E M L E R E V S V L R D Q C . . N T N C C
     TNRch   L Q E L L S R I E M L E R E V S M L R D Q C . . N S N C C
     TNCchk  I K D L L S R L E E L E G L V S S L R E Q C A S G A G C C           
     TNChum  V K E L L S R L E E L E N L V S S L R E Q C T A G A G C C
     mouse   V K E L L S R L E E L E L L V S S L R E Q C T M G T G C C
     pig     V K E L L S R L E E L E N L V S S L R E Q C T S G A G C C
     nwt     I K Y L L S L L E E L E G L V S S L R E Q C T S G A G C C
 
   TNXhum            PAS A Q A G T G Q T D V R T L C S L H G V F D L S R C T
   TNRhum            . Q E S A A T G Q L D Y I P H C S G H G N F S F E S C G
   TNRrat            . Q E S A A T G Q L D Y V P H C S G H G N F S F E S C G
    TNRchk           . Q E N A A T G R L D Y T L P C S G H G N F S L E S C R
    TNCchk           P N S Q T A E G R L D T A P Y C S G H G N Y S T E I C G 
    TNChum           L . . Q P A T G R L D T R P F C S G R G N F S T E G C G 
    mouse            L . . Q P A E G R L D T R P F C S G R G N F S A E G C G
     pig             L . . Q P A E G R L D T R P F C S G R G N F S T E G C G
     nwt             G G S Q T A E G A V D T K P Y C N G R G N Y S T E T C S
 
EGF-like domains TNXhum C S C E P G W G G P T C TNRhum C I C N E G W F G K N C TNRrat C I C N E G W F G K N C TNRchk (it fits as well here as any) C I C S E G W A G S N C TNCchk C V C E P G W K G P N C TNChum C V C E P G W K G P N C m C V C E P G W K G P N C p C V C E P G W K G P N C n C I C E P G W T G P N C   TNRegf domains put at the end. No attempt was made to align TNXegf domains EGF domain number 1 TNCchk S E P A C P R N C L N R G L C V R G K C I C E E G F T G E D C TNChum S E P E C P G N C H L R G R C I D G Q C I C D D G F T G E D C m S E P D C P G N C N L R G Q C L E G Q C I C D E G F T G E D C p S E P E C P S N C H L R G Q C V D G Q C V C N E G F T G E D C n S E S E C P G H C H N R G K C V N G K C I C D E G F T G M D C EGF #2 G is missing from Zardi seq TNCchk S Q A A C P S D C N D Q G K C V D G V C V C F E G Y T G P D C TNChum S Q L A C P S D C N D Q G K C V N G V C I C F E G Y A G A D C m S Q L A C P N D C N D Q G R C V N G V C V C F E G Y A G P D C p S Q L A C P S D C N D Q G K C V N G V C V C F E G Y S G V D C n S E V T C P D D C N D Q G R C V N G I C V C F E G Y G G E D C   EGF #3 TNCchk G E E L C P H G C G I H G R C V G G R C V C H E G F T G E D C TNChum S R E I C P V P C S EEH G T C V D G L C V C H D G F A G D D C m G L E V C P V P C S EEH G M C V D G R C V C K D G F A G E D C p S R E T C P V P C S EEH G R C V D G R C V C Q E G F A G E D C n G Q E I C Q V E C S E F G K C V N G Q C V C D E G F T G E D C   EGF #4 very conserved TNCchk N E P L C P N N C H N R G R C V D N E C V C D E G Y T G E D C TNChum N K P L C L N N C Y N R G R C V E N E C V C D E G F T G E D C m N E P L C L N N C Y N R G R C V E N E C V C D E G F T G E D C p N E P L C L H N C H G R G R C V E N E C V C D E G F T G E D C n S E P R C P N N C N N R G R C V E D E C V C D E G F T G D D C EGF #5 very conserved TNCchk G E L I C P N D C F D R G R C I N G T C F C E E G Y T G E D C TNChum S E L I C P N D C F D R G R C I N G T C Y C E E G F T G E D C m S E L I C P N S C F D R G R C I N G T C Y C E E G F T G E D C p G E L I C P K D C F D R G R C I N G T C Y C D E G F E G E D C n S E L I C P N D C F D R G R C I N G V C F C D E G F T G E D C   EGF #6 is poorly conserved, especially on left TNCchk G E L T C P N N C N G N G R C E N G L C V C H E G F V G D D C TNChum G K P T C P H A C H T Q G R C E E G Q C V C D E G F A G L D C m G E L T C P N D C Q G R G Q C E E G Q C V C N E G F A G A D C p G R L A C P H G C R G R G R C E E G Q C V C D E G F A G A D C n G E L T C P N N C N N R G R C V N G L C V C D D G F Q G D D C   chicken #7 not very similar TNCchk S Q K R C P K D C N N R G H C V D G R C V C H E G Y L G E D C TNChum S E K R C P A D C H N R G R C V D G R C E C D D G F T G A D C m S E K R C P A D C H H R G R C L N G Q C E C D D G F T G A D C p S E R R C P S D C H N R G R C L D G R C E C D D G F E G E D C n S E L R C P N D C N D R G R C V N G K C V C K E G F M G E D C   chicken #8 is missing TNChum G E L K C P N G C S G H G R C V N G Q C V C D E G Y T G E D C m G D L Q C P N G C S G H G R C V N G Q C V C D E G Y T G E D C p G E L R C P G G C S G H G R C V N G Q C V C D E G R T G E D C   EGF #9 TNCchk G E L R C P N D C H N R G R C I N G Q C V C D E G F I G E D C TNChum S Q L R C P N D C H S R G R C V E G K C V C E Q G F K G Y D C m S Q R R C P N D C H N R G L C V Q G K C I C E Q G F K G F D C p S Q L R C P N D C H G R G R C V Q G R C E C E H G F Q G Y D C n A D L R C P N D C N N R G R C V N G Q C V C D E G F M G E D C   EGF #10 TNCchk G E L R C P N D C H N R G R C V N G Q C E C H E G F I G E D C TNChum S D M S C P N D C H Q H G R C V N G M C V C D D G Y T G E D C m S E M S C P N D C H Q H G R C V N G M C I C D D D Y T G E D C p S E M S C P H D C H Q H G R C V N G M C V C D D G Y T G E D C n S D L R C P G D C N N R G R C V N G Q C V C D E G F R G E D C     EGF #11 TNRhum S E P Y C P L G C S S R G V C V D G Q C I C D S E Y S G D D C TNRrat S E P Y C P L G C S S R G V C V D G Q C I C D S E Y S G D D C TNRchk S E P R C P R G C S S R G V C L E G Q C V C D N D Y G G E D C TNCchk G E L R C P N D C N S H G R C V N G Q C V C D E G Y T G E D C TNChum R D R Q C P R D C S N R G L C V D G Q C V C E D G F T G P D C m R D R R C P R D C S Q R G R C V D G Q C I C E D G F T G P D C p R E L R C P G D C S Q R G R C V D G R C V C E H G F A G P D C n G E L R C P D D C N N R G V C V N G Q C I C D E G F M G E N C   EGF #12 TNRhum S E L R C P T D C S S R G L C V D G E C V C E E P Y T G E D C TNRrat S E L R C P T D C S S R G L C V D G E C V C E E P Y T G E D C TNRchk S Q L R C P A G C G S R G L C V D G E C I C E E G F G G E D C TNCchk G E L R C P N D C H N R G R C V E G R C V C D N G F M G E D C TNChum A E L S C P N D C H G Q G R C V N G Q C V C H E G F M G K D C m A E L S C P S D C H G H G R C V N G Q C I C H E G F T G K D C p A D L A C P S D C H G R G R C V N G Q C V C H E G F T G K D C n G E L R C P N D C K N R G R C V N G Q C I C D D G F K G E D C   EGF #13 TNRhum R E L R C P G D C S G K G R C A N G T C L C E E G Y V G E D C TNRrat R E L R C P G D C S G K G Q C A N G T C L C Q E G Y A G E D C TNRchk S Q P R C P R D C S G R G H C D N G T C V C A E G Y A G E D C TNCchk G E L S C P N D C H Q H G R C V D G R C V C H E G F T G E D C TNChum K E Q R C P S D C H G Q G R C V D G Q C I C H E G F T G L D C m K E Q R C P S D C H G Q G R C E D G Q C I C H E G F T G L D C p G Q R R C P G D C H G Q G R C V D G Q C V C H E G F T G L D C n S E L R C P D D C N D R G R C I N G Q C V C A E G F T G E N C   EGF # 14. this TNR seq is a miserable fit. TNRhum G Q R Q C L N A C S G R G Q C E E G L C V C E E G Y Q G P D C TNRrat S Q R R C L N A C S G R G H C Q E G L C I C E E G Y Q G P D C TNRchk G W L R C P N A C S G R G V C Q D G L C I C E D G Y G G Q D C TNCchk R E R S C P N D C N N V G R C V E G R C V C E E G Y M G I D C TNChum G Q H S C P S D C N N L G Q C V S G R C I C N E G Y S G E D C m G Q R S C P N D C S N Q G Q C V S G R C I C N E G Y T G I D C p G Q R S C P N D C S N W G Q C V S G R C I C N E G Y S G E D C n D S L A C L N N C N D R G L C V N G Q C V C E E G F L G E D C FN-III domains- HxB  
HxB FN-III domain #1
TNRhum  SAVAPPEDLRVAGISDRSIELEWDGPMAVTE.YVISYQPTALGGLQLQQRVPG..DWSGVTITELEPGLTYNISVYAVISNILSLPITAKVAT
TNRrat  SAVTPPEDLRVAGISDRSIELEWDGPMAVTE.YVISYQPSL.GGLQLQQRVPG..DWSGVTITELEPGLTYNISVYAVISNILSLPITAKVAT
TNRchk  SAVAPPENLRVTGISDGSIELAWDSLGAATE.YVVSYQPAGPGGSQLQQRVPG..DWSTITITELEPGVAYNVSIYAVISDVLSSPVTTKVTT
TNCchk  SDVSPPTELTVTNVTDKTVNLEWKHENLVNE.YLVTYVPTSSGGLDLQFTVPG..NQTSATIHELEPGVEYFIRVFAILKNKKSIPVSARVAT
TNChum  SEVSPPKDLVVTEVTEETVNLAWDNEMRVTE.YLVVYTPTHEGGLEMQFRVPG..DQTSTIIQELEPGVEYFIRVFAILENKKSIPVSARVAT 
m       SEVSPPKDLIVTEVTEETVNLAWDNEMRVTE.YLIMYTPTHADGLEMQFRVPG..DQTSTTIRELEPGVEYFIRVFAILENKRSIPVSARVAT
p       SQVSPPKDLIVTEVTEETVNLAWDNEMRVTE.YLIVYTPTHEDGLEMQFRVPG..DQTSTTIRELEPGVEYFIRVFAILENKKSIPVSARVAT
n       SEVSPPKDLTVTDVTTQSVNLEWANEMKVTE.YLITYIPTSPGGLELDFRVPG..DQTTATIQELEPGVEYFVRVFAILRNQRSIPVSARVAT
 
HxB FN-III domain #2
TNRhum  .HLSTPQGLQFKTITETTVEVQWEPFSFSFDGWEISFIPKNN.E..GGVIAQVPSDVTSFNQTGLKPGEEYIVNVV.ALKEQARSPPTSASVST
TNRrat   HLSTPQGLQFKTITETTVEVQWEPFSFSFDGWEISFTPKNN.E..GGVIAQLPSDVTSFNQTGLKPGEEYIVNVV.ALKEQARGPPTSASVST 
TNRchk   NLATPQGLKFKTITETTVEVQWEPFSFPFDGWEISFIPKNN.E..GGVIAQLPSTVTTFNQTGLKPGEEYTVTVV.ALKDQARSPPASDSIST
TNCchk  .YLPAPEGLKFKSVRETSVQVEWDPLSISFDGWELVFRNMQKKDDNGDITSSLKRPETSYMQPGLAPGQQYNVSLH.IVKNNTRGPGLSRVITT
TNChum  .YLPAPEGLKFKSIKETSVEVEWDPLDIAFETWEIIFRNMNK.EDEGEITKSLRRPETSYRQTGLAPGQEYEISLH.IVKNNTRGPGLKRVTTT 
m       .YLPAPEGLKFKSIKETSVEVEWDPLDIAFETWEIIFRNMNK.EDEGEITKSLRRPETSYRQTGLAPGQEYEISLH.IVKNNTRGPGLKKVTTT
p       .YLPTPEGLKFKSIKETSVEVEWDPLDIAFETWEIIFRNMNK.EDEGEITKSLRRPETTYRQTGLAPGQEYEISLH.IVKNNTRGPGLKRVTTT
n       .HLPTTDDLRFKSVKETSVEVEWDPLDISFDTWDLIIRNTK..EENGEISTSLQRPVTSYVQTGLAPGETYNFSIH.VVKNSTRGPGLAKVTTT
 
HxB FN-III domain #3
TNRhum  .VIDGPTQILVRDVSDTVAFVEWIPPRAKVDFILLKYGLVGGEGGRTTFRLQP..PLSQYSVQALRPGSRYEVSVSAVRGTNESDSATTQFTT
TNRrat   VIDGPTQILVRDVSDTVAFVEWTPPRAKVDFILLKYGLVGGEGGKTTFRLQP..PLSQYSVQALRPGSRYEVSISAVRGTNESDASSTQFTT
TNRchk  .LIDGPTQILVRDVSDTVAFVEWTPPRARVDAILLKYGLADGEGGRTTFRLQP..PLSQYSLQALRPGARYHLAVSALRGANESQPALAQFTT
TNCchk  .KLDAPSQIEAKDVTDTTALITWSKPLAEIEGIELTYGPKDVPGDRTTIDLSE..DENQYSIGNLRPHTEYEVTLISRRGDMESDPAKEVFVT
TNChum  .RLDAPSQIEVKDVTDTTALITWFKPLAEIDGIELTYGIKDVPGDRTTIDLTE..DENQYSIGNLKPDTEYEVSLISRRGDMSSNPAKETFTT  
m       .RLDAPSHIEVKDVTDTTALITWFKPLAEIDSIELSYGIKDVPGDRTTIDLTH..EDNQYSIGNLRPDTEYEVSLISRRVDMASNPAKETFIT
p       .RLDAPSQIEAKDVTDTTALITWFKPLAEIDGIELTYGIKDVPGDRTTIDLTH..EENQYSIGNLKPDTEYEVSLISRRADMSSNPAKETFTT
n       .RLDAPSQVEVRDVTDSMALVTWFRPLAQIDGVILSYGTGSQP..PTVVELSE..DESQYSLGNLIPDTEYEVTLLSRRGLMTSDPVTETFTT
 
HxB FN-III domain #4
TNRhum  .EIDAPKNLRVGSRTATSLDLEWDNSEAEVQEYKVVYSTLAGEQYHEVLVPKGIGPTTRATLTDLVPGTEYGVGISAVMNSQQSVPATMNART
TNRrat   EIDAPKNLRVGSRTATSLDLEWDNSEAEAQEYKVVYSTLAGEQYHEVLVPKGIGPTTKTTLTDLVPGTEYGVGISAVMNSKQSIPATMNART
TNRchk   EIDAPKNLRVGSRTPASLELTWDNSEAEAHSYRVVYSTLAGEHYHEVLVPRDTGPTTRATLADLVPGTEYGIGISAVMDSQQSVPATMNART
TNCchk  .DLDAPRNLKRVSQTDNSITLEWKNSHANIDNYRIKFAPISGGDHTELTVPKGNQATTRATLTGLRPGTEYGIGVTAVRQDRESAPATINAGT
TNChum  .GLDAPRNLRRVSQTDNSITLEWRNGKAAIDSYRIKYAPISGGDHAEVDVPKSQQATTKTTLTGLRPGTEYGIGVSAVKEDKESNPATINAAT
m       .GLDAPRNLRRVSQTDNSITLEWRNVKADIDSYRIKYAPISGGDHAEIDVPKSQQATTKTTLTGLRPGTEYGIGVSAVKGDKESDPATINAAT
p       .GLDAPRNLRRISQTDNSITLEWRNGKAAADTYRIKYAPISGGDHAEVEVPRSPQTTTKATLTGLRPGTEYGIGVSAVKGDKESDPATINAAT
n       .DLDAPKNLRRVSQTDTTITLEWKNSQANVDLYRIKFAPLSTGDHAEITVPKSNQVTTKVTLTDLKPGTEYGIGVTAVKQDRESGPATINAAT
?? octopus?            DNWITLEWKNSRSSIDGYRIKYGPIKGGAHGEDMFPKRAGDTTWATITGLRPGTEY
?? fish?               DNSITLEWKNSRANVLNYRVKYGPLSGGEHGELVFPSGPQDTTQAKITGLAPGTEY
HxB FN-III domain #5
TNRhum  ,ELDSPRDLMVTASSETSISLIWTKASGPIDHYRITFTPSSGIASEVTVPKDR....TSYTLTDLEPGAEYIISVTAERGRQQSLESTVDAFT
TNRrat   ELDSPRDLMVTASSETSISLIWTKASGPIDHYRITFTPSSGISSEVTVPRDR....TSYTLTDLEPGAEYIISITAERGRQQSLESTVDAFT 
TNRchk   ELDSPRDLLVTASTETSISLSWTKAMGPIDHYRVTFTPASGMASEVTVSRNE....SQLTLSELEPGTEYTISIIAERGRQQSLEATVDAFT
TNCchk  .DLDNPKDLEVSDPTETTLSLRWRRPVAKFDRYRLTYVSPSGKKNEMEIPVDS....TSFILRGLDAGTEYTISLVAEKGRHKSKPTTIKGST
TNChum  .ELDTPKDLQVSETAETSLTLLWKTPLAKFDRYRLNYSLPTGQWVGVQLPRNT....TSYVLRGLEPGQEYNVLLTAEKGRHKSKPARVKAST
m       .EIDAPKDLRVSETTQDSLTFFWTTPLAKFDRYRLNSSLPTGHSMEVQLPKDA....TSHVLTDLEPGQEYTVLLIAEKGRHKSKPARVKAST
p       .DLDPPKDFRVSELKESSLTLLWRTPLAKFDRYRLNYGLPSGQPVEVQLPRNA....TSYILRGLEPGQEYTILLTAEKGRHKSKPARVKAST
n       .DLDAPKDLQISGSSESTLSLRWKRPLAKFERYLISYISNTGKKNEIEVPGNV....NSFVLTGLDAGTEYSIAIVAEKGRHKSKPASVIGST
zFish                                                   IPGAA....NTYILTGLNPGMLHTITLTAERGRKMSAPATLSAST
_____________________________________________
Alternatively spliced domains
 
TN-C FN-III domain Q ZebraFish UNIQUE!
zFish  DEEKPQVGNITISDVSWDSFSMSWDLDRGEVEGFLIEVSDPDGLSDGQNHTLSGQEF..SLAVTDLSPSTFYRVTLYGLYKGELLDPVFAEAIT
 
    TNR  alt splice domain R1 doesn't match anything 
TNRhum  .GFRPISHLHFSHVTSSSVNITWSDPSPPADRLILNYSPRDEEEEMMEVSLDATKR..HAVLMGLQPATEYIVNLVAVHGTVTSEPIVGSITT
TNRrat  .GFRPISHLHFSHVTSSSVNITWSDPSPPADRLILNYSPRDEEEEMMEVLLDATKR..HAVLMGLQPATEYIVNLVAVHGTVTSEPIVGSITT 
TNRchk  .GVRPITQLHFSQLTSSSVNITWSDPSPPADRLVLTYSPRDEEAP.QQLALDGTRR..HASLTGLRPSTEYLVSLVAVHGAVSSEPVTGSITT 
 
 
 
HxB FN-III domain A1
TNCchk  EEEPELGNLSVSETGWDGFQLTWTAADGAYENFVIQVQQSDNPEETWNITVPGGQH..SVNVTGLKANTPYNVTLYGVIRGYRTKPLYVETTT
TNChum  EQAPELENLTVTEVGWDGLRLNWTAADQAYEHFIIQVQEANKVEAARNLTVPGSLR..AVDIPGLKAATPYTVSIYGVIQGYRTPVLSAEAST 
m       EEVPSLENLTVTEAGWDGLRLNWTADDLAYEYFVIQVQEANNVETAHNFTVPGNLR..AADIPGLKVATSYRVSIYGVARGYRTPVLSAETST
 
HxB FN-III domain A2
TNChum  GETPNLGEVVVAEVGWDALKLNWTAPEGAYEYFFIQVQEADTVEAAQNLTVPGGLR..STDLPGLKAATHYTITIRGVTQDFSTTPLSVEVLT 
m       GTTPNLGEVTVAEVGWDALTLNWTAPEGAYKNFFIQVLEADTTQTVQNLTVPGGLR..SVDLPGLKAATRYYITLRGVTQDFGTAPLSVEVLT
 
HxB FN-III domain A3
TNChum  EEVPDMGNLTVTEVSWDALRLNWTTPDGTYDQFTIQVQEADQVEEAHNLTVPGSLR..SMEIPGLRAGTPYTVTLHGEVRGHSTRPLAVEVVT 
 
HxB FN-III domain A4
TNChum  EDLPQLGDLAVSEVGWDGLRLNWTAADNAYEHFVIQVQEVNKVEAAQNLTLPGSLR..AVDIPGLEAATPYRVSIYGVIRGYRTPVLSAEAST 
m       EDLPQLGGLSVTEVSWDGLTLNWTTDDLAYKHFVVQVQEANNVEAAQNLTVPSSLR..AVDIPGLKADTPYRVSIYGVIQGYRTPMLSTDVST
r9      SEIPELEGLTVTEVSWDSLTLNWTTDDLAYKHFIIQVQEANNVEAARNLTVSGSLR..VVDIPGLKADTPYTVSIYGVIQGYRTPMLSADVST
Only 12 mismatches between mouse and rat; 27 mismatches (63 identities) with human
HxB FN-III domain B
TNCchk  GAHPEVGELTVSDITPESFNLSWTTTNGDFDAFTIEIIDSNRLLEPMEFNISGNSR..TAHISGLSPSTDFIVYLYGISHGFRTQAISAAATT
TNChum  AKEPEIGNLNVSDITPESFNLSWMATDGIFETFTIEIIDSNRLLETVEYNISGAER..TAHISGLPPSTDFIVYLSGLAPSIRTKTISATATT 
m       AREPEIGNLNVSDVTPKSFNLSWTATDGIFDMFTIEIIDSNRLLQTAEHNISGAER..TAHISGLPPSTDFIVYLSGIAPSIRTKTISTTATT
r       RKEPEIGNLNISDVTPESFNLSWTATDRIFDMFTIEIIDSNRLLQTAEHNISGAER..TAHISGLPPSTDFIVYLSGIAPSIRTKTISTTATT
p       AGEPEIGNLSVSDITPESFSLSWTATEGAFETFTIEIIDSNRFLETMEYNISGAER..TAHISGLRPGNDFIVYLSGLAPGIQTKPISATATT
 
HxB FN-III domain AD2
TNCchk  VEEPLLSKLTVSNATSDSMLLMWEAQDNAFDHFILEVRNSDSPLDSLVQIVPGASR..HYVVTNLKAATNYTVQLHGVVDGQGGQTLTALATT
 
HxB FN-III domain AD1
TNChum  EPKPQLGMLIFSNITPKSFNMSWTTQAGLFAKIVINVSDAHSLHESQQFTVSGDAK..QAHITGLVENTGYDVSVAGTTLAGDPTRPLTAFVIT
TNCchk  EAEPQLGTLTLTNVTPDSFNLSWTTRDGPFAKFVIHVRDSFAAHEPQELTVSGGAR..SAHISGLLDYTGYDINIKGTTDAGVHTEPLTAFVMT
 
HxB FN-III domain C
TNChum  EALPLLENLTISDINPYGFTVSWMASENAFDSFLVTVVDSGKLLDPQEFTLSGTQR..KLELRGLITGIGYEVMVSGFTQGHQTKPLRAEIVT 
TNCchk  EAMPPLENLTVSDINPYGFTVSWMASENAFDSFLVVVVDSGKLLDPQEFLLTGAQR..QLKLKGLITGIGYEVMLYGFAKGHQTKPLSTVAVT
 
HxB FN-III domain D 
TNCchk  EAEPEVDNLLVSDATPDGFRLSWTADDGVFDSFVLKIRDTKRKSDPLELIVPGHER..THDITGLKEGTEYEIELYGVSSGRRSQPINSVATT
TNChum  EAEPEVDNLLVSDATPDGFRLSWTADEGVFDNFVLKIRDTKKQSEPLEITLLAPER..TRDITGLREATEYEIELYGISKGRRSQTVSAIATT 
m       EAEPEVDNLLVSDATPDGFRLSWTADEGIFDSFVIRIRDTKKQSEPQEISLPSPER..TRDITGLREATEYEIELYGISRGRRSQPVSAIATT
r       EAEPEVDNLLVSDATPDGFCLSWTADEGIFDSFVIRIRDTKKQSEPQEITLPSPDR..TRDITGLREATEYEIELYGISRGRRSQPVSAIATT
p       EAEPEVDNLLVSDATPDGFRLSWTADEGVFDSFVLKIRDTKKQSEPLEITLLASER..TRDITGLREATEYEIELYGISSGKRSQPVSAIATT
 
_____________________________________________________
 
Last three invariant FN-III domains
 
Human HxB FN-III domain #6
TNRhum  .GIDPPKDITISNVTKDSVMVSWSPPVASFDYYRVSYRPTQVGRLDSSVVPNTVTEF...TITRLNPATEYEISLNSVRGREESERICTLVHT
TNRrat  .GIDPPKNITISNVTKDSLTVSWSPPVAPFDYYEYPIDHPS.GRLDSSVVPNTVTEF...TITRLYPASQYEISLNSVRGREESERICTLVHT
TNRchk  .GMDAPKDLRVGNITQDSMVIYWSPPVAPFDHYRISYRAAE.GRTDSTAIGNDATEY...IMRLLQPATKYEIGVKSVRGREESEVASITTYT
TNXhum  PVLESPRDLQFSEIRETSAKVNWMPPPSRADSFKVSYQLADGGEPQSVQVDGQARTQK...LQGLIPGARYEVTVVSVRGFEESEPLTGFLTT
TNXmus  PVLESPRDLQFSDIGETSAKVKWVPPTSRVDSFKISYQLADGGEPQSVQVDGRTQTQI...LQGLIPDTRYEVTVVSVRGFEESEPLTGFLTT
TNXpig                                                                               MRGFEESEPLTGFLTT
TNCchk  .VVGSPKGISFSDITENSATVSWTPPRSRVDSYRVSYVPITGGTPNVVTVDGSK.TRT..KLVKLVPGVDYNVNIISVKGFEESEPISGILKT
TNChum  .AMGSPKEVIFSDITENSATVSWRAPTAQVESFRITYVPITGGTPSMVTVDGTK.TQT..RLVKLIPGVEYLVSIIAMKGFEESEPVSGSFTT
m       .AMGSPKEIMFSDITENAATVSWRAPTAQVESFRITYVPMTGGAPSMVTVDGTD.TET..RLVKLTPGVEYRVSVIAMKGFEESDPVSGTLIT
r       .AMGSPKEIMFSDITENAATVSWRAPTAQVESFRITYVPVTGGPPSMVTVDGTD.TET..RLVRLTPGVEYHVSVIAMKGFEESDPVSGSLIT
p       .AMGSPKEITFSDITENSATVSWMVPTAQVESFRITYVPITGGAPSVVTVDGTK.TQT..RLLRLLPGVEYLVSVIAVKGFEESEPVSGTLTT
n       .AVGAPKGLSFSDITENSATVSWSAPQTRVDSFKVTYVPASGGVPQTVTVDGTK.TRT..TLVKLTPGVEYIVTVVSVKALDESGPISGPLTT
zFish   .GLGAPKGIRFSDVTDTSATVHWTMPHTRVDNYRVIYVPIQGGSPLTLRVDGGE.SQA..MLSNLTPGVTYQVTVIAVKGLEESEPGSERVTT     
 
Human HxB FN-III domain #7
TNRhum  .AMDNPVDLIATNITPTEALLQWKAPVGEVENYVIVLTHFAVAGETILVDGVSEE....FRLVDLLPSTHYTATMYATNGPLTSGTISTNFST
TNRrat   AMDSPMDLIATNITPTEALLQWKAPMGEVENYVIVLTHFAMAGETILVDGVSEE....FQLVDLLPRTHYTVTMYATSGPLVSGTIATNFST
TNRchk  .AMDAPLGVTATNITPTEALLQWNPPLMDVESYVLVLTR..HTGETILVDGINQE....YQLTNLQPSTTYTVAMYATNGPLTSQTISTNFTT
TNXhum  .VPDGPTQLRALNLTEGFAVLHWKPPQNPVDTYDVQVTAPGAPPLQAETPGSAVD....YPLHDLVLHTNYTATVRGLRGPNLTSPASITFTT
TNXmus  .VPDGPTQLRALNLTDGSALLHWKPPHKPVDKYDVEVESPGAPPLQASAPGSAVD....YPLTDLALDTNYTATVRGLRGPNFTSPASITFTT 
TNXpig  .VPDGPTQLRALNLTEGSALLHWKPPQTPVDTYDVKVTASGAPSLQGSAPGSAVD....YPLHGLELHTNYTATLRGLRGPNLTSPASITFTT
TNCchk  .ALDSPSGLVVMNITDSEALATWQPAIAAVDNYIVSYSSEDEPEVTQMVSGNTVE....YDLNGLRPATEYTLRVHAVKDAQKSETLSTQFTT
TNChum  .ALDGPSGLVTANITDSEALARWQPAIATVDSYVISYTGEKVPEITRTVSGNTVE....YALTDLEPATEYTLRIFAEKGPQKSSTITAKFTT 
m       .ALDGPSGLLIANITDSEALAMWQPAIATVDSYVISYTGERVPEVTRTVSGNTVE....YELHDLEPATEYILSIFAEKGQQKSSTIATKFTT
r       .ALDGPSGLLTANITDSEALAMWQPAIATVDSYVISYTGERVPEITRTVSGNTVE....LELHDLEPATEYTLSVFAEKGHQKSSTTATKFTT
p       .ALDGPSGLVTANITDSEALAMWQPAIAPVDHYVISYTGDRVPEITRTVSGNTVE....YALTNLEPATEYTLRIFAEKGPQKSSTITTKFTT
n       .ALDSPSGLKAVNVTETEAIALWQPSIASVDNYVLSYAADNDPETTKTISGNNVE....SDITGLQPSTQYTLTIYAVRGPQKSATMSTKFTT
TNCzfsh .ALDKPRGLTAVNISDTEALLLWQPSIATVDGYVITYSADSVAPVMERVSGNTVE....FEMSSLTPATLYTVKVYAFRDTAKSAATSTDFTT
 
Human HxB FN-III domain #8
TNRhum  .HLDPPANLTASEVTRQSALISWQPPRAEIENYVLTYKSTDGSRKELIVDAED....TWIRLEGLLENTDYTVLLQAAQDTTWSSITSTAFTT
TNRrat  .LLDPPANLTASEVTRQSALISWQPPRAAIENYVLTYKSTDGSRKELIVDAED....TWIRLEGLSENTDYTVLLQAAQEATRSSLTSTIFTT
TNRchk  .LLDPPTNLTASEVTRRSALLSWVPPVGDIENYILTYRSTDGSRKELIVDAED....TWIRLEGLSETTQYTVRLQAAQNAMRSGFISTTFTT
TNXhum  .GLEAPRDLEAKEVTPRTALLTWTEPPVRPAGYLLSFHTPGGQNQEILLPGGI....TSHQLLGLFGSTSYNARLQAMWGQSLLPPVSTSFTT
TNXmus  .GLKPPQDLEAKEVTPRTALLTWTEPEVPPTGYLLSFDTPGGQIQEILLPAGT....TSHRLLRLFPSTFYSAQLRAIWGESLTPPVLTSFTT
TNXpig  .GLEAPQDLEAKEVTPRTVLLTWTAPQVPPTGYLITFNTPGGQTQEILLPGGV....TSHRLQGLFPSTPYSAWLRAMWGESFTPPVSTSFTT
TNYchk   ..GAPGTLWVGTLWPRSAHLHWAPPHVPPEGYNLIYGPPGGPVKTLQLPPEA....TSKELWGLEPSGRYRVQL...WGRGL.EPLETTFDT
TNCchk  .GLDAPKDLSATEVQSETAVITWRPPRAPVTDYLLTYESIDGRVKEVILDPET....TSYTLTELSPSTQYTVKLQALSRSMRSKMIQTVFTT
TNChum  .DLDSPRDLTATEVQSETALLTWRPPRASVTGYLLVYESVDGTVKEVIVGPDT....TSYSLADLSPSTHYTAKIQALNGPLRSNMIQTIFTT
m       .DLDSPREFTATEVQSETALLTWRPPRASVTGYLLVYESVDGTVKEVIVGPDT....TSYSLADLSPSTHYSARIQALSGSLRSKLIQTIFTT
r       .DLDSPRELTATEVQSETAFLTWRPPRASVTGYLLVYESVDGTVKEVIVGPDT....TSYSLADLSPSTHYTARIQALSGSLRSKLIQTIFTT
p       .DLDSPRDLTATEVQSETALLTWRPPRASVTGYLLVYESVDGTLKEVVVGPET....TSYSLSGLSPSTHYTARIQALNGPLRSKMSQTVFTT
n       .ALDAPRDLAASEIQSETALLTWRPPRSIITGFILIYEAVDGTLKEVILGPDM....TSYNLVDLSPSTQYTVRLLAMNGDVRSKTIQTVFTT
TNCzfsh .DVDAPQNLAASNIQTETAMLTWKPPRADISGYILSFESADGVVKEVVLSPTA....TFYSMSQLTASTEYTVKLQAIAGPKRSRVISTVFLT
 
 
Hexabrachion terminal knob &endash; fibrinogen-like domain
Short linker (according to Doolittle this is really part of the fibrinogen domain)
 
                            TNRhum     GGRVFPHP
                            TNRrat     GGRVFSHP 
                            TNRchk     GGRVFANP
                            TNXhum     GGLRIPFP
                            TNXmus     GGQRIPFP
                            TNXpig     GGLRIPFP 
                            TNYchk     PPLPHPHP
                            TNCchk     TGLLYPYP
                            TNChum     IGLLYPFP
                            TNCmus     IGLLYPFP
                            TNCrat     IGLLYPFP  
                            TNCpig     IGLLYPFP
                            TNCnwt     TGLLYPFP
                            TNCzfsh    IGVLYKHP
                        
 
TNRhum   QDCAQHLMNGDTLSGVYPIFLNGELSQKLQVYCDMTTDGGGWIVFQRRQNGQTDFFRKWADYRVGFGNVEDEFW
TNRrat   QDCAQHLMNGDTLSGVYTIFLNGELSHKLQVYCDMTTDGGGWIVFQRRQNGQTDFFRKWADYRVGFGNLEDEFW
TNRchk   QDCAQHLMNGDTLSGVYTISINGDLSQRVQVFCDMSTDGGGWIVFQRRQNGLTDFFRKWADYRVGFGNLEDEFW
TNXhum   RDCGEEMQNGAGASRTSTIFLNGNRERPLNVFCDMETDGGGWLVFQRRMDGQTDFWRDWEDYAHGFGNISGEFW
TNXmus   RDCGEELKNGPSASKTTTIFLNGNRERPLDVFCDMETDGGGWLVFQRRMDGQTDFWRDWEEYAHGFGNISGEFW
TNXpig   RDCGEEMQNGVSTSRTTTIFLNGNRERPLNVFCDMETDGGGWLVFQRRMDGKTDFWRDWEDYAHGFGNISGEFW
TNYchk   RDCAEEQLNGPGPSREVLIFLGGDRQRPLHVFCDMESNGGGWLVFQRRMDGGTDFWRGWEEYVHGFGNVSGEFW
TNCchk   KDCSQALLNGEVTSGLYTIYLNGDRTQPLQVFCDMAEDGGGWIVFLRRQNGKEDFYRNWKNYVAGFGDPKDEFW 
TNChum   KDCSQAMLNGDTTSGLYTIYLNGDKAQALEVFCDMTSDGGGWIVFLRRKNGRENFYQNWKAYAAGFGDRREEFW
TNCmus   RDCSQAMLNGDTTSGLYTIYINGDKTQALEVYCDMTSDGGGWIVFLRRKNGREDFYRNWKAYAAGFGDRREEFW
TNCrat   RDCSQAMLNGDTTSGLYTIYINGDKTQALEVYCDMTSDGGGWIVFLRRKNGREDFYRNWKAYATGFGDRRE
TNCpig   RDCSQAMLNGDTTSGLYTIYVNNDKAQKLEVFCDMTSDSGGWIVFLRRKNGREDFYRNWKAYAAGFGDLKEEFW
TNCnwt   KDCSQALLNGETASGLYTIYLNGDKAKPQEEYCDMSEYGGGWIVFLRRVDGKEDFYRNWKTYTAGFGDPTKEFF
TNCzfish KDCSQALLNGDTTSGLYTIYLRGDESQPLQVYCDMTTDGGGWIVFVRRQSGKVEFFRNWKNYTAGFGDLNDEFW
 
TNRhum   LGLDNIHRITSQGRYELRVDMRDGQEAAFASYDRFSVEDSRNLYKLRIGSYNGTAGDSLSYHQGRPFSTEDRDNDVAV
TNRrat   LGLDNYHRITAQGRYELRVDMRDGQEAVFAYYDKFAVEDSRSLYKLRIGGYNGTAGDSLSYHQGRPFSTEDRDNDVAV
TNRchk   LGLDNIHKITSQGRYELRIDMRDGQEAAYAYYDKFSVGDSRSLYKLRIGDYNGTSGDSLTYHQGRPFSTKDRDNDVAV 
TNXhum   LGNEALHSLTQAGDYSIRVDLRAGDEAVFAQYDSFHVDSAAEYYRLHLEGYHGTAGDSMSYHSGSVFSARDRDPNSLL 
TNXmus   LGNEALHSLTQAGDYSLRVDLRAGKEAVFAQYDFFRVDSAKENYRLHLEGYHGTAGDSMSYHSGSVFSARDRDPNNLL
TNXpig   LGNEALHSLTKAGDYSLRVDLRAGEEAVFAQYESFQVDSAAEHYRLHLEGYHGTAGDSMSYHSGSVFSARDRDPNNLL
TNYchk   LGNAALHTLTASGPTELRVDLRTPSDSAFARYRDFAVSGPEDNFRLHLGAYSGTAGDALSYHAGSPFSTRDHDPRGRP
TNCchk   IGLENLHKISSQGQYELRVDLRDRGETAYAVYDKFSVGDAKTRYRLRVDGYSGTAGDSMTYHNGRSFSTFDKDNDSAI
TNChum   LGLDNLNKITAQGQYELRVDLRDHGETAFAVYDKFSVGDAKTRYKLKVEGYSGTAGDSMAYHNGRSFSTFDKDTDSAI  
TNCmus   LGLDNLSKITAQGQYELRVDLQDHGESAYAVYDRFSVGDAKSRYKLKVEGYSGTAGDSMNYHNGRSFSTYDKDTDSAI
TNCpig   LGLDALSKITAQGQYELRVDLRDHGETAYAVYDRFSVGDARTRYKLKVEGYSGTAGDSMAYHNGRSFSTFDKDTDSAI
TNCnwt   LGLENLHQITSQGQYELRVDLRDGAETAYAVYDKFFVGDSKTAYRLKVDGYSGTAGDSMTYHSGALFSTFDKDNDSAI
TNCzfish LGLSNLHKITSFGQYELRVDLRDKGESAYAQYDKFSISEPRARYKVHVGGYSGTAGDSMTYHHGRPFSTYDNDNDIAV
 
TNRhum   TNCAMSYKGAWWYKNCHRTNLNGKYGESRHSQGINWYHWKGHEFSIPFVEMKMRPYNHRLMAGRKRQSLQF
TNRrat   TNCAMSYKGAWWYKNCHRTNLNGKYGESRHSQGINWYHWKGHEFSIPFVEMKMRPYIHRLTAGRKRRALKF
TNRchk   TNCAMSYKGAWWYKNCHRTNLNGKYGESRHSQGINWYHWKGHEFSIPFVEMKMRPYNHRNISGRKRRSLQL             
TNXhum   ISCAVSYRGAWWYRNCHYANLNGLYGSTVDHQGVSWYHWKGFEFSVPFTEMKLRPRNFRSPAGGG
TNXmus   ISCAVSYRGAWWYRNCHYANLNGLYGSTVDHQGVSWYHWKGFEFSVPFTEMKLRPRNFQVPTRGT
TNYchk   RPCAVAYTGAWWYRNCHYANLNGRYGVPYDHQGINWYPWKGFEYSIPFTEMKLRPQRD
TNXpig   ISCAVSYRGAWWYRNCHYANLNGLYGSTVDHQGVSWYYWKGFEFSVPFTEMKLRPRSFRPPGRGG
TNCchk   TNCALSYKGAFWYKNCHRVNLMGRYGDNNHSQGVNWFHWKGHEYSIQFAEMKLRPSSFRNLEGRRKRA
TNChum   TNCALSYKGAFWYRNCHRVNLMGRYGDNNHSQGVNWFHWKGHEHSIQFAEMKLRPSNFRNLEGRRKRA
TNCpig   TNCALSYKGAFWYKNCHRVNLMGRYGDNSHSQGVNWFHWKGHEYSIQFAEMKLRPSNFRNLEGRRKRA
TNCnwt   TNCALSYKGAFWYKNCHRVNLMGRYGDNSHSQGVNWFHWKGHEYSIQFAEMKVRPVSFRNLEGRRRRA
TNCzfish TNCALSYKGAFWYKNCHRVNIMGRYGDNSHSKGVNWFHWKGHEHSVEFAEMKIRPANFRNFEGRKKRS