Use Back or return to HPE home page
Disclaimer: These sequences have been collected over the years and
added to the alignment, always by hand. I have not gone back and
checked for accuracy, so there may be mistakes. Sorry, I haven't even
tabulated all the accession numbers, but I will state that the best
reference for human TN-C is X78565 (Ghezi...Zardi JBC
270:3429-3434), which has corrected a few errors in previous
entries.
Intron-exon boundaries have been determined for TNChum, TNRhum,
TNXhum, and partially
for TNXpig. These are indicated by Red and double underline, but only
in a separate WORD file.
;
First 22 aa missing from cell culture hum TN-C (Zardi) TNRhum M G A D GET V V L K N M L I G V N L I L L G TNRrat M G I E GET V V L K N M L I G V N L I L L G TNRchk M G T D SEN P V L R N V L I S F N L L L L G TNCchkM G L P S Q V L ACA I L G L L Y Q H A S GTNChumM G A M T Q L L A G V F L A F L A L A T E Gmouse M G A V T W L L P G I F L A L F A L T P E G pig M G V V T R L L V G T F L A S L A L P A Q G nwt M E A T GCARV L A C L T L R L L C H H S D G These aa also missing from chick brain TN-C (tissue TN-C?; Jones) TNRhum S M I K P S E C Q L EVTTERVQRQSVEE TNRrat S M L K P S E C R L EVTTERVQRQTVEE TNRchk A V L K P F E C R L EVTTEPAERPAVDE TNCchkG L I K R I I R Q K RTNChum G V L K K V I R H K R mouse G V L K K I I R H K R pig G V L K K V I R H K R nwt G L I K K I I R Q K R TNRhum E G G IAN Y N T S S K E Q P V V F N H V Y N I N V P L D N L C S S G L E A S A EQE TNRr E G G ASS Y N T S S K E Q P M V F N H V Y N I N V P L E S L C S S G L E A S A EQD TNRc E G G LAN C S P P V K E Q P M V F H H I Y N I N V P V D S C C S S M L R S S A E E TNCchk E T G L N V T L P E D N Q P V V F N H V Y N I K L P V G S L C S V D L D T A S G D TNChum Q S G V N A T L P E E N Q P V V F N H V Y N I K L P V G S Q C S V D L E S A S G E m E S G L N M T L P E E N Q P V V F N H I Y N I K L P M G S Q C S V D L E S A S G E p Q T G V N V T L P E E S Q P V V F N H V Y N I K L P V G S Q C S V D L E S A S G D nwt E S GLTSN V T L P D D N Q P V V F N H V Y N I N L P M G S M C S V D L D P V P G G Underlined segment in TNRchk is alternatively spliced! EPPVLASE\ TNXhum .....no homology, no Cys....G E K Q V V F T H R I N L PPS T G C G C P P G T ^ TNRhum V S . . A E D E T L A E Y M G Q T S D H E S Q V T F T H R I N F P K K A C P C A S SAQ V TNRrat V S . . A E D D T L A E Y T G Q T S D H E S Q V T F T H K I N L P K K A C P C A S SAQ V TNRchk V S . . S E D D R L A E Y T E Q T S D S E S Q V T F T H R I N L P K Q A C K C S T SLP S TNCchk A D L K A E I E P V K N Y E E H T V N E G N Q I V F T H R I N I P R R A C G C A A A P D TNChum K D L A P P S E P S E S F Q E H T V D G E N Q I V F T H R I N I P R R A C G C A A A P D mouse K D L T P T P E S S G S F Q E H T V D G E N Q I V F T H R I N I P R R A C G C A A A P D pig K D L A A P S E P S E S V Q E H T V D G E N Q I V F T H R I N I P R R A C G C A A A P D nwt A G Q N A L S E P G S Q Q E E H T V D G N N Q I V F T H R I N I R . G A C G C A A A P D heptad of alpha-helix * * * * * * * TNXhum V Q A L R V R L E I L E E L V K G L K E Q C T . G . G C C TNRhum L Q E L L S R I E M L E R E V S V L R D Q C . . N A N C C TNRrat L Q E L L S R I E M L E R E V S V L R D Q C . . N T N C C TNRch L Q E L L S R I E M L E R E V S M L R D Q C . . N S N C C TNCchk I K D L L S R L E E L E G L V S S L R E Q C A S G A G C C TNChum V K E L L S R L E E L E N L V S S L R E Q C T A G A G C C mouse V K E L L S R L E E L E L L V S S L R E Q C T M G T G C C pig V K E L L S R L E E L E N L V S S L R E Q C T S G A G C C nwt I K Y L L S L L E E L E G L V S S L R E Q C T S G A G C C TNXhum PAS A Q A G T G Q T D V R T L C S L H G V F D L S R C T TNRhum . Q E S A A T G Q L D Y I P H C S G H G N F S F E S C G TNRrat . Q E S A A T G Q L D Y V P H C S G H G N F S F E S C G TNRchk . Q E N A A T G R L D Y T L P C S G H G N F S L E S C R TNCchk P N S Q T A E G R L D T A P Y C S G H G N Y S T E I C G TNChum L . . Q P A T G R L D T R P F C S G R G N F S T E G C G mouse L . . Q P A E G R L D T R P F C S G R G N F S A E G C G pig L . . Q P A E G R L D T R P F C S G R G N F S T E G C G nwt G G S Q T A E G A V D T K P Y C N G R G N Y S T E T C S
EGF-like domains TNXhum C S C E P G W G G P T C TNRhum C I C N E G W F G K N C TNRrat C I C N E G W F G K N C TNRchk (it fits as well here as any) C I C S E G W A G S N C TNCchk C V C E P G W K G P N C TNChum C V C E P G W K G P N C m C V C E P G W K G P N C p C V C E P G W K G P N C n C I C E P G W T G P N C TNRegf domains put at the end. No attempt was made to align TNXegf domains EGF domain number 1 TNCchk S E P A C P R N C L N R G L C V R G K C I C E E G F T G E D C TNChum S E P E C P G N C H L R G R C I D G Q C I C D D G F T G E D C m S E P D C P G N C N L R G Q C L E G Q C I C D E G F T G E D C p S E P E C P S N C H L R G Q C V D G Q C V C N E G F T G E D C n S E S E C P G H C H N R G K C V N G K C I C D E G F T G M D C EGF #2 G is missing from Zardi seq TNCchk S Q A A C P S D C N D Q G K C V D G V C V C F E G Y T G P D C TNChum S Q L A C P S D C N D Q G K C V N G V C I C F E G Y A G A D C m S Q L A C P N D C N D Q G R C V N G V C V C F E G Y A G P D C p S Q L A C P S D C N D Q G K C V N G V C V C F E G Y S G V D C n S E V T C P D D C N D Q G R C V N G I C V C F E G Y G G E D C EGF #3 TNCchk G E E L C P H G C G I H G R C V G G R C V C H E G F T G E D C TNChum S R E I C P V P C S EEH G T C V D G L C V C H D G F A G D D C m G L E V C P V P C S EEH G M C V D G R C V C K D G F A G E D C p S R E T C P V P C S EEH G R C V D G R C V C Q E G F A G E D C n G Q E I C Q V E C S E F G K C V N G Q C V C D E G F T G E D C EGF #4 very conserved TNCchk N E P L C P N N C H N R G R C V D N E C V C D E G Y T G E D C TNChum N K P L C L N N C Y N R G R C V E N E C V C D E G F T G E D C m N E P L C L N N C Y N R G R C V E N E C V C D E G F T G E D C p N E P L C L H N C H G R G R C V E N E C V C D E G F T G E D C n S E P R C P N N C N N R G R C V E D E C V C D E G F T G D D C EGF #5 very conserved TNCchk G E L I C P N D C F D R G R C I N G T C F C E E G Y T G E D C TNChum S E L I C P N D C F D R G R C I N G T C Y C E E G F T G E D C m S E L I C P N S C F D R G R C I N G T C Y C E E G F T G E D C p G E L I C P K D C F D R G R C I N G T C Y C D E G F E G E D C n S E L I C P N D C F D R G R C I N G V C F C D E G F T G E D C EGF #6 is poorly conserved, especially on left TNCchk G E L T C P N N C N G N G R C E N G L C V C H E G F V G D D C TNChum G K P T C P H A C H T Q G R C E E G Q C V C D E G F A G L D C m G E L T C P N D C Q G R G Q C E E G Q C V C N E G F A G A D C p G R L A C P H G C R G R G R C E E G Q C V C D E G F A G A D C n G E L T C P N N C N N R G R C V N G L C V C D D G F Q G D D C chicken #7 not very similar TNCchk S Q K R C P K D C N N R G H C V D G R C V C H E G Y L G E D C TNChum S E K R C P A D C H N R G R C V D G R C E C D D G F T G A D C m S E K R C P A D C H H R G R C L N G Q C E C D D G F T G A D C p S E R R C P S D C H N R G R C L D G R C E C D D G F E G E D C n S E L R C P N D C N D R G R C V N G K C V C K E G F M G E D C chicken #8 is missing TNChum G E L K C P N G C S G H G R C V N G Q C V C D E G Y T G E D C m G D L Q C P N G C S G H G R C V N G Q C V C D E G Y T G E D C p G E L R C P G G C S G H G R C V N G Q C V C D E G R T G E D C EGF #9 TNCchk G E L R C P N D C H N R G R C I N G Q C V C D E G F I G E D C TNChum S Q L R C P N D C H S R G R C V E G K C V C E Q G F K G Y D C m S Q R R C P N D C H N R G L C V Q G K C I C E Q G F K G F D C p S Q L R C P N D C H G R G R C V Q G R C E C E H G F Q G Y D C n A D L R C P N D C N N R G R C V N G Q C V C D E G F M G E D C EGF #10 TNCchk G E L R C P N D C H N R G R C V N G Q C E C H E G F I G E D C TNChum S D M S C P N D C H Q H G R C V N G M C V C D D G Y T G E D C m S E M S C P N D C H Q H G R C V N G M C I C D D D Y T G E D C p S E M S C P H D C H Q H G R C V N G M C V C D D G Y T G E D C n S D L R C P G D C N N R G R C V N G Q C V C D E G F R G E D C EGF #11 TNRhum S E P Y C P L G C S S R G V C V D G Q C I C D S E Y S G D D C TNRrat S E P Y C P L G C S S R G V C V D G Q C I C D S E Y S G D D C TNRchk S E P R C P R G C S S R G V C L E G Q C V C D N D Y G G E D C TNCchk G E L R C P N D C N S H G R C V N G Q C V C D E G Y T G E D C TNChum R D R Q C P R D C S N R G L C V D G Q C V C E D G F T G P D C m R D R R C P R D C S Q R G R C V D G Q C I C E D G F T G P D C p R E L R C P G D C S Q R G R C V D G R C V C E H G F A G P D C n G E L R C P D D C N N R G V C V N G Q C I C D E G F M G E N C EGF #12 TNRhum S E L R C P T D C S S R G L C V D G E C V C E E P Y T G E D C TNRrat S E L R C P T D C S S R G L C V D G E C V C E E P Y T G E D C TNRchk S Q L R C P A G C G S R G L C V D G E C I C E E G F G G E D C TNCchk G E L R C P N D C H N R G R C V E G R C V C D N G F M G E D C TNChum A E L S C P N D C H G Q G R C V N G Q C V C H E G F M G K D C m A E L S C P S D C H G H G R C V N G Q C I C H E G F T G K D C p A D L A C P S D C H G R G R C V N G Q C V C H E G F T G K D C n G E L R C P N D C K N R G R C V N G Q C I C D D G F K G E D C EGF #13 TNRhum R E L R C P G D C S G K G R C A N G T C L C E E G Y V G E D C TNRrat R E L R C P G D C S G K G Q C A N G T C L C Q E G Y A G E D C TNRchk S Q P R C P R D C S G R G H C D N G T C V C A E G Y A G E D C TNCchk G E L S C P N D C H Q H G R C V D G R C V C H E G F T G E D C TNChum K E Q R C P S D C H G Q G R C V D G Q C I C H E G F T G L D C m K E Q R C P S D C H G Q G R C E D G Q C I C H E G F T G L D C p G Q R R C P G D C H G Q G R C V D G Q C V C H E G F T G L D C n S E L R C P D D C N D R G R C I N G Q C V C A E G F T G E N C EGF # 14. this TNR seq is a miserable fit. TNRhum G Q R Q C L N A C S G R G Q C E E G L C V C E E G Y Q G P D C TNRrat S Q R R C L N A C S G R G H C Q E G L C I C E E G Y Q G P D C TNRchk G W L R C P N A C S G R G V C Q D G L C I C E D G Y G G Q D C TNCchk R E R S C P N D C N N V G R C V E G R C V C E E G Y M G I D C TNChum G Q H S C P S D C N N L G Q C V S G R C I C N E G Y S G E D C m G Q R S C P N D C S N Q G Q C V S G R C I C N E G Y T G I D C p G Q R S C P N D C S N W G Q C V S G R C I C N E G Y S G E D C n D S L A C L N N C N D R G L C V N G Q C V C E E G F L G E D C FN-III domains- HxB
HxB FN-III domain #1 TNRhum SAVAPPEDLRVAGISDRSIELEWDGPMAVTE.YVISYQPTALGGLQLQQRVPG..DWSGVTITELEPGLTYNISVYAVISNILSLPITAKVAT TNRrat SAVTPPEDLRVAGISDRSIELEWDGPMAVTE.YVISYQPSL.GGLQLQQRVPG..DWSGVTITELEPGLTYNISVYAVISNILSLPITAKVAT TNRchk SAVAPPENLRVTGISDGSIELAWDSLGAATE.YVVSYQPAGPGGSQLQQRVPG..DWSTITITELEPGVAYNVSIYAVISDVLSSPVTTKVTT TNCchk SDVSPPTELTVTNVTDKTVNLEWKHENLVNE.YLVTYVPTSSGGLDLQFTVPG..NQTSATIHELEPGVEYFIRVFAILKNKKSIPVSARVAT TNChum SEVSPPKDLVVTEVTEETVNLAWDNEMRVTE.YLVVYTPTHEGGLEMQFRVPG..DQTSTIIQELEPGVEYFIRVFAILENKKSIPVSARVAT m SEVSPPKDLIVTEVTEETVNLAWDNEMRVTE.YLIMYTPTHADGLEMQFRVPG..DQTSTTIRELEPGVEYFIRVFAILENKRSIPVSARVAT p SQVSPPKDLIVTEVTEETVNLAWDNEMRVTE.YLIVYTPTHEDGLEMQFRVPG..DQTSTTIRELEPGVEYFIRVFAILENKKSIPVSARVAT n SEVSPPKDLTVTDVTTQSVNLEWANEMKVTE.YLITYIPTSPGGLELDFRVPG..DQTTATIQELEPGVEYFVRVFAILRNQRSIPVSARVAT HxB FN-III domain #2 TNRhum .HLSTPQGLQFKTITETTVEVQWEPFSFSFDGWEISFIPKNN.E..GGVIAQVPSDVTSFNQTGLKPGEEYIVNVV.ALKEQARSPPTSASVST TNRrat HLSTPQGLQFKTITETTVEVQWEPFSFSFDGWEISFTPKNN.E..GGVIAQLPSDVTSFNQTGLKPGEEYIVNVV.ALKEQARGPPTSASVST TNRchk NLATPQGLKFKTITETTVEVQWEPFSFPFDGWEISFIPKNN.E..GGVIAQLPSTVTTFNQTGLKPGEEYTVTVV.ALKDQARSPPASDSIST TNCchk .YLPAPEGLKFKSVRETSVQVEWDPLSISFDGWELVFRNMQKKDDNGDITSSLKRPETSYMQPGLAPGQQYNVSLH.IVKNNTRGPGLSRVITT TNChum .YLPAPEGLKFKSIKETSVEVEWDPLDIAFETWEIIFRNMNK.EDEGEITKSLRRPETSYRQTGLAPGQEYEISLH.IVKNNTRGPGLKRVTTT m .YLPAPEGLKFKSIKETSVEVEWDPLDIAFETWEIIFRNMNK.EDEGEITKSLRRPETSYRQTGLAPGQEYEISLH.IVKNNTRGPGLKKVTTT p .YLPTPEGLKFKSIKETSVEVEWDPLDIAFETWEIIFRNMNK.EDEGEITKSLRRPETTYRQTGLAPGQEYEISLH.IVKNNTRGPGLKRVTTT n .HLPTTDDLRFKSVKETSVEVEWDPLDISFDTWDLIIRNTK..EENGEISTSLQRPVTSYVQTGLAPGETYNFSIH.VVKNSTRGPGLAKVTTT HxB FN-III domain #3 TNRhum .VIDGPTQILVRDVSDTVAFVEWIPPRAKVDFILLKYGLVGGEGGRTTFRLQP..PLSQYSVQALRPGSRYEVSVSAVRGTNESDSATTQFTT TNRrat VIDGPTQILVRDVSDTVAFVEWTPPRAKVDFILLKYGLVGGEGGKTTFRLQP..PLSQYSVQALRPGSRYEVSISAVRGTNESDASSTQFTT TNRchk .LIDGPTQILVRDVSDTVAFVEWTPPRARVDAILLKYGLADGEGGRTTFRLQP..PLSQYSLQALRPGARYHLAVSALRGANESQPALAQFTT TNCchk .KLDAPSQIEAKDVTDTTALITWSKPLAEIEGIELTYGPKDVPGDRTTIDLSE..DENQYSIGNLRPHTEYEVTLISRRGDMESDPAKEVFVT TNChum .RLDAPSQIEVKDVTDTTALITWFKPLAEIDGIELTYGIKDVPGDRTTIDLTE..DENQYSIGNLKPDTEYEVSLISRRGDMSSNPAKETFTT m .RLDAPSHIEVKDVTDTTALITWFKPLAEIDSIELSYGIKDVPGDRTTIDLTH..EDNQYSIGNLRPDTEYEVSLISRRVDMASNPAKETFIT p .RLDAPSQIEAKDVTDTTALITWFKPLAEIDGIELTYGIKDVPGDRTTIDLTH..EENQYSIGNLKPDTEYEVSLISRRADMSSNPAKETFTT n .RLDAPSQVEVRDVTDSMALVTWFRPLAQIDGVILSYGTGSQP..PTVVELSE..DESQYSLGNLIPDTEYEVTLLSRRGLMTSDPVTETFTT HxB FN-III domain #4 TNRhum .EIDAPKNLRVGSRTATSLDLEWDNSEAEVQEYKVVYSTLAGEQYHEVLVPKGIGPTTRATLTDLVPGTEYGVGISAVMNSQQSVPATMNART TNRrat EIDAPKNLRVGSRTATSLDLEWDNSEAEAQEYKVVYSTLAGEQYHEVLVPKGIGPTTKTTLTDLVPGTEYGVGISAVMNSKQSIPATMNART TNRchk EIDAPKNLRVGSRTPASLELTWDNSEAEAHSYRVVYSTLAGEHYHEVLVPRDTGPTTRATLADLVPGTEYGIGISAVMDSQQSVPATMNART TNCchk .DLDAPRNLKRVSQTDNSITLEWKNSHANIDNYRIKFAPISGGDHTELTVPKGNQATTRATLTGLRPGTEYGIGVTAVRQDRESAPATINAGT TNChum .GLDAPRNLRRVSQTDNSITLEWRNGKAAIDSYRIKYAPISGGDHAEVDVPKSQQATTKTTLTGLRPGTEYGIGVSAVKEDKESNPATINAAT m .GLDAPRNLRRVSQTDNSITLEWRNVKADIDSYRIKYAPISGGDHAEIDVPKSQQATTKTTLTGLRPGTEYGIGVSAVKGDKESDPATINAAT p .GLDAPRNLRRISQTDNSITLEWRNGKAAADTYRIKYAPISGGDHAEVEVPRSPQTTTKATLTGLRPGTEYGIGVSAVKGDKESDPATINAAT n .DLDAPKNLRRVSQTDTTITLEWKNSQANVDLYRIKFAPLSTGDHAEITVPKSNQVTTKVTLTDLKPGTEYGIGVTAVKQDRESGPATINAAT ?? octopus? DNWITLEWKNSRSSIDGYRIKYGPIKGGAHGEDMFPKRAGDTTWATITGLRPGTEY ?? fish? DNSITLEWKNSRANVLNYRVKYGPLSGGEHGELVFPSGPQDTTQAKITGLAPGTEY HxB FN-III domain #5 TNRhum ,ELDSPRDLMVTASSETSISLIWTKASGPIDHYRITFTPSSGIASEVTVPKDR....TSYTLTDLEPGAEYIISVTAERGRQQSLESTVDAFT TNRrat ELDSPRDLMVTASSETSISLIWTKASGPIDHYRITFTPSSGISSEVTVPRDR....TSYTLTDLEPGAEYIISITAERGRQQSLESTVDAFT TNRchk ELDSPRDLLVTASTETSISLSWTKAMGPIDHYRVTFTPASGMASEVTVSRNE....SQLTLSELEPGTEYTISIIAERGRQQSLEATVDAFT TNCchk .DLDNPKDLEVSDPTETTLSLRWRRPVAKFDRYRLTYVSPSGKKNEMEIPVDS....TSFILRGLDAGTEYTISLVAEKGRHKSKPTTIKGST TNChum .ELDTPKDLQVSETAETSLTLLWKTPLAKFDRYRLNYSLPTGQWVGVQLPRNT....TSYVLRGLEPGQEYNVLLTAEKGRHKSKPARVKAST m .EIDAPKDLRVSETTQDSLTFFWTTPLAKFDRYRLNSSLPTGHSMEVQLPKDA....TSHVLTDLEPGQEYTVLLIAEKGRHKSKPARVKAST p .DLDPPKDFRVSELKESSLTLLWRTPLAKFDRYRLNYGLPSGQPVEVQLPRNA....TSYILRGLEPGQEYTILLTAEKGRHKSKPARVKAST n .DLDAPKDLQISGSSESTLSLRWKRPLAKFERYLISYISNTGKKNEIEVPGNV....NSFVLTGLDAGTEYSIAIVAEKGRHKSKPASVIGST zFish IPGAA....NTYILTGLNPGMLHTITLTAERGRKMSAPATLSAST _____________________________________________ Alternatively spliced domains TN-C FN-III domain Q ZebraFish UNIQUE! zFish DEEKPQVGNITISDVSWDSFSMSWDLDRGEVEGFLIEVSDPDGLSDGQNHTLSGQEF..SLAVTDLSPSTFYRVTLYGLYKGELLDPVFAEAIT TNR alt splice domain R1 doesn't match anything TNRhum .GFRPISHLHFSHVTSSSVNITWSDPSPPADRLILNYSPRDEEEEMMEVSLDATKR..HAVLMGLQPATEYIVNLVAVHGTVTSEPIVGSITT TNRrat .GFRPISHLHFSHVTSSSVNITWSDPSPPADRLILNYSPRDEEEEMMEVLLDATKR..HAVLMGLQPATEYIVNLVAVHGTVTSEPIVGSITT TNRchk .GVRPITQLHFSQLTSSSVNITWSDPSPPADRLVLTYSPRDEEAP.QQLALDGTRR..HASLTGLRPSTEYLVSLVAVHGAVSSEPVTGSITT HxB FN-III domain A1 TNCchk EEEPELGNLSVSETGWDGFQLTWTAADGAYENFVIQVQQSDNPEETWNITVPGGQH..SVNVTGLKANTPYNVTLYGVIRGYRTKPLYVETTT TNChum EQAPELENLTVTEVGWDGLRLNWTAADQAYEHFIIQVQEANKVEAARNLTVPGSLR..AVDIPGLKAATPYTVSIYGVIQGYRTPVLSAEAST m EEVPSLENLTVTEAGWDGLRLNWTADDLAYEYFVIQVQEANNVETAHNFTVPGNLR..AADIPGLKVATSYRVSIYGVARGYRTPVLSAETST HxB FN-III domain A2 TNChum GETPNLGEVVVAEVGWDALKLNWTAPEGAYEYFFIQVQEADTVEAAQNLTVPGGLR..STDLPGLKAATHYTITIRGVTQDFSTTPLSVEVLT m GTTPNLGEVTVAEVGWDALTLNWTAPEGAYKNFFIQVLEADTTQTVQNLTVPGGLR..SVDLPGLKAATRYYITLRGVTQDFGTAPLSVEVLT HxB FN-III domain A3 TNChum EEVPDMGNLTVTEVSWDALRLNWTTPDGTYDQFTIQVQEADQVEEAHNLTVPGSLR..SMEIPGLRAGTPYTVTLHGEVRGHSTRPLAVEVVT HxB FN-III domain A4 TNChum EDLPQLGDLAVSEVGWDGLRLNWTAADNAYEHFVIQVQEVNKVEAAQNLTLPGSLR..AVDIPGLEAATPYRVSIYGVIRGYRTPVLSAEAST m EDLPQLGGLSVTEVSWDGLTLNWTTDDLAYKHFVVQVQEANNVEAAQNLTVPSSLR..AVDIPGLKADTPYRVSIYGVIQGYRTPMLSTDVST r9 SEIPELEGLTVTEVSWDSLTLNWTTDDLAYKHFIIQVQEANNVEAARNLTVSGSLR..VVDIPGLKADTPYTVSIYGVIQGYRTPMLSADVST Only 12 mismatches between mouse and rat; 27 mismatches (63 identities) with human HxB FN-III domain B TNCchk GAHPEVGELTVSDITPESFNLSWTTTNGDFDAFTIEIIDSNRLLEPMEFNISGNSR..TAHISGLSPSTDFIVYLYGISHGFRTQAISAAATT TNChum AKEPEIGNLNVSDITPESFNLSWMATDGIFETFTIEIIDSNRLLETVEYNISGAER..TAHISGLPPSTDFIVYLSGLAPSIRTKTISATATT m AREPEIGNLNVSDVTPKSFNLSWTATDGIFDMFTIEIIDSNRLLQTAEHNISGAER..TAHISGLPPSTDFIVYLSGIAPSIRTKTISTTATT r RKEPEIGNLNISDVTPESFNLSWTATDRIFDMFTIEIIDSNRLLQTAEHNISGAER..TAHISGLPPSTDFIVYLSGIAPSIRTKTISTTATT p AGEPEIGNLSVSDITPESFSLSWTATEGAFETFTIEIIDSNRFLETMEYNISGAER..TAHISGLRPGNDFIVYLSGLAPGIQTKPISATATT HxB FN-III domain AD2 TNCchk VEEPLLSKLTVSNATSDSMLLMWEAQDNAFDHFILEVRNSDSPLDSLVQIVPGASR..HYVVTNLKAATNYTVQLHGVVDGQGGQTLTALATT HxB FN-III domain AD1 TNChum EPKPQLGMLIFSNITPKSFNMSWTTQAGLFAKIVINVSDAHSLHESQQFTVSGDAK..QAHITGLVENTGYDVSVAGTTLAGDPTRPLTAFVIT TNCchk EAEPQLGTLTLTNVTPDSFNLSWTTRDGPFAKFVIHVRDSFAAHEPQELTVSGGAR..SAHISGLLDYTGYDINIKGTTDAGVHTEPLTAFVMT HxB FN-III domain C TNChum EALPLLENLTISDINPYGFTVSWMASENAFDSFLVTVVDSGKLLDPQEFTLSGTQR..KLELRGLITGIGYEVMVSGFTQGHQTKPLRAEIVT TNCchk EAMPPLENLTVSDINPYGFTVSWMASENAFDSFLVVVVDSGKLLDPQEFLLTGAQR..QLKLKGLITGIGYEVMLYGFAKGHQTKPLSTVAVT HxB FN-III domain D TNCchk EAEPEVDNLLVSDATPDGFRLSWTADDGVFDSFVLKIRDTKRKSDPLELIVPGHER..THDITGLKEGTEYEIELYGVSSGRRSQPINSVATT TNChum EAEPEVDNLLVSDATPDGFRLSWTADEGVFDNFVLKIRDTKKQSEPLEITLLAPER..TRDITGLREATEYEIELYGISKGRRSQTVSAIATT m EAEPEVDNLLVSDATPDGFRLSWTADEGIFDSFVIRIRDTKKQSEPQEISLPSPER..TRDITGLREATEYEIELYGISRGRRSQPVSAIATT r EAEPEVDNLLVSDATPDGFCLSWTADEGIFDSFVIRIRDTKKQSEPQEITLPSPDR..TRDITGLREATEYEIELYGISRGRRSQPVSAIATT p EAEPEVDNLLVSDATPDGFRLSWTADEGVFDSFVLKIRDTKKQSEPLEITLLASER..TRDITGLREATEYEIELYGISSGKRSQPVSAIATT _____________________________________________________ Last three invariant FN-III domains Human HxB FN-III domain #6 TNRhum .GIDPPKDITISNVTKDSVMVSWSPPVASFDYYRVSYRPTQVGRLDSSVVPNTVTEF...TITRLNPATEYEISLNSVRGREESERICTLVHT TNRrat .GIDPPKNITISNVTKDSLTVSWSPPVAPFDYYEYPIDHPS.GRLDSSVVPNTVTEF...TITRLYPASQYEISLNSVRGREESERICTLVHT TNRchk .GMDAPKDLRVGNITQDSMVIYWSPPVAPFDHYRISYRAAE.GRTDSTAIGNDATEY...IMRLLQPATKYEIGVKSVRGREESEVASITTYT TNXhum PVLESPRDLQFSEIRETSAKVNWMPPPSRADSFKVSYQLADGGEPQSVQVDGQARTQK...LQGLIPGARYEVTVVSVRGFEESEPLTGFLTT TNXmus PVLESPRDLQFSDIGETSAKVKWVPPTSRVDSFKISYQLADGGEPQSVQVDGRTQTQI...LQGLIPDTRYEVTVVSVRGFEESEPLTGFLTT TNXpig MRGFEESEPLTGFLTT TNCchk .VVGSPKGISFSDITENSATVSWTPPRSRVDSYRVSYVPITGGTPNVVTVDGSK.TRT..KLVKLVPGVDYNVNIISVKGFEESEPISGILKT TNChum .AMGSPKEVIFSDITENSATVSWRAPTAQVESFRITYVPITGGTPSMVTVDGTK.TQT..RLVKLIPGVEYLVSIIAMKGFEESEPVSGSFTT m .AMGSPKEIMFSDITENAATVSWRAPTAQVESFRITYVPMTGGAPSMVTVDGTD.TET..RLVKLTPGVEYRVSVIAMKGFEESDPVSGTLIT r .AMGSPKEIMFSDITENAATVSWRAPTAQVESFRITYVPVTGGPPSMVTVDGTD.TET..RLVRLTPGVEYHVSVIAMKGFEESDPVSGSLIT p .AMGSPKEITFSDITENSATVSWMVPTAQVESFRITYVPITGGAPSVVTVDGTK.TQT..RLLRLLPGVEYLVSVIAVKGFEESEPVSGTLTT n .AVGAPKGLSFSDITENSATVSWSAPQTRVDSFKVTYVPASGGVPQTVTVDGTK.TRT..TLVKLTPGVEYIVTVVSVKALDESGPISGPLTT zFish .GLGAPKGIRFSDVTDTSATVHWTMPHTRVDNYRVIYVPIQGGSPLTLRVDGGE.SQA..MLSNLTPGVTYQVTVIAVKGLEESEPGSERVTT Human HxB FN-III domain #7 TNRhum .AMDNPVDLIATNITPTEALLQWKAPVGEVENYVIVLTHFAVAGETILVDGVSEE....FRLVDLLPSTHYTATMYATNGPLTSGTISTNFST TNRrat AMDSPMDLIATNITPTEALLQWKAPMGEVENYVIVLTHFAMAGETILVDGVSEE....FQLVDLLPRTHYTVTMYATSGPLVSGTIATNFST TNRchk .AMDAPLGVTATNITPTEALLQWNPPLMDVESYVLVLTR..HTGETILVDGINQE....YQLTNLQPSTTYTVAMYATNGPLTSQTISTNFTT TNXhum .VPDGPTQLRALNLTEGFAVLHWKPPQNPVDTYDVQVTAPGAPPLQAETPGSAVD....YPLHDLVLHTNYTATVRGLRGPNLTSPASITFTT TNXmus .VPDGPTQLRALNLTDGSALLHWKPPHKPVDKYDVEVESPGAPPLQASAPGSAVD....YPLTDLALDTNYTATVRGLRGPNFTSPASITFTT TNXpig .VPDGPTQLRALNLTEGSALLHWKPPQTPVDTYDVKVTASGAPSLQGSAPGSAVD....YPLHGLELHTNYTATLRGLRGPNLTSPASITFTT TNCchk .ALDSPSGLVVMNITDSEALATWQPAIAAVDNYIVSYSSEDEPEVTQMVSGNTVE....YDLNGLRPATEYTLRVHAVKDAQKSETLSTQFTT TNChum .ALDGPSGLVTANITDSEALARWQPAIATVDSYVISYTGEKVPEITRTVSGNTVE....YALTDLEPATEYTLRIFAEKGPQKSSTITAKFTT m .ALDGPSGLLIANITDSEALAMWQPAIATVDSYVISYTGERVPEVTRTVSGNTVE....YELHDLEPATEYILSIFAEKGQQKSSTIATKFTT r .ALDGPSGLLTANITDSEALAMWQPAIATVDSYVISYTGERVPEITRTVSGNTVE....LELHDLEPATEYTLSVFAEKGHQKSSTTATKFTT p .ALDGPSGLVTANITDSEALAMWQPAIAPVDHYVISYTGDRVPEITRTVSGNTVE....YALTNLEPATEYTLRIFAEKGPQKSSTITTKFTT n .ALDSPSGLKAVNVTETEAIALWQPSIASVDNYVLSYAADNDPETTKTISGNNVE....SDITGLQPSTQYTLTIYAVRGPQKSATMSTKFTT TNCzfsh .ALDKPRGLTAVNISDTEALLLWQPSIATVDGYVITYSADSVAPVMERVSGNTVE....FEMSSLTPATLYTVKVYAFRDTAKSAATSTDFTT Human HxB FN-III domain #8 TNRhum .HLDPPANLTASEVTRQSALISWQPPRAEIENYVLTYKSTDGSRKELIVDAED....TWIRLEGLLENTDYTVLLQAAQDTTWSSITSTAFTT TNRrat .LLDPPANLTASEVTRQSALISWQPPRAAIENYVLTYKSTDGSRKELIVDAED....TWIRLEGLSENTDYTVLLQAAQEATRSSLTSTIFTT TNRchk .LLDPPTNLTASEVTRRSALLSWVPPVGDIENYILTYRSTDGSRKELIVDAED....TWIRLEGLSETTQYTVRLQAAQNAMRSGFISTTFTT TNXhum .GLEAPRDLEAKEVTPRTALLTWTEPPVRPAGYLLSFHTPGGQNQEILLPGGI....TSHQLLGLFGSTSYNARLQAMWGQSLLPPVSTSFTT TNXmus .GLKPPQDLEAKEVTPRTALLTWTEPEVPPTGYLLSFDTPGGQIQEILLPAGT....TSHRLLRLFPSTFYSAQLRAIWGESLTPPVLTSFTT TNXpig .GLEAPQDLEAKEVTPRTVLLTWTAPQVPPTGYLITFNTPGGQTQEILLPGGV....TSHRLQGLFPSTPYSAWLRAMWGESFTPPVSTSFTT TNYchk ..GAPGTLWVGTLWPRSAHLHWAPPHVPPEGYNLIYGPPGGPVKTLQLPPEA....TSKELWGLEPSGRYRVQL...WGRGL.EPLETTFDT TNCchk .GLDAPKDLSATEVQSETAVITWRPPRAPVTDYLLTYESIDGRVKEVILDPET....TSYTLTELSPSTQYTVKLQALSRSMRSKMIQTVFTT TNChum .DLDSPRDLTATEVQSETALLTWRPPRASVTGYLLVYESVDGTVKEVIVGPDT....TSYSLADLSPSTHYTAKIQALNGPLRSNMIQTIFTT m .DLDSPREFTATEVQSETALLTWRPPRASVTGYLLVYESVDGTVKEVIVGPDT....TSYSLADLSPSTHYSARIQALSGSLRSKLIQTIFTT r .DLDSPRELTATEVQSETAFLTWRPPRASVTGYLLVYESVDGTVKEVIVGPDT....TSYSLADLSPSTHYTARIQALSGSLRSKLIQTIFTT p .DLDSPRDLTATEVQSETALLTWRPPRASVTGYLLVYESVDGTLKEVVVGPET....TSYSLSGLSPSTHYTARIQALNGPLRSKMSQTVFTT n .ALDAPRDLAASEIQSETALLTWRPPRSIITGFILIYEAVDGTLKEVILGPDM....TSYNLVDLSPSTQYTVRLLAMNGDVRSKTIQTVFTT TNCzfsh .DVDAPQNLAASNIQTETAMLTWKPPRADISGYILSFESADGVVKEVVLSPTA....TFYSMSQLTASTEYTVKLQAIAGPKRSRVISTVFLT Hexabrachion terminal knob &endash; fibrinogen-like domain Short linker (according to Doolittle this is really part of the fibrinogen domain) TNRhum GGRVFPHP TNRrat GGRVFSHP TNRchk GGRVFANP TNXhum GGLRIPFP TNXmus GGQRIPFP TNXpig GGLRIPFP TNYchk PPLPHPHP TNCchk TGLLYPYP TNChum IGLLYPFP TNCmus IGLLYPFP TNCrat IGLLYPFP TNCpig IGLLYPFP TNCnwt TGLLYPFP TNCzfsh IGVLYKHP TNRhum QDCAQHLMNGDTLSGVYPIFLNGELSQKLQVYCDMTTDGGGWIVFQRRQNGQTDFFRKWADYRVGFGNVEDEFW TNRrat QDCAQHLMNGDTLSGVYTIFLNGELSHKLQVYCDMTTDGGGWIVFQRRQNGQTDFFRKWADYRVGFGNLEDEFW TNRchk QDCAQHLMNGDTLSGVYTISINGDLSQRVQVFCDMSTDGGGWIVFQRRQNGLTDFFRKWADYRVGFGNLEDEFW TNXhum RDCGEEMQNGAGASRTSTIFLNGNRERPLNVFCDMETDGGGWLVFQRRMDGQTDFWRDWEDYAHGFGNISGEFW TNXmus RDCGEELKNGPSASKTTTIFLNGNRERPLDVFCDMETDGGGWLVFQRRMDGQTDFWRDWEEYAHGFGNISGEFW TNXpig RDCGEEMQNGVSTSRTTTIFLNGNRERPLNVFCDMETDGGGWLVFQRRMDGKTDFWRDWEDYAHGFGNISGEFW TNYchk RDCAEEQLNGPGPSREVLIFLGGDRQRPLHVFCDMESNGGGWLVFQRRMDGGTDFWRGWEEYVHGFGNVSGEFW TNCchk KDCSQALLNGEVTSGLYTIYLNGDRTQPLQVFCDMAEDGGGWIVFLRRQNGKEDFYRNWKNYVAGFGDPKDEFW TNChum KDCSQAMLNGDTTSGLYTIYLNGDKAQALEVFCDMTSDGGGWIVFLRRKNGRENFYQNWKAYAAGFGDRREEFW TNCmus RDCSQAMLNGDTTSGLYTIYINGDKTQALEVYCDMTSDGGGWIVFLRRKNGREDFYRNWKAYAAGFGDRREEFW TNCrat RDCSQAMLNGDTTSGLYTIYINGDKTQALEVYCDMTSDGGGWIVFLRRKNGREDFYRNWKAYATGFGDRRE TNCpig RDCSQAMLNGDTTSGLYTIYVNNDKAQKLEVFCDMTSDSGGWIVFLRRKNGREDFYRNWKAYAAGFGDLKEEFW TNCnwt KDCSQALLNGETASGLYTIYLNGDKAKPQEEYCDMSEYGGGWIVFLRRVDGKEDFYRNWKTYTAGFGDPTKEFF TNCzfish KDCSQALLNGDTTSGLYTIYLRGDESQPLQVYCDMTTDGGGWIVFVRRQSGKVEFFRNWKNYTAGFGDLNDEFW TNRhum LGLDNIHRITSQGRYELRVDMRDGQEAAFASYDRFSVEDSRNLYKLRIGSYNGTAGDSLSYHQGRPFSTEDRDNDVAV TNRrat LGLDNYHRITAQGRYELRVDMRDGQEAVFAYYDKFAVEDSRSLYKLRIGGYNGTAGDSLSYHQGRPFSTEDRDNDVAV TNRchk LGLDNIHKITSQGRYELRIDMRDGQEAAYAYYDKFSVGDSRSLYKLRIGDYNGTSGDSLTYHQGRPFSTKDRDNDVAV TNXhum LGNEALHSLTQAGDYSIRVDLRAGDEAVFAQYDSFHVDSAAEYYRLHLEGYHGTAGDSMSYHSGSVFSARDRDPNSLL TNXmus LGNEALHSLTQAGDYSLRVDLRAGKEAVFAQYDFFRVDSAKENYRLHLEGYHGTAGDSMSYHSGSVFSARDRDPNNLL TNXpig LGNEALHSLTKAGDYSLRVDLRAGEEAVFAQYESFQVDSAAEHYRLHLEGYHGTAGDSMSYHSGSVFSARDRDPNNLL TNYchk LGNAALHTLTASGPTELRVDLRTPSDSAFARYRDFAVSGPEDNFRLHLGAYSGTAGDALSYHAGSPFSTRDHDPRGRP TNCchk IGLENLHKISSQGQYELRVDLRDRGETAYAVYDKFSVGDAKTRYRLRVDGYSGTAGDSMTYHNGRSFSTFDKDNDSAI TNChum LGLDNLNKITAQGQYELRVDLRDHGETAFAVYDKFSVGDAKTRYKLKVEGYSGTAGDSMAYHNGRSFSTFDKDTDSAI TNCmus LGLDNLSKITAQGQYELRVDLQDHGESAYAVYDRFSVGDAKSRYKLKVEGYSGTAGDSMNYHNGRSFSTYDKDTDSAI TNCpig LGLDALSKITAQGQYELRVDLRDHGETAYAVYDRFSVGDARTRYKLKVEGYSGTAGDSMAYHNGRSFSTFDKDTDSAI TNCnwt LGLENLHQITSQGQYELRVDLRDGAETAYAVYDKFFVGDSKTAYRLKVDGYSGTAGDSMTYHSGALFSTFDKDNDSAI TNCzfish LGLSNLHKITSFGQYELRVDLRDKGESAYAQYDKFSISEPRARYKVHVGGYSGTAGDSMTYHHGRPFSTYDNDNDIAV TNRhum TNCAMSYKGAWWYKNCHRTNLNGKYGESRHSQGINWYHWKGHEFSIPFVEMKMRPYNHRLMAGRKRQSLQF TNRrat TNCAMSYKGAWWYKNCHRTNLNGKYGESRHSQGINWYHWKGHEFSIPFVEMKMRPYIHRLTAGRKRRALKF TNRchk TNCAMSYKGAWWYKNCHRTNLNGKYGESRHSQGINWYHWKGHEFSIPFVEMKMRPYNHRNISGRKRRSLQL TNXhum ISCAVSYRGAWWYRNCHYANLNGLYGSTVDHQGVSWYHWKGFEFSVPFTEMKLRPRNFRSPAGGG TNXmus ISCAVSYRGAWWYRNCHYANLNGLYGSTVDHQGVSWYHWKGFEFSVPFTEMKLRPRNFQVPTRGT TNYchk RPCAVAYTGAWWYRNCHYANLNGRYGVPYDHQGINWYPWKGFEYSIPFTEMKLRPQRD TNXpig ISCAVSYRGAWWYRNCHYANLNGLYGSTVDHQGVSWYYWKGFEFSVPFTEMKLRPRSFRPPGRGG TNCchk TNCALSYKGAFWYKNCHRVNLMGRYGDNNHSQGVNWFHWKGHEYSIQFAEMKLRPSSFRNLEGRRKRA TNChum TNCALSYKGAFWYRNCHRVNLMGRYGDNNHSQGVNWFHWKGHEHSIQFAEMKLRPSNFRNLEGRRKRA TNCpig TNCALSYKGAFWYKNCHRVNLMGRYGDNSHSQGVNWFHWKGHEYSIQFAEMKLRPSNFRNLEGRRKRA TNCnwt TNCALSYKGAFWYKNCHRVNLMGRYGDNSHSQGVNWFHWKGHEYSIQFAEMKVRPVSFRNLEGRRRRA TNCzfish TNCALSYKGAFWYKNCHRVNIMGRYGDNSHSKGVNWFHWKGHEHSVEFAEMKIRPANFRNFEGRKKRS