本發(fā)明提供了新的基于血凝素(HA)蛋白的流感疫苗,其是易于制造的,有力的并且引發(fā)針對流感HA蛋白的莖區(qū)的廣泛中和性流感抗體。特別地,本發(fā)明提供了融合前構(gòu)象的修飾的流感HA莖區(qū)蛋白及其部分,其可用于誘導(dǎo)中和性抗體的產(chǎn)生。本發(fā)明還提供在其表面上表達(dá)流感HA蛋白的新型基于納米顆粒(np)的疫苗。此類納米顆粒包含融合蛋白,每個融合蛋白包含連接到來自流感HA蛋白的莖區(qū)的抗原性或免疫原性部分的鐵蛋白的單體亞基。因?yàn)榇祟惣{米顆粒在其表面上展示流感HA蛋白莖區(qū),所以它們可以用于針對流感病毒對個體接種疫苗。
發(fā)明背景
通過針對流感病毒接種疫苗誘導(dǎo)的保護(hù)性免疫應(yīng)答主要針對病毒HA蛋白,其是病毒表面上負(fù)責(zé)病毒與宿主細(xì)胞受體的相互作用的糖蛋白。病毒表面上的HA蛋白是HA蛋白單體的三聚體,其被酶促切割以產(chǎn)生氨基端末端HA1和羧基端末端HA2多肽。球狀頭部僅由HA1多肽的主要部分組成,而將HA蛋白錨定到病毒脂質(zhì)包膜中的莖由HA2和HA1的部分組成。HA蛋白的球狀頭部包括兩個結(jié)構(gòu)域:受體結(jié)合結(jié)構(gòu)域(RBD),包括唾液酸結(jié)合位點(diǎn)的約148個氨基酸殘基結(jié)構(gòu)域,和退化酯酶結(jié)構(gòu)域(vestigial esterase domain),剛好低于RBD的較小的約75個氨基酸殘基區(qū)域。球狀頭部牽涉幾個包括免疫顯性表位的抗原位點(diǎn)。實(shí)例包括Sa,Sb,Ca1,Ca2和Cb抗原位點(diǎn)(參見例如Caton AJ et al,1982,Cell 31,417-427)。RBD-A區(qū)包括Sa抗原位點(diǎn)和Sb抗原位點(diǎn)的部分。
針對流感的抗體通常靶向HA球狀頭中的可變抗原位點(diǎn),其圍繞保守的唾液酸結(jié)合位點(diǎn),因此僅中和抗原緊密相關(guān)的病毒。HA頭部的可變性是由于流感病毒的恒定抗原漂移所致,并且造成流感的季節(jié)性流行病。相比之下,HA莖是高度保守的,并且經(jīng)歷很少的抗原漂移。不幸的是,不同于免疫顯性頭部,保守的HA莖不是非常免疫原性的。此外,病毒基因組的基因區(qū)段可以在宿主物種中進(jìn)行重配(抗原漂移),創(chuàng)建具有改變的抗原性的能夠變成大流行的新病毒[Salomon,R.et al.Cell 136,402-410(2009)]。直到現(xiàn)在,每年更新流感疫苗以反映即將到來的流行病毒的預(yù)測的HA和神經(jīng)氨酸酶(NA)。
最近,分離了一類全新的針對流感病毒的廣泛中和性抗體,其識別高度保守的HA莖[Corti,D.et al.J Clin Invest 120,1663-1673(2010);Ekiert,D.C.et al.Science 324,246-251(2009);Kashyap,A.K.et al.Proc Natl Acad Sci USA105,5986-5991(2008);Okuno,Y.et al.J Virol 67,2552-2558(1993);Sui,J.et al.Nat Struct Mol Biol 16,265-273(2009);Ekiert,D.C.et al.Science 333,843-850(2011);Corti,D.et al.Science 333,850-856(2011)]。與毒株特異性抗體不同,那些抗體能夠中和多種抗原性獨(dú)特的病毒,因此誘導(dǎo)此類抗體已成為下一代通用疫苗開發(fā)的焦點(diǎn)[Nabel,G.J.et al.Nat Med 16,1389-1391(2010)]。然而,通過疫苗接種用此類異源中和概況強(qiáng)力引發(fā)這些抗體是困難的[Steel,J.et al.MBio 1,e0018(2010);Wang,T.T.et al.PLoS Pathog 6,e1000796(2010);Wei,C.J.et al.Science 329,1060-1064(2010)]。通過遺傳操作除去HA(其含有競爭性表位)的免疫顯性頭部區(qū)和穩(wěn)定化所得莖結(jié)構(gòu)域是改善這些廣泛中和性莖抗體的引發(fā)的一種潛在方式。
目前用于流感的疫苗策略使用化學(xué)滅活或減毒活流感病毒。兩種疫苗通常在含胚卵中產(chǎn)生,其由于耗時的方法和有限的生產(chǎn)能力而存在主要的制造限制。當(dāng)前疫苗的另一個更關(guān)鍵的限制是其高度毒株特異性功效。在2009年H1N1大流行的出現(xiàn)期間,這些挑戰(zhàn)變得顯著,從而驗(yàn)證了能夠克服這些限制的新疫苗平臺的必要性。病毒樣顆粒代表了這種替代方法之一,目前正在臨床試驗(yàn)中進(jìn)行評估[Roldao,A.et al.Expert Rev Vaccines 9,1149-1176(2010);Sheridan,C.Nat Biotechnol 27,489-491(2009)]。代替含胚卵,通常包含HA,NA和基質(zhì)蛋白1(M1)的VLP可以在哺乳動物或昆蟲細(xì)胞表達(dá)系統(tǒng)中大規(guī)模生產(chǎn)[Haynes,J.R.Expert Rev Vaccines 8,435-445(2009)]。這種方法的優(yōu)點(diǎn)是其顆粒,多價(jià)性質(zhì)和正確折疊、三聚體HA刺突的真實(shí)展示,其忠實(shí)模擬感染性病毒體。相比之下,由于其組裝的性質(zhì),有包膜的VLP含有小的但有限的宿主細(xì)胞組分,其可以在重復(fù)使用該平臺后呈現(xiàn)潛在的安全性,免疫原性挑戰(zhàn)[Wu,C.Y.et al.PLoS One 5,e9784(2010)]。此外,VLP誘導(dǎo)的免疫與當(dāng)前疫苗基本相同,因此不可能顯著改善疫苗誘導(dǎo)的保護(hù)性免疫的效力和廣度。除了VLP外,重組HA蛋白也已經(jīng)在人體中進(jìn)行了評估[Treanor,J.J.et al.Vaccine 19,1732-1737(2001);Treanor,J.J.JAMA 297,1577-1582(2007)],盡管誘導(dǎo)保護(hù)性中和性抗體滴度的能力有限。在這些試驗(yàn)中使用的重組HA蛋白在昆蟲細(xì)胞中產(chǎn)生并且可能不優(yōu)先形成天然三聚體[Stevens,J.Science303,1866-1870(2004)]。
盡管常規(guī)流感疫苗有幾種替代,但在過去幾十年中生物技術(shù)的進(jìn)步已經(jīng)允許利用生物材料的工程化來產(chǎn)生新的疫苗平臺。鐵蛋白,幾乎所有活生物體中發(fā)現(xiàn)的鐵貯存蛋白,是已經(jīng)廣泛研究和工程化以用于許多潛在的生物化學(xué)/生物醫(yī)學(xué)目的的實(shí)例[Iwahori,K.U.S.Patent 2009/0233377(2009);Meldrum,F.C.et al.Science 257,522-523(1992);Naitou,M.et al.U.S.Patent2011/0038025(2011);Yamashita,I.Biochim Biophys Acta 1800,846-857(2010)],包括用于展示外源表位肽的潛在疫苗平臺[Carter,D.C.et al.U.S.Patent 2006/0251679(2006);Li,C.Q.et al.Industrial Biotechnol 2,143-147(2006)]。其作為疫苗平臺的用途是特別有趣的,這是由于其自身組裝和抗原的多價(jià)呈遞,這比單價(jià)形式誘導(dǎo)更強(qiáng)的B細(xì)胞應(yīng)答以及誘導(dǎo)T細(xì)胞非依賴性抗體應(yīng)答[Bachmann,M.F.et al.Annu Rev Immunol 15,235-270(1997);Dintzis,H.M.et al.Proc Natl Acad Sci USA 73,3671-3675(1976)]。此外,鐵蛋白的分子結(jié)構(gòu),其由組裝成具有432對稱的八面體籠的24個亞基組成,具有在其表面上展示多聚體抗原的潛力。
仍然需要提供強(qiáng)力的針對流感病毒的保護(hù)的有效的流感疫苗。特別地,仍然需要保護(hù)個體免受流感病毒異源株,包括進(jìn)化中的未來的季節(jié)性和大流行性流感病毒株的流感疫苗。本發(fā)明通過提供新穎的基于納米顆粒的疫苗來滿足這種需要,所述疫苗由新的HA穩(wěn)定化的莖(SS)組成,沒有遺傳上融合到納米顆粒表面的可變免疫顯性頭部區(qū)(gen6HA-SS np),從而產(chǎn)生流感疫苗,其是易于制造的,有力的,并且引發(fā)廣泛異亞型保護(hù)性的抗體。
附圖簡述
圖1a顯示了HA頭部的基于結(jié)構(gòu)的除去允許保留莖免疫原抗原性。帶狀模型描繪了HA-SS設(shè)計(jì)途徑,開始于融合到T4折疊物(foldon)三聚化結(jié)構(gòu)域(在HA胞外域下方為綠色)的HA胞外域的模型。最后三個HA-SS設(shè)計(jì)(Gen4-6)遺傳融合到鐵蛋白納米顆粒(下圖)。每個HA三聚體的一個單體被遮蔽。用于創(chuàng)建Gen6的核心穩(wěn)定化突變顯示為球體。每種HA-SS免疫原設(shè)計(jì)下方顯示了三聚化百分比(包括折疊物)和對規(guī)定mAb的抗原親和常數(shù)(KD,M)。ND,未確定;NA,不適用。圖1b分別顯示沒有折疊物結(jié)構(gòu)域的H1N1HA胞外域(PDB ID 1GBN),Gen4HA-SS和Gen6HA-SS的HA部分的表面呈現(xiàn),其通過與H5N1 2004VN的序列保守加陰影(深灰色,可變;白色,保守)。分別對于Gen4和Gen6HA-SS,無折疊物結(jié)構(gòu)域的免疫原的HA莖百分比增加。*進(jìn)一步評估此免疫原,并且在本公開的實(shí)施例部分中稱為H1-SS-np。圖1c顯示了描繪在Gen6HA-SS中Glu103-Lys51鹽橋替換為Leu103-Met51疏水對的橫截面圖的帶狀圖。虛線(左)指示橫截面的位置。圖1d顯示了以其可溶性和納米顆粒形式呈現(xiàn)的Gen6HA-SS的抗原性。三個圖顯示了一個頭(CH65)和三個莖特異性抗體(CR6261,CR9114,F(xiàn)I6v3)對Gen6’HA-SS(左圖),H1-SS-np(中圖)和H1-SS-np’(右圖)的ELISA結(jié)合。濃度范圍為10-6.40×10-4μg/mL的抗體的ELISA結(jié)合。圖1e和圖1f顯示了H1-SS-np(圖1e)和H1-SS-np’(圖1f)與HA莖定向性bNAb結(jié)合的Octet傳感圖。將H1-SS-np固定在Octet探針上,并與不同濃度的抗體結(jié)合片段Fab或scFv莖定向性抗體溫育,其在每個傳感圖的頂部指示。圖1g顯示通過抗IgM(=總受體活性),空np,HA-np(HA含有Y98F突變,以消除與唾液酸的非特異性結(jié)合)和H1-SS-np’的野生型IGHV1-69v-基因逆轉(zhuǎn)的CR6261BCR(左圖)對雙重Ile53Ala/Phe54Ala CDRH2突變體BCR(右圖)的刺激通過流式細(xì)胞術(shù)測量為Ca2+敏感染料FuraRed的Ca2+結(jié)合/未結(jié)合狀態(tài)的比率。
圖2a顯示三聚體,而不是納米顆粒莖免疫原,展示HA莖展開。左圖描述了Gen3HA-SS(黑色和灰色)和mAb C179(標(biāo)記)之間的復(fù)合物的晶體結(jié)構(gòu)的帶狀圖。圖2a的中間圖示出了在兩個不同視圖(側(cè)面和底部)中比較晶體結(jié)構(gòu)(光)與模型(暗)的展開的草圖。圖2a的右圖顯示了Gen3HA-SS/C179結(jié)合界面與1957H2N2HA/C179結(jié)合界面(PDB ID 4HLZ)的重疊??贵wCDR環(huán)對于重鏈用“H”標(biāo)記,對于輕鏈用“L”標(biāo)記。重鏈框架3環(huán)標(biāo)記為FR3。RMSD,均方根偏差。圖2b描繪了與圖2a中相同的圖格式,顯示了Gen4HA-SS,并且在右圖中,Gen4HA-SS/CR6261重鏈結(jié)合界面與1918H1N1HA/CR6261結(jié)合界面(PDB ID 3GBN)的重疊。圖2c顯示H1-SS-np冷凍電子顯微術(shù)(cryo-electron microscopy)分析。前兩個圖分別顯示了Gen4HA-SS晶體結(jié)構(gòu)(剪切(cropped))和H1-SS-np模型,分別適合于一個H1-SS-np刺突的冷凍電子顯微術(shù)圖。圖2c的接下來兩個圖顯示了適合到H1-SS-np低溫電子顯微術(shù)圖中的整個H1-SS-np模型的兩個不同視圖。圖2d顯示分別用Superdex 20010/300和Superose 610/300柱得到的HA,Gen4HA-SS和H1-SS-np’(左圖)和HA np,Gen4HA-SS-np np和H1-SS-np’和H1-SS-np(右圖)的大小排阻層析中流感病毒HA和HA-SS不溶性和納米顆粒形式的表征。圖2e是HA-np(左圖)和Gen4HA-SS-np(中圖)和H1-SS-np(右圖)的負(fù)染色透射電子顯微術(shù)圖像。最初以67,000×放大率記錄圖像。圖2f顯示H1-SS-np場的低溫EM圖像。箭頭描繪一些環(huán)樣納米顆粒;比例尺為20nm。圖2g顯示了通過納米顆粒(插圖)的全局圓形平均值的2D徑向密度概況(曲線)對H1-SS-np的大小分析。該概況示出了兩層結(jié)構(gòu),其具有以距離顆粒中心約為中心的基峰和跨越約至范圍的第二峰。峰高度的差異對于以含有幾個離散刺突的層為頂部的更連續(xù)的蛋白質(zhì)層是一致的。圖2h顯示H1-SS-np的無參考的2D類平均值,沒有施加對稱。類別指示具有蛋白質(zhì)殼和突出的刺突密度的顆粒的不同視圖,并且視圖與預(yù)期的八面體對稱一致。圖2i通過傅里葉殼關(guān)聯(lián)(FSC)圖的H1-SS-np 3D重建的分辨率評估。遵循如在RELION軟件包中實(shí)施的金標(biāo)準(zhǔn)程序(gold-standard procedure),使用FSC(0.143)作為截留值。
圖3a顯示免疫的小鼠和雪貂的免疫應(yīng)答。左圖顯示針對多種多樣的HA蛋白的抗體端點(diǎn)滴度,并且右圖顯示來自用SAS佐劑化(SAS-adjuvanted)的H1-SS-np免疫的小鼠(每組n=10)的血清的中和滴度。圖3b顯示了用SAS-佐劑化的空np(n=5),H1-SS-np’(n=6),2006-07TIV(n=6)或H5HA(2xDNA/1xMIV;n=6)免疫的雪貂的免疫應(yīng)答。圖3b的左圖顯示了H1-SS-np’免疫血清對多種多樣的HA蛋白的抗體端點(diǎn)滴度,并且右圖顯示了來自四種免疫方案的血清的HA莖反應(yīng)性。圖3c顯示了用三種施用方案免疫的雪貂的血清的中和滴度。在加強(qiáng)后兩周對每個個體動物顯示抗體端點(diǎn)和IC50滴度。虛線指示ELISA和假型化慢病毒報(bào)告物測定法兩者的基線(1:25稀釋)。誤差棒表示平均值±s.d。使用雙尾學(xué)生t檢驗(yàn)(two-tailed student’s t-test)進(jìn)行統(tǒng)計(jì)分析。
圖4a顯示了在小鼠和雪貂中針對致死性H5N1 2004VN流感病毒攻擊賦予的免疫保護(hù)。在第0、8和11周,用SAS-佐劑化的空np或H1-SS-np對BALB/c小鼠(每組n=10)接種疫苗三次,或保持未接種疫苗(未處理)。最后一次疫苗接種后四周,用高劑量(25LD50)的H5N1 2004VN病毒攻擊小鼠,并監(jiān)測體重減輕(左圖)和存活(右圖)達(dá)14天。圖4b顯示用SAS-佐劑化的空np(n=5),H1-SS-np’(n=6),2006-07TIV(n=6)或H5HA(DNA/MIV;n=6),并在用1000TCID50的H5N1 2004VN最后免疫后6周攻擊。監(jiān)測體重減輕(左圖)和存活(右圖)達(dá)14天。圖4c顯示在用高劑量(25LD50)的H5N1 2004VN流感病毒攻擊之前24小時用來自未處理或H1-SS-np-免疫動物的10mg Ig被動免疫(腹膜內(nèi))的BALB/c小鼠(每組n=10)。監(jiān)測體重減輕(左圖)和存活(右圖)達(dá)14天。在圖4a,4b和4c的每一個中,黑色虛線(右圖)指示50%存活。使用時序(Mantel-Cox)檢驗(yàn)進(jìn)行統(tǒng)計(jì)分析。圖4d顯示了未處理和H1-SS-np免疫Ig的表征。通過ELISA,未處理Ig(左)和H1-SS-np-免疫Ig(右)對空鐵蛋白np和各種HA蛋白的結(jié)合。圖4e顯示在輸注多克隆Ig后24小時的小鼠血清中Gen6HA-SS特異性Ig的估計(jì)濃度。
圖5-24提供了用于產(chǎn)生本發(fā)明的肽構(gòu)建體的質(zhì)粒圖譜和序列。如本公開表2中詳細(xì)描述的,圖5顯示了包含SEQ ID NO:266的Gen6_H1NC99_K394M/E446L/E448Q/R449W/D452L/Y437D/N438L_N19Q的圖譜。圖6顯示了包含SEQ ID NO:273的Gen6_H1CA09_K394M/E446L/E448Q/R449W/D452L/Y437D/N438L_N19Q的圖譜。圖7顯示包含SEQ ID NO:280的Gen6_H2Sing57_K394M/M445L/E446L/E448Q/R449W/D452L/Y437D/N438L_N19Q的圖譜。圖8顯示包含SEQ ID NO:287的Gen6_H5Ind05K394M/M445L/E446L/E448Q/R449W/D452L/Y437D/N438L/S49bW_N19Q的圖譜。圖9顯示包含SEQ ID NO:294的Gen6_H1NC99_K394M/E446L_N19Q的圖譜。圖10顯示包含SEQ ID NO:301的Gen6_H1NC99_K394M/E446L/Y437D/N438L_N19Q的圖譜。圖11顯示包含SEQ ID NO:308的Gen6_H1NC99_K394I/E446I/Y437D/N438L_N19Q的圖譜。圖12顯示包含SEQ ID NO:315的Gen6H1NC99K394L/E446I/Y437D/N438L_N19Q的圖譜。圖13顯示包含SEQ ID NO:322的Gen6_H1NC99_K394L/E446L/Y437D/N438L_N19Q的圖譜。圖14顯示包含SEQ ID NO:329的Gen6_H1NC99_K394M/E446M/Y437D/N438L_N19Q的圖譜。圖15顯示包含SEQ ID NO:336的Gen6H1NC99K394Q/E446Q/Y437D/N438L_N19Q的圖譜。圖16顯示包含SEQ ID NO:343的Gen6H1NC99K394M/E446L/Y437D/N438L/H45N/V47T_N19Q的圖譜。圖17顯示包含SEQ ID NO:350的Gen6H1NC99V36I/K394M/L445M/E446L/E448Q/R449F/D452L/Y437D/N438L N19Q的圖譜。圖18顯示包含SEQ ID NO:357的Gen6_H1NC99_K394M/E446L/E448Q/R449W/D452L/S402aN/G402cT/S402dG/T402fA/Y437D/N438L_N19Q的圖譜。圖19顯示包含SEQ ID NO:364的Gen6_H1NC99_K394M/E446L/E448Q/R449W/D452L/S402bG/G402cN/S402eT/T402fA/Y437D/N438L_N19Q的圖譜。圖20顯示包含SEQ ID NO:371的Gen6_H1NC99_K394M/E446L/E448Q/R449W/D452L/S402eN/Y437D/N438L_N19Q的圖譜。圖21顯示了包含SEQ ID NO:378的Gen6H1NC99K394M/E446L/E448Q/R449W/D452L/G402cN/G402eT/T402fA/Q370N/E372T/Y437D/N438L_S21T的圖譜。圖22顯示了包含SEQ ID NO:386的Gen6_H1NC99_K394M/E446L/E448Q/R449W/D452L/G402cN/G402eT/T402fA/Q370N/E372T/Y437D/N438L_S21T/Q69N的圖。圖23顯示了包含SEQ ID NO:392的Gen6_H1NC99_K394M/E446L/Y437D/N438L/Δ172-174的圖。圖24顯示了包含SEQ ID NO:399的Gen6_H1NC99_rpk3_Dloop2的圖。
發(fā)明詳述
本發(fā)明涉及用于流感病毒的新型疫苗。更具體地,本發(fā)明涉及新的基于流感HA蛋白的疫苗,其引發(fā)針對來自一大批流感病毒的HA蛋白的莖區(qū)的免疫應(yīng)答。它還涉及自組裝納米顆粒,所述自組裝納米顆粒在其表面上展示來自流感HA蛋白的莖區(qū)的融合前構(gòu)象的免疫原性部分。此類納米顆??捎糜卺槍α鞲胁《緦€體接種疫苗。因此,本發(fā)明還涉及用于產(chǎn)生此類納米顆粒的蛋白質(zhì)構(gòu)建體和編碼此類蛋白質(zhì)的核酸分子。另外,本發(fā)明涉及生產(chǎn)本發(fā)明的納米顆粒的方法,以及使用此類納米顆粒對個體接種疫苗的方法。
在進(jìn)一步描述本發(fā)明前,應(yīng)當(dāng)理解本發(fā)明不限于描述的具體實(shí)施方案,因此當(dāng)然可以有所變化。還應(yīng)當(dāng)理解,本文中使用的術(shù)語僅為了描述具體的實(shí)施方案,而并不意圖為限制性的,因?yàn)楸景l(fā)明的范圍僅會以權(quán)利要求書為限。
應(yīng)當(dāng)注意到,如本文中及所附權(quán)利要求書中使用的,單數(shù)形式“一個”、“一種”和“該/所述”包括復(fù)數(shù)提及物,除非上下文另有明確規(guī)定。例如,核酸分子指一種或多種核酸分子。因此,術(shù)語“一個”、“一種”、“一個/種或多個/種”和“至少一個/種”可以互換使用。類似地,術(shù)語“包含”、“包括”和“具有”可以互換使用。進(jìn)一步注意到,權(quán)利要求書可以撰寫為排除任何任選要素。因此,此陳述意圖充當(dāng)與權(quán)利要求要素的敘述結(jié)合使用排除術(shù)語,如“單獨(dú)”、“僅”等,或者使用“負(fù)”限定的前置基礎(chǔ)。
在上文外,除非另有明確定義,本文中公開的各個實(shí)施方案共同的下列術(shù)語和短語如下定義:
如本文中使用的,蛋白質(zhì)構(gòu)建體是由人工制備的蛋白質(zhì),其中兩個或更多個氨基酸序列以自然界中未發(fā)現(xiàn)的方式共價(jià)連接。被連接的氨基酸序列可以是相關(guān)的或不相關(guān)的。如本文所使用的,如果通常沒有發(fā)現(xiàn)多肽序列的氨基酸序列在其天然環(huán)境(例如細(xì)胞內(nèi))中通過共價(jià)鍵連接在一起,則它們是不相關(guān)的。例如,通常沒有發(fā)現(xiàn)構(gòu)成鐵蛋白的單體亞基的氨基酸序列和流感HA蛋白的氨基酸序列通過共價(jià)鍵連接在一起。因此,此類序列被認(rèn)為是不相關(guān)的。
蛋白質(zhì)構(gòu)建體還可以包含相關(guān)的氨基酸序列。例如,流感HA蛋白的結(jié)構(gòu)使得頭部區(qū)氨基酸序列在兩端側(cè)翼為莖區(qū)氨基酸序列。通過遺傳手段,可以通過從頭部區(qū)的中間除去氨基酸殘基,同時保持側(cè)翼為莖區(qū)序列的頭部區(qū)的部分,來創(chuàng)建HA蛋白的缺失形式。雖然最終分子中序列的順序保持相同,但氨基酸之間的空間關(guān)系將不同于天然蛋白。因此,此類分子將被認(rèn)為是蛋白質(zhì)構(gòu)建體。根據(jù)本發(fā)明,蛋白質(zhì)構(gòu)建體也可以稱為融合蛋白。
蛋白質(zhì)構(gòu)建體中的氨基酸序列可以彼此直接連接,或者它們可以使用接頭序列連接。接頭序列,肽或多肽是用于連接具有期望特征(例如,結(jié)構(gòu),表位,免疫原性,活性等)的兩種蛋白質(zhì)的短(例如,2-20)氨基酸序列。接頭序列通常不具有其自身的活性,并且通常用于允許蛋白質(zhì)構(gòu)建體的其它部分呈現(xiàn)期望的構(gòu)象。接頭序列通常由小氨基酸殘基和/或其運(yùn)行(runs),例如絲氨酸,丙氨酸和甘氨酸制備,盡管不排除使用其它氨基酸殘基。
如本文中使用的,術(shù)語免疫原性是指特定蛋白質(zhì)或其特定區(qū)域引發(fā)對特定蛋白質(zhì)或包含與特定蛋白質(zhì)具有高度同一性的氨基酸序列的蛋白質(zhì)的免疫應(yīng)答的能力。根據(jù)本發(fā)明,具有高同一性程度的兩種蛋白質(zhì)具有至少80%相同,至少85%相同,至少87%相同,至少90%相同,至少92%相同,至少93%相同,至少94%相同,至少95%相同,至少96%相同,至少97%相同,至少98%相同或至少99%相同的氨基酸序列。測定兩個氨基酸或核酸序列之間的百分比同一性的方法是本領(lǐng)域已知的。
如本文中使用的,對本發(fā)明的疫苗或納米顆粒的免疫應(yīng)答是受試者中形成對疫苗中存在的HA蛋白的體液和/或細(xì)胞免疫應(yīng)答。為了本發(fā)明的目的,“體液免疫應(yīng)答”是指由抗體分子(包括分泌型(IgA)或IgG分子)介導(dǎo)的免疫應(yīng)答,而“細(xì)胞免疫應(yīng)答”是由T淋巴細(xì)胞和/或其它白血細(xì)胞介導(dǎo)的。細(xì)胞免疫的一個重要方面涉及溶細(xì)胞性T細(xì)胞(“CTL”)的抗原特異性應(yīng)答。CTL對肽抗原具有特異性,所述肽抗原與由主要組織相容性復(fù)合物(MHC)編碼并且在細(xì)胞表面上表達(dá)的蛋白質(zhì)聯(lián)合呈現(xiàn)。CTL有助于誘導(dǎo)和促進(jìn)細(xì)胞內(nèi)微生物的破壞或被此類微生物感染的細(xì)胞的溶解。細(xì)胞免疫的另一方面涉及輔助T細(xì)胞的抗原特異性應(yīng)答。輔助T細(xì)胞作用為幫助刺激非特異性效應(yīng)細(xì)胞針對細(xì)胞的功能,并且聚焦非特異性效應(yīng)細(xì)胞針對細(xì)胞的活性,所述細(xì)胞在其表面上展示與MHC分子聯(lián)合的肽抗原。細(xì)胞免疫應(yīng)答還指由活化的T細(xì)胞和/或其它白細(xì)胞(包括源自CD4+和CD8+T細(xì)胞的那些)產(chǎn)生的細(xì)胞因子,趨化因子和其它此類分子的產(chǎn)生。
因此,免疫應(yīng)答可以是刺激CTL和/或輔助T細(xì)胞的產(chǎn)生或激活的應(yīng)答。也可以刺激趨化因子和/或細(xì)胞因子的產(chǎn)生。疫苗還可以引發(fā)抗體介導(dǎo)的免疫應(yīng)答。因此,免疫應(yīng)答可以包括一種或多種以下效應(yīng):由B細(xì)胞產(chǎn)生抗體(例如IgA或IgG);和/或特異性針對存在于疫苗中的HA蛋白的抑制物(suppressor),細(xì)胞毒性或輔助T細(xì)胞和/或T細(xì)胞的活化。這些應(yīng)答可用來中和感染性(例如抗體依賴性保護(hù)),和/或介導(dǎo)抗體-補(bǔ)體或抗體依賴性細(xì)胞細(xì)胞毒性(ADCC)以向免疫的個體提供保護(hù)。此類反應(yīng)可以使用本領(lǐng)域熟知的標(biāo)準(zhǔn)免疫測定法和中和測定法來測定。
如本文中使用的,術(shù)語抗原性的,抗原性等是指由抗體或一組抗體結(jié)合的蛋白質(zhì)。類似地,蛋白質(zhì)的抗原部分是被抗體或一組抗體識別的任何部分。根據(jù)本發(fā)明,通過抗體識別蛋白質(zhì)是指抗體選擇性地與蛋白質(zhì)結(jié)合。如本文中使用的,短語選擇性地結(jié)合,選擇性結(jié)合等是指抗體與同HA無關(guān)的結(jié)合蛋白或樣品或測定法中非蛋白質(zhì)組分形成對比優(yōu)先結(jié)合HA蛋白的能力。優(yōu)先結(jié)合HA的抗體是結(jié)合HA但不顯著結(jié)合可能存在于樣品或測定法中的其它分子或組分的抗體。認(rèn)為顯著的結(jié)合是例如抗HA抗體與非HA分子的結(jié)合,其親和力或親合力大到足以干擾測定法檢測和/或測定樣品中抗流感抗體,或HA蛋白水平的能力??纱嬖谟跇悠坊驕y定法中的其它分子和化合物的實(shí)例包括但不限于非HA蛋白,如白蛋白,脂質(zhì)和碳水化合物。根據(jù)本發(fā)明,非HA蛋白是具有與本文公開的流感HA蛋白的序列共享小于60%同一性的氨基酸序列的蛋白質(zhì)。在一些實(shí)施方案中,一種或多種抗體提供廣泛的異亞型保護(hù)。在一些實(shí)施方案中,一種或多種抗體是中和性的。
如本文中使用的,中和性抗體是防止流感病毒完成一輪復(fù)制的抗體。如本文所定義的,一輪復(fù)制指病毒的生命周期,從病毒附著到宿主細(xì)胞開始,并以從宿主細(xì)胞出芽新形成的病毒結(jié)束。該生命周期包括但不限于以下步驟:附著于細(xì)胞,進(jìn)入細(xì)胞,HA蛋白的切割和重排,病毒膜與內(nèi)體膜的融合,病毒核糖核蛋白向細(xì)胞質(zhì)的釋放,形成新病毒顆粒和自宿主細(xì)胞膜的病毒顆粒的出芽。根據(jù)本發(fā)明,中和性抗體是抑制一個或多個此類步驟的抗體。
如本文中使用的,廣泛中和性抗體是中和流感病毒的多于一種類型,亞型和/或毒株的抗體。例如,針對來自A型流感病毒的HA蛋白引發(fā)的廣泛中和性抗體可以中和B型或C型病毒。作為另一個實(shí)例,針對來自I型流感病毒的HA蛋白引發(fā)的廣泛中和性抗體可以中和組2病毒。作為另一個實(shí)例,針對來自病毒的一種亞型或株的HA蛋白引發(fā)的廣泛中和性抗體可以中和病毒的另一種亞型或株。例如,針對來自H1流感病毒的HA蛋白引發(fā)的廣泛中和性抗體可以中和來自一種或多種選自下組的亞型的病毒:H2,H3,H4,H5,H6,H7,H8,H8,H10,H11,H12,H13,H14,H15,H16,H17或H18。
根據(jù)本發(fā)明,用于分類流感病毒的所有命名法是本領(lǐng)域技術(shù)人員通常使用的。因此,流感病毒的類型或組是指A型流感,B型流感或C型流感。本領(lǐng)域技術(shù)人員應(yīng)當(dāng)理解,病毒作為特定類型的命名涉及在各自的M1(基質(zhì))蛋白質(zhì)或NP(核蛋白)中的序列差異。A型流感病毒進(jìn)一步分為組1和組2。這些組進(jìn)一步分為亞型,其指基于其HA蛋白的序列的病毒分類。目前普遍認(rèn)可的亞型的實(shí)例是H1,H2,H3,H4,H5,H6,H7,H8,H8,H10,H11,H12,H13,H14,H15,H16,H17或H18。組1流感亞型是H1,H2,H5,H6,H8,H9,H11,H12,H13,H16,H17和H18。組2流感亞型是H3,H4,H7,H10,H14,和H15。最后,術(shù)語毒株是指亞型內(nèi)彼此不同之處在于它們在其基因組中具有小的遺傳變異的病毒。
如本文中使用的,流感血凝素蛋白或HA蛋白是指全長流感血凝素蛋白或其任何部分,其可用于產(chǎn)生本發(fā)明的蛋白質(zhì)構(gòu)建體和納米顆?;蚰軌蛞l(fā)免疫應(yīng)答。優(yōu)選的HA蛋白是能夠形成三聚體的那些。全長流感HA蛋白的表位是指此類蛋白質(zhì)的部分,其可以引發(fā)針對同源流感病毒株,即衍生HA的菌株的抗體應(yīng)答。在一些實(shí)施方案中,此類表位也可以引發(fā)針對異源流感病毒株,即具有與免疫原的HA不同的HA的毒株的抗體應(yīng)答。在一些實(shí)施方案中,表位引發(fā)廣泛異亞型保護(hù)性應(yīng)答。在一些實(shí)施方案中,表位引發(fā)中和性抗體。
如本文中使用的,變體指在序列上與參照序列相似但不相同的蛋白質(zhì)或核酸分子,其中變體蛋白質(zhì)(或由變體核酸分子編碼的蛋白質(zhì))的活性沒有顯著改變。這些序列變異可以是天然存在的變異或者它們可以經(jīng)由使用本領(lǐng)域技術(shù)人員已知的遺傳工程化技術(shù)來工程化改造。此類技術(shù)的例子可見Sambrook J,Fritsch E F,Maniatis T等,于Molecular Cloning--A Laboratory Manual,2nd Edition,Cold Spring Harbor Laboratory Press,1989,pp.9.31-9.57),或于Current Protocols in Molecular Biology,John Wiley&Sons,N.Y.(1989),6.3.1-6.3.6,這兩篇的完整內(nèi)容通過提及并入本文。
就變體而言,氨基酸或核酸序列的任何類型變化是可允許的,只要所得的變體蛋白質(zhì)保留引發(fā)針對流感病毒的中和性或非中和性抗體的能力。此類變異的例子包括但不限于缺失、插入、取代及其組合。例如,就蛋白質(zhì)而言,本領(lǐng)域技術(shù)人員公知的是,一個或多個(例如2,3,4,5,6,7,8,9或10)氨基酸經(jīng)??梢詮牡鞍踪|(zhì)的氨基和/或羧基端末端除去,而不顯著影響所述蛋白質(zhì)的活性。類似地,一個或多個(例如2,3,4,5,6,7,8,9或10)氨基酸經(jīng)??梢圆迦氲鞍踪|(zhì)中,而不顯著影響蛋白質(zhì)的活性。在已經(jīng)進(jìn)行插入的變體中,插入的氨基酸可以通過參考其后進(jìn)行插入的氨基酸殘基來提及。例如,在氨基酸殘基402之后插入四個氨基酸殘基可以稱為402a-402d。此外,如果那些插入的氨基酸之一隨后被另一個氨基酸取代,則這種變化可以參考字母位置提及。例如,用蘇氨酸取代插入的甘氨酸(在插入物的另一個位置中)可以稱為S402dT。
如記錄的,相對于本文中公開的流感HA蛋白,本發(fā)明的變體蛋白質(zhì)可以含有氨基酸取代。任何氨基酸取代是可允許的,只要蛋白質(zhì)的活性不受顯著影響。在這點(diǎn)上,本領(lǐng)域中應(yīng)當(dāng)理解,氨基酸可以基于其物理特性而分成組。此類組的例子包括但不限于帶電荷的氨基酸、不帶電荷的氨基酸、極性不帶電荷的氨基酸、和疏水性氨基酸。含有取代的優(yōu)選變體是那些其中的氨基酸用來自相同組的氨基酸取代的變體。此類取代稱為保守取代。
天然存在的殘基可以基于共同的側(cè)鏈特性而分成類:
1)疏水性:Met,Ala,Val,Leu,Ile;
2)中性親水性:Cys,Ser,Thr;
3)酸性:Asp,Glu;
4)堿性:Asn,Gln,His,Lys,Arg;
5)影響鏈取向的殘基:Gly,Pro;和
6)芳香基:Trp,Tyr,Phe。
例如,非保守取代可以牽涉用這些類別之一的成員替換來自另一類別的成員。
在進(jìn)行氨基酸變化中,可以考慮氨基酸的親水指數(shù)?;诿糠N氨基酸的疏水性和電荷性質(zhì),已給每種氨基酸的親水指數(shù)賦值。親水指數(shù)是:異亮氨酸(+4.5);纈氨酸(+4.2);亮氨酸(+3.8);苯丙氨酸(+2.8);半胱氨酸/胱氨酸(+2.5);甲硫氨酸(+1.9);丙氨酸(+1.8);甘氨酸(-0.4);蘇氨酸(-0.7);絲氨酸(-0.8);色氨酸(-0.9);酪氨酸(-1.3);脯氨酸(-1.6);組氨酸(-3.2);谷氨酸(-3.5);谷氨酰胺(-3.5);天冬氨酸(-3.5);天冬酰胺(-3.5);賴氨酸(-3.9);和精氨酸(-4.5)。本領(lǐng)域一般了解親水氨基酸指數(shù)在賦予蛋白質(zhì)相互作用性生物學(xué)功能中的重要性(Kyte等,1982,J.Mol.Biol.157:105-31)。已知可以用某些氨基酸替代其它具有相似親水指數(shù)或分值的氨基酸,而仍然保留相似的生物學(xué)活性。在進(jìn)行基于親水指數(shù)的變化中,親水指數(shù)在±2之內(nèi)的氨基酸取代是優(yōu)選的,在±1之內(nèi)的那些氨基酸取代是特別優(yōu)選的,且在±0.5之內(nèi)的那些氨基酸取代是甚至更特別優(yōu)選的。
本領(lǐng)域還了解可以基于疏水性有效地進(jìn)行類似氨基酸的取代,特別是在意圖將由此產(chǎn)生的生物功能等同性蛋白質(zhì)或肽用于結(jié)合免疫學(xué)發(fā)明(本案就是如此)的情況中。蛋白質(zhì)的最大局部平均親水性(如由其相鄰氨基酸的親水性所決定的)與其免疫原性和抗原性,即與蛋白質(zhì)的生物學(xué)特性相關(guān)聯(lián)。已將下列親水性數(shù)值(hydrophilicity value)賦予這些氨基酸殘基:精氨酸(+3.0);賴氨酸(+3.0);天冬氨酸(+3.0±1);谷氨酸(+3.0±1);絲氨酸(+0.3);天冬酰胺(+0.2);谷氨酰胺(+0.2);甘氨酸(0);蘇氨酸(-0.4);脯氨酸(-0.5±1);丙氨酸(-0.5);組氨酸(-0.5);半胱氨酸(-1.0);甲硫氨酸(-1.3);纈氨酸(-1.5);亮氨酸(-1.8);異亮氨酸(-1.8);酪氨酸(-2.3);苯丙氨酸(-2.5);和色氨酸(-3.4)。在進(jìn)行基于相似親水性數(shù)值的變化時,親水性數(shù)值在±2之內(nèi)的氨基酸取代是優(yōu)選的,在±1之內(nèi)的那些氨基酸取代是特別優(yōu)選的,且在±0.5之內(nèi)的那些氨基酸取代是甚至更特別優(yōu)選的。還可以基于親水性鑒定來自一級氨基酸序列的表位。
在期望此類取代時,本領(lǐng)域技術(shù)人員可以確定期望的氨基酸取代(無論是保守的還是非保守的)。例如,可以使用氨基酸取代來鑒定HA蛋白的重要?dú)埢?,或者提高或降低本文中描述的HA蛋白的免疫原性、溶解度或穩(wěn)定性。下文在表I中顯示了例示性的氨基酸取代。
表1
氨基酸取代
如本文中使用的,短語顯著影響蛋白質(zhì)活性指將蛋白質(zhì)活性降低至少10%,至少20%,至少30%,至少40%或至少50%。就本發(fā)明而言,此類活性可以例如以蛋白質(zhì)引發(fā)針對流感病毒的保護(hù)性抗體的能力測量。此類活性可以通過測量針對流感病毒的此類抗體的效價(jià),此類抗體針對流感感染提供保護(hù)的能力或者通過測量由引發(fā)的抗體中和的類型、亞型或毒株的數(shù)目測量。測定抗體效價(jià),實(shí)施保護(hù)測定法,和實(shí)施病毒中和測定法的方法是本領(lǐng)域技術(shù)人員已知的。在上文描述的活性外,可以測量的其它活性包括凝集紅細(xì)胞的能力和蛋白質(zhì)對細(xì)胞的結(jié)合親和力。測量此類活性的方法是本領(lǐng)域技術(shù)人員已知的。
術(shù)語個體、受試者和患者是本領(lǐng)域中公知的,并且在本文中可互換使用,指對流感感染易感的任何人或其它動物。例子包括但不限于人和其它靈長類,包括非人靈長類,諸如黑猩猩及其它猿和猴物種;家畜,諸如牛、綿羊、豬、海豹、山羊和馬;馴養(yǎng)哺乳動物,諸如犬和貓;實(shí)驗(yàn)室動物,包括嚙齒類,諸如小鼠、大鼠和豚鼠;禽類,包括馴養(yǎng)禽類、野生禽類和獵禽,諸如雞、火雞和其它雞形目(gallinaceous)禽類、鴨、鵝,等等。術(shù)語個體、受試者和患者單獨(dú)不表示特定年齡、性別、人種,等等。因此,任何年齡的個體(無論雄性或雌性)意圖為本公開內(nèi)容覆蓋,并且包括但不限于老年人、成人、兒童、嬰孩(babies)、嬰兒(infant)、和幼童(toddler)。同樣地,本發(fā)明的方法可以適用于任何人種,包括例如高加索人(Caucasian)(白種人)、非洲裔美國人(African-American)(黑人)、美洲原住民(Native American)、夏威夷原住民(Native Hawaiian)、西班牙裔(Hispanic)、拉美裔(Latino)、亞裔(Asian)、和歐洲裔。感染的受試者是已知在其體內(nèi)具有流感病毒的受試者。
如本文中使用的,接種疫苗的受試者是已經(jīng)施用意圖提供針對流感病毒的保護(hù)性效果的疫苗的受試者。
如本文中使用的,術(shù)語暴露指受試者已經(jīng)與已知感染流感病毒的動物個體接觸。
本文中討論的出版物僅提供其在本申請的提交日前的公開內(nèi)容。本文中的任何內(nèi)容不應(yīng)解釋為承認(rèn)憑借在先發(fā)明,本發(fā)明沒有資格早于此類出版物。此外,提供的出版日期可以與實(shí)際出版日期不同,這可能需要獨(dú)立確認(rèn)。
除非另有定義,本文中使用的所有技術(shù)和科學(xué)術(shù)語與本發(fā)明所屬領(lǐng)域的普通技術(shù)人員的通常理解具有相同的意義。雖然與本文中描述的方法和材料類似或等同的任何方法和材料也可以用于實(shí)施或測試本發(fā)明,現(xiàn)在描述優(yōu)選的方法和材料。本文中提及的所有出版物通過提及收入本文以公開并描述與結(jié)合出版物引用的方法和/或材料。
應(yīng)當(dāng)領(lǐng)會,本發(fā)明的某些特征(為了清楚,其在不同實(shí)施方案的背景中描述)也可以在單一實(shí)施方案中組合提供。相反,本發(fā)明的各個特征(為了簡潔,其在單一實(shí)施方案的背景中描述)也可以分開或在任何合適的亞組合中提供。實(shí)施方案的所有組合是本發(fā)明明確涵蓋的,并且在本文中公開,就像每種組合單獨(dú)且明確公開一樣。另外,所有亞組合也是本發(fā)明明確涵蓋的,并且在本文中公開,就像每種此類亞組合在本文中單獨(dú)且明確公開一樣。
本發(fā)明的一個實(shí)施方案是包含流感HA蛋白的蛋白質(zhì)構(gòu)建體,其中流感HA蛋白的頭部區(qū)已被包含距HA蛋白頭部區(qū)少于5個連續(xù)氨基酸殘基的氨基酸序列替換。如本文中使用的,HA蛋白是指可用于產(chǎn)生本發(fā)明的蛋白質(zhì)構(gòu)建體和納米顆粒的全長流感HA蛋白或其任何一個或多個部分和/或變體。因此,本發(fā)明涉及能夠引發(fā)對流感HA蛋白的莖區(qū)的免疫應(yīng)答的分子。在一些實(shí)施方案中,HA蛋白構(gòu)建體的序列已經(jīng)進(jìn)一步改變(即突變),以穩(wěn)定蛋白的莖區(qū),其形式可以呈遞給免疫系統(tǒng)。此類HA蛋白的一些代表性實(shí)例和由其制備的蛋白質(zhì)構(gòu)建體示于下表2中。
表2
病毒表面上的三聚體HA蛋白包含球狀頭部區(qū)和莖或柄區(qū)域,其將HA蛋白錨定到病毒脂質(zhì)包膜中。流感HA的頭部區(qū)僅由HA1多肽的主要部分形成,而柄區(qū)由HA1和HA2的區(qū)段制成。根據(jù)本發(fā)明,頭部區(qū)大致由對應(yīng)于流感H1N1NC的全長HA蛋白(SEQ ID NO:8)的氨基酸59-291的HA蛋白的氨基酸組成。類似地,如本文所使用的,莖區(qū)大約由氨基酸1-58和對應(yīng)于流感H1N1NC的全長HA蛋白(SEQ ID NO:8)的氨基酸328-564的HA蛋白的氨基酸組成。如本文中使用的,關(guān)于頭部和莖區(qū)的術(shù)語大約是指上述序列在長度上可以改變幾個氨基酸,而不影響本發(fā)明的性質(zhì)。因此,例如,頭部區(qū)可以由氨基酸50-291,氨基酸59-296或氨基酸59-285組成。通常,頭部和莖區(qū)域?qū)⒉粫纳鲜鑫恢酶淖兂^十個氨基酸;然而,在一個實(shí)施方案中,頭部區(qū)的羧基端末端可以延伸得遠(yuǎn)達(dá)對應(yīng)于SEQ ID NO:8的氨基酸327的氨基酸。在一個實(shí)施方案中,頭部區(qū)由在對應(yīng)于流感A/新喀里多尼亞/20/1999(SEQ ID NO:8)的Cys59和Cys291的氨基酸殘基之間的氨基酸序列組成,并且包括所述氨基酸殘基。關(guān)于HA蛋白,本領(lǐng)域技術(shù)人員應(yīng)當(dāng)理解,來自不同流感病毒的HA蛋白可能由于蛋白質(zhì)中的突變(插入,缺失)而具有不同的長度。因此,提及相應(yīng)的區(qū)域是指與所比較的區(qū)域在序列、結(jié)構(gòu)和/或功能上相同或幾乎相同(例如,至少90%相同,至少95%相同,至少98%相同或至少99%相同)的另一種蛋白質(zhì)的區(qū)域。例如,關(guān)于HA蛋白的莖區(qū),另一HA蛋白中的相應(yīng)區(qū)域可以不具有相同的殘基數(shù),但是將具有幾乎相同的序列并且將執(zhí)行相同的功能。作為實(shí)例,在上述實(shí)施方案中,來自A/新喀里多尼亞/20/1999的HA蛋白(SEQ ID NO:8)的頭部區(qū)在氨基酸C291處結(jié)束。A/加利福尼亞/4/2009(H1)(SEQ ID NO:11)中頭部區(qū)末端的相應(yīng)氨基酸是半胱氨酸292。為了更好地闡明病毒之間的序列比較,本領(lǐng)域技術(shù)人員使用編號系統(tǒng),其將氨基酸位置與參考序列相關(guān)。因此,來自不同流感毒株的HA蛋白中的相應(yīng)氨基酸殘基相對于其與蛋白質(zhì)的n-末端氨基酸的距離可能不具有相同的殘基數(shù)。例如,使用H3編號系統(tǒng),參考A/新喀里多尼亞/20/1999(1999NC,H1)中的殘基100并不意味著它是距離N-末端氨基酸的第100個殘基。相反,A/新喀里多尼亞/20/1999(1999NC,H1)的殘基100與流感H3N2毒株的殘基100對齊。本領(lǐng)域技術(shù)人員理解這種編號系統(tǒng)的使用。雖然H3編號系統(tǒng)可用于鑒定氨基酸的位置,除非另有說明,HA蛋白中氨基酸殘基的位置將通過一般性參考來自本文公開的序列的相應(yīng)氨基酸的位置來鑒定。
本發(fā)明人還發(fā)現(xiàn),通過將流感病毒HA蛋白的特定序列與能夠?qū)A蛋白呈遞給免疫系統(tǒng)的不相關(guān)分子組合,可以引發(fā)對HA蛋白的靶向區(qū)域的免疫應(yīng)答。本發(fā)明的一個實(shí)施方案是包含與單體亞基蛋白的至少部分連接的流感HA蛋白的蛋白質(zhì)構(gòu)建體,其中流感HA蛋白的頭部區(qū)已被包含來自HA蛋白的頭部區(qū)的少于5個連續(xù)氨基酸殘基的氨基酸序列替換,并且其中所述蛋白質(zhì)構(gòu)建體能夠形成納米顆粒。
通過至少將流感HA蛋白的部分與單體亞基連接,本發(fā)明的蛋白質(zhì)構(gòu)建體能夠組裝成在其表面上表達(dá)HA的三聚體的納米顆粒。應(yīng)當(dāng)理解,構(gòu)成此類三聚體的HA蛋白是融合前形式,并且與單體亞基的連接和在納米顆粒上的表達(dá)使融合前蛋白以其三聚體形式穩(wěn)定化。這是重大的,因?yàn)镠A蛋白以更天然的形式呈現(xiàn),意味著莖多肽的某些表面不被暴露,從而降低莖多肽可能誘導(dǎo)不利抗體應(yīng)答的風(fēng)險(xiǎn)。
在一個實(shí)施方案中,HA蛋白包含來自流感HA蛋白的莖區(qū)的至少一個免疫原性部分,其中所述蛋白引發(fā)針對流感病毒的保護(hù)性抗體。在一個實(shí)施方案中,HA蛋白包含來自選自A型流感病毒,B型流感病毒和C型流感病毒的病毒的HA蛋白的莖區(qū)的至少一個免疫原性部分,其中蛋白質(zhì)引發(fā)針對流感病毒的保護(hù)性抗體。在一個實(shí)施方案中,HA蛋白包含來自選自以下的HA蛋白的莖區(qū)的至少一個免疫原性部分:H1流感病毒HA蛋白,H2流感病毒HA蛋白,H3流感病毒HA蛋白,流感H4病毒HA蛋白,H5流感病毒HA蛋白,H6流感病毒HA蛋白,H7流感病毒HA蛋白,H8流感病毒HA蛋白,H9流感病毒HA蛋白,H10流感病毒HA蛋白HA蛋白,H11流感病毒HA蛋白,H12流感病毒HA蛋白,H13流感病毒HA蛋白,H14流感病毒HA蛋白,H15流感病毒HA蛋白,H16流感病毒HA蛋白,H17流感病毒HA蛋白和H18流感病毒HA蛋白。
在一個實(shí)施方案中,HA蛋白包含來自蛋白質(zhì)的至少一個免疫原性部分,所述蛋白質(zhì)包含與選自下組的序列至少80%相同的氨基酸序列:SEQ ID NO:8,SEQ ID NO:11,SEQ ID NO:14和SEQ ID NO:17。在一個實(shí)施方案中,HA蛋白包含來自蛋白質(zhì)的至少一個免疫原性部分,所述蛋白質(zhì)包含選自下組的氨基酸序列:SEQ ID NO:8,SEQ ID NO:11,SEQ ID NO:14和SEQ ID NO:17。在一個實(shí)施方案中,HA蛋白包含來自蛋白質(zhì)的至少一個免疫原性部分,所述蛋白質(zhì)包含與選自下組的序列至少80%相同的氨基酸序列:SEQ ID NO:80,SEQ ID NO:83,SEQ ID NO:86,SEQ ID NO:89,SEQ ID NO:92,SEQ ID NO:95,SEQ ID NO:98,SEQ ID NO:101,SEQ ID NO:104,SEQ ID NO:150,SEQ ID NO:152,SEQ ID NO:154,SEQ ID NO:156,SEQ ID NO:158,SEQ ID NO:164,SEQ ID NO:170,SEQ ID NO:176,SEQ ID NO:182,SEQ ID NO:188,SEQ ID NO:197,SEQ ID NO:203,SEQ ID NO:209,SEQ ID NO:214,SEQ ID NO:217,SEQ ID NO:222,SEQ ID NO:224,SEQ ID NO:226,SEQ ID NO:228,SEQ ID NO:230,SEQ ID NO:232,SEQ ID NO:235,SEQ ID NO:240,SEQ ID NO:242,SEQ ID NO:244,SEQ ID NO:246,SEQ ID NO:248,SEQ ID NO:250,SEQ ID NO:252,SEQ ID NO:254,SEQ ID NO:256,SEQ ID NO:258,SEQ ID NO:261,SEQ ID NO:268,SEQ ID NO:275,SEQ ID NO:282,SEQ ID NO:289,SEQ ID NO:296,SEQ ID NO:303,SEQ ID NO:310,SEQ ID NO:317,SEQ ID NO:324,SEQ ID NO:331,SEQ ID NO:338,SEQ ID NO:345,SEQ ID NO:352,SEQ ID NO:359,SEQ ID NO:366,SEQ ID NO:373,SEQ ID NO:380,SEQ ID NO:387,SEQ ID NO:394和SEQ ID NO:400。在一個實(shí)施方案中,HA蛋白包含來自蛋白質(zhì)的至少一個免疫原性部分,所述蛋白質(zhì)包含選自下組的氨基酸序列:SEQ ID NO:80,SEQ ID NO:83,SEQ ID NO:86,SEQ ID NO:89,SEQ ID NO:92,SEQ ID NO:95,SEQ ID NO:98,SEQ ID NO:101,SEQ ID NO:104,SEQ ID NO:150,SEQ ID NO:152,SEQ ID NO:154,SEQ ID NO:156,SEQ ID NO:158,SEQ ID NO:164,SEQ ID NO:170,SEQ ID NO:176,SEQ ID NO:182,SEQ ID NO:188,SEQ ID NO:197,SEQ ID NO:203,SEQ ID NO:209,SEQ ID NO:214,SEQ ID NO:217,SEQ ID NO:222,SEQ ID NO:224,SEQ ID NO:226,SEQ ID NO:228,SEQ ID NO:230,SEQ ID NO:232,SEQ ID NO:235,SEQ ID NO:240,SEQ ID NO:242,SEQ ID NO:244,SEQ ID NO:246,SEQ ID NO:248,SEQ ID NO:250,SEQ ID NO:252,SEQ ID NO:254,SEQ ID NO:256,SEQ ID NO:258,SEQ ID NO:261,SEQ ID NO:268,SEQ ID NO:275,SEQ ID NO:282,SEQ ID NO:289,SEQ ID NO:296,SEQ ID NO:303,SEQ ID NO:310,SEQ ID NO:317,SEQ ID NO:324,SEQ ID NO:331,SEQ ID NO:338,SEQ ID NO:345,SEQ ID NO:352,SEQ ID NO:359,SEQ ID NO:366,SEQ ID NO:373,SEQ ID NO:380,SEQ ID NO:387,SEQ ID NO:394和SEQ ID NO:400。在一個實(shí)施方案中,包含HA蛋白的免疫原性部分的此類蛋白質(zhì)引發(fā)針對流感病毒的廣泛保護(hù)性抗體的產(chǎn)生。
蛋白質(zhì)的免疫原性部分包含表位,其是被免疫系統(tǒng)識別的氨基酸殘基的簇,從而引發(fā)免疫應(yīng)答。此類表位可以由連續(xù)的氨基酸殘基(即,在蛋白質(zhì)中彼此相鄰的氨基酸殘基)組成,或者它們可以由非連續(xù)的氨基酸殘基(即,蛋白質(zhì)中彼此不相鄰的氨基酸殘基),但在最終折疊的蛋白質(zhì)中緊密空間接近。本領(lǐng)域技術(shù)人員完全理解,表位需要最少六個氨基酸殘基,以便被免疫系統(tǒng)識別。因此,在一個實(shí)施方案中,來自流感HA蛋白的免疫原性部分包含至少一個表位。在一個實(shí)施方案中,HA蛋白包含來自流感HA蛋白的莖區(qū)的至少6個氨基酸,至少10個氨基酸,至少25個氨基酸,至少50個氨基酸,至少75個氨基酸或至少100個氨基酸。在一個實(shí)施方案中,HA蛋白包含來自HA蛋白的莖區(qū)的至少6個氨基酸,至少10個氨基酸,至少25個氨基酸,至少50個氨基酸,至少75個氨基酸或至少100個氨基酸,所述HA蛋白來自選自A型流感病毒,B型流感病毒和C型流感病毒的病毒。在一個實(shí)施方案中,HA蛋白包含來自HA蛋白的莖區(qū)的至少6個氨基酸,至少10個氨基酸,至少25個氨基酸,至少50個氨基酸,至少75個氨基酸或至少100個氨基酸,所述HA蛋白來自選自H1流感病毒HA蛋白,H2流感病毒HA蛋白,H3流感病毒HA蛋白,H4流感病毒HA蛋白,H5流感病毒HA蛋白,H6流感病毒HA蛋白,H7流感病毒病毒HA蛋白,H8流感病毒HA蛋白,H9流感病毒HA蛋白,H10流感病毒HA蛋白HA蛋白,H11流感病毒HA蛋白,H12流感病毒HA蛋白,H13流感病毒HA蛋白,H14流感病毒HA蛋白,H15流感病毒HA蛋白,H16流感病毒HA蛋白,H17流感病毒HA蛋白和H18流感病毒HA蛋白。在一個實(shí)施方案中,HA蛋白包含來自HA蛋白的莖區(qū)的至少6個氨基酸,至少10個氨基酸,至少25個氨基酸,至少50個氨基酸,至少75個氨基酸或至少100個氨基酸,所述HA蛋白來自于選自下組的病毒株:流感A/新喀里多尼亞/20/1999(1999NC,H1),A/加利福尼亞/04/2009(2009CA,H1),A/新加坡/1/1957(1957Sing,H2),A/香港/1/1968(1968HK,H3),A/布里斯班/10/2007(2007Bris,H3),A/印度尼西亞/05/2005(2005Indo,H5),B/佛羅里達(dá)/4/2006(2006Flo,B),A/珀斯/16/2009(2009Per,H3),A/布里斯班/59/2007(2007Bris,H1),B/布里斯班/60/2008(2008Bris,B)及其變體。在一個實(shí)施方案中,氨基酸是來自HA蛋白的莖區(qū)的連續(xù)氨基酸。在一個實(shí)施方案中,包含來自HA蛋白的莖區(qū)的至少6個氨基酸,至少10個氨基酸,至少25個氨基酸,至少50個氨基酸,至少75個氨基酸或至少100個氨基酸的此類蛋白質(zhì)引發(fā)針對流感病毒的廣泛保護(hù)性抗體的產(chǎn)生。本發(fā)明的一個實(shí)施方案是包含蛋白質(zhì)構(gòu)建體,其包含來自HA蛋白的莖區(qū)的至少6個氨基酸,至少10個氨基酸,至少25個氨基酸,至少50個氨基酸,至少75個氨基酸或至少100個氨基酸,所述HA蛋白包含選自SEQ ID NO:8,SEQ ID NO:11,SEQ ID NO:14和SEQ ID NO:17的氨基酸序列。本發(fā)明的一個實(shí)施方案是蛋白質(zhì)構(gòu)建體,其包含來自HA蛋白的莖區(qū)的至少6個氨基酸,至少10個氨基酸,至少25個氨基酸,至少50個氨基酸,至少75個氨基酸或至少100個氨基酸,所述HA蛋白包含選自下組的氨基酸序列:SEQ ID NO:80,SEQ ID NO:83,SEQ ID NO:86,SEQ ID NO:89,SEQ ID NO:92,SEQ ID NO:95,SEQ ID NO:98,SEQ ID NO:101,SEQ ID NO:104,SEQ ID NO:150,SEQ ID NO:152,SEQ ID NO:154,SEQ ID NO:156,SEQ ID NO:158,SEQ ID NO:164,SEQ ID NO:170,SEQ ID NO:176,SEQ ID NO:182,SEQ ID NO:188,SEQ ID NO:197,SEQ ID NO:203,SEQ ID NO:209,SEQ ID NO:214,SEQ ID NO:217,SEQ ID NO:222,SEQ ID NO:224,SEQ ID NO:226,SEQ ID NO:228,SEQ ID NO:230,SEQ ID NO:232,SEQ ID NO:235,SEQ ID NO:240,SEQ ID NO:242,SEQ ID NO:244,SEQ ID NO:246,SEQ ID NO:248,SEQ ID NO:250,SEQ ID NO:252,SEQ ID NO:254,SEQ ID NO:256,SEQ ID NO:258,SEQ ID NO:261,SEQ ID NO:268,SEQ ID NO:275,SEQ ID NO:282,SEQ ID NO:289,SEQ ID NO:296,SEQ ID NO:303,SEQ ID NO:310,SEQ ID NO:317,SEQ ID NO:324,SEQ ID NO:331,SEQ ID NO:338,SEQ ID NO:345,SEQ ID NO:352,SEQ ID NO:359,SEQ ID NO:366,SEQ ID NO:373,SEQ ID NO:380,SEQ ID NO:387,SEQ ID NO:394和SEQ ID NO:400。在一個實(shí)施方案中,氨基酸是來自HA蛋白的莖區(qū)的連續(xù)氨基酸。在一個實(shí)施方案中,氨基酸是非連續(xù)的,但在最終蛋白質(zhì)中緊密空間接近。
雖然本申請例示了來自幾種示例性HA蛋白的莖區(qū)序列的使用,但是本發(fā)明也可以使用來自包含所公開的HA序列的變異的蛋白質(zhì)的莖區(qū)來實(shí)施。因此,在一個實(shí)施方案中,HA蛋白來自選自下組的病毒:流感A/新喀里多尼亞/20/1999(1999NC,H1),A/加利福尼亞/04/2009(2009CA,H1),A/新加坡/1/1957(1957Sing,H2),A/香港/1/1968(1968HK,H3),A/布里斯班/10/2007(2007Bris,H3),A/印度尼西亞/05/2005(2005Indo,H5),B/佛羅里達(dá)/4/2006(2006Flo,B),A/珀斯/16/2009(2009Per,H3),A/布里斯班/59/2007(2007Bris,H1),B/布里斯班/60/2008(2008Bris,B)及其變體。在一個實(shí)施方案中,HA蛋白包含與HA蛋白的莖區(qū)至少80%,至少85%,至少90%,至少92%,至少94%,至少96%,至少98%或至少99%相同的氨基酸序列,所述HA蛋白包含選自下組的氨基酸序列:SEQ ID NO:8,SEQ ID NO:,SEQ ID NO:11,SEQ ID NO:14,SEQ ID NO:17。在一個實(shí)施方案中,HA蛋白包含選自SEQ ID NO:8,SEQ ID NO:,SEQ ID NO:11,SEQ ID NO:14,SEQ ID NO:17的氨基酸序列。
在一個實(shí)施方案中,HA蛋白的頭部區(qū)序列被接頭序列替換??梢允褂萌魏谓宇^序列,只要莖區(qū)序列能夠形成期望的結(jié)構(gòu)。雖然任何氨基酸可用于制備接頭序列,但優(yōu)選使用缺少大的或帶電荷的側(cè)鏈的氨基酸。優(yōu)選的氨基酸包括但不限于絲氨酸,甘氨酸和丙氨酸。在一個實(shí)施方案中,接頭由絲氨酸和甘氨酸殘基制成。接頭序列的長度可以變化,但是優(yōu)選的實(shí)施方案使用最短的可能序列,以允許莖序列形成期望的結(jié)構(gòu)。在一個實(shí)施方案中,接頭序列的長度小于10個氨基酸。在一個實(shí)施方案中,接頭序列的長度小于5個氨基酸。在優(yōu)選的實(shí)施方案中,接頭序列缺乏來自HA蛋白的頭部區(qū)的連續(xù)氨基酸序列。在一個實(shí)施方案中,接頭序列包含來自HA蛋白頭部區(qū)的少于5個連續(xù)氨基酸。
如上所述,HA序列與單體亞基蛋白的部分連接。如本文中使用的,單體亞基蛋白是指能夠結(jié)合其它單體亞基蛋白的蛋白單體,使得單體亞基蛋白自組裝成納米顆粒。任何單體亞基蛋白可以用于產(chǎn)生本發(fā)明的蛋白質(zhì)構(gòu)建體,只要該蛋白質(zhì)構(gòu)建體能夠形成在其表面上展示HA蛋白的多聚體結(jié)構(gòu)。在一個實(shí)施方案中,單體亞基是鐵蛋白。
鐵蛋白是在所有動物,細(xì)菌和植物中發(fā)現(xiàn)的球狀蛋白,其主要通過將水合鐵離子和質(zhì)子運(yùn)輸?shù)降V化核心和從礦化核心運(yùn)輸來控制多核Fe(III)2O3形成的速率和位置起作用。鐵蛋白的球狀形式由單體亞基蛋白(也稱為單體鐵蛋白亞基)組成,其是具有約17-20kDa的分子量的多肽。一個此類單體鐵蛋白亞基的序列的實(shí)例由SEQ ID NO:2表示。每個單體鐵蛋白亞基具有螺旋束的拓?fù)浣Y(jié)構(gòu),其包括四個反向平行螺旋基序,具有大致垂直于4螺旋束的長軸的第五較短螺旋(c-端螺旋)。根據(jù)慣例,螺旋分別從N-末端標(biāo)記為“A,B,C和D&E”。N-末端序列位于納米顆粒三折軸附近并延伸到表面,而E螺旋在四折疊軸上聚集在一起,C-末端延伸到顆粒核心中。這種包裝的結(jié)果在納米顆粒表面上創(chuàng)建兩個孔。預(yù)期這些孔中的一個或兩個代表水合鐵擴(kuò)散進(jìn)入和離開納米顆粒的點(diǎn)。產(chǎn)生后,這些單體鐵蛋白亞基蛋白自組裝成球狀鐵蛋白蛋白。因此,鐵蛋白的球狀形式包含24個單體,鐵蛋白亞基蛋白,并具有432對稱的殼體樣結(jié)構(gòu)。
根據(jù)本發(fā)明,本發(fā)明的單體鐵蛋白亞基是鐵蛋白蛋白的全長單一多肽或其任何部分,其能夠指導(dǎo)單體鐵蛋白亞基自組裝成蛋白質(zhì)的球狀形式。此類蛋白質(zhì)的實(shí)例包括但不限于SEQ ID NO:2和SEQ ID NO:5。來自任何已知的鐵蛋白蛋白的單體鐵蛋白亞基的氨基酸序列可以用于產(chǎn)生本發(fā)明的蛋白質(zhì)構(gòu)建體,只要單體鐵蛋白亞基能夠自組裝成在其表面上展示HA的納米顆粒。在一個實(shí)施方案中,單體亞基來自選自下組的鐵蛋白蛋白:細(xì)菌鐵蛋白蛋白,植物鐵蛋白蛋白,藻鐵蛋白蛋白,昆蟲鐵蛋白蛋白,真菌鐵蛋白蛋白和哺乳動物鐵蛋白蛋白。在一個實(shí)施方案中,所述鐵蛋白蛋白來自幽門螺桿菌(Helicobacter pylori)。
本發(fā)明的蛋白質(zhì)構(gòu)建體不需要包含鐵蛋白蛋白的單體亞基多肽的全長序列??梢允褂脝误w鐵蛋白亞基蛋白的部分或區(qū)域,只要該部分包含指導(dǎo)單體鐵蛋白亞基自組裝成蛋白的球形形式的氨基酸序列。此類區(qū)域的一個實(shí)例位于幽門螺桿菌鐵蛋白蛋白的氨基酸5和167之間。更具體的區(qū)域描述于Zhang,Y.Self-Assembly in the Ferritin Nano-Cage Protein Super Family.2011,Int.J.Mol.Sci.,12,5406-5421,其通過引用整體并入本文。
在一個實(shí)施方案中,HA蛋白與來自鐵蛋白的至少50個,至少100個或至少150個氨基酸連接,其中所述蛋白質(zhì)構(gòu)建體能夠形成納米顆粒。在一個實(shí)施方案中,HA蛋白與來自SEQ ID NO:2或SEQ ID NO:5的至少50,至少100或至少150個氨基酸連接,其中所述蛋白質(zhì)構(gòu)建體能夠形成納米顆粒。在一個實(shí)施方案中,HA蛋白與蛋白質(zhì)連接,所述蛋白質(zhì)包含與鐵蛋白序列至少85%,至少90%或至少95%相同的氨基酸序列,其中蛋白質(zhì)構(gòu)建體能夠形成納米顆粒。在一個實(shí)施方案中,HA蛋白與蛋白質(zhì)連接,所述蛋白質(zhì)包含與SEQ ID NO:2或SEQ ID NO:5至少85%,至少90%,至少95%相同的氨基酸序列,其中所述蛋白質(zhì)構(gòu)建體形成納米顆粒。
在一個實(shí)施方案中,單體亞基是2,4-二氧四氫蝶啶合成酶(lumazine synthase)。在一個實(shí)施方案中,HA蛋白與來自2,4-二氧四氫蝶啶合酶的至少50個,至少100個或至少150個氨基酸連接,其中所述蛋白質(zhì)構(gòu)建體能夠形成納米顆粒。因此,在一個實(shí)施方案中,HA蛋白與蛋白質(zhì)連接,所述蛋白質(zhì)與2,4-二氧四氫蝶啶合酶至少85%,至少90%,至少95%相同,其中蛋白質(zhì)構(gòu)建體能夠形成納米顆粒。
如本文中使用的,本發(fā)明的納米顆粒是指通過本發(fā)明的蛋白質(zhì)構(gòu)建體(融合蛋白)的自組裝形成的三維顆粒。本發(fā)明的納米顆粒通常是球形形狀的,盡管不排除其它形狀,并且通常直徑為約20nm至約100nm。本發(fā)明的納米顆??梢缘恍枰说鞍踪|(zhì)構(gòu)建體外的分子,如蛋白質(zhì),脂質(zhì),碳水化合物等,它們從所述蛋白質(zhì)構(gòu)建體中形成。
可以使用重組技術(shù)制備本發(fā)明的蛋白質(zhì)構(gòu)建體以將HA蛋白,接頭和單體亞基的各部分連接在一起。以這種方式,可以產(chǎn)生僅包含產(chǎn)生納米顆粒疫苗所必需的那些序列的蛋白質(zhì)構(gòu)建體。因此,本發(fā)明的一個實(shí)施方案是蛋白質(zhì)構(gòu)建體(也稱為融合蛋白),其包含來自流感病毒HA蛋白的莖區(qū)的第一氨基酸序列和來自流感病毒HA蛋白的莖區(qū)的第二氨基酸序列,所述第一和第二氨基酸序列通過接頭序列共價(jià)連接,
其中所述第一氨基酸序列包含來自頭部區(qū)序列的氨基端末端上游的氨基酸序列的至少20個連續(xù)氨基酸殘基;
其中所述第二氨基酸序列包含來自頭部區(qū)序列的羧基端末端下游的氨基酸序列的至少20個連續(xù)氨基酸殘基;和,
其中所述第一或第二氨基酸序列與單體亞基結(jié)構(gòu)域的至少部分連接,使得所述蛋白質(zhì)構(gòu)建體能夠形成納米顆粒。
在一個實(shí)施方案中,第一氨基酸序列來自選自下組的病毒的HA蛋白的莖區(qū):A型流感病毒,B型流感病毒和C型流感病毒。在一個實(shí)施方案中,第一氨基酸序列來自選自下組的病毒的HA蛋白的莖區(qū):H1流感病毒,H2流感病毒,流感H3病毒,流感H4病毒,流感H5病毒,流感H6病毒,H7流感病毒,H8流感病毒,H9流感病毒,H10流感病毒,H11流感病毒,H12流感病毒,H13流感病毒,H14流感病毒,H15流感病毒,H16流感病毒,H17流感病毒,和H18流感病毒。在一個實(shí)施方案中,第一氨基酸序列來自選自下組的病毒的HA蛋白的莖區(qū):流感A/新喀里多尼亞/20/1999(1999NC,H1),A/加利福尼亞/04/2009(2009CA,H1),A/新加坡/1/1957(1957Sing,H2),A/香港/1/1968(1968HK,H3),A/布里斯班/10/2007(2007Bris,H3),A/印度尼西亞/05/2005(2005Indo,H5),B/佛羅里達(dá)/4/2006(2006Flo,B),A/珀斯/16/2009(2009Per,H3),A/布里斯班/59/2007(2007Bris,H1),B/布里斯班/60/2008(2008Bris,B)。在一個實(shí)施方案中,第一氨基酸序列來自HA蛋白的莖區(qū),所述HA蛋白具有與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的氨基酸序列:SEQ ID NO:8,SEQ ID NO:,SEQ ID NO:11,SEQ ID NO:14和SEQ ID NO:17。在一個實(shí)施方案中,第一氨基酸序列來自HA蛋白的莖區(qū),所述HA蛋白包含選自下組的序列:SEQ ID NO:8,SEQ ID NO:11,SEQ ID NO:14和SEQ ID NO:17。在一個實(shí)施方案中,HA蛋白包含來自蛋白質(zhì)的至少一個免疫原性部分,所述蛋白質(zhì)包含與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的氨基酸序列:SEQ ID NO:80,SEQ ID NO:83,SEQ ID NO:86,SEQ ID NO:89,SEQ ID NO:92,SEQ ID NO:95,SEQ ID NO:98,SEQ ID NO:101,SEQ ID NO:104,SEQ ID NO:150,SEQ ID NO:152,SEQ ID NO:154,SEQ ID NO:156,SEQ ID NO:158,SEQ ID NO:164,SEQ ID NO:170,SEQ ID NO:176,SEQ ID NO:182,SEQ ID NO:188,SEQ ID NO:197,SEQ ID NO:203,SEQ ID NO:209,SEQ ID NO:214,SEQ ID NO:217,SEQ ID NO:222,SEQ ID NO:224,SEQ ID NO:226,SEQ ID NO:228,SEQ ID NO:230,SEQ ID NO:232,SEQ ID NO:235,SEQ ID NO:240,SEQ ID NO:242,SEQ ID NO:244,SEQ ID NO:246,SEQ ID NO:248,SEQ ID NO:250,SEQ ID NO:252,SEQ ID NO:254,SEQ ID NO:256,SEQ ID NO:258,SEQ ID NO:261,SEQ ID NO:268,SEQ ID NO:275,SEQ ID NO:282,SEQ ID NO:289,SEQ ID NO:296,SEQ ID NO:303,SEQ ID NO:310,SEQ ID NO:317,SEQ ID NO:324,SEQ ID NO:331,SEQ ID NO:338,SEQ ID NO:345,SEQ ID NO:352,SEQ ID NO:359,SEQ ID NO:366,SEQ ID NO:373,SEQ ID NO:380,SEQ ID NO:387,SEQ ID NO:394和SEQ ID NO:400。在一個實(shí)施方案中,HA蛋白包含來自蛋白質(zhì)的至少一個免疫原性部分,所述蛋白質(zhì)包含選自下組的氨基酸序列:SEQ ID NO:80,SEQ ID NO:83,SEQ ID NO:86,SEQ ID NO:89,SEQ ID NO:92,SEQ ID NO:95,SEQ ID NO:98,SEQ ID NO:101,SEQ ID NO:104,SEQ ID NO:150,SEQ ID NO:152,SEQ ID NO:154,SEQ ID NO:156,SEQ ID NO:158,SEQ ID NO:164,SEQ ID NO:170,SEQ ID NO:176,SEQ ID NO:182,SEQ ID NO:188,SEQ ID NO:197,SEQ ID NO:203,SEQ ID NO:209,SEQ ID NO:214,SEQ ID NO:217,SEQ ID NO:222,SEQ ID NO:224,SEQ ID NO:226,SEQ ID NO:228,SEQ ID NO:230,SEQ ID NO:232,SEQ ID NO:235,SEQ ID NO:240,SEQ ID NO:242,SEQ ID NO:244,SEQ ID NO:246,SEQ ID NO:248,SEQ ID NO:250,SEQ ID NO:252,SEQ ID NO:254,SEQ ID NO:256,SEQ ID NO:258,SEQ ID NO:261,SEQ ID NO:268,SEQ ID NO:275,SEQ ID NO:282,SEQ ID NO:289,SEQ ID NO:296,SEQ ID NO:303,SEQ ID NO:310,SEQ ID NO:317,SEQ ID NO:324,SEQ ID NO:331,SEQ ID NO:338,SEQ ID NO:345,SEQ ID NO:352,SEQ ID NO:359,SEQ ID NO:366,SEQ ID NO:373,SEQ ID NO:380,SEQ ID NO:387,SEQ ID NO:394和SEQ ID NO:400。
在一個實(shí)施方案中,第二氨基酸序列來自選自下組的病毒的HA蛋白的莖區(qū):A型流感病毒,B型流感病毒和C型流感病毒。在一個實(shí)施方案中,第二氨基酸序列來自選自下組的病毒的HA蛋白的莖區(qū):H1流感病毒,H2流感病毒,流感H3病毒,流感H4病毒,流感H5病毒,流感H6病毒,H7流感病毒,H8流感病毒,H9流感病毒,H10流感病毒,H11流感病毒,H12流感病毒,H13流感病毒,H14流感病毒,H15流感病毒,H16流感病毒,H17流感病毒和H18流感病毒。在一個實(shí)施方案中,第二氨基酸序列來自選自下組的病毒的HA蛋白的莖區(qū):流感A/新喀里多尼亞/20/1999(1999NC,H1),A/加利福尼亞/04/2009(2009CA,H1),A/新加坡/1/1957(1957Sing,H2),A/香港/1/1968(1968HK,H3),A/布里斯班/10/2007(2007Bris,H3),A/印度尼西亞/05/2005(2005Indo,H5),B/佛羅里達(dá)/4/2006(2006Flo,B),A/珀斯/16/2009(2009Per,H3),A/布里斯班/59/2007(2007Bris,H1),B/布里斯班/60/2008(2008Bris,B)。在一個實(shí)施方案中,第二氨基酸序列來自HA蛋白的莖區(qū),所述HA蛋白具有與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的氨基酸序列:SEQ ID NO:8,SEQ ID NO:11,SEQ ID NO:14和SEQ ID NO:17。在一個實(shí)施方案中,第二氨基酸序列來自HA蛋白的莖區(qū),所述HA蛋白包含選自下組的序列:SEQ ID NO:8,SEQ ID NO:11,SEQ ID NO:14和SEQ ID NO:17。在一個實(shí)施方案中,HA蛋白包含來自蛋白質(zhì)的至少一個免疫原性部分,所述蛋白質(zhì)包含與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的氨基酸序列:SEQ ID NO:80,SEQ ID NO:83,SEQ ID NO:86,SEQ ID NO:89,SEQ ID NO:92,SEQ ID NO:95,SEQ ID NO:98,SEQ ID NO:101,SEQ ID NO:104,SEQ ID NO:150,SEQ ID NO:152,SEQ ID NO:154,SEQ ID NO:156,SEQ ID NO:158,SEQ ID NO:164,SEQ ID NO:170,SEQ ID NO:176,SEQ ID NO:182,SEQ ID NO:188,SEQ ID NO:197,SEQ ID NO:203,SEQ ID NO:209,SEQ ID NO:214,SEQ ID NO:217,SEQ ID NO:222,SEQ ID NO:224,SEQ ID NO:226,SEQ ID NO:228,SEQ ID NO:230,SEQ ID NO:232,SEQ ID NO:235,SEQ ID NO:240,SEQ ID NO:242,SEQ ID NO:244,SEQ ID NO:246,SEQ ID NO:248,SEQ ID NO:250,SEQ ID NO:252,SEQ ID NO:254,SEQ ID NO:256,SEQ ID NO:258,SEQ ID NO:261,SEQ ID NO:268,SEQ ID NO:275,SEQ ID NO:282,SEQ ID NO:289,SEQ ID NO:296,SEQ ID NO:303,SEQ ID NO:310,SEQ ID NO:317,SEQ ID NO:324,SEQ ID NO:331,SEQ ID NO:338,SEQ ID NO:345,SEQ ID NO:352,SEQ ID NO:359,SEQ ID NO:366,SEQ ID NO:373,SEQ ID NO:380,SEQ ID NO:387,SEQ ID NO:394和SEQ ID NO:400。在一個實(shí)施方案中,HA蛋白包含來自蛋白質(zhì)的至少一個免疫原性部分,所述蛋白質(zhì)包含選自下組的氨基酸序列:SEQ ID NO:80,SEQ ID NO:83,SEQ ID NO:86,SEQ ID NO:89,SEQ ID NO:92,SEQ ID NO:95,SEQ ID NO:98,SEQ ID NO:101,SEQ ID NO:104,SEQ ID NO:150,SEQ ID NO:152,SEQ ID NO:154,SEQ ID NO:156,SEQ ID NO:158,SEQ ID NO:164,SEQ ID NO:170,SEQ ID NO:176,SEQ ID NO:182,SEQ ID NO:188,SEQ ID NO:197,SEQ ID NO:203,SEQ ID NO:209,SEQ ID NO:214,SEQ ID NO:217,SEQ ID NO:222,SEQ ID NO:224,SEQ ID NO:226,SEQ ID NO:228,SEQ ID NO:230,SEQ ID NO:232,SEQ ID NO:235,SEQ ID NO:240,SEQ ID NO:242,SEQ ID NO:244,SEQ ID NO:246,SEQ ID NO:248,SEQ ID NO:250,SEQ ID NO:252,SEQ ID NO:254,SEQ ID NO:256,SEQ ID NO:258,SEQ ID NO:261,SEQ ID NO:268,SEQ ID NO:275,SEQ ID NO:282,SEQ ID NO:289,SEQ ID NO:296,SEQ ID NO:303,SEQ ID NO:310,SEQ ID NO:317,SEQ ID NO:324,SEQ ID NO:331,SEQ ID NO:338,SEQ ID NO:345,SEQ ID NO:352,SEQ ID NO:359,SEQ ID NO:366,SEQ ID NO:373,SEQ ID NO:380,SEQ ID NO:387,SEQ ID NO:394和SEQ ID NO:400。
如上所述,第一氨基酸序列包含來自頭部區(qū)序列的氨基端末端上游的氨基酸序列的至少20個連續(xù)氨基酸殘基。根據(jù)本發(fā)明,術(shù)語上游指與頭部區(qū)的第一個氨基酸殘基的氨基端末端連接的氨基酸序列的全部。在一個實(shí)施方案中,頭部區(qū)的氨基端末端位于對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白(SEQ ID NO:8)的Cys59的氨基酸殘基。因此,在一個實(shí)施方案中,第一氨基酸序列包含來自對應(yīng)于流感A新喀里多尼亞/20/1999(H1)(SEQ ID NO:8)的氨基酸殘基1-58的HA蛋白區(qū)域的至少20個連續(xù)氨基酸殘基。在一個實(shí)施方案中,第一氨基酸序列包含來自與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的序列的至少20個連續(xù)氨基酸殘基:SEQ ID NO:20,SEQ ID NO:35,SEQ ID NO:50和SEQ ID NO:65。在一個實(shí)施方案中,第一氨基酸序列包含來自選自SEQ ID NO:20,SEQ ID NO:35,SEQ ID NO:50和SEQ ID NO:65的序列的至少20個連續(xù)氨基酸殘基。
在一個實(shí)施方案中,第一氨基酸序列包含來自對應(yīng)于流感A新喀里多尼亞/20/1999(H1)(SEQ ID NO:8)的氨基酸殘基1-58的HA蛋白的氨基酸區(qū)域的至少40個連續(xù)氨基酸殘基。在一個實(shí)施方案中,第一氨基酸序列包含與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的序列的至少40個連續(xù)氨基酸殘基:SEQ ID NO:20,SEQ ID NO:35,SEQ ID NO:50和SEQ ID NO:65。在一個實(shí)施方案中,第一氨基酸序列包含來自選自SEQ ID NO:20,SEQ ID NO:35,SEQ ID NO:50和SEQ ID NO:65的序列的至少40個連續(xù)氨基酸殘基。
在一個實(shí)施方案中,第一氨基酸序列包含與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的序列:SEQ ID NO:20,SEQ ID NO:35,SEQ ID NO:50和SEQ ID NO:65。在一個實(shí)施方案中,第一氨基酸序列包含選自SEQ ID NO:20,SEQ ID NO:35,SEQ ID NO:50和SEQ ID NO:65的序列。
在一個實(shí)施方案中,第二氨基酸序列來自選自下組的病毒的HA蛋白的莖區(qū):A型流感病毒,B型流感病毒和C型流感病毒。在一個實(shí)施方案中,第二氨基酸序列來自選自下組的病毒的HA蛋白的莖區(qū):H1流感病毒,H2流感病毒,H3流感病毒,H4流感病毒,H5流感病毒病毒,H6流感病毒,H7流感病毒,H8流感病毒,H9流感病毒,H10流感病毒,H11流感病毒,H12流感病毒,H13流感病毒,H14流感病毒,H15流感病毒病毒,H16流感病毒,H17流感病毒和H18流感病毒。在一個實(shí)施方案中,第二氨基酸序列來自選自下組的病毒的HA蛋白的莖區(qū):流感A/新喀里多尼亞/20/1999(1999NC,H1),A/加利福尼亞/04/2009(2009CA,H1),A/新加坡/1/1957(1957Sing,H2),A/香港/1/1968(1968HK,H3),A/布里斯班/10/2007(2007Bris,H3),A/印度尼西亞/05/2005(2005Indo,H5),B/佛羅里達(dá)/4/2006(2006Flo,B),A/珀斯/16/2009(2009Per,H3),A/布里斯班/59/2007(2007Bris,H1),B/布里斯班/60/2008(2008Bris,B)。在一個實(shí)施方案中,第二氨基酸序列來自HA蛋白的莖區(qū),所述HA蛋白具有與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的氨基酸序列:SEQ ID NO:8,SEQ ID NO:11,SEQ ID NO:14和SEQ ID NO:17。在一個實(shí)施方案中,第二氨基酸序列來自HA蛋白的莖區(qū),所述HA蛋白包含選自SEQ ID NO:8,SEQ ID NO:11,SEQ ID NO:14和SEQ ID NO:17的序列。
如上所述,第二氨基酸序列包含來自頭部區(qū)序列的羧基端末端下游的氨基酸序列的至少20個連續(xù)氨基酸殘基。根據(jù)本發(fā)明,術(shù)語下游指與頭部區(qū)的羧基端末端氨基酸殘基連接的整個氨基酸序列。在一個實(shí)施方案中,頭部區(qū)的羧基端末端位于對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白(SEQ ID NO:8)的Cys291的氨基酸位置。因此,在一個實(shí)施方案中,第二氨基酸序列包含來自對應(yīng)于流感A新喀里多尼亞/20/1999(H1)(SEQ ID NO:8)的氨基酸殘基292-517的HA蛋白的氨基酸區(qū)域的至少20個連續(xù)氨基酸。在一個實(shí)施方案中,第二氨基酸序列包含來自對應(yīng)于流感A新喀里多尼亞/20/1999(H1)(SEQ ID NO:8)的氨基酸殘基328-517的HA蛋白的氨基酸區(qū)域的至少20個連續(xù)氨基酸。在一個實(shí)施方案中,第二氨基酸序列包含與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的序列的至少20個連續(xù)氨基酸殘基:SEQ ID NO:23,SEQ ID NO:26,SEQ ID NO:29,SEQ ID NO:32,SEQ ID NO:38,SEQ ID NO:41,SEQ ID NO:44,SEQ ID NO:47,SEQ ID NO:53,SEQ ID NO:56,SEQ ID NO:59,SEQ ID NO:62,SEQ ID NO:68,SEQ ID NO:71,SEQ ID NO:74和SEQ ID NO:77。在一個實(shí)施方案中,第二氨基酸序列包含來自選自下組的序列的至少20個連續(xù)氨基酸殘基:SEQ ID NO:23,SEQ ID NO:26,SEQ ID NO:29,SEQ ID NO:32,SEQ ID NO:38,SEQ ID NO:41,SEQ ID NO:44,SEQ ID NO:47,SEQ ID NO:53,SEQ ID NO:56,SEQ ID NO:59,SEQ ID NO:62,SEQ ID NO:68,SEQ ID NO:71,SEQ ID NO:74和SEQ ID NO:77。
在一個實(shí)施方案中,第二氨基酸序列包含來自頭部區(qū)序列的羧基端末端下游的氨基酸序列的至少40個,至少60個,至少75個,至少100個或至少150個連續(xù)氨基酸。在一個實(shí)施方案中,第二氨基酸序列包含來自HA蛋白的氨基酸區(qū)的至少40個,至少60個,至少75個,至少100個或至少150個連續(xù)氨基酸,所述氨基酸區(qū)對應(yīng)于流感A新喀里多尼亞/20/1999(H1)(SEQ ID NO:8)的氨基酸殘基292-517。在一個實(shí)施方案中,第二氨基酸序列包含來自HA蛋白的氨基酸區(qū)域的至少40個,至少60個,至少75個,至少100個或至少150個連續(xù)氨基酸,所述氨基酸區(qū)域?qū)?yīng)于流感A新喀里多尼亞/20/1999(H1)(SEQ ID NO:8)的氨基酸殘基328-517。在一個實(shí)施方案中,第二氨基酸序列包含來自序列的至少40,至少60,至少75,至少100或至少150個連續(xù)氨基酸,所述序列與選自下組的序列至少85%,至少90%,至少95%或至少97%相同:SEQ ID NO:23,SEQ ID NO:26,SEQ ID NO:29,SEQ ID NO:32,SEQ ID NO:38,SEQ ID NO:41,SEQ ID NO:44,SEQ ID NO:47,SEQ ID NO:53,SEQ ID NO:56,SEQ ID NO:59,SEQ ID NO:62,SEQ ID NO:68,SEQ ID NO:71,SEQ ID NO:74和SEQ ID NO:77。在一個實(shí)施方案中,第二氨基酸序列包含來自下組的至少40,至少60,至少75,至少100或至少150個連續(xù)氨基酸:SEQ ID NO:23,SEQ ID NO:26,SEQ ID NO:29,SEQ ID NO:32,SEQ ID NO:38,SEQ ID NO:41,SEQ ID NO:44,SEQ ID NO:47,SEQ ID NO:53,SEQ ID NO:56,SEQ ID NO:59,SEQ ID NO:62,SEQ ID NO:68,SEQ ID NO:71,SEQ ID NO:74和SEQ ID NO:77。在一個實(shí)施方案中,第二氨基酸序列包含與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的氨基酸序列:SEQ ID NO:23,SEQ ID NO:26,SEQ ID NO:29,SEQ ID NO:32,SEQ ID NO:38,SEQ ID NO:41,SEQ ID NO:44,SEQ ID NO:47,SEQ ID NO:53,SEQ ID NO:56,SEQ ID NO:59,SEQ ID NO:62,SEQ ID NO:68,SEQ ID NO:71,SEQ ID NO:74和SEQ ID NO:77。在一個實(shí)施方案中,第二氨基酸序列包含選自下組的序列:SEQ ID NO:23,SEQ ID NO:26,SEQ ID NO:29,SEQ ID NO:32,SEQ ID NO:38,SEQ ID NO:41,SEQ ID NO:44,SEQ ID NO:47,SEQ ID NO:53,SEQ ID NO:56,SEQ ID NO:59,SEQ ID NO:62,SEQ ID NO:68,SEQ ID NO:71,SEQ ID NO:74和SEQ ID NO:77。
如上所述,蛋白質(zhì)構(gòu)建體的第一和第二氨基酸序列可以通過接頭序列連接??梢允褂萌魏谓宇^序列,只要該接頭序列具有距HA蛋白的頭部區(qū)少于5個連續(xù)的氨基酸殘基,并且只要第一和第二氨基酸能夠形成期望的構(gòu)象即可。在一個實(shí)施方案中,接頭序列長度小于10個氨基酸,小于7個氨基酸或小于5個氨基酸。在一個實(shí)施方案中,接頭序列包含甘氨酸和絲氨酸。在一個實(shí)施方案中,接頭序列將第一氨基酸序列的羧基端末端連接到第二氨基酸序列的氨基端末端。在一個實(shí)施方案中,接頭序列將第二氨基酸序列的羧基端末端連接到第一氨基酸序列的氨基端末端。
如上所述,蛋白質(zhì)構(gòu)建體的第一或第二氨基酸序列與單體亞基蛋白的至少部分連接,使得蛋白質(zhì)構(gòu)建體能夠形成納米顆粒。在一個實(shí)施方案中,單體亞基蛋白的至少部分連接到第二氨基酸序列。在優(yōu)選的實(shí)施方案中,單體亞基蛋白的至少部分連接到第二氨基酸序列的羧基端末端。在一個實(shí)施方案中,所述部分包含來自單體亞基的至少50個,至少100個或至少150個氨基酸。在一個實(shí)施方案中,單體亞基是鐵蛋白。在一個實(shí)施方案中,單體亞基是2,4-二氧四氫蝶啶合成酶。在一個實(shí)施方案中,所述部分包含來自SEQ ID NO:2,SEQ ID NO:5或SEQ ID NO:194的至少50,至少100或至少150個氨基酸。在一個實(shí)施方案中,單體亞基包含與SEQ ID NO:2,SEQ ID NO:5或SEQ ID NO:194具有至少85%相同,至少90%相同或至少95%相同的序列。在一個實(shí)施方案中,單體亞基包含選自SEQ ID NO:2,SEQ ID NO:5和SEQ ID NO:194的序列。
發(fā)明人已經(jīng)發(fā)現(xiàn),上述蛋白質(zhì)構(gòu)建體的流感HA序列的修飾導(dǎo)致蛋白質(zhì)構(gòu)建體的改進(jìn)的穩(wěn)定性。例如,本發(fā)明人已經(jīng)發(fā)現(xiàn),從對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白(SEQ ID NO:8)的氨基酸N403-W435的氨基酸區(qū)的HA蛋白的缺失導(dǎo)致更穩(wěn)定的蛋白質(zhì)構(gòu)建體。在該區(qū)域缺失時,該區(qū)域側(cè)翼的氨基酸序列可以直接連接在一起,或者它們可以用接頭序列如例如甘氨酸-絲氨酸-甘氨酸連接。因此,在一個實(shí)施方案中,第二氨基酸序列包含與來自頭部區(qū)序列的羧基端末端下游的氨基酸序列的至少100個連續(xù)氨基酸殘基至少85%,至少90%或至少95%相同的多肽序列,其中所述多肽序列缺少對應(yīng)于來自流感A/新喀里多尼亞1999的HA蛋白(SEQ ID NO:8)的SEQ ID NO:133,SEQ ID NO:134,SEQ ID NO:135或SEQ ID NO:136的區(qū)域。在一個實(shí)施方案中,第二氨基酸序列包含來自頭部區(qū)序列的羧基端末端下游的氨基酸序列的至少100個連續(xù)氨基酸殘基,其中多肽序列缺乏對應(yīng)于流感A/新喀里多尼亞1999的HA蛋白(SEQ ID NO:8)的SEQ ID NO:133,SEQ ID NO:134,SEQ ID NO:135或SEQ ID NO:136的區(qū)域。
在一個實(shí)施方案中,第二氨基酸序列包含與來自頭部區(qū)序列的羧基端末端下游的氨基酸序列的至少100個連續(xù)氨基酸殘基至少85%,至少90%或至少95%相同的多肽序列,其中所述多肽序列缺少對應(yīng)于流感A/加利福尼亞/4/2009的HA蛋白(SEQ ID NO:10)的SEQ ID NO:137,SEQ ID NO:138,SEQ ID NO:139或SEQ ID NO:140的區(qū)域。在一個實(shí)施方案中,第二氨基酸序列包含來自頭部區(qū)序列的羧基端末端下游的氨基酸序列的至少100個連續(xù)氨基酸殘基,其中多肽序列缺少對應(yīng)于流感A/加利福尼亞/4/2009的HA蛋白(SEQ ID NO:10)的SEQ ID NO:137,SEQ ID NO:138,SEQ ID NO:139或SEQ ID NO:140的區(qū)域。
在一個實(shí)施方案中,第二氨基酸序列包含與來自頭部區(qū)序列的羧基端末端下游的氨基酸序列的至少100個連續(xù)氨基酸殘基至少85%,至少90%或至少95%相同的氨基酸序列,其中多肽序列缺少對應(yīng)于流感A/新加坡/1957的HA蛋白(SEQ ID NO:12)的SEQ ID NO:141,SEQ ID NO:142,SEQ ID NO:143或SEQ ID NO:144的區(qū)域。在一個實(shí)施方案中,第二氨基酸序列包含與來自頭部區(qū)序列的羧基端末端下游的氨基酸序列的至少100個連續(xù)氨基酸殘基至少85%,至少90%或至少95%相同的氨基酸序列,其中多肽序列缺少對應(yīng)于流感A/新加坡/1957的HA蛋白(SEQ ID NO:12)的SEQ ID NO:141,SEQ ID NO:142,SEQ ID NO:143或SEQ ID NO:144的區(qū)域。
在一個實(shí)施方案中,第二氨基酸序列包含來自頭部區(qū)序列的羧基端末端下游的氨基酸序列的至少100個連續(xù)氨基酸殘基,其中多肽序列缺少對應(yīng)于流感A/印度尼西亞/05/2005(H5)的HA蛋白(SEQ ID NO:16)的SEQ ID NO:145,SEQ ID NO:146,SEQ ID NO:147或SEQ ID NO:148的區(qū)域。在一個實(shí)施方案中,第二氨基酸序列包含來自頭部區(qū)序列的羧基端末端下游的氨基酸序列的至少100個連續(xù)氨基酸殘基,其中多肽序列缺少對應(yīng)于流感A/印度尼西亞/05/2005(H5)的HA蛋白(SEQ ID NO:16)的SEQ ID NO:145,SEQ ID NO:146,SEQ ID NO:147或SEQ ID NO:148的區(qū)域。
在一個實(shí)施方案中,第二氨基酸序列包含與SEQ ID NO:23,SEQ ID NO:26或SEQ ID NO:29的100個連續(xù)氨基酸至少85%,至少90%或至少95%相同的序列,其中所述100個連續(xù)氨基酸不包含選自SEQ ID NO:133,SEQ ID NO:134,SEQ ID NO:135和SEQ ID NO:136的序列。在一個實(shí)施方案中,第二氨基酸序列包含來自SEQ ID NO:23,SEQ ID NO:26或SEQ ID NO:29的100個連續(xù)氨基酸,其中所述100個連續(xù)氨基酸不包含選自下組的序列:SEQ ID NO:133,SEQ ID NO:134,SEQ ID NO:135和SEQ ID NO:136。
在一個實(shí)施方案中,第二氨基酸序列包含與SEQ ID NO:38,SEQ ID NO:41或SEQ ID NO:44的100個連續(xù)氨基酸至少85%,至少90%或至少95%相同的序列,其中所述100個連續(xù)氨基酸不包含選自SEQ ID NO:137,SEQ ID NO:138,SEQ ID NO:139和SEQ ID NO:140的序列。在一個實(shí)施方案中,第二氨基酸序列包含來自SEQ ID NO:38,SEQ ID NO:41或SEQ ID NO:44的100個連續(xù)氨基酸,其中所述100個連續(xù)氨基酸不包含選自下組的序列:SEQ ID NO:137,SEQ ID NO:138,SEQ ID NO:139和SEQ ID NO:140。
在一個實(shí)施方案中,第二氨基酸序列包含與SEQ ID NO:53,SEQ ID NO:56或SEQ ID NO:59的100個連續(xù)氨基酸至少85%,至少90%或至少95%相同的序列,其中所述100個連續(xù)氨基酸不包含選自SEQ ID NO:141,SEQ ID NO:142,SEQ ID NO:143和SEQ ID NO:144的序列。在一個實(shí)施方案中,第二氨基酸序列包含來自SEQ ID NO:53,SEQ ID NO:56或SEQ ID NO:59的100個連續(xù)氨基酸,其中所述100個連續(xù)氨基酸不包含選自下組的序列:SEQ ID NO:141,SEQ ID NO:142,SEQ ID NO:143和SEQ ID NO:144。
在一個實(shí)施方案中,第二氨基酸序列包含與SEQ ID NO:68,SEQ ID NO:71或SEQ ID NO:74的100個連續(xù)氨基酸至少85%,至少90%或至少95%相同的序列,其中所述100個連續(xù)氨基酸不包含選自SEQ ID NO:145,SEQ ID NO:146,SEQ ID NO:147和SEQ ID NO:148的序列。在一個實(shí)施方案中,第二氨基酸序列包含來自SEQ ID NO:68,SEQ ID NO:71或SEQ ID NO:74的100個連續(xù)氨基酸,其中所述100個連續(xù)氨基酸不包含選自下組的序列:SEQ ID NO:145,SEQ ID NO:146,SEQ ID NO:147和SEQ ID NO:148。
在一個實(shí)施方案中,第二氨基酸序列包含與來自選自下組的序列的100個連續(xù)氨基酸至少85%,至少90%或至少95%相同的序列:SEQ ID NO:26,SEQ ID NO:28,SEQ ID NO:32,SEQ ID NO:41,SEQ ID NO:44,SEQ ID NO:47,SEQ ID NO:56,SEQ ID NO:59,SEQ ID NO:62,SEQ ID NO:71和SEQ ID NO:77。在一個實(shí)施方案中,第二氨基酸序列包含來自選自下組的序列的至少100個連續(xù)氨基酸:SEQ ID NO:26,SEQ ID NO:32,SEQ ID NO:41,SEQ ID NO:47,SEQ ID NO:56,SEQ ID NO:62,SEQ ID NO:71和SEQ ID NO:77。在一個實(shí)施方案中,第二氨基酸序列包含選自下組的序列:SEQ ID NO:26,SEQ ID NO:32,SEQ ID NO:41,SEQ ID NO:47,SEQ ID NO:56,SEQ ID NO:62,SEQ ID NO:71,SEQ ID NO:74和SEQ ID NO:77。
本發(fā)明人還發(fā)現(xiàn)了,HA莖區(qū)序列的序列改變導(dǎo)致更穩(wěn)定的蛋白質(zhì)構(gòu)建體。例如,在折疊的HA蛋白中,對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的K394和E446(對應(yīng)于SEQ ID NO:149的K1和E53)的氨基酸殘基形成鹽橋,有助于穩(wěn)定折疊的蛋白質(zhì)。本發(fā)明人已經(jīng)發(fā)現(xiàn),通過用合適的氨基酸取代賴氨酸和谷氨酸殘基,可以加強(qiáng)兩個氨基酸殘基之間的相互作用,這改善了分子的穩(wěn)定性并允許對其進(jìn)行更廣泛的操作。因此,本發(fā)明的一個實(shí)施方案是蛋白質(zhì)構(gòu)建體,其包含來自流感病毒HA蛋白的莖區(qū)的第一氨基酸序列和來自流感病毒HA蛋白的莖區(qū)的第二氨基酸序列,所述第一和第二氨基酸酸序列通過接頭序列共價(jià)連接,
其中所述第一氨基酸序列包含來自頭部區(qū)序列的氨基端末端上游的氨基酸序列的至少20個連續(xù)氨基酸性殘基,
其中所述第二氨基酸序列包含來自頭部區(qū)序列的羧基端末端下游的氨基酸序列的至少60個連續(xù)氨基酸,
其中所述60個連續(xù)氨基酸包含對應(yīng)于來自A/新喀里多尼亞/20/1999的SEQ ID NO:149或SEQ ID NO:150的序列的多肽序列,且
其中對應(yīng)于SEQ ID NO:149的K1或SEQ ID NO:150的K1的所述多肽序列中的氨基酸殘基被除賴氨酸以外的氨基酸取代,
并且對應(yīng)于SEQ ID NO:149的E53或SEQ ID NO:150的E20的氨基酸殘基被除谷氨酸之外的氨基酸殘基取代,使得取代的氨基酸殘基之間的相互作用的強(qiáng)度大于在野生型蛋白中的相互作用的強(qiáng)度。
如上所述,對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的K394和E446的氨基酸殘基形成鹽橋,其是一類鍵。本領(lǐng)域已知存在氨基酸之間的其它類型的鍵,其強(qiáng)度根據(jù)鍵的類型而變化。此類鍵的實(shí)例包括但不限于疏水鍵和氫鍵,二者通常比鹽橋更強(qiáng)。因此,在一個實(shí)施方案中,對應(yīng)于SEQ ID NO:149的K1或SEQ ID NO:150的K1的多肽中的氨基酸殘基和對應(yīng)于SEQ ID NO:149的E53或SEQ ID NO:150的E20的多肽中的氨基酸殘基被改變,使得它們在最終折疊的蛋白質(zhì)中形成氫鍵。在一個實(shí)施方案中,對應(yīng)于SEQ ID NO:149的K1或SEQ ID NO:150的K1的多肽中的氨基酸殘基和對應(yīng)于SEQ ID NO:149的E53或SEQ ID NO:150的E20的多肽中的氨基酸殘基被改變,使得它們在最終折疊的蛋白質(zhì)中形成疏水鍵。
對應(yīng)于SEQ ID NO:149的K1,SEQ ID NO:150的K1,SEQ ID NO:149的E53或SEQ ID NO:150的E20的氨基酸可以被任何氨基酸殘基取代,只要兩個氨基酸之間的所得相互作用比未改變的蛋白質(zhì)中的鹽橋更強(qiáng)。增加對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的K394和E446(SEQ ID NO:149的K1和E53)的氨基酸之間的相互作用強(qiáng)度的取代的實(shí)例包括但不限于:
其中對應(yīng)于SEQ ID NO:149的K1的多肽序列中的氨基酸殘基被甲硫氨酸取代,并且對應(yīng)于SEQ ID NO:149的E53的氨基酸殘基被亮氨酸取代;
其中對應(yīng)于SEQ ID NO:149的K1的多肽序列中的氨基酸殘基被甲硫氨酸取代,并且對應(yīng)于SEQ ID NO:149的E53的氨基酸殘基被甲硫氨酸取代;
其中對應(yīng)于SEQ ID NO:149的K1的多肽序列中的氨基酸殘基被亮氨酸取代,并且對應(yīng)于SEQ ID NO:149的E53的氨基酸殘基被亮氨酸取代;
其中對應(yīng)于SEQ ID NO:149的K1的多肽序列中的氨基酸殘基被異亮氨酸取代,并且對應(yīng)于SEQ ID NO:149的E53的氨基酸殘基被異亮氨酸取代;
其中對應(yīng)于SEQ ID NO:149的K1的多肽序列中的氨基酸殘基被亮氨酸取代,并且對應(yīng)于SEQ ID NO:149的E53的氨基酸殘基被異亮氨酸取代;
其中對應(yīng)于SEQ ID NO:149的K1的多肽序列中的氨基酸殘基被谷氨酰胺取代,并且對應(yīng)于SEQ ID NO:149的E53的氨基酸殘基被谷氨酰胺取代。
在一個實(shí)施方案中,第一氨基酸序列來自選自下組的病毒的HA蛋白的莖區(qū):A型流感病毒,B型流感病毒和C型流感病毒。在一個實(shí)施方案中,第一氨基酸序列來自選自下組的病毒的HA蛋白的莖區(qū):H1流感病毒,H2流感病毒,流感H3病毒,流感H4病毒,流感H5病毒,流感H6病毒,H7流感病毒,H8流感病毒,H9流感病毒,H10流感病毒,H11流感病毒,H12流感病毒,H13流感病毒,H14流感病毒,H15流感病毒,H16流感病毒,H17流感病毒,和H18流感病毒。在一個實(shí)施方案中,第一氨基酸序列來自選自下組的病毒的HA蛋白的莖區(qū):流感A/新喀里多尼亞/20/1999(1999NC,H1),A/加利福尼亞/04/2009(2009CA,H1),A/新加坡/1/1957(1957Sing,H2),A/香港/1/1968(1968HK,H3),A/布里斯班/10/2007(2007Bris,H3),A/印度尼西亞/05/2005(2005Indo,H5),B/佛羅里達(dá)/4/2006(2006Flo,B),A/珀斯/16/2009(2009Per,H3),A/布里斯班/59/2007(2007Bris,H1),B/布里斯班/60/2008(2008Bris,B)。在一個實(shí)施方案中,第一氨基酸序列來自HA蛋白的莖區(qū),所述HA蛋白具有與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的氨基酸序列:SEQ ID NO:8,SEQ ID NO:11,SEQ ID NO:14和SEQ ID NO:17。在一個實(shí)施方案中,第一氨基酸序列來自HA蛋白的莖區(qū),所述HA蛋白包含選自下組的序列:SEQ ID NO:8,SEQ ID NO:11,SEQ ID NO:14和SEQ ID NO:17。
在一個實(shí)施方案中,第二氨基酸序列來自選自下組的病毒的HA蛋白的莖區(qū):A型流感病毒,B型流感病毒和C型流感病毒。在一個實(shí)施方案中,第二氨基酸序列來自選自下組的病毒的HA蛋白的莖區(qū):H1流感病毒,H2流感病毒,流感H3病毒,流感H4病毒,流感H5病毒,流感H6病毒,H7流感病毒,H8流感病毒,H9流感病毒,H10流感病毒,H11流感病毒,H12流感病毒,H13流感病毒,H14流感病毒,H15流感病毒,H16流感病毒,H17流感病毒,和H18流感病毒。在一個實(shí)施方案中,第二氨基酸序列來自來自選自下組的病毒的HA蛋白的莖區(qū):流感A/新喀里多尼亞/20/1999(1999NC,H1),A/加利福尼亞/04/2009(2009CA,H1),A/新加坡/1/1957(1957Sing,H2),A/香港/1/1968(1968HK,H3),A/布里斯班/10/2007(2007Bris,H3),A/印度尼西亞/05/2005(2005Indo,H5),B/佛羅里達(dá)/4/2006(2006Flo,B),A/珀斯/16/2009(2009Per,H3),A/布里斯班/59/2007(2007Bris,H1),B/布里斯班/60/2008(2008Bris,B)。在一個實(shí)施方案中,第二氨基酸序列來自HA蛋白的莖區(qū),所述HA蛋白具有與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的氨基酸序列。在一個實(shí)施方案中,第二氨基酸序列來自HA蛋白的莖區(qū),所述HA蛋白包含選自下組的序列:SEQ ID NO:8,SEQ ID NO:11,SEQ ID NO:14和SEQ ID NO:17。
在一個實(shí)施方案中,第一氨基酸序列包含來自對應(yīng)于流感A新喀里多尼亞/20/1999(H1)(SEQ ID NO:8)的氨基酸殘基1-58的HA蛋白的區(qū)域的至少20個連續(xù)氨基酸殘基。在一個實(shí)施方案中,第一氨基酸序列包含與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的序列的至少20個連續(xù)氨基酸殘基:SEQ ID NO:20,SEQ ID NO:35,SEQ ID NO:50和SEQ ID NO:65。在一個實(shí)施方案中,第一氨基酸序列包含來自選自SEQ ID NO:20,SEQ ID NO:35,SEQ ID NO:50和SEQ ID NO:65的序列的至少20個連續(xù)氨基酸殘基。
在一個實(shí)施方案中,第一氨基酸序列包含來自HA蛋白的氨基酸區(qū)域的至少40個連續(xù)氨基酸殘基,所述氨基酸區(qū)域?qū)?yīng)于流感A新喀里多尼亞/20/1999(H1)(SEQ ID NO:8)的氨基酸殘基1-58。在一個實(shí)施方案中,第一氨基酸序列包含與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的序列的至少40個連續(xù)氨基酸殘基:SEQ ID NO:20,SEQ ID NO:35,SEQ ID NO:50和SEQ ID NO:65。在一個實(shí)施方案中,第一氨基酸序列包含來自選自SEQ ID NO:20,SEQ ID NO:35,SEQ ID NO:50和SEQ ID NO:65的序列的至少40個連續(xù)氨基酸殘基。在一個實(shí)施方案中,第一氨基酸序列包含與選自下組的序列至少85%,至少90%或至少95%相同的序列:SEQ ID NO:20,SEQ ID NO:35,SEQ ID NO:50和SEQ ID NO:65。在一個實(shí)施方案中,第一氨基酸序列包含選自SEQ ID NO:20,SEQ ID NO:35,SEQ ID NO:50和SEQ ID NO:65的序列。
在一個實(shí)施方案中,第二氨基酸序列來自選自A型流感病毒,B型流感病毒和C型流感病毒的病毒的HA蛋白的莖區(qū)。在一個實(shí)施方案中,第二氨基酸序列來自選自下組的病毒的HA蛋白的莖區(qū):H1流感病毒,H2流感病毒,H3流感病毒,H4流感病毒,H5流感病毒病毒,H6流感病毒,H7流感病毒,H8流感病毒,H9流感病毒,H10流感病毒,H11流感病毒,H12流感病毒,H13流感病毒,H14流感病毒,H15流感病毒病毒,H16流感病毒,H17流感病毒和H18流感病毒。在一個實(shí)施方案中,第二氨基酸序列來自選自下組的病毒的HA蛋白的莖區(qū):流感A/新喀里多尼亞/20/1999(1999NC,H1),A/加利福尼亞/04/2009(2009CA,H1),A/新加坡/1/1957(1957Sing,H2),A/香港/1/1968(1968HK,H3),A/布里斯班/10/2007(2007Bris,H3),A/印度尼西亞/05/2005(2005Indo,H5),B/佛羅里達(dá)/4/2006(2006Flo,B),A/珀斯/16/2009(2009Per,H3),A/布里斯班/59/2007(2007Bris,H1),B/布里斯班/60/2008(2008Bris,B)。在一個實(shí)施方案中,第二氨基酸序列來自HA蛋白的莖區(qū),所述HA蛋白具有與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的氨基酸序列:SEQ ID NO:8,SEQ ID NO:11,SEQ ID NO:14和SEQ ID NO:17。在一個實(shí)施方案中,第二氨基酸序列來自HA蛋白的莖區(qū),所述HA蛋白包含選自SEQ ID NO:8,SEQ ID NO:11,SEQ ID NO:14和SEQ ID NO:17的序列。
在一個實(shí)施方案中,第二氨基酸序列的至少60個連續(xù)氨基酸來自HA蛋白的氨基酸區(qū),所述氨基酸區(qū)對應(yīng)于流感A新喀里多尼亞/20/1999(H1)(SEQ ID NO:8)的氨基酸殘基292-517。在一個實(shí)施方案中,第二氨基酸序列的至少60個連續(xù)氨基酸來自HA蛋白的氨基酸區(qū),其對應(yīng)于流感A新喀里多尼亞/20/1999(H1)(SEQ ID NO:8)的氨基酸殘基328-517。在一個實(shí)施方案中,第二氨基酸序列的至少60個連續(xù)氨基酸來自與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的序列:SEQ ID NO:23,SEQ ID NO:26,SEQ ID NO:29,SEQ ID NO:32,SEQ ID NO:38,SEQ ID NO:41,SEQ ID NO:44,SEQ ID NO:47,SEQ ID NO:53,SEQ ID NO:56,SEQ ID NO:59,SEQ ID NO:62,SEQ ID NO:68,SEQ ID NO:71,SEQ ID NO:74和SEQ ID NO:77。在一個實(shí)施方案中,第二氨基酸序列的至少60個連續(xù)氨基酸來自選自下組的序列
SEQ ID NO:23,SEQ ID NO:26,SEQ ID NO:29,SEQ ID NO:32,SEQ ID NO:38,SEQ ID NO:41,SEQ ID NO:44,SEQ ID NO:47,SEQ ID NO:53,SEQ ID NO:56,SEQ ID NO:59,SEQ ID NO:62,SEQ ID NO:68,SEQ ID NO:71,SEQ ID NO:74和SEQ ID NO:77。
在一個實(shí)施方案中,第二氨基酸序列包含來自頭部區(qū)序列的羧基端末端下游的氨基酸序列的至少75個,至少100個,至少150個或至少200個連續(xù)氨基酸,其中至少75個,至少100個,至少150個或至少200個連續(xù)氨基酸包含對應(yīng)于H1N1NC的SEQ ID NO:149或SEQ ID NO:150的序列的多肽序列,并且其中對應(yīng)于SEQ ID NO:149的K1或SEQ ID NO:150的K1的多肽序列中的氨基酸殘基,和多肽序列中對應(yīng)于SEQ ID NO:149的E53或SEQ ID NO:150的E20的氨基酸殘基已經(jīng)分別被除了賴氨酸和谷氨酸外的氨基酸取代,使得取代的氨基酸殘基之間的相互作用的強(qiáng)度大于野生型蛋白質(zhì)中的相互作用的強(qiáng)度。在一個實(shí)施方案中,第二氨基酸序列包含來自HA蛋白的氨基酸區(qū)域的至少75,至少100,至少150或至少200個連續(xù)氨基酸,所述HA蛋白對應(yīng)于流感A新喀里多尼亞/20/1999(H1)(SEQ ID NO:8)的氨基酸殘基292-517,其中所述至少75個,至少100個,至少150個或至少200個連續(xù)氨基酸包含對應(yīng)于H1N1NC的SEQ ID NO:149或SEQ ID NO:150的序列的多肽序列,并且其中對應(yīng)于SEQ ID NO:149的K1或SEQ ID NO:150的K1的多肽序列中的氨基酸殘基,和對應(yīng)于SEQ ID NO:149的E53或SEQ ID NO:150的E20的多肽序列中的氨基酸殘基分別被除了賴氨酸和谷氨酸之外的氨基酸取代,使得取代的氨基酸殘基之間的相互作用的強(qiáng)度大于強(qiáng)度的野生型蛋白中的相互作用。在一個實(shí)施方案中,第二氨基酸序列包含來自HA蛋白的氨基酸區(qū)域的至少75個,至少100個,至少150個或至少200個連續(xù)氨基酸,所述氨基酸區(qū)域?qū)?yīng)于流感A新喀里多尼亞/20/1999(H1)(SEQ ID NO:8)的氨基酸殘基328-517,其中至少75,至少100,至少150或至少200個連續(xù)氨基酸包含對應(yīng)于H1N1NC的SEQ ID NO:149或SEQ ID NO:150的序列的多肽序列,并且其中對應(yīng)于SEQ ID NO:149的K1或SEQ ID NO:150的K1的多肽序列中的氨基酸殘基,和對應(yīng)于SEQ ID NO:149的E53或SEQ ID NO:150的E20的多肽序列中的氨基酸殘基分別被除了賴氨酸和谷氨酸之外的氨基酸取代,使得取代的氨基酸殘基之間的相互作用的強(qiáng)度大于強(qiáng)度的野生型蛋白中的相互作用。在一個實(shí)施方案中,第二氨基酸序列包含來自與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的序列的至少75,至少100,至少150或至少200個連續(xù)氨基酸:SEQ ID NO:23,SEQ ID NO:26,SEQ ID NO:29,SEQ ID NO:32,SEQ ID NO:38,SEQ ID NO:41,SEQ ID NO:44,SEQ ID NO:47,SEQ ID NO:53,SEQ ID NO:56,SEQ ID NO:59,SEQ ID NO:62,SEQ ID NO:68,SEQ ID NO:71,SEQ ID NO:74和SEQ ID NO:77,其中所述至少75個,至少100個,至少150個或至少200個連續(xù)氨基酸包含對應(yīng)于H1N1NC的SEQ ID NO:149或SEQ ID NO:150的序列的多肽序列,和其中對應(yīng)于SEQ ID NO:149的K1或SEQ ID NO:150的K1的多肽序列中的氨基酸殘基和對應(yīng)于SEQ ID NO:149的E53或SEQ ID NO:150的E20的多肽序列中的氨基酸殘基分別被除了賴氨酸和谷氨酸之外的氨基酸取代,使得取代的氨基酸殘基之間的相互作用的強(qiáng)度大于野生型蛋白質(zhì)中相互作用的強(qiáng)度。在一個實(shí)施方案中,第二氨基酸序列包含來自下組的至少75,至少100,至少150或至少200個連續(xù)氨基酸:SEQ ID NO:23,SEQ ID NO:26,SEQ ID NO:29,SEQ ID NO:32,SEQ ID NO:38,SEQ ID NO:41,SEQ ID NO:44,SEQ ID NO:47,SEQ ID NO:53,SEQ ID NO:56,SEQ ID NO:59,SEQ ID NO:62,SEQ ID NO:68,SEQ ID NO:71,SEQ ID NO:74和SEQ ID NO:77。其中所述至少75個,至少100個,至少150個或至少200個連續(xù)氨基酸包含對應(yīng)于H1N1NC的SEQ ID NO:149或SEQ ID NO:150的序列的多肽序列,并且其中對應(yīng)于SEQ ID NO:149的K1或SEQ ID NO:150的K1的多肽序列中的氨基酸殘基,并且多肽序列中對應(yīng)于SEQ ID NO:149的E53或SEQ ID NO:150的E20的氨基酸殘基已經(jīng)分別被除了賴氨酸和谷氨酸之外的氨基酸取代,使得取代的氨基酸殘基之間的相互作用的強(qiáng)度大于野生型蛋白質(zhì)中的相互作用的強(qiáng)度。
含有規(guī)定位點(diǎn)特異性突變的蛋白質(zhì)構(gòu)建體可用于通過將本發(fā)明的納米顆粒連接到單體亞基來制備本發(fā)明的納米顆粒。因此,在一個實(shí)施方案中,將含有所公開的位點(diǎn)特異性突變(例如,SEQ ID NO:149或SEQ ID NO:150的K1和SEQ ID NO:149的E53或SEQ ID NO:150的E20)的蛋白質(zhì)構(gòu)建體連接到單體亞基蛋白的至少部分,其中所述單體亞基蛋白的所述部分能夠指導(dǎo)蛋白質(zhì)構(gòu)建體的自組裝。在一個實(shí)施方案中,單體亞基蛋白的至少部分連接到第二氨基酸序列。在優(yōu)選的實(shí)施方案中,單體亞基蛋白的至少部分連接到第二氨基酸序列的羧基端末端。在一個實(shí)施方案中,所述部分包含來自單體亞基的至少50個,至少100個或至少150個氨基酸。在一個實(shí)施方案中,單體亞基是鐵蛋白。在一個實(shí)施方案中,單體亞基是2,4-二氧四氫蝶啶合成酶。在一個實(shí)施方案中,單體亞基包含與SEQ ID NO:2,SEQ ID NO:5或SEQ ID NO:194至少85%相同,至少90%相同或至少95%相同的序列。在一個實(shí)施方案中,單體亞基包含選自SEQ ID NO:2,SEQ ID NO:5和SEQ ID NO:194的序列。
盡管對本文公開的HA蛋白進(jìn)行的修飾已經(jīng)描述為單獨(dú)的實(shí)施方案,但是應(yīng)當(dāng)理解,所有此類修飾可以包含在單一蛋白質(zhì)構(gòu)建體中。例如,可以制備蛋白質(zhì)構(gòu)建體,其中第一氨基酸序列通過接頭連接到第二氨基酸序列,其中第二氨基酸序列包含來自頭部區(qū)的羧基端末端下游的區(qū)域的氨基酸序列,但是缺乏由SEQ ID NO:133-148表示的內(nèi)部環(huán)序列,并且其中對應(yīng)于SEQ ID NO:149的K1或SEQ ID NO:50的K1和SEQ ID NO:149的E53或SEQ ID NO:150的E20的第二氨基酸序列中的氨基酸分別被除了賴氨酸和谷氨酸之外的氨基酸取代,以增加折疊蛋白中這些氨基酸殘基之間的相互作用的強(qiáng)度。因此,本發(fā)明的一個實(shí)施方案是蛋白質(zhì)構(gòu)建體,其包含來自流感病毒HA蛋白的莖區(qū)的第一氨基酸序列和來自流感病毒HA蛋白的莖區(qū)的第二氨基酸序列,所述第一和第二氨基酸酸序列通過接頭序列共價(jià)連接,
其中所述第一氨基酸序列包含來自頭部區(qū)序列的氨基端末端上游的氨基酸序列的至少20個連續(xù)氨基酸殘基;
其中所述第二氨基酸序列包含與來自頭部區(qū)序列的羧基端末端下游的氨基酸序列的至少100個連續(xù)氨基酸殘基至少85%,至少90%或至少95%相同的多肽序列,
其中所述多肽序列包含對應(yīng)于由SEQ ID NO:150代表的流感A新喀里多尼亞/20/1999(H1)中的序列,由SEQ ID NO:152代表的流感A加利福尼亞/2009(H1)中的序列,由SEQ ID NO:154表示的流感A新加坡/1957(H2)中的序列和由SEQ ID NO:156表示的流感A印度尼西亞/2005H5)中的序列和,
其中對應(yīng)于SEQ ID NO:150的K1的多肽序列中的氨基酸殘基已經(jīng)被除了賴氨酸之外的氨基酸取代,并且對應(yīng)于SEQ ID NO:150的E20的氨基酸殘基已經(jīng)被除了谷氨酸外的氨基酸取代。
在一個實(shí)施方案中,多肽包含來自頭部區(qū)序列的羧基端末端下游的氨基酸序列的至少100個連續(xù)氨基酸。在一個實(shí)施方案中,所述至少100個連續(xù)氨基酸包含SEQ ID NO:150。在一個實(shí)施方案中,所述至少100個連續(xù)氨基酸包含SEQ ID NO:152。在一個實(shí)施方案中,所述至少100個連續(xù)氨基酸序列包含SEQ ID NO:154。在一個實(shí)施方案中,所述至少100個連續(xù)氨基酸包含SEQ ID NO:156。應(yīng)當(dāng)理解,在上述構(gòu)建體中,當(dāng)除去內(nèi)部環(huán)區(qū)時,剩余的HA蛋白的各個末端可以直接連接在一起。然而,在一些情況下,此類直接連接可能降低肽主鏈的柔性。因此,在一些情況下,用接頭序列替代內(nèi)部環(huán)區(qū)域可能是有益的。作為實(shí)例,如果六個氨基酸接頭序列插入SEQ ID NO:150,則最終序列可表現(xiàn)如下:VNSVIEKMGSGGSGTYNAELLVLL。
因此,在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的多肽序列包含SEQ ID NO:150,其中插入短接頭序列。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的多肽序列包含SEQ ID NO:152,其中插入短接頭序列。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的多肽序列包含SEQ ID NO:154,其中插入短接頭序列。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的多肽序列包含SEQ ID NO:156,其中插入短接頭序列。在一個實(shí)施方案中,接頭由絲氨酸和甘氨酸殘基制成。在一個實(shí)施方案中,接頭的長度少于10個氨基酸。在一個實(shí)施方案中,接頭的長度少于5個氨基酸。在一個實(shí)施方案中,接頭的長度少于3個氨基酸。
盡管上文所述的蛋白質(zhì)構(gòu)建體可用于產(chǎn)生能夠產(chǎn)生針對一種或多種流感病毒的免疫應(yīng)答的納米顆粒,但是在一些實(shí)施方案中,可能有用的是將進(jìn)一步的突變工程化改造到本發(fā)明的蛋白質(zhì)的氨基酸序列中。例如,可以有用的是改變單體亞基蛋白,三聚化結(jié)構(gòu)域或接頭序列中的位點(diǎn),如酶識別位點(diǎn)或糖基化位點(diǎn),以便對蛋白質(zhì)給予有益的性質(zhì)(例如溶解度,半衰期,免于免疫監(jiān)視的蛋白質(zhì)的掩蔽部分)。在這方面,已知鐵蛋白的單體亞基不是天然糖基化的。然而,如果其在哺乳動物或酵母細(xì)胞中作為分泌性蛋白質(zhì)表達(dá),則其可以被糖基化。因此,在一個實(shí)施方案中,來自單體鐵蛋白亞基的氨基酸序列中的潛在N連接的糖基化位點(diǎn)被突變,使得突變的鐵蛋白亞基序列在突變位點(diǎn)不再被糖基化。突變的單體鐵蛋白亞基的一個此類序列由SEQ ID NO:5表示。
也可以改變蛋白質(zhì)構(gòu)建體序列以包括其它有用的突變。例如,在一些情況下,可以期望阻斷針對蛋白質(zhì)構(gòu)建體中的某些氨基酸序列的免疫應(yīng)答的產(chǎn)生。這可以通過在待阻斷的位點(diǎn)附近添加糖基化位點(diǎn)來完成,使得聚糖在空間上阻礙免疫系統(tǒng)到達(dá)阻斷位點(diǎn)的能力。因此,在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的序列已經(jīng)改變?yōu)榘ㄒ粋€或多個糖基化位點(diǎn)。這樣的位點(diǎn)的實(shí)例包括但不限于Asn-X-Ser,Asn-X-Thr和Asn-X-Cys。在一些情況下,可以將糖基化位點(diǎn)引入接頭序列中。引入糖基化位點(diǎn)的有用位點(diǎn)的其它實(shí)例包括但不限于對應(yīng)于來自流感A新喀里多尼亞/20/1999(H1)的氨基酸45-47或氨基酸370-372的氨基酸。引入糖基化位點(diǎn)的方法是本領(lǐng)域技術(shù)人員已知的。
本文的公開內(nèi)容證明在HA或單體亞基蛋白中的特定位置處的突變產(chǎn)生有用的蛋白質(zhì)構(gòu)建體,并因此產(chǎn)生本發(fā)明的納米顆粒。引入突變的鐵蛋白蛋白質(zhì)中有用位置的實(shí)例包括對應(yīng)于選自下組的氨基酸位置的氨基酸:SEQ ID NO:2的氨基酸位置18,氨基酸位置20和氨基酸位置68。引入突變的有用位置的實(shí)例包括HA蛋白中對應(yīng)于選自下組的氨基酸位置的氨基酸:流感A新喀里多尼亞/20/1999(H1)的HA蛋白(SEQ ID NO:8)的氨基酸位置36,氨基酸位置45,氨基酸位置47,氨基酸位置49,氨基酸位置339,氨基酸位置340,氨基酸位置341,氨基酸位置342,氨基酸位置361,氨基酸位置372,氨基酸位置394,氨基酸位置402,氨基酸位置437,氨基酸位置438,氨基酸位置445,氨基酸位置446,氨基酸位置448,氨基酸449,氨基酸位置450和氨基酸位置452。表2中列出了此類突變的一些實(shí)例。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置36的位置處包含異亮氨酸或與其具有相似性質(zhì)的氨基酸殘基A。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置45的位置處包含天冬酰胺或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999的HA蛋白的氨基酸位置47的位置包含蘇氨酸或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置49的位置處包含色氨酸或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置339的位置處包含谷氨酰胺或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置340的位置處包含精氨酸或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置341的位置包含谷氨酸或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置342的位置包含蘇氨酸或與其具有相似性質(zhì)的氨基酸殘基(H1)。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置372的位置處包含蘇氨酸或與其具有相似性質(zhì)的氨基酸殘基(H1)。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置394的位置包含甲硫氨酸,異亮氨酸,亮氨酸,谷氨酰胺或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置402的位置處包含天冬酰胺,蘇氨酸,甘氨酸,天冬酰胺或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置437的位置包含天冬氨酸或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置438的位置包含亮氨酸或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置445的位置包含亮氨酸,甲硫氨酸或具有與其相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置446的位置包含異亮氨酸,亮氨酸,甲硫氨酸,谷氨酰胺或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置448的位置處包含谷氨酰胺或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置449的位置包含色氨酸,苯丙氨酸或具有與其相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置450的位置處包含丙氨酸或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置452的位置處包含亮氨酸或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分缺少對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸515-517的一個或多個氨基酸。
本發(fā)明的一個實(shí)施方案是蛋白質(zhì)構(gòu)建體,所述蛋白質(zhì)構(gòu)建體包含與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的氨基酸序列:SEQ ID NO:80,SEQ ID NO:83,SEQ ID NO:86,SEQ ID NO:89,SEQ ID NO:92,SEQ ID NO:95,SEQ ID NO:98,SEQ ID NO:101,SEQ ID NO:104,SEQ ID NO:158,SEQ ID NO:164,SEQ ID NO:170,SEQ ID NO:176,SEQ ID NO:182,SEQ ID NO:188,SEQ ID NO:197,SEQ ID NO:203,SEQ ID NO:209,SEQ ID NO:214,SEQ ID NO:217,SEQ ID NO:222,SEQ ID NO:224,SEQ ID NO:226,SEQ ID NO:228,SEQ ID NO:230,SEQ ID NO:232,SEQ ID NO:235,SEQ ID NO:240,SEQ ID NO:242,SEQ ID NO:244,SEQ ID NO:246,SEQ ID NO:248,SEQ ID NO:250,SEQ ID NO:252,SEQ ID NO:254,SEQ ID NO:256,SEQ ID NO:258,SEQ ID NO:261,SEQ ID NO:268,SEQ ID NO:275,SEQ ID NO:282,SEQ ID NO:289,SEQ ID NO:296,SEQ ID NO:303,SEQ ID NO:310,SEQ ID NO:317,SEQ ID NO:324,SEQ ID NO:331,SEQ ID NO:338,SEQ ID NO:345,SEQ ID NO:352,SEQ ID NO:359,SEQ ID NO:366,SEQ ID NO:373,SEQ ID NO:380,SEQ ID NO:387,SEQ ID NO:394和SEQ ID NO:400。
在一個實(shí)施方案中,對應(yīng)于SEQ ID NO:149的K1或SEQ ID NO:150的K1的氨基酸殘基的氨基酸殘基被除了賴氨酸之外的氨基酸取代,并且對應(yīng)于SEQ ID NO:149的E53的氨基酸殘基或SEQ ID NO:20的E20的氨基酸殘基被谷氨酸以外的氨基酸取代,使得在折疊蛋白中取代的氨基酸之間的相互作用的強(qiáng)度增加。
本發(fā)明的一個實(shí)施方案是蛋白質(zhì)構(gòu)建體,所述蛋白質(zhì)構(gòu)建體包含選自下組的序列:SEQ ID NO:80,SEQ ID NO:83,SEQ ID NO:86,SEQ ID NO:89,SEQ ID NO:92,SEQ ID NO:95,SEQ ID NO:98,SEQ ID NO:101,SEQ ID NO:104,SEQ ID NO:158,SEQ ID NO:164,SEQ ID NO:170,SEQ ID NO:176,SEQ ID NO:182,SEQ ID NO:188,SEQ ID NO:197,SEQ ID NO:203,SEQ ID NO:209,SEQ ID NO:214,SEQ ID NO:217,SEQ ID NO:222,SEQ ID NO:224,SEQ ID NO:226,SEQ ID NO:228,SEQ ID NO:230,SEQ ID NO:232,SEQ ID NO:235,SEQ ID NO:240,SEQ ID NO:242,SEQ ID NO:244,SEQ ID NO:246,SEQ ID NO:248,SEQ ID NO:250,SEQ ID NO:252,SEQ ID NO:254,SEQ ID NO:256,SEQ ID NO:258,SEQ ID NO:261,SEQ ID NO:268,SEQ ID NO:275,SEQ ID NO:282,SEQ ID NO:289,SEQ ID NO:296,SEQ ID NO:303,SEQ ID NO:310,SEQ ID NO:317,SEQ ID NO:324,SEQ ID NO:331,SEQ ID NO:338,SEQ ID NO:345,SEQ ID NO:352,SEQ ID NO:359,SEQ ID NO:366,SEQ ID NO:373,SEQ ID NO:380,SEQ ID NO:387,SEQ ID NO:394和SEQ ID NO:400。在一個實(shí)施方案中,當(dāng)與單體亞基蛋白連接時,蛋白構(gòu)建體能夠形成納米顆粒,其中納米顆粒能夠引發(fā)針對流感病毒的免疫應(yīng)答。
如前已經(jīng)描述,由流感HA蛋白制成的蛋白質(zhì)構(gòu)建體可以用于通過將其連接到單體亞基來制備本發(fā)明的納米顆粒。因此,在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體與單體亞基蛋白質(zhì)的至少一部分連接,其中單體亞基蛋白質(zhì)的部分能夠指導(dǎo)蛋白質(zhì)構(gòu)建體的自組裝。在一個實(shí)施方案中,單體亞基蛋白的至少一部分連接到第二氨基酸序列。在優(yōu)選的實(shí)施方案中,單體亞基蛋白的至少一部分連接到第二氨基酸序列的羧基末端。在一個實(shí)施方案中,所述部分包含來自單體亞基的至少50個,至少100個或至少150個氨基酸。在一個實(shí)施方案中,單體亞基是鐵蛋白。在一個實(shí)施方案中,單體亞基是2,4-二氧四氫蝶啶合成酶。在一個實(shí)施方案中,單體亞基包含與SEQ ID NO:2,SEQ ID NO:5或SEQ ID NO:194至少85%相同,至少90%相同或至少95%相同的序列。在一個實(shí)施方案中,單體亞基包含選自SEQ ID NO:2,SEQ ID NO:5和SEQ ID NO:194的序列。
本發(fā)明的一個實(shí)施方案是蛋白質(zhì)構(gòu)建體,其包含與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的氨基酸序列:SEQ ID NO:107,SEQ ID NO:110,SEQ ID NO:113,SEQ ID NO:116,SEQ ID NO:119,SEQ ID NO:122,SEQ ID NO:125,SEQ ID NO:128,SEQ ID NO:131,SEQ ID NO:161,SEQ ID NO:167,SEQ ID NO:173,SEQ ID NO:179,SEQ ID NO:185,SEQ ID NO:191,SEQ ID NO:200,SEQ ID NO:206,SEQ ID NO:212,SEQ ID NO:215,SEQ ID NO:220,SEQ ID NO:223,SEQ ID NO:225,SEQ ID NO:227,SEQ ID NO:229,SEQ ID NO:231,SEQ ID NO:233,SEQ ID NO:238,SEQ ID NO:241,SEQ ID NO:243,SEQ ID NO:245,SEQ ID NO:247,SEQ ID NO:249,SEQ ID NO:251,SEQ ID NO:253,SEQ ID NO:255,SEQQ ID NO:257,SEQ ID NO:259,SEQ ID NO:264,SEQ ID NO:271,SEQ ID NO:278,SEQ ID NO:285,SEQ ID NO:292,SEQ ID NO:299,SEQ ID NO:306,SEQ ID NO:313,SEQ ID NO:320,SEQ ID NO:327,SEQ ID NO:334,SEQ ID NO:341,SEQ ID NO:348,SEQ ID NO:355,SEQ ID NO:362,SEQ ID NO:369,SEQ ID NO:376,SEQ ID NO:383,SEQ ID NO:390和SEQ ID NO:397。在一個實(shí)施方案中,對應(yīng)于SEQ ID NO:149的K1或SEQ ID NO:150的K1的氨基酸殘基被除了賴氨酸之外的氨基酸取代,并且對應(yīng)于SEQ ID NO:149的E53的氨基酸殘基或SEQ ID NO:20的E20被除了谷氨酸之外的氨基酸取代,使得在折疊蛋白中取代的氨基酸之間的相互作用的強(qiáng)度增加。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置36的位置包含異亮氨酸或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置45的位置處包含天冬酰胺或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置47的位置包含蘇氨酸或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置49的位置處包含色氨酸或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置339的位置處包含谷氨酰胺或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置340的位置處包含精氨酸或與其具有相似性質(zhì)的氨基酸殘基(H1)。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/20的HA蛋白的氨基酸位置341的位置包含谷氨酸或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置342的位置包含蘇氨酸或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置372的位置處包含蘇氨酸或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置394的位置包含甲硫氨酸,異亮氨酸,亮氨酸,谷氨酰胺或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置402的位置處包含天冬酰胺,蘇氨酸,甘氨酸,天冬酰胺或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置437的位置包含天冬氨酸或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置438的位置包含亮氨酸或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置445的位置包含亮氨酸,甲硫氨酸或具有與其相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置446的位置包含異亮氨酸,亮氨酸,甲硫氨酸,谷氨酰胺或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置448的位置處包含谷氨酰胺或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置449的位置包含色氨酸,苯丙氨酸或具有與其相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置450的位置處包含丙氨酸或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置452的位置處包含亮氨酸或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分缺少對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸515-517的一個或多個氨基酸。
本發(fā)明的一個實(shí)施方案是蛋白質(zhì)構(gòu)建體,所述蛋白質(zhì)構(gòu)建體包含選自下組的序列:SEQ ID NO:107,SEQ ID NO:110,SEQ ID NO:113,SEQ ID NO:116,SEQ ID NO:119,SEQ ID NO:122,SEQ ID NO:125,SEQ ID NO:128,SEQ ID NO:131,SEQ ID NO:161,SEQ ID NO:167,SEQ ID NO:173,SEQ ID NO:179,SEQ ID NO:185,SEQ ID NO:191,SEQ ID NO:200,SEQ ID NO:206,SEQ ID NO:212,SEQ ID NO:215,SEQ ID NO:220,SEQ ID NO:223,SEQ ID NO:225,SEQ ID NO:227,SEQ ID NO:229,SEQ ID NO:231,SEQ ID NO:233,SEQ ID NO:238,SEQ ID NO:241,SEQ ID NO:243,SEQ ID NO:245,SEQ ID NO:247,SEQ ID NO:249,SEQ ID NO:251,SEQ ID NO:253,SEQ ID NO:255,SEQQ ID NO:257,SEQ ID NO:259,SEQ ID NO:264,SEQ ID NO:271,SEQ ID NO:278,SEQ ID NO:285,SEQ ID NO:292,SEQ ID NO:299,SEQ ID NO:306,SEQ ID NO:313,SEQ ID NO:320,SEQ ID NO:327,SEQ ID NO:334,SEQ ID NO:341,SEQ ID NO:348,SEQ ID NO:355,SEQ ID NO:362,SEQ ID NO:369,SEQ ID NO:376,SEQ ID NO:383,SEQ ID NO:390和SEQ ID NO:397。
本發(fā)明的一個實(shí)施方案是由核酸分子編碼的蛋白質(zhì)構(gòu)建體,所述核酸分子包含與選自下組的序列至少85%,至少90%,至少95%或至少97%相同的核酸序列:SEQ ID NO:266,SEQ ID NO:273,SEQ ID NO:SEQ ID NO:280,SEQ ID NO:287,SEQ ID NO:294,SEQ ID NO:301,SEQ ID NO:308,SEQ ID NO:315,SEQ ID NO:322,SEQ ID NO:329,SEQ ID NO:336,SEQ ID NO:343,SEQ ID NO:350,SEQ ID NO:357,SEQ ID NO:364,SEQ ID NO:371,SEQ ID NO:378,SEQ ID NO:385SEQ ID NO:392和SEQ ID NO:399。本發(fā)明的一個實(shí)施方案是由核酸分子編碼的蛋白質(zhì)構(gòu)建體,所述核酸分子包含選自下組的核酸序列:SEQ ID NO:266,SEQ ID NO:273,SEQ ID NO:SEQ ID NO:280,SEQ ID NO:287,SEQ ID NO:294,SEQ ID NO:301,SEQ ID NO:308,SEQ ID NO:315,SEQ ID NO:322,SEQ ID NO:329,SEQ ID NO:336,SEQ ID NO:343,SEQ ID NO:350,SEQ ID NO:357,SEQ ID NO:364,SEQ ID NO:371,SEQ ID NO:378,SEQ ID NO:385SEQ ID NO:392和SEQ ID NO:399。
本發(fā)明的蛋白質(zhì)和蛋白質(zhì)構(gòu)建體由本發(fā)明的核酸分子編碼。此外,它們由本發(fā)明的核酸構(gòu)建體表達(dá)。如本文所使用的,核酸構(gòu)建體是重組表達(dá)載體,即連接到編碼蛋白質(zhì)的核酸分子的載體,使得當(dāng)將核酸構(gòu)建體施用于,例如,受試者或器官,組織或細(xì)胞時,核酸分子可以實(shí)現(xiàn)蛋白質(zhì)表達(dá)。載體還能夠?qū)⒑怂岱肿愚D(zhuǎn)運(yùn)到環(huán)境內(nèi)的細(xì)胞,例如但不限于生物體,組織或細(xì)胞培養(yǎng)物。本公開的核酸構(gòu)建體通過人類干預(yù)產(chǎn)生。核酸構(gòu)建體可以是DNA,RNA或其變體。載體可以是DNA質(zhì)粒,病毒載體或其它載體。在一個實(shí)施方案中,載體可以是巨細(xì)胞病毒(CMV),逆轉(zhuǎn)錄病毒,腺病毒,腺伴隨病毒,皰疹病毒,牛痘病毒,脊髓灰質(zhì)炎病毒,辛德畢斯病毒或任何其它DNA或RNA病毒載體。在一個實(shí)施方案中,載體可以是假型化的慢病毒或逆轉(zhuǎn)錄病毒載體。在一個實(shí)施方案中,載體可以是DNA質(zhì)粒。在一個實(shí)施方案中,載體可以是包含能夠進(jìn)行核酸分子遞送和表達(dá)的病毒組分和質(zhì)粒組分的DNA質(zhì)粒。構(gòu)建本公開的核酸構(gòu)建體的方法是公知的。參見,例如,Molecular Cloning:a Laboratory Manual,3rd edition,Sambrook et al.2001Cold Spring Harbor Laboratory Press,以及Current Protocols in Molecular Biology,Ausubel et al.eds.,John Wiley&Sons,1994。在一個實(shí)施方案中,載體是DNA質(zhì)粒,如CMV/R質(zhì)粒,如CMV/R或CMV/R 8KB(本文也稱為CMV/R 8kb)。本文提供了CMV/R和CMV/R 8kb的實(shí)例。CMV/R也在2006年8月22日授權(quán)的US 7,094,598B2中描述。
如本文中使用的,核酸分子包含編碼本發(fā)明的蛋白質(zhì)構(gòu)建體的核酸序列。核酸分子可以重組地,合成地或通過重組和合成程序的組合產(chǎn)生。本公開的核酸分子可以具有野生型核酸序列或密碼子修飾的核酸序列,以例如摻入由人翻譯系統(tǒng)更好識別的密碼子。在一個實(shí)施方案中,核酸分子可以被遺傳工程化以引入或消除編碼不同氨基酸的密碼子,如引入編碼N-連接的糖基化位點(diǎn)的密碼子。產(chǎn)生本公開核酸分子的方法是本領(lǐng)域已知的,特別是一旦知道核酸序列。應(yīng)當(dāng)理解,核酸構(gòu)建體可以包含一個核酸分子或多于一個核酸分子。還應(yīng)當(dāng)理解,核酸分子可以編碼一種蛋白質(zhì)或多于一種蛋白質(zhì)。
一個實(shí)施方案是編碼流感HA蛋白的核酸分子,所述流感HA蛋白包含與選自SEQ ID NO:150,SEQ ID NO:152,SEQ ID NO:154和SEQ ID NO:156的序列至少80%,至少85%,至少90%,至少92%,至少94%,至少96%,至少98%或至少99%相同的氨基酸序列。一個實(shí)施方案是編碼流感HA蛋白的核酸分子,所述流感HA蛋白包含選自SEQ ID NO:150,SEQ ID NO:152,SEQ ID NO:154和SEQ ID NO:156的氨基酸序列。
在一個實(shí)施方案中,核酸分子編碼流感HA蛋白,其包含與選自下組的序列至少80%,至少85%,至少90%,至少92%,至少94%,至少96%,至少98%或至少99%相同的氨基酸序列:SEQ ID NO:80,SEQ ID NO:83,SEQ ID NO:86,SEQ ID NO:89,SEQ ID NO:92,SEQ ID NO:95,SEQ ID NO:98,SEQ ID NO:101,SEQ ID NO:104,SEQ ID NO:158,SEQ ID NO:164,SEQ ID NO:170,SEQ ID NO:176,SEQ ID NO:182,SEQ ID NO:188,SEQ ID NO:197,SEQ ID NO:203,SEQ ID NO:209,SEQ ID NO:214,SEQ ID NO:217,SEQ ID NO:222,SEQ ID NO:224,SEQ ID NO:226,SEQ ID NO:228,SEQ ID NO:230,SEQ ID NO:232,SEQ ID NO:235,SEQ ID NO:240,SEQ ID NO:242,SEQ ID NO:244,SEQ ID NO:246,SEQ ID NO:248,SEQ ID NO:250,SEQ ID NO:252,SEQ ID NO:254,SEQ ID NO:256,SEQ ID NO:258,SEQ ID NO:261,SEQ ID NO:268,SEQ ID NO:275,SEQ ID NO:282,SEQ ID NO:289,SEQ ID NO:296,SEQ ID NO:303,SEQ ID NO:310,SEQ ID NO:317,SEQ ID NO:324,SEQ ID NO:331,SEQ ID NO:338,SEQ ID NO:345,SEQ ID NO:352,SEQ ID NO:359,SEQ ID NO:366,SEQ ID NO:373,SEQ ID NO:380,SEQ ID NO:387,SEQ ID NO:394和SEQ ID NO:400。在一個實(shí)施方案中,核酸分子編碼流感HA蛋白,其包含選自下組的氨基酸:SEQ ID NO:80,SEQ ID NO:83,SEQ ID NO:86,SEQ ID NO:89,SEQ ID NO:92,SEQ ID NO:95,SEQ ID NO:98,SEQ ID NO:101,SEQ ID NO:104,SEQ ID NO:158,SEQ ID NO:164,SEQ ID NO:170,SEQ ID NO:176,SEQ ID NO:182,SEQ ID NO:188,SEQ ID NO:197,SEQ ID NO:203,SEQ ID NO:209,SEQ ID NO:214,SEQ ID NO:217,SEQ ID NO:222,SEQ ID NO:224,SEQ ID NO:226,SEQ ID NO:228,SEQ ID NO:230,SEQ ID NO:232,SEQ ID NO:235,SEQ ID NO:240,SEQ ID NO:242,SEQ ID NO:244,SEQ ID NO:246,SEQ ID NO:248,SEQ ID NO:250,SEQ ID NO:252,SEQ ID NO:254,SEQ ID NO:256,SEQ ID NO:258,SEQ ID NO:261,SEQ ID NO:268,SEQ ID NO:275,SEQ ID NO:282,SEQ ID NO:289,SEQ ID NO:296,SEQ ID NO:303,SEQ ID NO:310,SEQ ID NO:317,SEQ ID NO:324,SEQ ID NO:331,SEQ ID NO:338,SEQ ID NO:345,SEQ ID NO:352,SEQ ID NO:359,SEQ ID NO:366,SEQ ID NO:373,SEQ ID NO:380,SEQ ID NO:387,SEQ ID NO:394和SEQ ID NO:400。
本發(fā)明的一個實(shí)施方案是核酸分子,所述核酸分子包含與選自下組的序列至少80%,至少85%,至少90%,至少92%,至少94%,至少96%,至少98%或至少99%相同的核酸序列:SEQ ID NO:79,SEQ ID NO:82,SEQ ID NO:85,SEQ ID NO:88,SEQ ID NO:91,SEQ ID NO:94,SEQ ID NO:97,SEQ ID NO:100,SEQ ID NO:103,SEQ ID NO:157,SEQ ID NO:163,SEQ ID NO:169,SEQ ID NO:175,SEQ ID NO:181,SEQ ID NO:187,SEQ ID NO:196,SEQ ID NO:202,SEQ ID NO:208,SEQ ID NO:216,SEQ ID NO:234,SEQ ID NO:260,SEQ ID NO:267,SEQ ID NO:274,SEQ ID NO:281,SEQ ID NO:288,SEQ ID NO:295,SEQ ID NO:302,SEQ ID NO:309,SEQ ID NO:316,SEQ ID NO:323,SEQ ID NO:330,SEQ ID NO:337,SEQ ID NO:344,SEQ ID NO:351,SEQ ID NO:358,SEQ ID NO:365,SEQ ID NO:372,SEQ ID NO:379,SEQ ID NO:386和SEQ ID NO:393。本發(fā)明的一個實(shí)施方案是核酸分子,其包含選自下組的核酸:SEQ ID NO:79,SEQ ID NO:82,SEQ ID NO:85,SEQ ID NO:88,SEQ ID NO:91,SEQ ID NO:94,SEQ ID NO:97,SEQ ID NO:100,SEQ ID NO:103,SEQ ID NO:157,SEQ ID NO:163,SEQ ID NO:169,SEQ ID NO:175,SEQ ID NO:181,SEQ ID NO:187,SEQ ID NO:196,SEQ ID NO:202,SEQ ID NO:208,SEQ ID NO:216,SEQ ID NO:234,SEQ ID NO:260,SEQ ID NO:267,SEQ ID NO:274,SEQ ID NO:281,SEQ ID NO:288,SEQ ID NO:295,SEQ ID NO:302,SEQ ID NO:309,SEQ ID NO:316,SEQ ID NO:323,SEQ ID NO:330,SEQ ID NO:337,SEQ ID NO:344,SEQ ID NO:351,SEQ ID NO:358,SEQ ID NO:365,SEQ ID NO:372,SEQ ID NO:379,SEQ ID NO:386和SEQ ID NO:393。
優(yōu)選的核酸分子是編碼單體亞基,HA蛋白和/或包含與流感HA蛋白連接的單體亞基蛋白的蛋白質(zhì)構(gòu)建體的那些。因此,本發(fā)明的一個實(shí)施方案是包含編碼蛋白質(zhì)的核酸序列的核酸分子,所述蛋白質(zhì)包含與流感HA蛋白連接的鐵蛋白蛋白的單體亞基。在一個實(shí)施方案中,單體亞基包含與選自SEQ ID NO:2和SEQ ID NO:5的序列至少80%,至少85%,至少90%,至少92%,至少94%,至少96%,至少98%或至少99%序列相同的氨基酸序列。在一個實(shí)施方案中,單體亞基包含選自SEQ ID NO:2和SEQ ID NO:5的氨基酸序列。
本發(fā)明的一個實(shí)施方案是包含編碼蛋白質(zhì)的核酸序列的核酸分子,所述蛋白質(zhì)包含與流感HA蛋白連接的2,4-二氧四氫蝶啶合酶單體亞基。在一個實(shí)施方案中,單體亞基包含與SEQ ID NO:194至少80%,至少85%,至少90%,至少92%,至少94%,至少96%,至少98%或至少99%相同的氨基酸序列。在一個實(shí)施方案中,單體亞基包含SEQ ID NO:194。
本發(fā)明的一個實(shí)施方案是編碼蛋白質(zhì)構(gòu)建體的核酸分子,所述蛋白質(zhì)構(gòu)建體包含與選自下組的序列至少80%,至少85%,至少90%,至少92%,至少94%,至少96%,至少98%或至少99%相同的氨基酸序列:SEQ ID NO:107,SEQ ID NO:110,SEQ ID NO:113,SEQ ID NO:116,SEQ ID NO:119,SEQ ID NO:122,SEQ ID NO:125,SEQ ID NO:128,SEQ ID NO:131,SEQ ID NO:161,SEQ ID NO:167,SEQ ID NO:173,SEQ ID NO:179,SEQ ID NO:185,SEQ ID NO:191,SEQ ID NO:200,SEQ ID NO:206,SEQ ID NO:212,SEQ ID NO:215,SEQ ID NO:220,SEQ ID NO:223,SEQ ID NO:225,SEQ ID NO:227,SEQ ID NO:229,SEQ ID NO:231,SEQ ID NO:233,SEQ ID NO:238,SEQ ID NO:241,SEQ ID NO:243,SEQ ID NO:245,SEQ ID NO:247,SEQ ID NO:249,SEQ ID NO:251,SEQ ID NO:253,SEQ ID NO:255,SEQQ ID NO:257,SEQ ID NO:259,SEQ ID NO:264,SEQ ID NO:271,SEQ ID NO:278,SEQ ID NO:285,SEQ ID NO:292,SEQ ID NO:299,SEQ ID NO:306,SEQ ID NO:313,SEQ ID NO:320,SEQ ID NO:327,SEQ ID NO:334,SEQ ID NO:341,SEQ ID NO:348,SEQ ID NO:355,SEQ ID NO:362,SEQ ID NO:369,SEQ ID NO:376,SEQ ID NO:383,SEQ ID NO:390和SEQ ID NO:397。本發(fā)明的一個實(shí)施方案是編碼蛋白質(zhì)構(gòu)建體的核酸分子,所述蛋白質(zhì)構(gòu)建體包含選自下組的序列:SEQ ID NO:107,SEQ ID NO:110,SEQ ID NO:113,SEQ ID NO:116,SEQ ID NO:119,SEQ ID NO:122,SEQ ID NO:125,SEQ ID NO:128,SEQ ID NO:131,SEQ ID NO:161,SEQ ID NO:167,SEQ ID NO:173,SEQ ID NO:179,SEQ ID NO:185,SEQ ID NO:191,SEQ ID NO:200,SEQ ID NO:206,SEQ ID NO:212,SEQ ID NO:215,SEQ ID NO:220,SEQ ID NO:223,SEQ ID NO:225,SEQ ID NO:227,SEQ ID NO:229,SEQ ID NO:231,SEQ ID NO:233,SEQ ID NO:238,SEQ ID NO:241,SEQ ID NO:243,SEQ ID NO:245,SEQ ID NO:247,SEQ ID NO:249,SEQ ID NO:251,SEQ ID NO:253,SEQ ID NO:255,SEQQ ID NO:257,SEQ ID NO:259,SEQ ID NO:264,SEQ ID NO:271,SEQ ID NO:278,SEQ ID NO:285,SEQ ID NO:292,SEQ ID NO:299,SEQ ID NO:306,SEQ ID NO:313,SEQ ID NO:320,SEQ ID NO:327,SEQ ID NO:334,SEQ ID NO:341,SEQ ID NO:348,SEQ ID NO:355,SEQ ID NO:362,SEQ ID NO:369,SEQ ID NO:376,SEQ ID NO:383,SEQ ID NO:390和SEQ ID NO:397。
本發(fā)明的一個實(shí)施方案是包含核酸序列的核酸分子,所述核酸序列與選自下組的序列至少80%,至少85%,至少90%,至少92%,至少94%,至少96%,至少98%或至少99%相同:SEQ ID NO:106,SEQ ID NO:109,SEQ ID NO:112,SEQ ID NO:115,SEQ ID NO:118,SEQ ID NO:121,SEQ ID NO:124,SEQ ID NO:127,SEQ ID NO:130,SEQ ID NO:160,SEQ ID NO:166,SEQ ID NO:172,SEQ ID NO:178,SEQ ID NO:184,SEQ ID NO:190,SEQ ID NO:199,SEQ ID NO:205,SEQ ID NO:211,SEQ ID NO:219,SEQ ID NO:237,SEQ ID NO:263,SEQ ID NO:270,SEQ ID NO:277,SEQ ID NO:284,SEQ ID NO:291,SEQ ID NO:298,SEQ ID NO:305,SEQ ID NO:312,SEQ ID NO:319,SEQ ID NO:326,SEQ ID NO:333,SEQ ID NO:340,SEQ ID NO:347,SEQ ID NO:354,SEQ ID NO:361,SEQ ID NO:368,SEQ ID NO:375,SEQ ID NO:382,SEQ ID NO:389和SEQ ID NO:396。本發(fā)明的一個實(shí)施方案是包含選自下組的核酸序列的核酸分子:SEQ ID NO:106,SEQ ID NO:109,SEQ ID NO:112,SEQ ID NO:115,SEQ ID NO:118,SEQ ID NO:121,SEQ ID NO:124,SEQ ID NO:127,SEQ ID NO:130,SEQ ID NO:160,SEQ ID NO:166,SEQ ID NO:172,SEQ ID NO:178,SEQ ID NO:184,SEQ ID NO:190,SEQ ID NO:199,SEQ ID NO:205,SEQ ID NO:211,SEQ ID NO:219,SEQ ID NO:237,SEQ ID NO:263,SEQ ID NO:270,SEQ ID NO:277,SEQ ID NO:284,SEQ ID NO:291,SEQ ID NO:298,SEQ ID NO:305,SEQ ID NO:312,SEQ ID NO:319,SEQ ID NO:326,SEQ ID NO:333,SEQ ID NO:340,SEQ ID NO:347,SEQ ID NO:354,SEQ ID NO:361,SEQ ID NO:368,SEQ ID NO:375,SEQ ID NO:382,SEQ ID NO:389和SEQ ID NO:396。
本發(fā)明還涵蓋用于產(chǎn)生本發(fā)明的蛋白質(zhì)構(gòu)建體的表達(dá)系統(tǒng)。在一個實(shí)施方案中,本發(fā)明的核酸分子可操作地連接于啟動子。如本文中使用的,操作連接是指當(dāng)連接的啟動子被激活時,可以表達(dá)由連接的核酸分子編碼的蛋白質(zhì)。用于實(shí)施本發(fā)明的啟動子是本領(lǐng)域技術(shù)人員已知的。本發(fā)明的一個實(shí)施方案是包含核酸序列的核酸分子,所述核酸序列與選自下組的序列至少85%,至少90%,至少95%或至少97%相同:SEQ ID NO:266,SEQ ID NO:273,SEQ ID NO:SEQ ID NO:280,SEQ ID NO:287,SEQ ID NO:294,SEQ ID NO:301,SEQ ID NO:308,SEQ ID NO:315,SEQ ID NO:322,SEQ ID NO:329,SEQ ID NO:336,SEQ ID NO:343,SEQ ID NO:350,SEQ ID NO:357,SEQ ID NO:364,SEQ ID NO:371,SEQ ID NO:378,SEQ ID NO:385SEQ ID NO:392和SEQ ID NO:399。本發(fā)明的一個實(shí)施方案是包含選自下組的核酸序列的核酸分子:SEQ ID NO:266,SEQ ID NO:273,SEQ ID NO:SEQ ID NO:280,SEQ ID NO:287,SEQ ID NO:294,SEQ ID NO:301,SEQ ID NO:308,SEQ ID NO:315,SEQ ID NO:322,SEQ ID NO:329,SEQ ID NO:336,SEQ ID NO:343,SEQ ID NO:350,SEQ ID NO:357,SEQ ID NO:364,SEQ ID NO:371,SEQ ID NO:378,SEQ ID NO:385SEQ ID NO:392和SEQ ID NO:399。
本發(fā)明的一個實(shí)施方案是包含本發(fā)明的核酸分子的重組細(xì)胞。本發(fā)明的一個實(shí)施方案是包含本發(fā)明的核酸分子的重組病毒。
如上指示,本發(fā)明的蛋白質(zhì)構(gòu)建體的重組生產(chǎn)可以使用本領(lǐng)域目前已知的任何合適的常規(guī)重組技術(shù)來完成。例如,可以如下在大腸桿菌(E.coli)中進(jìn)行編碼融合蛋白的核酸分子的產(chǎn)生,即使用編碼合適的單體亞基蛋白(如幽門螺桿菌鐵蛋白單體亞基)的核酸分子,并且將其融合到編碼本文公開的合適的流感蛋白的核酸分子。然后,可以將構(gòu)建體轉(zhuǎn)化成蛋白質(zhì)表達(dá)細(xì)胞,培養(yǎng)至合適的大小,并誘導(dǎo)產(chǎn)生融合蛋白。
如已經(jīng)描述的,因?yàn)楸景l(fā)明的蛋白質(zhì)構(gòu)建體包含單體亞基蛋白質(zhì),所以它們可以自組裝。根據(jù)本發(fā)明,由此類自組裝產(chǎn)生的超分子被稱為HA表達(dá)性、基于單體亞基的納米顆粒。為了便于討論,將HA表達(dá)性、基于單體亞基的納米顆粒簡稱為納米顆粒(np)。本發(fā)明的納米顆粒具有與制備它們的單體蛋白質(zhì)的納米顆粒相似的結(jié)構(gòu)特征。例如,關(guān)于鐵蛋白,基于鐵蛋白的納米顆粒含有24個亞基并且具有432對稱性。在本發(fā)明的納米顆粒的情況下,亞基是包含與流感HA蛋白連接的單體亞基(例如,鐵蛋白,2,4-二氧四氫蝶啶合酶等)的蛋白質(zhì)構(gòu)建體。此類納米顆粒在其表面上以HA三聚體展示HA蛋白的至少一部分。在此類構(gòu)建中,HA三聚體對于免疫系統(tǒng)是可及的,并且因此可以引發(fā)免疫應(yīng)答。因此,本發(fā)明的一個實(shí)施方案是包含本發(fā)明的蛋白構(gòu)建體的納米顆粒,其中所述蛋白構(gòu)建體包含來自與單體亞基蛋白連接的HA蛋白的莖區(qū)的氨基酸。在一個實(shí)施方案中,納米顆粒在其表面上以HA三聚體展示HA蛋白。在一個實(shí)施方案中,流感HA蛋白能夠引發(fā)針對流感病毒的保護(hù)性抗體。
在本發(fā)明的一個實(shí)施方案中,納米顆粒包含蛋白質(zhì)構(gòu)建體,其包含來自流感病毒HA蛋白的莖區(qū)的第一氨基酸序列和來自流感病毒HA蛋白的莖區(qū)的第二氨基酸序列,所述第一和第二氨基酸序列通過接頭序列共價(jià)連接,
其中所述第一氨基酸序列包含來自頭部區(qū)序列的氨基端末端上游的氨基酸序列的至少20個連續(xù)氨基酸殘基;
其中所述第二氨基酸序列包含來自所述頭部區(qū)序列的羧基端末端下游的氨基酸序列的至少20個連續(xù)氨基酸殘基;且
其中所述第一或第二氨基酸序列與單體亞基結(jié)構(gòu)域的至少一部分連接。
在本發(fā)明的一個實(shí)施方案中,納米顆粒包含蛋白質(zhì)構(gòu)建體,其包含來自流感病毒HA蛋白的莖區(qū)的第一氨基酸序列和來自流感病毒HA蛋白的莖區(qū)的第二氨基酸序列,所述第一和第二氨基酸序列通過接頭序列共價(jià)連接,
其中所述第一氨基酸序列包含來自頭部區(qū)序列的氨基端末端上游的氨基酸序列的至少20個連續(xù)氨基酸殘基;
其中所述第二氨基酸序列包含與頭部區(qū)序列的羧基端末端下游的氨基酸序列的至少100個連續(xù)氨基酸殘基至少85%,至少90%或至少95%相同的多肽序列,
其中所述多肽序列包含與由SEQ ID NO:150代表的流感A新喀里多尼亞/20/1999(H1)中的序列,由SEQ ID NO:150代表的流感A加利福尼亞/2009中的序列,由SEQ ID NO:154代表的流感A新加坡/1957(H2)中的序列和由SEQ ID NO:156代表的流感A印度尼西亞/2005H5中的序列對應(yīng)的序列;和
其中所述第一或第二氨基酸序列與單體亞基蛋白連接。
在另一個實(shí)施方案中,多肽序列中對應(yīng)于SEQ ID NO:150的K1的氨基酸殘基已被除賴氨酸以外的氨基酸取代,并且對應(yīng)于SEQ ID NO:150的E20的氨基酸殘基已經(jīng)被除谷氨酸以外的氨基酸取代。
在一個實(shí)施方案中,在構(gòu)成納米顆粒的蛋白質(zhì)構(gòu)建體的單體亞基部分和/或第一和/或第二氨基酸序列中進(jìn)行了另外的突變。引入突變的鐵蛋白蛋白質(zhì)中有用位置的實(shí)例包括對應(yīng)于選自下組的氨基酸位置的氨基酸:SEQ ID NO:2的氨基酸位置18,氨基酸位置20和氨基酸位置68。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體包含在對應(yīng)于選自下組的氨基酸位置的氨基酸位置處的突變:流感A新喀里多尼亞/20/1999(H1)的HA蛋白(SEQ ID NO:8)的氨基酸位置36,氨基酸位置45,氨基酸位置47,氨基酸位置49,氨基酸位置339,氨基酸位置340,氨基酸位置341,氨基酸位置342,氨基酸位置361,氨基酸位置372,氨基酸位置394,氨基酸位置402,氨基酸位置437,氨基酸位置438,氨基酸位置445,氨基酸位置446,氨基酸位置448,氨基酸449,氨基酸位置450和氨基酸位置452。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置36的位置包含異亮氨酸或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置45的位置處包含天冬酰胺或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置47的位置包含蘇氨酸或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置49的位置處包含色氨酸或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置339的位置處包含谷氨酰胺或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置340的位置處包含精氨酸或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置341的位置包含谷氨酸或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置342的位置包含蘇氨酸或與其具有相似性質(zhì)的氨基酸殘基(H1)。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置372的位置處包含蘇氨酸或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置394的位置包含甲硫氨酸,異亮氨酸,亮氨酸,谷氨酰胺或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置402的位置處包含天冬酰胺,蘇氨酸,甘氨酸,天冬酰胺或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置437的位置包含天冬氨酸或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置438的位置包含亮氨酸或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置445的位置包含亮氨酸,甲硫氨酸或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置446的位置包含異亮氨酸,亮氨酸,甲硫氨酸,谷氨酰胺或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置448的位置處包含谷氨酰胺或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置449的位置包含色氨酸,苯丙氨酸或具有與其相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置450的位置處包含丙氨酸或與其具有相似性質(zhì)的氨基酸殘基。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分在對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸位置452的位置處包含亮氨酸或與其具有相似性質(zhì)的氨基酸殘基(H1)。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體的HA部分缺少對應(yīng)于流感A新喀里多尼亞/20/1999(H1)的HA蛋白的氨基酸515-517的一個或多個氨基酸。
在一個實(shí)施方案中,本發(fā)明的納米顆粒包含單體亞基蛋白,其包含來自2,4-二氧四氫蝶啶合酶的至少50個氨基酸,至少100個氨基酸或至少150個氨基酸。在一個實(shí)施方案中,單體亞基蛋白包含來自選自SEQ ID NO:194的氨基酸序列的至少50個氨基酸,至少100個氨基酸或至少150個氨基酸,和/或包含與SEQ ID NO:194至少85%,至少90%,至少95%,至少97%,至少99%相同的氨基酸序列。在一個實(shí)施方案中,單體亞基包含SEQ ID NO:194。
在一個實(shí)施方案中,單體亞基蛋白包含來自鐵蛋白蛋白的至少50個氨基酸,至少100個氨基酸或至少150個氨基酸。在一個實(shí)施方案中,單體亞基蛋白包含來自選自SEQ ID NO:2和SEQ ID NO:5的氨基酸序列的至少50個氨基酸,至少100個氨基酸或至少150個氨基酸,和或包含與選自SEQ ID NO:2和SEQ ID NO:5的氨基酸序列至少85%,至少90%,至少95%,至少97%,至少99%相同的氨基酸序列。在一個實(shí)施方案中,單體鐵蛋白亞基包含SEQ ID NO:2或SEQ ID NO:5。
在一個實(shí)施方案中,納米顆粒包含蛋白質(zhì)構(gòu)建體,其包含與來自病毒的HA蛋白的至少一個免疫原性部分連接的本發(fā)明的單體蛋白質(zhì),所述病毒選自A型流感病毒,B型流感病毒和C型流感病毒。在一個實(shí)施方案中,蛋白質(zhì)構(gòu)建體包含與選自下組的HA蛋白的至少一個免疫原性部分連接的本發(fā)明的單體蛋白:H1流感病毒HA蛋白,H2流感病毒HA蛋白,H3流感病毒HA蛋白,H4流感病毒HA蛋白,H5流感病毒HA蛋白,H6流感病毒HA蛋白,H7流感病毒HA蛋白,H8流感病毒HA蛋白,H9流感病毒HA蛋白,H10流感病毒HA蛋白,H11流感病毒HA蛋白,H12流感病毒HA蛋白,H13流感病毒HA蛋白,H14流感病毒HA蛋白,H15流感病毒HA蛋白,H16流感病毒HA蛋白,H17流感病毒HA蛋白,和H18流感病毒HA蛋白。在一個實(shí)施方案中,免疫原性部分包含至少一個表位。
在一個實(shí)施方案中,納米顆粒包含包含蛋白質(zhì)構(gòu)建體,所述蛋白質(zhì)構(gòu)建體包含與氨基酸序列連接的本發(fā)明的單體蛋白,所述氨基酸序列與選自下組的序列是至少約80%,至少約85%,至少約90%,至少約95%,至少約97%或至少約99%相同的:SEQ ID NO:80,SEQ ID NO:83,SEQ ID NO:86,SEQ ID NO:89,SEQ ID NO:92,SEQ ID NO:95,SEQ ID NO:98,SEQ ID NO:101,SEQ ID NO:104,SEQ ID NO:158,SEQ ID NO:164,SEQ ID NO:170,SEQ ID NO:176,SEQ ID NO:182,SEQ ID NO:188,SEQ ID NO:197,SEQ ID NO:203,SEQ ID NO:209,SEQ ID NO:214,SEQ ID NO:217,SEQ ID NO:222,SEQ ID NO:224,SEQ ID NO:226,SEQ ID NO:228,SEQ ID NO:230,SEQ ID NO:232,SEQ ID NO:235,SEQ ID NO:240,SEQ ID NO:242,SEQ ID NO:244,SEQ ID NO:246,SEQ ID NO:248,SEQ ID NO:250,SEQ ID NO:252,SEQ ID NO:254,SEQ ID NO:256,SEQ ID NO:258,SEQ ID NO:261,SEQ ID NO:268,SEQ ID NO:275,SEQ ID NO:282,SEQ ID NO:289,SEQ ID NO:296,SEQ ID NO:303,SEQ ID NO:310,SEQ ID NO:317,SEQ ID NO:324,SEQ ID NO:331,SEQ ID NO:338,SEQ ID NO:345,SEQ ID NO:352,SEQ ID NO:359,SEQ ID NO:366,SEQ ID NO:373,SEQ ID NO:380,SEQ ID NO:387,SEQ ID NO:394和SEQ ID NO:400,其中蛋白質(zhì)構(gòu)建體能夠選擇性結(jié)合抗流感抗體。在一個實(shí)施方案中,納米顆粒包含蛋白質(zhì)構(gòu)建體,所述蛋白質(zhì)構(gòu)建體包含與氨基酸序列連接的本發(fā)明的單體蛋白,所述氨基酸序列選自下組:SEQ ID NO:80,SEQ ID NO:83,SEQ ID NO:86,SEQ ID NO:89,SEQ ID NO:92,SEQ ID NO:95,SEQ ID NO:98,SEQ ID NO:101,SEQ ID NO:104,SEQ ID NO:158,SEQ ID NO:164,SEQ ID NO:170,SEQ ID NO:176,SEQ ID NO:182,SEQ ID NO:188,SEQ ID NO:197,SEQ ID NO:203,SEQ ID NO:209,SEQ ID NO:214,SEQ ID NO:217,SEQ ID NO:222,SEQ ID NO:224,SEQ ID NO:226,SEQ ID NO:228,SEQ ID NO:230,SEQ ID NO:232,SEQ ID NO:235,SEQ ID NO:240,SEQ ID NO:242,SEQ ID NO:244,SEQ ID NO:246,SEQ ID NO:248,SEQ ID NO:250,SEQ ID NO:252,SEQ ID NO:254,SEQ ID NO:256,SEQ ID NO:258,SEQ ID NO:261,SEQ ID NO:268,SEQ ID NO:275,SEQ ID NO:282,SEQ ID NO:289,SEQ ID NO:296,SEQ ID NO:303,SEQ ID NO:310,SEQ ID NO:317,SEQ ID NO:324,SEQ ID NO:331,SEQ ID NO:338,SEQ ID NO:345,SEQ ID NO:352,SEQ ID NO:359,SEQ ID NO:366,SEQ ID NO:373,SEQ ID NO:380,SEQ ID NO:387,SEQ ID NO:394和SEQ ID NO:400,其中蛋白質(zhì)構(gòu)建體能夠選擇性結(jié)合抗流感抗體。
在本發(fā)明的一個實(shí)施方案中,納米顆粒包含蛋白質(zhì)構(gòu)建體,所述蛋白質(zhì)構(gòu)建體包含與選自下組的序列至少80%,至少約85%,至少約90%,至少約95%,至少約97%或至少約99%相同的氨基酸序列:SEQ ID NO:107,SEQ ID NO:110,SEQ ID NO:113,SEQ ID NO:116,SEQ ID NO:119,SEQ ID NO:122,SEQ ID NO:125,SEQ ID NO:128,SEQ ID NO:131,SEQ ID NO:161,SEQ ID NO:167,SEQ ID NO:173,SEQ ID NO:179,SEQ ID NO:185,SEQ ID NO:191,SEQ ID NO:200,SEQ ID NO:206,SEQ ID NO:212,SEQ ID NO:215,SEQ ID NO:220,SEQ ID NO:223,SEQ ID NO:225,SEQ ID NO:227,SEQ ID NO:229,SEQ ID NO:231,SEQ ID NO:233,SEQ ID NO:238,SEQ ID NO:241,SEQ ID NO:243,SEQ ID NO:245,SEQ ID NO:247,SEQ ID NO:249,SEQ ID NO:251,SEQ ID NO:253,SEQ ID NO:255,SEQQ ID NO:257,SEQ ID NO:259,SEQ ID NO:264,SEQ ID NO:271,SEQ ID NO:278,SEQ ID NO:285,SEQ ID NO:292,SEQ ID NO:299,SEQ ID NO:306,SEQ ID NO:313,SEQ ID NO:320,SEQ ID NO:327,SEQ ID NO:334,SEQ ID NO:341,SEQ ID NO:348,SEQ ID NO:355,SEQ ID NO:362,SEQ ID NO:369,SEQ ID NO:376,SEQ ID NO:383,SEQ ID NO:390和SEQ ID NO:397,其中蛋白質(zhì)構(gòu)建體能夠選擇性結(jié)合抗流感抗體。在本發(fā)明的一個實(shí)施方案中,納米顆粒包含蛋白質(zhì)構(gòu)建體,所述蛋白質(zhì)構(gòu)建體包含選自下組的氨基酸序列:SEQ ID NO:107,SEQ ID NO:110,SEQ ID NO:113,SEQ ID NO:116,SEQ ID NO:119,SEQ ID NO:122,SEQ ID NO:125,SEQ ID NO:128,SEQ ID NO:131,SEQ ID NO:161,SEQ ID NO:167,SEQ ID NO:173,SEQ ID NO:179,SEQ ID NO:185,SEQ ID NO:191,SEQ ID NO:200,SEQ ID NO:206,SEQ ID NO:212,SEQ ID NO:215,SEQ ID NO:220,SEQ ID NO:223,SEQ ID NO:225,SEQ ID NO:227,SEQ ID NO:229,SEQ ID NO:231,SEQ ID NO:233,SEQ ID NO:238,SEQ ID NO:241,SEQ ID NO:243,SEQ ID NO:245,SEQ ID NO:247,SEQ ID NO:249,SEQ ID NO:251,SEQ ID NO:253,SEQ ID NO:255,SEQQ ID NO:257,SEQ ID NO:259,SEQ ID NO:264,SEQ ID NO:271,SEQ ID NO:278,SEQ ID NO:285,SEQ ID NO:292,SEQ ID NO:299,SEQ ID NO:306,SEQ ID NO:313,SEQ ID NO:320,SEQ ID NO:327,SEQ ID NO:334,SEQ ID NO:341,SEQ ID NO:348,SEQ ID NO:355,SEQ ID NO:362,SEQ ID NO:369,SEQ ID NO:376,SEQ ID NO:383,SEQ ID NO:390和SEQ ID NO:397。
在一個實(shí)施方案中,本發(fā)明的納米顆粒包含由核酸分子編碼的蛋白質(zhì)構(gòu)建體,所述核酸分子包含與選自下組的序列至少85%,至少90%,至少95%or至少97%相同的核酸序列:SEQ ID NO:266,SEQ ID NO:273,SEQ ID NO:SEQ ID NO:280,SEQ ID NO:287,SEQ ID NO:294,SEQ ID NO:301,SEQ ID NO:308,SEQ ID NO:315,SEQ ID NO:322,SEQ ID NO:329,SEQ ID NO:336,SEQ ID NO:343,SEQ ID NO:350,SEQ ID NO:357,SEQ ID NO:364,SEQ ID NO:371,SEQ ID NO:378,SEQ ID NO:385SEQ ID NO:392和SEQ ID NO:399。在一個實(shí)施方案中,本發(fā)明的納米顆粒包含由核酸分子編碼的蛋白質(zhì)構(gòu)建體,所述核酸分子包含選自下組的核酸序列:SEQ ID NO:266,SEQ ID NO:273,SEQ ID NO:SEQ ID NO:280,SEQ ID NO:287,SEQ ID NO:294,SEQ ID NO:301,SEQ ID NO:308,SEQ ID NO:315,SEQ ID NO:322,SEQ ID NO:329,SEQ ID NO:336,SEQ ID NO:343,SEQ ID NO:350,SEQ ID NO:357,SEQ ID NO:364,SEQ ID NO:371,SEQ ID NO:378,SEQ ID NO:385SEQ ID NO:392和SEQ ID NO:399。
本發(fā)明的納米顆??捎糜谝l(fā)對流感病毒的免疫應(yīng)答。一類免疫應(yīng)答是B細(xì)胞應(yīng)答,其導(dǎo)致產(chǎn)生針對引發(fā)免疫應(yīng)答的抗原的抗體。因此,在一個實(shí)施方案中,納米顆粒引發(fā)結(jié)合來自選自A型流感病毒,B型流感病毒和C型流感病毒的病毒的流感HA蛋白的莖區(qū)的抗體。本發(fā)明的一個實(shí)施方案是納米顆粒,其引發(fā)結(jié)合流感HA蛋白的莖區(qū)的抗體,所述流感HA蛋白選自H1流感病毒HA蛋白,H2流感病毒HA蛋白,H3流感病毒HA蛋白,H4流感病毒HA蛋白,H5流感病毒HA蛋白,H6流感病毒HA蛋白,H7流感病毒HA蛋白,H8流感病毒HA蛋白,H9流感病毒HA蛋白,H10流感病毒HA蛋白HA蛋白,H11流感病毒HA蛋白,H12流感病毒HA蛋白,H13流感病毒HA蛋白,H14流感病毒HA蛋白,H15流感病毒HA蛋白,H16流感病毒HA蛋白,H17流感病毒HA蛋白,和H18流感病毒HA蛋白。本發(fā)明的一個實(shí)施方案是納米顆粒,其引發(fā)結(jié)合來自病毒株的流感HA蛋白的莖區(qū)的抗體,所述病毒株選自流感A/新喀里多尼亞/20/1999(1999NC,H1),A/加利福尼亞/04/2009(2009CA,H1),A/新加坡/1/1957(1957Sing,H2),A/香港/1/1968(1968HK,H3),A/布里斯班/10/2007(2007Bris,H3),A/印度尼西亞/05/2005(2005Indo,H5),B/佛羅里達(dá)/4/2006(2006Flo,B),A/珀斯/16/2009(2009Per,H3),A/布里斯班/59/2007(2007Bris,H1),B/布里斯班/60/2008(2008Bris,B)及其變體。
盡管所有抗體能夠結(jié)合引發(fā)導(dǎo)致抗體產(chǎn)生的免疫應(yīng)答的抗原,但優(yōu)選的抗體是那些提供針對流感病毒的廣泛的異亞型保護(hù)的抗體。因此,本發(fā)明的一個實(shí)施方案是引發(fā)保護(hù)性抗體的納米顆粒,所述保護(hù)性抗體結(jié)合來自選自A型流感病毒,B型流感病毒和C型流感病毒的病毒的流感HA蛋白的莖區(qū)。本發(fā)明的一個實(shí)施方案是引發(fā)與流感HA蛋白的莖區(qū)結(jié)合的保護(hù)性抗體的蛋白質(zhì),所述流感HA蛋白選自H1流感病毒HA蛋白,H2流感病毒HA蛋白,H3流感病毒HA蛋白,H4流感病毒HA蛋白,H5流感病毒HA蛋白,H6流感病毒HA蛋白,H7流感病毒HA蛋白,H8流感病毒HA蛋白,H9流感病毒HA蛋白,H10流感病毒HA蛋白HA蛋白,H11流感病毒HA蛋白,H12流感病毒HA蛋白,H13流感病毒HA蛋白,H14流感病毒HA蛋白,H15流感病毒HA蛋白,H16流感病毒HA蛋白,H17流感病毒HA蛋白,和H18流感病毒HA蛋白。本發(fā)明的一個實(shí)施方案是引發(fā)針對選自下組的病毒的抗體的納米顆粒:流感A/新喀里多尼亞/20/1999(1999NC,H1),A/加利福尼亞/04/2009(2009CA,H1),A/新加坡/1/1957(1957Sing,H2),A/香港/1/1968(1968HK,H3),A/布里斯班/10/2007(2007Bris,H3),A/印度尼西亞/05/2005(2005Indo,H5),B/佛羅里達(dá)/4/2006(2006Flo,B),A/珀斯/16/2009(2009Per,H3),A/布里斯班/59/2007(2007Bris,H1)and B/布里斯班/60/2008(2008Bris,B)。本發(fā)明的一個實(shí)施方案是引發(fā)結(jié)合蛋白質(zhì)的抗體的納米顆粒,所述蛋白質(zhì)包含與選自下組的序列至少80%相同的氨基酸序列:SEQ ID NO:8,SEQ ID NO:11,SEQ ID NO:14和SEQ ID NO:17。本發(fā)明的一個實(shí)施方案是引發(fā)結(jié)合蛋白質(zhì)的抗體的納米顆粒,所述蛋白質(zhì)包含選自SEQ ID NO:8,SEQ ID NO:11,SEQ ID NO:14和SEQ ID NO:17的氨基酸序列。
由本發(fā)明的蛋白質(zhì)引發(fā)的保護(hù)性抗體可通過影響病毒生命周期中的任何步驟來提供保護(hù)免受病毒感染。例如,保護(hù)性抗體可以防止流感病毒附著于細(xì)胞,進(jìn)入細(xì)胞,將病毒核糖核蛋白釋放到細(xì)胞質(zhì)中,在受感染的細(xì)胞中形成新的病毒顆粒并從受感染的宿主細(xì)胞膜出芽新的病毒顆粒。在一個實(shí)施方案中,由本發(fā)明的蛋白質(zhì)引發(fā)的保護(hù)性抗體防止流感病毒進(jìn)入宿主細(xì)胞。在一個實(shí)施方案中,由本發(fā)明的蛋白質(zhì)引發(fā)的保護(hù)性抗體防止病毒膜與內(nèi)體膜的融合。在一個實(shí)施方案中,由本發(fā)明的蛋白質(zhì)引發(fā)的保護(hù)性抗體防止核糖核蛋白釋放到宿主細(xì)胞的細(xì)胞質(zhì)中。在一個實(shí)施方案中,由本發(fā)明的蛋白質(zhì)引發(fā)的保護(hù)性抗體防止新病毒在感染的宿主細(xì)胞中的裝配。在一個實(shí)施方案中,由本發(fā)明的蛋白質(zhì)引發(fā)的保護(hù)性抗體防止新形成的病毒從感染的宿主細(xì)胞釋放。
因?yàn)榱鞲胁《镜那o區(qū)的氨基酸序列是高度保守的,所以由本發(fā)明的納米顆粒引發(fā)的保護(hù)性抗體可以是廣泛保護(hù)性的。也就是說,本發(fā)明的納米顆粒引發(fā)的保護(hù)性抗體可以針對多于一種類型,亞型和/或毒株的流感病毒提供保護(hù)。因此,本發(fā)明的一個實(shí)施方案是引發(fā)結(jié)合流感HA蛋白莖區(qū)的廣泛保護(hù)性抗體的蛋白質(zhì)。一個實(shí)施方案是引發(fā)結(jié)合來自多于一種類型的流感病毒的HA蛋白的莖區(qū)的抗體的納米顆粒,所述流感病毒選自A型流感病毒,B型流感病毒和C型流感病毒。一個實(shí)施方案是引發(fā)結(jié)合來自多于一種亞型流感病毒的HA蛋白的莖區(qū)的抗體的納米顆粒,所述流感病毒選自H1流感病毒,H2流感病毒,H3流感病毒,H4流感病毒,H5流感病毒,H6流感病毒,H7流感病毒,H8流感病毒,H9流感病毒,H10流感病毒,H11流感病毒,H12流感病毒,H13流感病毒,H14流感病毒,H15流感病毒,H16流感病毒,H17流感病毒和H18流感病毒。一個實(shí)施方案是引發(fā)結(jié)合來自超過流感病毒株的HA蛋白的莖區(qū)的抗體的納米顆粒。本發(fā)明的一個實(shí)施方案是引發(fā)結(jié)合超過一種蛋白質(zhì)的抗體的納米顆粒,所述蛋白質(zhì)包含與選自下組的序列至少80%相同的氨基酸序列:SEQ ID NO:8,SEQ ID NO:11,SEQ ID NO:14和SEQ ID NO:17。本發(fā)明的一個實(shí)施方案是引發(fā)結(jié)合多于一種蛋白質(zhì)的抗體的納米顆粒,所述蛋白質(zhì)包含選自SEQ ID NO:8,SEQ ID NO:11,SEQ ID NO:14和SEQ ID NO:17的氨基酸序列。
因?yàn)楸景l(fā)明的納米顆??梢砸l(fā)對流感病毒的免疫應(yīng)答,所以它們可用作保護(hù)個體免受流感病毒感染的疫苗。因此,本發(fā)明的一個實(shí)施方案是包含本發(fā)明的納米顆粒的疫苗。本發(fā)明的疫苗還可以含有其它成分,如佐劑,緩沖液等。盡管可以使用任何佐劑,但優(yōu)選的實(shí)施方案可以含有:化學(xué)佐劑,如磷酸鋁,benzyalkonium chloride,烏苯美司(ubenimex)和QS21;遺傳佐劑如IL-2基因或其片段,粒細(xì)胞巨噬細(xì)胞集落刺激因子(GM-CSF)基因或其片段,IL-18基因或其片段,趨化因子(CC基序)配體21(CCL21)基因或其片段,IL-6基因或其片段,CpG,LPS,TLR激動劑和其它免疫刺激基因;蛋白質(zhì)佐劑如IL-2或其片段,粒細(xì)胞巨噬細(xì)胞集落刺激因子(GM-CSF)或其片段,IL-18或其片段,趨化因子(CC基序)配體21(CCL21)或其片段,IL-6或其片段,CpG,LPS,TLR激動劑和其它免疫刺激性細(xì)胞因子或其片段;脂質(zhì)佐劑如陽離子脂質(zhì)體,N3(陽離子脂質(zhì)),單磷酰脂質(zhì)A(MPL1);其它佐劑,包括霍亂毒素,腸毒素,F(xiàn)ms樣酪氨酸激酶-3配體(Flt-3L),布比卡因(bupivacaine),丁哌卡因(marcaine)和左旋咪唑。
本發(fā)明的一個實(shí)施方案是包含多于一種流感HA蛋白的納米顆粒疫苗。此類疫苗可以包括在單個納米顆粒上或作為納米顆?;旌衔锏牟煌鞲蠬A蛋白的組合,其中至少兩種具有獨(dú)特的流感HA蛋白。多價(jià)疫苗可包含與必要一樣多的流感HA蛋白,以便導(dǎo)致提供保護(hù)免于期望的病毒毒株寬度必需的免疫應(yīng)答的產(chǎn)生。在一個實(shí)施方案中,疫苗包含來自至少兩種不同流感株(二價(jià))的HA蛋白。在一個實(shí)施方案中,疫苗包含來自至少三種不同流感株(三價(jià))的HA蛋白。在一個實(shí)施方案中,疫苗包含來自至少四種不同流感株(四價(jià))的HA蛋白。在一個實(shí)施方案中,疫苗包含來自至少五種不同流感株(五價(jià))的HA蛋白。在一個實(shí)施方案中,疫苗包含來自至少六種不同流感病毒株(六價(jià))的HA蛋白。在各種實(shí)施方案中,疫苗包含來自7、8、9或10種不同流感病毒株之每種的HA蛋白。此類組合的實(shí)例是包含流感A組1HA蛋白,流感A組2HA蛋白,和流感B HA蛋白的納米顆粒疫苗。在一個實(shí)施方案中,流感HA蛋白是H1HA,H3HA和B HA。在一個實(shí)施方案中,流感HA蛋白是包括在2011-2012流感疫苗中的那些。多價(jià)疫苗的另一個實(shí)例是包含來自四種不同流感病毒的HA蛋白的納米顆粒疫苗。在一個實(shí)施方案中,多價(jià)疫苗包含來自流感A/新喀里多尼亞/20/1999(1999NC,H1),A/加利福尼亞/04/2009(2009CA,H1),A/新加坡/1/1957(1957Sing,H2),A/香港/1/1968(1968HK,H3),A/布里斯班/10/2007(2007Bris,H3),A/印度尼西亞/05/2005(2005Indo,H5),B/佛羅里達(dá)/4/2006(2006Flo,B),A/珀斯/16/2009(2009Per,H3),A/布里斯班/59/2007(2007Bris,H1)和B/布里斯班/60/2008(2008Bris,B)的HA蛋白。
本發(fā)明的一個實(shí)施方案是針對流感病毒對個體接種疫苗的方法,所述方法包括向個體施用納米顆粒,使得在個體中產(chǎn)生針對流感病毒的免疫應(yīng)答,其中所述納米顆粒包含連接到流感HA蛋白的單體亞基蛋白,并且其中所述納米顆粒在其表面上展示所述流感HA。在一個實(shí)施方案中,納米顆粒是單價(jià)納米顆粒。在一個實(shí)施方案中,納米顆粒是多價(jià)納米顆粒。本發(fā)明的另一個實(shí)施方案是針對流感病毒感染對個體接種疫苗的方法,所述方法包括:
a)獲得包含單體亞基的納米顆粒,其中所述單體亞基與流感血凝素蛋白連接,并且其中所述納米顆粒在其表面上展示流感HA;并且,
b)將納米顆粒施用于個體,使得產(chǎn)生針對流感病毒的免疫應(yīng)答。
本發(fā)明的一個實(shí)施方案是針對流感病毒對個體接種疫苗的方法,所述方法包括向個體施用實(shí)施方案的疫苗,使得在個體中產(chǎn)生針對流感病毒的免疫應(yīng)答,其中所述疫苗包含至少一種納米顆粒,其包含與流感HA蛋白連接的單體亞基,并且其中所述納米顆粒在其表面上展示流感HA。在一個實(shí)施方案中,疫苗是單價(jià)疫苗。在一個實(shí)施方案中,疫苗是多價(jià)疫苗。本發(fā)明的另一個實(shí)施方案是針對流感病毒感染對個體接種疫苗的方法,所述方法包括:
a)獲得包含至少一種包含本發(fā)明的蛋白質(zhì)構(gòu)建體的納米顆粒的疫苗,其中所述蛋白質(zhì)構(gòu)建體包含與流感HA蛋白連接的單體亞基蛋白,并且其中所述納米顆粒在其表面上展示流感HA;并且,
b)將所述疫苗施用于個體,使得產(chǎn)生針對流感病毒的免疫應(yīng)答。
在一個實(shí)施方案中,納米顆粒是單價(jià)納米顆粒。在一個實(shí)施方案中,納米顆粒是多價(jià)納米顆粒。
在一個實(shí)施方案中,納米顆粒具有八面體對稱。在一個實(shí)施方案中,流感HA蛋白能夠引發(fā)針對流感病毒的抗體。在一個實(shí)施方案中,流感HA蛋白能夠廣泛引發(fā)針對流感病毒的抗體。在優(yōu)選的實(shí)施方案中,引發(fā)的抗體是保護(hù)性抗體。在優(yōu)選的實(shí)施方案中,引發(fā)的抗體是廣泛異亞型保護(hù)性的。
本發(fā)明的疫苗可用于使用初免/加強(qiáng)方案對個體接種疫苗。此類方案在美國專利公開號20110177122中描述,其通過引用整體并入本文。在此類方案中,可以向個體施用第一疫苗組合物(初次),然后在一段時間后,可以向個體施用第二疫苗組合物(加強(qiáng))。施用加強(qiáng)組合物通常是在施用引發(fā)組合物后數(shù)周或數(shù)月,優(yōu)選約2-3周或4周,或8周,或16周,或20周,或24周,或28周,或32周。在一個實(shí)施方案中,配制加強(qiáng)組合物,用于在施用引發(fā)組合物后約1周,或2周,或3周,或4周,或5周,或6周,或7周,或8周,或9周,或16周,或20周,或24周,或28周,或32周施用。
第一和第二疫苗組合物可以是,但不需要是相同的組合物。因此,在本發(fā)明的一個實(shí)施方案中,施用疫苗的步驟包括施用第一疫苗組合物,然后在稍后時間施用第二疫苗組合物。在一個實(shí)施方案中,第一疫苗組合物包含本發(fā)明的納米顆粒。在一個實(shí)施方案中,第一疫苗組合物包含納米顆粒,其包含來自流感病毒的HA蛋白的氨基酸序列,所述流感病毒選自A/新喀里多尼亞/20/1999(1999NC,H1),A/加利福尼亞/04/2009(2009CA,H1),A/新加坡/1/1957(1957Sing,H2),A/香港/1/1968(1968HK,H3),A/布里斯班/10/2007(2007Bris,H3),A/印度尼西亞/05/2005(2005Indo,H5),B/佛羅里達(dá)/4/2006(2006Flo,B),A/珀斯/16/2009(2009Per,H3),A/布里斯班/59/2007(2007Bris,H1),B/布里斯班/60/2008(2008Bris,B)。
在一個實(shí)施方案中,接種疫苗的個體已暴露于流感病毒。如本文所使用的,術(shù)語暴露的,暴露等指示受試者已經(jīng)與已知感染流感病毒的動物對象接觸。可以使用本領(lǐng)域技術(shù)人員熟知的技術(shù)來施用本發(fā)明的疫苗。用于配制和施用的技術(shù)可以在例如“Remington’s Pharmaceutical Sciences”,18th ed.,1990,Mack Publishing Co.,Easton,PA中找到。疫苗可通過包括但不限于傳統(tǒng)注射器,無針注射裝置或微粒轟擊基因槍的手段施用。合適的施用途徑包括但不限于腸胃外遞送,如肌內(nèi),皮內(nèi),皮下,髓內(nèi)注射以及鞘內(nèi),直接心室內(nèi),靜脈內(nèi),腹膜內(nèi),鼻內(nèi)或眼內(nèi)注射,僅舉幾個例子。對于注射,本發(fā)明的一個實(shí)施方案的化合物可以配制在水溶液中,優(yōu)選在生理上相容的緩沖液如Hanks溶液,林格氏溶液或生理鹽水緩沖液中配制。
在一個實(shí)施方案中,本發(fā)明的疫苗或納米顆??捎糜诒Wo(hù)個體免受異源流感病毒的感染。也就是說,使用來自流感病毒的一種毒株的HA蛋白制備的疫苗能夠保護(hù)個體免受不同流感病毒株的感染。例如,使用來自流感A/新喀里多尼亞/20/1999(1999NC,H1)的HA蛋白制備的疫苗可以用于保護(hù)個體免受流感病毒感染,所述流感病毒包括但不限于A/新喀里多尼亞/20/1999(1999NC,H1),A/加利福尼亞/04/2009(2009CA,H1),A/新加坡/1/1957(1957Sing,H2),A/香港/1/1968(1968HK,H3),A/布里斯班/10/2007(2007Bris,H3),A/印度尼西亞/05/2005(2005indo,H5),A/珀斯/16/2009(2009Per,H3),和/或A/布里斯班/59/2007(2007Bris,H1)。
在一個實(shí)施方案中,本發(fā)明的疫苗或納米顆??捎糜诒Wo(hù)個體免于抗原性趨異的流感病毒的感染??乖在叜惖氖侵噶鞲胁《局觌S時間突變的趨勢,從而改變展示給免疫系統(tǒng)的氨基酸。此類隨時間的突變也稱為抗原漂移。因此,例如,使用來自A/新喀里多尼亞/20/1999(1999NC,H1)流感病毒株的HA蛋白制備的疫苗能夠保護(hù)個體免受早期的,抗原性趨異的新喀里多尼亞流感毒株的感染,和未來的進(jìn)化(或趨異)流感毒株。
因?yàn)楸景l(fā)明的納米顆粒展示與完整HA在抗原性上相似的HA蛋白,所以它們可用于檢測針對流感病毒的抗體(抗流感抗體)的測定法中。
因此,本發(fā)明的一個實(shí)施方案是使用本發(fā)明的納米顆粒檢測抗流感病毒抗體的方法。本發(fā)明的檢測方法通常可以通過以下步驟實(shí)現(xiàn):
a.使測試抗流感抗體的存在的樣品的至少部分與本發(fā)明的納米顆粒接觸;并且,
b.檢測納米顆粒/抗體復(fù)合物的存在;
其中納米顆粒/抗體復(fù)合物的存在指示樣品含有抗流感抗體。
在本發(fā)明的一個實(shí)施方案中,從待測試抗流感病毒抗體的存在的個體獲得或收集樣品。個體可以是或不是懷疑具有抗流感抗體或已經(jīng)暴露于流感病毒的。樣品是從個體獲得的任何樣本,其可用于測試抗流感病毒抗體的存在。優(yōu)選的樣品是可用于檢測抗流感病毒抗體的存在的體液??捎糜趯?shí)施本發(fā)明方法的體液的實(shí)例包括但不限于血液,血漿,血清,淚液和唾液。本領(lǐng)域技術(shù)人員可以容易地鑒定適于實(shí)施所公開的方法的樣品。
血液或血液衍生的液體如血漿,血清等特別適合作為樣品??梢允褂帽绢I(lǐng)域已知的方法從個體收集和制備此類樣品。樣品可以在測定前冷藏或冷凍。
本發(fā)明的任何納米顆粒可用于實(shí)施所公開的方法,只要納米顆粒結(jié)合抗流感病毒抗體。有用的納米顆粒及其制備方法已在本文中詳細(xì)描述。在優(yōu)選的實(shí)施方案中,納米顆粒包含蛋白質(zhì)構(gòu)建體,其中蛋白質(zhì)構(gòu)建體包含連接到(融合到)來自流感HA蛋白的至少一個表位的來自單體亞基蛋白的至少25個,至少50個,至少75個,至少100個或至少150個連續(xù)氨基酸,使得納米顆粒在其表面上包含流感病毒HA蛋白表位的三聚體,并且其中蛋白質(zhì)構(gòu)建體能夠自組裝成納米顆粒。
如本文中使用的,術(shù)語接觸是指將測試抗流感抗體存在的樣品引入本發(fā)明的納米顆粒,例如通過組合或混合樣品和本發(fā)明的納米顆粒,使得納米顆粒能夠與樣品中的抗體(如果存在的話)物理接觸。當(dāng)抗流感病毒抗體存在于樣品中時,然后形成抗體/納米顆粒復(fù)合物。此類復(fù)合物形成是指抗流感病毒抗體選擇性結(jié)合納米顆粒中蛋白質(zhì)構(gòu)建體的HA部分以形成可檢測的穩(wěn)定復(fù)合物的能力。樣品中抗流感病毒抗體與納米顆粒的結(jié)合在適合形成復(fù)合物的條件下完成。此類條件(例如,適當(dāng)?shù)臐舛龋彌_液,溫度,反應(yīng)時間)以及優(yōu)化此類條件的方法是本領(lǐng)域技術(shù)人員已知的。結(jié)合可以使用本領(lǐng)域標(biāo)準(zhǔn)的多種方法測量,包括但不限于凝集測定法,沉淀測定法,酶免疫測定法(例如ELISA),免疫沉淀測定法,免疫印跡測定法和其它免疫測定法,如例如記載于Sambrook et al.,Molecular Cloning:A Laboratory Manual,(Cold Spring Harbor Labs Press,1989)和Harlow et al.,Antibodies,a Laboratory Manual(Cold Spring Harbor Labs Press,1988),兩者均通過引用整體并入本文。這些參考文獻(xiàn)還提供了復(fù)合物形成條件的實(shí)例。
如本文所使用的,短語選擇性地結(jié)合HA,選擇性結(jié)合HA等,是指與結(jié)合與HA無關(guān)的蛋白質(zhì),或樣品或測定法中的非蛋白質(zhì)組分形成對比,抗體優(yōu)先結(jié)合HA蛋白的能力。選擇性結(jié)合HA的抗體是結(jié)合HA但不顯著結(jié)合可能存在于樣品或測定法中的其它分子或組分的抗體。顯著的結(jié)合被認(rèn)為是例如抗HA抗體與非HA分子的結(jié)合,以大到足以干擾測定法檢測和/或測定樣品中的抗流感抗體的水平的能力的親和力或親合力??纱嬖谟跇悠坊驕y定法中的其它分子和化合物的實(shí)例包括但不限于非HA蛋白,例如白蛋白,脂質(zhì)和碳水化合物。
在一個實(shí)施方案中,可以在溶液中形成抗流感病毒抗體/納米顆粒復(fù)合物(本文中也稱為抗體/納米顆粒復(fù)合物)。在一個實(shí)施方案中,可以形成抗體/納米顆粒復(fù)合物,其中納米顆粒固定在(例如,涂覆到)基底上。固定化技術(shù)是本領(lǐng)域技術(shù)人員已知的。合適的基底材料包括但不限于塑料,玻璃,凝膠,賽璐珞(celluloid),織物,紙和顆粒材料?;撞牧系膶?shí)例包括但不限于膠乳,聚苯乙烯,尼龍,硝化纖維素,瓊脂糖,棉,PVDF(聚偏氟乙烯)和磁性樹脂。用于基底材料的合適形狀包括但不限于孔(例如,微量滴定盤孔),微量滴定板,浸漬片,條,珠,側(cè)流裝置,膜,過濾器,管,盤,賽璐珞型基質(zhì),磁性顆粒和其它顆粒。特別優(yōu)選的底物包括例如ELISA板,浸漬片,免疫斑點(diǎn)條,放射免疫測定板,瓊脂糖珠,塑料珠,乳膠珠,棉線,塑料芯片,免疫印跡膜,免疫印跡紙和流通膜。在一個實(shí)施方案中,基底,如顆粒,可以包括可檢測標(biāo)記物。對于基底材料的實(shí)例的描述,參見例如Kemeny,D.M.(1991)APractical Guide to ELISA,Pergamon Press,Elmsford,NY pp 33-44,以及Price,C.and Newman,D.eds.Principles and Practice of Immunoassay,2nd edition(1997)Stockton Press,NY,NY,兩者通過引用整體并入本文。
根據(jù)本發(fā)明,一旦形成,就檢測抗流感病毒抗體/納米顆粒復(fù)合物。檢測可以是定性,定量或半定量的。如本文所使用的,短語檢測復(fù)合物形成,檢測復(fù)合物等是指鑒定與納米顆粒復(fù)合的抗流感病毒抗體的存在。如果形成復(fù)合物,則可以但不需要量化形成的復(fù)合物的量。假定的抗流感病毒抗體和納米顆粒之間的復(fù)合物形成或選擇性結(jié)合可以使用本領(lǐng)域標(biāo)準(zhǔn)的多種方法測量(即檢測,測定)(參見例如Sambrook等人,同上),其實(shí)例在本文中公開??梢砸远喾N方式檢測復(fù)合物,包括但不限于使用一種或多種下列測定法:血凝抑制測定法,徑向擴(kuò)散測定法,酶聯(lián)免疫測定法,競爭性酶聯(lián)免疫測定法,放射免疫測定法,熒光免疫測定法,化學(xué)發(fā)光測定法,側(cè)向流測定法,流通測定法,基于顆粒的測定法(例如,使用顆粒,如但不限于磁性顆?;蛩芰暇酆衔?,如膠乳或聚苯乙烯珠),免疫沉淀測定法,BioCoreJ測定法(例如,使用膠體金),免疫斑點(diǎn)測定法(例如,CMG=s免疫印跡系統(tǒng)(s Immunodot System),F(xiàn)riborg,Switzerland)和免疫印跡測定法(例如,western印跡),磷光測定法,流通測定法,層析測定法,基于PAGe的測定,表面等離振子共振測定法,分光光度測定法和電子感覺測定。此類測定法是本領(lǐng)域技術(shù)人員公知的。
測定法可用于根據(jù)其使用方式給出定性或定量結(jié)果??梢酝ㄟ^目視(例如,通過眼或通過機(jī)器,如密度計(jì)或分光光度計(jì))觀察到一些測定法,如凝集,顆粒分離和沉淀測定法,而不需要可檢測的標(biāo)記物。
在其它測定法中,可檢測標(biāo)記物與納米顆?;蚺c選擇性結(jié)合納米顆粒的試劑的綴合(即,附著)有助于檢測復(fù)合物形成??蓹z測標(biāo)記物可以在不干擾納米顆粒結(jié)合抗流感病毒抗體的能力的位點(diǎn)與納米顆?;蚣{米顆粒結(jié)合試劑綴合。綴合方法是本領(lǐng)域技術(shù)人員已知的。可檢測標(biāo)記物的實(shí)例包括但不限于放射性標(biāo)記物,熒光標(biāo)記物,化學(xué)發(fā)光標(biāo)記物,發(fā)色標(biāo)記物,酶標(biāo)記物,磷光標(biāo)記物,電子標(biāo)記物;金屬溶膠標(biāo)記物,有色珠,物理標(biāo)記物或配體。配體是指與另一分子選擇性結(jié)合的分子。優(yōu)選的可檢測標(biāo)記物包括但不限于熒光素,放射性同位素,磷酸酶(例如堿性磷酸酶),生物素,抗生物素蛋白,過氧化物酶(例如辣根過氧化物酶),β-半乳糖苷酶和生物素相關(guān)化合物或抗生物素蛋白相關(guān)化合物(例如鏈霉抗生物素蛋白或ImmunoPure7NeutrAvidin)。
在一個實(shí)施方案中,可以通過使樣品與結(jié)合抗流感抗體,鐵蛋白或與抗體/納米顆粒復(fù)合物的特異性化合物(如抗體)接觸來檢測抗體/納米顆粒復(fù)合物,所述特異性化合物與可檢測標(biāo)記物綴合??蓹z測標(biāo)記物可以以不阻斷化合物結(jié)合所檢測的復(fù)合物的能力的方式與特定化合物綴合。優(yōu)選的可檢測標(biāo)記物包括但不限于熒光素,放射性同位素,磷酸酶(例如堿性磷酸酶),生物素,抗生物素蛋白,過氧化物酶(例如辣根過氧化物酶),β-半乳糖苷酶和生物素相關(guān)化合物或抗生物素蛋白相關(guān)化合物(例如鏈霉抗生物素蛋白或ImmunoPure7NeutrAvidin)。
在另一個實(shí)施方案中,通過使復(fù)合物與指示劑分子接觸來檢測復(fù)合物。合適的指示劑分子包括可以結(jié)合抗流感病毒抗體/納米顆粒復(fù)合物,抗流感病毒抗體或納米顆粒的分子。因此,指示劑分子可以包括例如結(jié)合抗流感病毒抗體的試劑,如識別免疫球蛋白的抗體。作為抗體的優(yōu)選指示劑分子包括例如與來自其中產(chǎn)生抗流感病毒抗體的個體物種的抗體反應(yīng)的抗體。指示劑分子本身可以附著到本發(fā)明的可檢測標(biāo)志物。例如,抗體可以與生物素,辣根過氧化物酶,堿性磷酸酶或熒光素綴合。
本發(fā)明還可以包含能夠檢測指示劑分子存在的二級分子或其它結(jié)合分子的一個或多個層和/或類型。例如,選擇性結(jié)合指示劑分子的無標(biāo)簽的(即,不與可檢測標(biāo)志物綴合的)二抗可以與選擇性結(jié)合二抗的有標(biāo)簽的(即,與可檢測標(biāo)志物綴合的)三抗結(jié)合。合適的二抗,三抗和其它二級或三級分子可以容易地由本領(lǐng)域技術(shù)人員選擇。優(yōu)選的三級分子也可以由本領(lǐng)域技術(shù)人員基于第二分子的特性來選擇。相同的策略可以應(yīng)用于后續(xù)層。
優(yōu)選地,指示劑分子與可檢測標(biāo)志物綴合。如果需要的話,加入顯影劑,并將底物送到檢測裝置進(jìn)行分析。在一些方案中,在一個或兩個復(fù)合物形成步驟之后加入清洗步驟以除去過量的試劑。如果使用這些步驟,則它們牽涉本領(lǐng)域技術(shù)人員已知的條件,使得除去過量的試劑,但保留復(fù)合物。
因?yàn)楸景l(fā)明的測定法可以檢測樣品(包括血液樣品)中的抗流感病毒抗體,所以此類測定法可用于鑒定具有抗流感抗體的個體。因此,本發(fā)明的一個實(shí)施方案是鑒定具有抗流感病毒抗體的個體的方法,所述方法包括:
a.使來自測試抗流感抗體的個體的樣品與本發(fā)明的納米顆粒接觸;和,
b.分析接觸的樣品的納米顆粒/抗體復(fù)合物的存在,
其中納米顆粒/抗體復(fù)合物的存在指示所述個體具有抗流感抗體。
任何公開的測定形式可用于進(jìn)行所公開的方法。有用的測定形式的實(shí)例包括但不限于徑向擴(kuò)散測定法,酶聯(lián)免疫測定法,競爭性酶聯(lián)免疫測定法,放射免疫測定法,熒光免疫測定法,化學(xué)發(fā)光測定法,側(cè)向流測定法,通過測定法,基于顆粒的測定法(例如,使用顆粒,如但不限于磁性顆?;蛩芰暇酆衔?,如膠乳或聚苯乙烯珠),免疫沉淀測定法,BioCoreJ測定(例如,使用膠體金),免疫印跡測定法(例如,CMG=s免疫印跡系統(tǒng),F(xiàn)ribourg,Switzerland)和免疫印跡測定法(例如,western印跡),磷光測定法,流通測定法,層析測定法,基于PAGe的測定法,表面等離振子共振測定法,生物層干涉測定法,分光光度測定法和電子感覺測定法。
如果在樣品中沒有檢測到抗流感抗體,則此類結(jié)果指示個體不具有抗流感病毒抗體。測試的個體可以是或不是懷疑具有針對流感病毒的抗體的。所公開的方法還可以用于確定個體是否已經(jīng)暴露于流感病毒的一種或多種特定類型,組,亞組或毒株。為了進(jìn)行此類測定,從個體獲得樣品,所述個體在其過去(例如,大于約1年,大于約2年,大于約3年,大于約4年,大于約5年等)的某個時候在針對流感病毒的一種或多種特定類型,組,亞組或毒株的抗體測試呈陰性(即,缺少抗體)。然后使用本發(fā)明的基于納米顆粒的測定法測試樣品的針對流感病毒的一種或多種類型,組,亞組或毒株的抗流感病毒抗體的存在。如果測定法指示存在此類抗體,則在鑒定它們?yōu)榭沽鞲锌贵w陰性的測試后的某個時候?qū)€體鑒定為已經(jīng)暴露于流感病毒的一種或多種類型,組亞組或毒株。因此,本發(fā)明的一個實(shí)施方案是鑒定已暴露于流感病毒的個體的方法,所述方法包括:
a.使來自正在測試抗流感抗體的個體的樣品的至少部分與本發(fā)明的納米顆粒接觸;和,
b.分析接觸的樣品的抗體/納米顆粒復(fù)合物的存在或水平,其中抗體/納米顆粒復(fù)合物的存在或水平指示最近的抗流感抗體的存在或水平;
c.將最近的抗流感抗體水平與過去的抗流感抗體水平進(jìn)行比較;
其中最近的抗流感抗體水平相對于過去的抗流感抗體水平的增加指示個體在確定過去的抗流感抗體水平之后已經(jīng)暴露于流感病毒。
本發(fā)明的方法還可用于確定個體對疫苗的響應(yīng)。因此,一個實(shí)施方案是用于測量個體對流感疫苗的響應(yīng)的方法,所述方法包括:
a.向個體施用流感病毒疫苗;
b.使來自所述個體的樣品的至少部分與本發(fā)明的納米顆粒接觸;
c.分析接觸的樣品的抗體/納米顆粒復(fù)合物的存在或水平,其中抗體/納米顆粒復(fù)合物的存在或水平指示最近的抗流感抗體的存在或水平
其中所述樣品中抗體的水平相對于所述個體中抗體的疫苗接種前水平的增加指示疫苗在所述個體中誘導(dǎo)免疫應(yīng)答。
施用于個體的流感疫苗可以但不需要包含本發(fā)明的疫苗,只要納米顆粒包含可以結(jié)合由施用的疫苗誘導(dǎo)的抗流感抗體的HA蛋白。施用流感疫苗的方法是本領(lǐng)域技術(shù)人員已知的。
可以使用任何公開的測定形式進(jìn)行對從個體獲得的樣品的分析。在一個實(shí)施方案中,使用選自以下的測定形式進(jìn)行樣品的分析:徑向擴(kuò)散測定法,酶聯(lián)免疫測定法,競爭性酶聯(lián)免疫測定法,放射免疫測定法,熒光免疫測定法,化學(xué)發(fā)光測定法,側(cè)向流測定法,流通測定法,基于顆粒的測定法(例如,使用顆粒,如但不限于磁性顆粒或塑料聚合物,如膠乳或聚苯乙烯珠),免疫沉淀測定法,BioCoreJ測定法(例如,使用膠體金),免疫斑點(diǎn)測定法(例如,CMG=s免疫印跡系統(tǒng),F(xiàn)ribourg,Switzerland)和免疫印跡測定法(例如,western印跡),磷光測定法,流通測定法,層析測定法,基于PAGE的測定法,表面等離振子共振測定法,生物層干涉測定測定法,分光光度測定法和電子感覺測定法。
在一個實(shí)施方案中,所述方法包括在施用疫苗之前測定個體中存在的抗流感抗體的水平的步驟。然而,如果此類信息可用,則也可以從先前的醫(yī)學(xué)記錄確定個體中存在的抗流感抗體的水平。
雖然不必實(shí)施所公開的方法,但優(yōu)選在施用疫苗的步驟和確定個體中抗流感抗體的水平的步驟之間等待一段時間。在一個實(shí)施方案中,對個體中存在的抗流感抗體的水平的測定在使用疫苗后的至少1天,至少2天,至少3天,至少4天,至少5天,至少6天,至少1周,至少2周,至少3周,至少4周,至少2個月,至少3個月或至少6個月實(shí)施。
本發(fā)明還包括適用于檢測抗流感抗體的試劑盒。合適的檢測手段包括利用本發(fā)明的納米顆粒的本文公開的技術(shù)。試劑盒還可以包含可檢測標(biāo)志物,如選擇性結(jié)合納米顆粒的抗體或其它指示劑分子的抗體。試劑盒還可以包含相關(guān)聯(lián)的組分,如但不限于緩沖液,標(biāo)記物,容器,插頁,管,小瓶,注射器等。
實(shí)施例
提出以下實(shí)施例以便向本領(lǐng)域普通技術(shù)人員提供如何制備和使用實(shí)施方案的完整公開和描述,并且不旨在限制發(fā)明人認(rèn)為是其發(fā)明的范圍,它們也不意圖表示下面的實(shí)驗(yàn)是所進(jìn)行的全部或唯一的實(shí)驗(yàn)。已經(jīng)做出努力以確保關(guān)于使用的數(shù)字(例如量,溫度等)的準(zhǔn)確性,但是應(yīng)該考慮一些實(shí)驗(yàn)誤差和偏差。除非另有指示,份數(shù)是重量份,分子量是重量平均分子量,并且溫度以攝氏度計(jì)。使用標(biāo)準(zhǔn)縮寫。
實(shí)施例1:HA穩(wěn)定化莖(HA-SS)構(gòu)建體的基于結(jié)構(gòu)的迭代設(shè)計(jì)
該實(shí)施例顯示用于產(chǎn)生缺乏免疫顯性頭部結(jié)構(gòu)域的HA穩(wěn)定化莖(HA-SS)免疫原的基于結(jié)構(gòu)的設(shè)計(jì)的六個迭代循環(huán)(Gen1-Gen6)。
流感A病毒包含18種HA亞型,其中兩種H1和H3目前導(dǎo)致大多數(shù)人類感染。季節(jié)性流感疫苗針對循環(huán)H1和H3株提供了一些保護(hù),但很少提供針對趨異的H5,H7和H9亞型的保護(hù),其導(dǎo)致人類感染的偶然暴發(fā),作為來自禽類和/或豬庫的人畜共患病。本發(fā)明人假設(shè)聚焦于保守血凝素(HA)莖的免疫應(yīng)答可能潛在地引發(fā)針對多種多樣毒株的廣泛的異亞型流感保護(hù)。因此,本發(fā)明人使用基于結(jié)構(gòu)的迭代設(shè)計(jì)來開發(fā)缺乏免疫顯性HA頭部區(qū)的HA穩(wěn)定化莖(HA-SS)糖蛋白(圖1)。
A/新喀里多尼亞/20/1999(1999NC)HA的胞外域序列和A/南卡羅來納/1/1918(1918SC)的晶體結(jié)構(gòu)(PDB ID 1GBN)用作設(shè)計(jì)模板,并且對每代HA-SS變體評估作為可溶性三聚體的表達(dá),以及基于與野生型(wt)HA三聚體相似的莖特異性單克隆抗體(mAb)反應(yīng)性評估抗原性。
使用人優(yōu)選密碼子合成編碼來自1999NC,1986SG,2009CA,H2 2005CAN,H5 2005IND和H5 2004VN的全長HA和神經(jīng)氨酸酶(NA)的質(zhì)粒。通過重疊PCR和定點(diǎn)誘變產(chǎn)生不同形式的HA-SS。在freestyle 293(293F;Life Technologies)細(xì)胞或293GnTI-/-細(xì)胞(用于Gen4HA-SS結(jié)晶)中表達(dá)所有HA,HA-SS蛋白和mAb,并如前所述進(jìn)行純化(Wei,C.J.,et al.Elicitation of broadly neutralizing influenza antibodies in animals with previous influenza exposure.Sci.Transl.Med.4,147ra114(2012))。如所述(Kanekiyo,M.,et al.Nature 499,102-106(2013))進(jìn)行HA-np和Gen1-Gen6HA-SS和Gen4-6HA-SS-np的構(gòu)建,純化和表征。
第一代設(shè)計(jì)(Gen1HA-SS)用GSG接頭替換受體結(jié)合結(jié)構(gòu)域(殘基HA1 51-277,H3編號)(圖1)。各自產(chǎn)生HA胞外域三聚體和所有三聚體HA-SS設(shè)計(jì),C-末端跨膜和胞質(zhì)殘基HA2 175-220(H3編號)替換為短接頭,T4折疊物,凝血酶切割位點(diǎn)和His標(biāo)簽。使HA1/HA2切割位點(diǎn)突變以防止切割。為了模擬HA-SS設(shè)計(jì)的結(jié)構(gòu),使用1918SC HA(PDB ID 1GBN)和噬菌體T4折疊物三聚體(PDB ID 1RFO)作為模板,使用LOOPY(Xiang,et.al.Proc.Natl.Acad.Sci.U.S.A.99,7432-7437(2002))設(shè)計(jì)環(huán)和連接,使用SCAP(Xiang,et al.,J.Mol.Biol.311,421-430(2001))突變側(cè)鏈,并且使用LSQMAN(Kleywegt,et al.,in International Tables for Crystallography,Vol.F,353-367(Kluwer Academic Publishers,Dordrecht,The Netherlands,2001))實(shí)施結(jié)構(gòu)重疊。使用Rosetta程序DDG_MONOMER(Kellogg,et al.,Proteins 79,830-838(2011))計(jì)算地評估特定突變的力能學(xué)(energetics)。使用Chimera(Pettersen,E.F.,et al.Journal of Computational Chemistry 25,1605-1612(2004))進(jìn)行表面積計(jì)算。檢查蛋白質(zhì)數(shù)據(jù)庫(PDB)中約700個三聚體結(jié)構(gòu),以找到合適的三聚化結(jié)構(gòu)域,以進(jìn)一步穩(wěn)定化HA-SS免疫原。該搜索揭示了HIV-1gp41(PDB ID 1SZT)針對以下待被優(yōu)化(i)其大小(每個單體小于70個氨基酸),(ii)其熱穩(wěn)定性(Tm=70℃),(iii)容易移植,其中N-和C-末端位于三聚體的相同末端,和(iv)gp41的內(nèi)部七價(jià)重復(fù)1(inner heptad repeat 1,HR1)螺旋的C-端末端與HA-SS三聚體的內(nèi)部C螺旋之間的結(jié)構(gòu)互補(bǔ)性。Gen1HA-SS不能表達(dá)為三聚體,盡管存在C末端折疊物三聚化結(jié)構(gòu)域。
為了增加第二代中的三聚體穩(wěn)定性,本發(fā)明人將HA-SS的膜遠(yuǎn)端區(qū)域處的HA2殘基66-85替換為熱穩(wěn)定性HIV-1gp41三聚化結(jié)構(gòu)域(參見Tan,et al.,Proc.Natl.Acad.Sci.U.S.A.94,12303-12308(1997)),其中內(nèi)部七價(jià)重復(fù)1(HR1)螺旋在結(jié)構(gòu)上與HA莖的內(nèi)部C螺旋互補(bǔ)。連接gp41和HA-SS必需循環(huán)排列g(shù)p41螺旋HR1和HR2,其順序是顛倒的并用富含甘氨酸的接頭重新連接(圖1)。為了將HIV-1gp41的融合后形式的六螺旋束插入Gen2HA-SS中,將來自gp41的三個內(nèi)部螺旋的殘基28-32(殘基573-577,HXBc2編號)疊加到HA內(nèi)螺旋殘基HA2 81-85(來自PDB ID 1RU7)上,對于15個Cα原子具有的均方根偏差(RMSD)。HA2殘基66-85被gp41七價(jià)重復(fù)(HR)2螺旋(殘基628-654,HXBc2編號)替換,隨后是含有N-連接的糖基化位點(diǎn)的序列子的六殘基富含甘氨酸的接頭(NGTGGG)和gp41HR1螺旋(殘基548-577)。HR1設(shè)計(jì)成與HA2的螺旋C符合讀碼框,以產(chǎn)生長的中心嵌合螺旋。通過加入鹽橋,縮短環(huán)和降低其疏水性來穩(wěn)定F’區(qū)的膜遠(yuǎn)端部分的努力沒有改善Gen2HA-SS設(shè)計(jì)的三聚化或抗原性。Gen2HA-SS的表達(dá)導(dǎo)致29%的三聚化。
為了改善第三代中的三聚化,除去了具有不規(guī)則二級結(jié)構(gòu)的HA1F’區(qū)的44個殘基的部分,并且HA-SS的內(nèi)部螺旋C被截短了6個殘基,以在gp41和HA2之間具有更好的互補(bǔ)性。這導(dǎo)致具有77%三聚化的可溶性Gen3HA-SS,其被具有與可溶性HA三聚體(圖1)的親和力總體類似的親和力的HA莖廣泛中和性mAb(bNAb)識別。在Gen3中,用GWG接頭替換F’區(qū)的HA-SS HA2殘基43-50和278-313,并除去HA2殘基60-65和86-92。為了使gp41與HA莖的下部區(qū)域重比對,將來自gp41的三個內(nèi)部螺旋的殘基30-34(575-579Hxbc2編號)疊加到HA內(nèi)部螺旋殘基HA2 90-94上,對于15Cα原子具有的RMSD。對于CR6261和70-5B03觀察到更快的解離速率,這可能部分是由于可以與CR6261重鏈有限接觸的HAF’區(qū)域的喪失。
為了在原子水平表征Gen3HA-SS,本發(fā)明人以分辨率測定了與鼠bNAb C179的抗原結(jié)合片段(Fab)復(fù)合的Gen3HA-SS的晶體結(jié)構(gòu)(參見Okuno,Y.,et al.J.Virol.67,2552-2558(1993))(圖2a,左圖);C179抗體是用異亞型中和發(fā)現(xiàn)的第一種廣泛中和性HA莖定向抗體。
將從雜交瘤細(xì)胞收獲的C179切割成Fab,如先前所述(Ofek,G.,et al.J.Virol.78,10724-10737(2004)),其中具有以下修改:LysC(Roche)與C179以1:20,000(w/w)比率使用,并且經(jīng)由通過在50mM Tris pH 8.0中的巰基-乙基-吡啶柱(Pall Life Sciences)從消化溶液中除去可結(jié)晶片段(Fc),并且用50mM NaAc pH 5.0洗脫C179Fab。
通過使1:1.25(Gen3HA-SS/C179摩爾比)混合物通過Superdex 200 26/60(GE Healthcare)凝膠過濾柱,并且收集在152.0mL洗脫的峰獲得Gen3HA-SS(在293GnTI-/-細(xì)胞中表達(dá))與C179Fab的復(fù)合物。將復(fù)合物在150mM NaCl,10mM Tris HCl pH7.5中濃縮至10mg/ml,并且通過在15%(W/V)聚乙二醇1500,5%(V/V)2-甲基-2,4-戊二醇,200mM NH4Cl和100mM Tris HCl pH8.5中的懸滴蒸汽擴(kuò)散(hanging drop vapor diffusion)在20℃結(jié)晶,這來源于沉淀劑協(xié)同結(jié)晶篩選(Majeed,S.,et al.Structure 11,1061-1070(2003))。在沒有任何另外的冷凍保護(hù)劑的情況下將晶體冷凍,并在數(shù)據(jù)收集之前貯存在液氮中。
在Advanced Photon Source(APS),阿貢國家實(shí)驗(yàn)室(Argonne National Laboratory)的東南地區(qū)協(xié)作訪問團(tuán)隊(duì)(Southeast Regional Collaborative Access Team,SER-CAT)22-BM束線處,使用的波長,在100K的溫度下收集X射線數(shù)據(jù)到分辨率用HKL2000在三角空間群H3中處理X射線數(shù)據(jù),并且通過使用五個單獨(dú)的搜索模型的分子替換來確定復(fù)合物的結(jié)構(gòu)。使用PHASER(Mccoy,A.J.,et al.J.Appl.Crystallogr.40,658-674(2007)),與來自1934PR8結(jié)構(gòu)的HA莖單體(PDB ID 1RU7,殘基5-36,315-323HA1鏈A和殘基514-559,590-660HA2鏈B),HIV-1gp41單體(PDB ID 1SZT,殘基3-29,42-67),鼠抗體S25-2的重鏈可變域(PDB ID 1Q9K,殘基1-111),和鼠抗體MN16C13F4的輕鏈可變域(PDB ID 1UWX,殘基3-108)一起搜索。使用MOLREP(Collaborative Computational Project.Acta Crystallogr.D Biol.Crystallogr.50,760-763(1994))來定位T4折疊物單體(PDB ID1RFO,鏈A),其證實(shí)了手工進(jìn)行的獨(dú)立擬合。通過眼將C179Fab恒定結(jié)構(gòu)域擬合入Fo-Fc密度中,之后使用上述Ab(PDB ID 1Q9K和1UWX)的恒定結(jié)構(gòu)域作為模板精修。使用COOT(Emsley,P.&Cowtan,K.Coot:D Biol.Crystallogr.60,2126-2132(2004))和PHENIX(Adams,P.D.,et al.Acta Crystallogr.D Biol.Crystallogr.58,1948-1954(2002))及搭乘氫(riding hydrogen)實(shí)施模型建立和精修。除了HA切割環(huán)(殘基48-52),連接gp41螺旋的富含甘氨酸的環(huán)(殘基139-144),連接HA-SS到折疊物的接頭(殘基256-259)和折疊物域C端的凝血酶切割位點(diǎn)和His標(biāo)簽(殘基286-302)外,將Gen3HA-SS的所有殘基建模成電子密度。觀察到糖并建立在Asn殘基23、119和236上。C179結(jié)構(gòu)包括重鏈殘基1-213和輕鏈殘基1-214。如由PHENIX測定的Ramachandran統(tǒng)計(jì)學(xué)揭示了有利區(qū)域中91.64%的殘基,允許區(qū)域中的7.49%和作為異常值的0.86%。
共晶體結(jié)構(gòu)揭示Gen3HA-SS的C179識別類似于在最近公布的C179與A/日本/305/1957(1957JP)HA的共晶結(jié)構(gòu)中識別H2N2三聚體HA的識別(參見Dreyfus,et al.,J.Virol.87,7149-7154(2013))(圖2a,右圖)。雖然這些發(fā)現(xiàn)證實(shí)了Gen3HA-SS上的莖表位的保留;整體結(jié)構(gòu)揭示了幾個意想不到的差異(圖2a,左圖和中圖)。首先,莖三聚體亞基在其C末端相對于HA分開約(圖2a,中間圖)。第二,C-末端折疊物三聚化結(jié)構(gòu)域倒轉(zhuǎn)并且在莖三聚體內(nèi)部疊入到張開區(qū)域中(圖2a,左圖)。最后,HA莖的外部螺旋A與gp41六螺旋束的外部HR2螺旋形成連續(xù)螺旋,而不是形成由甘氨酸接頭分開的兩個單獨(dú)的螺旋。
為了解決這些問題,創(chuàng)建了含有三個突變(圖1中概述)的第四代HA-SS,以努力除去潛在的側(cè)鏈碰撞并且破壞HA2的螺旋B與gp41HR2之間的連續(xù)螺旋(圖2b)。
為了結(jié)晶Gen4HA-SS/CR6261復(fù)合物,通過與內(nèi)切糖苷酶H(77U/μg Gen4HA-SS)溫育4小時來使Gen4HA-SS(在293GnTI-/-細(xì)胞中表達(dá))去糖基化,隨后通過刀豆蛋白A柱(Sigma)除去具有未切割的N-連接聚糖的蛋白質(zhì)。通過使1:1.25(Gen4HA-SS/CR6261摩爾比)混合物通過Superdex 200 10/300(GE Healthcare)凝膠過濾柱并收集在12.5mL處洗脫的峰來獲得與CR6261Fab的復(fù)合物。將復(fù)合物在150mM NaCl,10mM Tris HCl pH7.5中濃縮至11mg/ml,并通過在7%(w/v)聚乙二醇4000,4.5%(v/v)異丙醇,100mM咪唑pH6.5中的懸滴蒸汽擴(kuò)散在20℃結(jié)晶。將晶體在包含另外的5%(v/v)2R,3R丁二醇(Sigma)的貯存溶液中在室溫下浸泡6小時,然后簡短30秒轉(zhuǎn)移至含有15%2R,3R丁二醇的貯存溶液,之后快速冷卻。
在APS的SER-CAT BM-22束線處使用的波長在100K的溫度下收集X射線數(shù)據(jù)到分辨率。用空間群H3中的HKL2000(參考文獻(xiàn)37)處理數(shù)據(jù),并通過使用三個單獨(dú)的搜索模型的分子置換來確定復(fù)合物的結(jié)構(gòu)。使用PHASER來與來自1934PR8結(jié)構(gòu)的HA莖單體,HIV-1gp41單體(與上述相同模型)以及CR6261(PDB ID 3GBM)的可變和恒定結(jié)構(gòu)域一起搜索。分別使用COOT和PHENIX進(jìn)行模型建立和精制。除了HA切割環(huán)(殘基48-52),連接gp41螺旋的富含甘氨酸的環(huán)(殘基137-145)和C末端折疊物(殘基256-259),折疊結(jié)構(gòu)域C端的凝血酶切割位點(diǎn)和His標(biāo)簽(殘基286-302)外,將Gen4HA-SS的所有殘基建模為電子密度。盡管在Gen3HA-SS結(jié)構(gòu)中觀察到的相同區(qū)域中的HA莖內(nèi)部可見密度,但是它不足以唯一放置或穩(wěn)定精制折疊物結(jié)構(gòu)域。CR6261Fab結(jié)構(gòu)包括重鏈殘基1-213和輕鏈殘基3-107和113-215。如由PHENIX測定的Ramachandran統(tǒng)計(jì)學(xué)揭示有利區(qū)域中93.19%的殘基,允許區(qū)域中的6.09%和作為異常值的1.06%。
對于低溫電子顯微術(shù)分析,使用Vitrobot Mark IV(FEI Company,Hillsboro,OR)在多孔碳膜(Quantfoil,Germany)上將顆粒玻璃化。在Titan Krios電子顯微鏡(FEI公司,Hillsboro,OR)上收集顆粒的冷凍圖像,在液氮溫度下操作并在300kV下操作。在像素大小以范圍為約2.8至約6μm的散焦值,并且以范圍為約10至的劑量在4,096×4,096電荷耦合器件(CCD)照相機(jī)(Gatan Inc.,Warrendale,PA)上收集圖像。使用ctffind3(Mindell,J.A.&Grigorieff,N.J Struct Biol 142,334-347(2003))擬合觀察到的散焦值,并且將展示漂移或散光的圖像從進(jìn)一步分析中排除。從圖像中手動挑選顆粒(13,464)。無參考2D分類指示在3D精修期間施加的八面體對稱。使用平滑,無刺突的低通濾過的鐵蛋白(PDB ID 2JD6)作為起始模型。在精化過程中除去重疊顆粒之后,從6,540個顆粒計(jì)算重建(3D圖)。用Relion包(Scheres,S.H.W.J.Mol.Biol.415,406-418(2012))進(jìn)行所有圖像分析(2D和3D)。用Chimera進(jìn)行模型坐標(biāo)的可視化和分子??俊?/p>
與C179復(fù)合的Gen3HA-SS和與CR6261復(fù)合的Gen4HA-SS復(fù)合物的原子坐標(biāo)和結(jié)構(gòu)因子分別保存在PDB代碼4MKD和4MKE下。H1-SS-np的冷凍電子顯微術(shù)圖已經(jīng)以EMDB代碼EMD-6332保存。
與bNAb CR6261的Fab復(fù)合的Gen4HA-SS的分辨率的共晶體結(jié)構(gòu)(參見Ekiert,D.C.,et al.Science 324,246-251(2009))揭示了相對于gp41的展開仍然存在,額外旋轉(zhuǎn)約19°(圖2b,中間圖)。然而,在Gen4HA-SS中三聚化水平(83%),莖表位構(gòu)象的保持和HA莖bNAb結(jié)合(對四種bNAb為nM)接近最佳(圖1a和2b)。
發(fā)明人關(guān)注免疫原性HIV-1gp41區(qū)域的牽連,因此尋求用短的富含甘氨酸的接頭替換gp41(圖1a),因?yàn)檫@還將增加HA莖在免疫原表面上的百分比(圖1b)。在兩種情況,Gen5HA-SS(其保留Gen4穩(wěn)定化莖區(qū))和Gen6HA-SS(其中包含Lys51-Glu103(HA2,H3編號)的內(nèi)部鹽橋被替換為幾乎等排的Met-Leu疏水對)(Gen6HA-SS,圖1c)下進(jìn)行g(shù)p41替換。
通過完全除去gp41三聚化結(jié)構(gòu)域,將HA2殘基58-93與GSGGSG環(huán)連接并引入HA2突變Y94D和N95L來創(chuàng)建Gen5HA-SS。
為了設(shè)計(jì)Gen6HA-SS,最初創(chuàng)建了五個突變以穩(wěn)定化HA莖HA2的內(nèi)部核心:K51M,E103L,E105Q,R106W和D109L(稱為Gen6’HA-SS)。對所有三種免疫原保留通過HA莖抗體的三聚化和識別(圖1a)。包含三個另外的內(nèi)部穩(wěn)定化突變的Gen6HA-SS的中間形式(稱為Gen6’HA-SS)展示相似的抗原性(圖1d),但是最終觀察到突變E105Q,R106W和D109L不是穩(wěn)定化Gen6HA-SS和與鐵蛋白融合需要的,并且不用于最終的H1-SS-np構(gòu)建體(圖1c)。
實(shí)施例2:自組裝鐵蛋白納米顆粒的創(chuàng)建
該實(shí)施例描述了Gen4,Gen5,Gen6’和Gen6HA-SS通過它們各自的HA C末端與自組裝鐵蛋白納米顆粒的融合。
在自組裝納米顆粒(HA-np)的背景下,HA的免疫原性顯著增加(參見Kanekiyo,M.,et al.,Nature 499,102-106(2013))。此外,本發(fā)明人推測與納米顆粒的C-末端融合可以降低莖的近膜區(qū)域的張開。因此,本發(fā)明人將Gen4,Gen5,Gen6’和Gen6HA-SS通過它們各自的HA C-末端(替換折疊物)遺傳融合到幽門螺桿菌的自組裝鐵蛋白納米顆粒以創(chuàng)建HA-SS-納米顆粒(HA-SS-np)。
用SGG接頭將Gen4-6HA-SS與幽門螺桿菌鐵蛋白N-末端(殘基5-167)融合以產(chǎn)生HA-SS鐵蛋白納米顆粒(Gen4HA-SS-np,H1-SS-np和H1-SS-np’),如描述的(Kanekiyo,M.,et al.Nature 499,102-106(2013))。
使用fortéBio Octet Red384儀器測量HA和HA-SS分子對mAb CR6261,CR9114,F(xiàn)10scFv和70-5B03的結(jié)合動力學(xué)。所有測定法在30℃下進(jìn)行,在補(bǔ)充有1%BSA的PBS中設(shè)定為1,000rpm的攪拌,以使非特異性相互作用最小化。所有溶液的最終體積為100μl/孔。在固體黑色96孔板(Geiger Bio-One)中在30℃進(jìn)行測定。使用在10mM乙酸鹽pH 5.0緩沖液中具有C-末端生物素化的Avi-Tag(25μg/ml)和HA-np或HA-SS-np的HA或HA-SS分別加載鏈霉抗生物素蛋白和胺反應(yīng)性生物傳感器探針達(dá)300s。典型的捕獲水平在0.8和1nm之間,并且一排八個尖端內(nèi)的變異性不超過0.1nm。將生物傳感器尖端在PBS/1%BSA緩沖液中平衡300s,之后進(jìn)行溶液中的Fab或F10scFv(0.01至0.5μM)的結(jié)合測量。加入抗體后,使結(jié)合進(jìn)行300s;然后使結(jié)合解離300s。僅使用解離孔一次以防止污染。通過減去對于在PBS/1%BSA中溫育的裝載有HA或HA-SS分子的傳感器記錄的測量,進(jìn)行平行校正以減去系統(tǒng)基線漂移。為了除去非特異性結(jié)合應(yīng)答,將生物素化的gp120表面重修核心分子加載到鏈霉抗生物素蛋白探針上,并與抗莖抗體一起溫育,并從HA和HA-SS響應(yīng)數(shù)據(jù)中減去非特異性應(yīng)答。使用Octet軟件7.0版進(jìn)行數(shù)據(jù)分析和曲線擬合。實(shí)驗(yàn)數(shù)據(jù)用描述1:1相互作用的結(jié)合方程擬合。假設(shè)結(jié)合是可逆的(完全解離),使用非線性最小二乘法擬合進(jìn)行完整數(shù)據(jù)集的全局分析,所述非線性最小二乘法擬合允許對于每個實(shí)驗(yàn)中使用的所有濃度同時獲得單一組的結(jié)合參數(shù)。
如之前所述(Wei,C.J.,et al.Science 329:1060-1064(2010))進(jìn)行ELISA,血凝抑制(HAI)測定法和假型中和測定法。如描述(Wei,C.J.,et al.Sci.Transl.Med.2,24ra21(2010)),產(chǎn)生表達(dá)螢光素酶報(bào)告基因的重組HA/NA慢病毒載體。所有流感病毒均獲自疾病控制和預(yù)防中心(CDC;Atlanta,GA)。
Gen4,Gen6和Gen6’HA-SS-np各自表示為納米顆粒,如通過透射電子顯微鏡分析和凝膠過濾證實(shí)的(圖2)。然而,Gen5HA-SS-np未能表達(dá)。選擇Gen6和Gen6’HA-SS-np進(jìn)行進(jìn)一步評估,并且在下文中在這些實(shí)施例中分別稱為H1-SS-np和H1-SS-np’。以分辨率為實(shí)施的H1-SS-np的冷凍電子顯微術(shù)(EM)分析揭示了對稱的球形顆粒,每個顆粒具有從表面突出的八個刺突(圖2c)。值得注意的是,Gen6HA-SS莖的膜近端區(qū)域比Gen4HA-SS更好地適合于電子密度,這表明擴(kuò)展是減輕的或不再存在(圖2c,左圖)。此外,H1-SS-np和H1-SS-np’都具有期望的抗原性,在ELISA和生物層干涉測量法測量中被CR6261,CR9114,F(xiàn)10和70-5B03識別(參見Ekiert,D.C.,et al.Science 324,246-251(2009);Sui,J.,et al.Nat.Struct.Mol.Biol.16,265-273(2009);Dreyfus,C.,et al.Science 337,1343-1348(2012);Wrammert,J.,et al.J.Exp.Med.208,181-193(2011)),表明在與鐵蛋白融合后保留了真正的HA-SS結(jié)構(gòu)(圖1a,1e和1f)。
實(shí)施例3:評估疫苗功效
該實(shí)施例證明了與HA構(gòu)建體融合的鐵蛋白納米顆粒的疫苗功效的各種測量的表征。
本發(fā)明人使用鈣流量測定法評估了與全長HA-np相比H1-SS-np通過膜錨定的種系恢復(fù)的CR6261B細(xì)胞受體(BCR)觸發(fā)信號傳導(dǎo)的能力(Novak,et.al.Cytometry 17,135-141(1994))。
對于BCR活化測定法,通過輕鏈和膜錨定的IgM重鏈對Ramos B細(xì)胞系的表面IgM陰性克隆的慢病毒轉(zhuǎn)染(FEEKW載體;Luo,X.M.,et al.Blood 113,1422-1431(2009))穩(wěn)定表達(dá)種系CR6261BCR(野生型和雙重I53A/F54A突變體)。然后通過流式細(xì)胞術(shù)(BD FACSAria;BD Biosciences)分選種系CR6261BCR陽性細(xì)胞并擴(kuò)增。評估對于種系CR6261BCR(野生型或I53A/F54A突變體)表達(dá)>95%陽性的細(xì)胞的表面表達(dá)和正確的HA抗原性。對于信號傳導(dǎo),向表達(dá)種系CR6261BCR的1×106個Ramos B細(xì)胞呈現(xiàn)2500nM的H1-SS-np,HA np(HA含有Y98F突變以消除與唾液酸的非特異性結(jié)合)或空np。通過流式細(xì)胞術(shù)測量響應(yīng)于BCR刺激的鈣流量的動力學(xué),作為染料Fura Red的Ca2+結(jié)合/未結(jié)合狀態(tài)的比率。Ca2+流量的此比率在暴露于配體后10秒呈現(xiàn)。在刺激之前獲取30秒基線。對單個細(xì)胞的參比測量取平均值并通過動力學(xué)分析,F(xiàn)lowJo軟件變平滑。在暴露于0.5μg/μl抗人IgM F(ab’)2(Southern Biotech)后,通過Ca2+流量比較種系CR6261BCR對具有I53A/F54A突變的種系CR6261BCR之間的功能性。
與空鐵蛋白顆粒相反,H1-SS-np通過野生型BCR誘導(dǎo)有效的信號傳導(dǎo),全長HA-np在較小程度上亦然,并且通過在第二個重鏈互補(bǔ)決定區(qū)(CDR H2)中的兩個關(guān)鍵接觸殘基中突變的BCR沒有觀察到信號傳導(dǎo)(圖1g)。這一發(fā)現(xiàn)證實(shí)了H1-SS-np銜接CR6261的IGHV1-69種系前體并通過CDR H2依賴性識別刺激未免疫的B細(xì)胞的能力,在人中發(fā)現(xiàn)的廣泛中和性莖定向抗體的特征。
為了評估H1-SS-np疫苗功效,本發(fā)明人使用Sigma佐劑系統(tǒng)(SAS)免疫小鼠和雪貂,這是因?yàn)橐褕?bào)道類似于MF59(另一種被批準(zhǔn)用于人的基于角鯊烯的佐劑),SAS誘導(dǎo)HA響應(yīng)。
對于免疫研究,對于該研究進(jìn)行總共三個動物實(shí)驗(yàn),兩個在小鼠中,一個在雪貂中。在第一次小鼠實(shí)驗(yàn)中,在第0周和第2周時用2μg H1-SS-np,2μg空白鐵蛋白np,0.2μg H5 2005IND HA-np或TIV(HA摩爾當(dāng)量)肌肉內(nèi)免疫雌性BALB/c小鼠(6-8周齡,Jackson Laboratories)。在每次免疫后14天收集血液,并且分離血清。對于第二次小鼠免疫實(shí)驗(yàn),在第0周、第8周和第12周用3μg的H1-SS-np或空鐵蛋白np免疫雌性BALB/c小鼠三次。對于雪貂免疫,飼養(yǎng)使用6月齡雄性Fitch雪貂(Triple F Farms,Sayre,PA)(對于暴露于目前循環(huán)的大流行H1N1,季節(jié)性H1N1,H3N2和B流感毒株呈血清陰性),并在BIOQUAL,Inc.(Rockville,MD)護(hù)理。這些設(shè)施由美國實(shí)驗(yàn)動物保護(hù)國際認(rèn)可協(xié)會(American Association for the Accreditation of Laboratory Animal Care International)認(rèn)可,并滿足NIH標(biāo)準(zhǔn),如“實(shí)驗(yàn)動物護(hù)理和使用指南(Guide for the Care and Use of Laboratory Animals)”中所述。在第0周和第4周,用在500μl PBS中的20μg H1-SS-np’或空鐵蛋白np或TIV(相當(dāng)于2.5μg H1HA)肌內(nèi)免疫雪貂。用250μg表達(dá)H5 2005IND的質(zhì)粒DNA,隨后在第0周和第4周用H5N1 2005IND MIV的2.5μg HA免疫陽性對照組中的雪貂。通過肌內(nèi)注射將疫苗施用到大腿上部肌肉中。Sigma佐劑系統(tǒng)(SAS,Sigma)用于所有蛋白質(zhì)或基于np的免疫。每次免疫后14天收集血液,并且分離血清。動物實(shí)驗(yàn)完全符合所有相關(guān)聯(lián)邦規(guī)定和NIH指南進(jìn)行。
對于被動轉(zhuǎn)移研究,在第0周和第4周首先用H1-SS-np蛋白(2μg/劑量,具有SAS)接種150只小鼠,以產(chǎn)生HA-SS免疫Ig,并在加強(qiáng)后第1周,第2周和第3周(末端)收集血清。使用制造商方案用蛋白G(Life Technologies)純化來自免疫血清的Ig。攻擊前24小時,兩組BALB/c小鼠(n=10/組,Taconic inc。)通過腹膜內(nèi)途徑接受未免疫的(Molecular innovations)或免疫的Ig。在被動轉(zhuǎn)移后24小時從輸注的動物收集血清用于血清學(xué)分析。
對于病毒攻擊研究,從疾病控制和預(yù)防中心(Atlanta,GA)(CDC#2004706280,E1/E3(1/19/07)獲得H5N1毒株A/越南/1203/04,并且在BIOQUAL Inc.在10天齡的胚胎雞蛋(Charles River,North Franklin,CT)中擴(kuò)充。攻擊原液具有1010TCID50/ml的感染滴度。對于血液收集,放血和攻擊程序,用配制為對每只動物提供25mg/kg氯胺酮和0.001mg/kg右美托咪定劑量的氯胺酮/右美托咪定溶液麻醉動物。將小鼠用50μl病毒鼻內(nèi)接種,每個鼻孔大約25μl,并且對雪貂鼻內(nèi)接種500μl病毒,每個鼻孔約250μl。攻擊劑量為小鼠中的25LD50和雪貂中的1000TCID50。根據(jù)以前的研究,這些攻擊劑量預(yù)期分別導(dǎo)致未免疫的對照小鼠和雪貂中的100%致死率。對于雪貂,每天記錄感染的臨床體征,體重和溫度兩次。如下分配活動得分:0,警惕和嬉戲;1,警醒但只在受刺激時嬉戲(playful);2,警惕,但刺激時不嬉戲;和3,既不警惕,在刺激時也不嬉戲。對顯示嚴(yán)重疾病體征(延長的發(fā)燒,腹瀉,干擾飲食,飲水或呼吸的流涕;嚴(yán)重嗜睡;或神經(jīng)學(xué)體征)或體重減輕>20%的雪貂立即實(shí)施安樂死。
H1-SS-np和H1-SS-np’分別引發(fā)在小鼠和雪貂兩者中針對組1HA亞型(季節(jié)性和大流行H1,H2,H5和H9)的廣泛抗體響應(yīng)(圖3a,3b和3C)。此外,H1-SS-np在半數(shù)的小鼠中誘導(dǎo)出與H2和H5相當(dāng)?shù)膶?shí)質(zhì)性組2(H3和H7)應(yīng)答(圖3a,左圖)。在小鼠和雪貂兩者中,由H1-SS-np引發(fā)的對HA莖的抗體應(yīng)答顯著高于三價(jià)滅活的流感疫苗(TIV)的抗體應(yīng)答(圖3b,右圖)。雖然也觀察到對鐵蛋白的相當(dāng)大的應(yīng)答(圖3a和3b,左圖),但先前的研究已顯示用細(xì)菌鐵蛋白免疫不誘導(dǎo)小鼠中自體鐵蛋白的免疫,它也不減輕對隨后免疫的HA特異性抗體應(yīng)答。使用高度靈敏的HA-NA慢病毒報(bào)告物測定法(Wei,C.J.,et al.Sci.Transl.Med.2,24ra21(2010))測量血清中和活性(NT)揭示了在小鼠和雪貂兩者中針對趨異的H1N1毒株A/加利福尼亞/04/2009(2009CA)和A/新加坡/6/1986(1986SG)和同源1999NC株的看得出的活性。然而,針對異亞型H5N1A/越南/1203/2004(H5N1 2004VN),人起源H2N2A/加拿大/720/2005(H2N2 2005CA),H7N9A/安徽(Anhui)/1/2013(H7N9 2013AN)和H9N2A/香港/1074/1999(H9N2 1999HK)在小鼠和雪貂兩者中都是低的或不可檢測的(圖3a和3c)。盡管強(qiáng)的異亞型抗體反應(yīng)性,但觀察到的最小異亞型中和可能是由于莖中和所需要的單個表位區(qū)域的精確靶向,使得其比在表面積上大20倍的HA莖的其它部分對次要結(jié)構(gòu)差異更敏感。TIV免疫的動物在小鼠和雪貂兩者中具有針對同源1999NC的最高NT,針對異源H1N1株的可檢測NT,以及沒有針對異亞型H5N1的NT(圖3b)。如預(yù)期的,TIV免疫的動物具有顯著的血凝抑制(HAI)滴度,并且由H1-SS-np和H1-SS-np’引發(fā)的NT活性與HAI無關(guān)。
為了評估保護(hù),用高致死劑量的高致病性H5N1 2004VN病毒攻擊經(jīng)免疫的小鼠和雪貂。所有未免疫的小鼠和用空np免疫的小鼠死亡,并且顯著地,所有用H1-SS-np免疫的小鼠存活(圖4a)。用空鐵蛋白納米粒免疫的所有雪貂死于感染,并且用H5N1HA DNA/單價(jià)滅活疫苗(MIV)初免-加強(qiáng)免疫的所有雪貂存活(圖4b)。與小鼠研究一致,六個基于H1N1的H1-SS-np’免疫的雪貂中的四個幸免于H5N1攻擊。盡管六個TIV免疫的雪貂中的兩個存活,但是兩個存活者中的一個經(jīng)歷嚴(yán)重的體重減輕(圖4a),并且在具有最小體重減輕的另一只存活者中沒有H5血清學(xué)應(yīng)答的證據(jù),提示沒有發(fā)生感染。除了一個血清陰性動物之外,與空的鐵蛋白-np對照相比,TIV免疫的組在體重減輕或發(fā)燒方面沒有差異,并且如通過攻擊后活動評分證明,比H1-SS-np’-免疫的雪貂顯示更大的疾病。與空鐵蛋白免疫的對照相比,基于H1-SS-np’-免疫的雪貂中的活動評分,第6天體重減輕,發(fā)熱和疾病顯著減少(圖4)。在存活的雪貂中攻擊后第14天存在的針對H5N1 2004VN的HAI滴度指示雖然H1-SS-np’能夠預(yù)防疾病,但它不能防止感染。表3和4提供了小鼠和雪貂中的這些免疫研究的總結(jié)。
表3:在用H1-SS-np免疫的小鼠中針對H1N1 1999NC和H5N1 2004VN的攻擊后血清HAI抗體滴度。
*此小鼠在攻擊前1天死亡。
表4:用指定方案免疫的雪貂中針對同源H1N1 1999NC的攻擊前HAI抗體滴度和針對攻擊毒株H5N1 2004VN的攻擊后HAI抗體滴度。
由H1-SS-np’引發(fā)的可忽略的H5N1NT活性(圖3c)沒有解釋觀察到的異亞型保護(hù)。然而,在HA-SS-np’免疫的白鼬中,HA抗體滴度和存活之間以及抗體滴度和體重之間存在相關(guān)性。為了進(jìn)一步研究這種相關(guān)性,在用高致死劑量的H5N1 2004VN病毒攻擊前24小時,發(fā)明人被動轉(zhuǎn)移H1-SS-np免疫Ig至未免疫的小鼠(10mg/動物)。轉(zhuǎn)移的Ig具有與組1HA亞型(H1,H2,H5和H9)的強(qiáng)反應(yīng)性,與組2亞型(H3和H7)的較弱的結(jié)合和最小的NT活性(圖4d和4e)。在表5中顯示H1-SS-np免疫Ig對多種流感假病毒的IC50中和滴度。
表5:H1-SS-np免疫Ig的IC50假病毒中和滴度。
雖然所有接受未免疫的Ig的小鼠都死于感染,但接受免疫Ig的10只小鼠中的8只完全被保護(hù)而免于致命的H5N1異亞型攻擊。在免疫Ig組中死亡的兩只小鼠中對同源H1 1999NC HA的低血清反應(yīng)性指示它們可能尚未接受適當(dāng)?shù)腎g施用(圖4c)。
這些數(shù)據(jù)一起顯示,基于除中和之外的功能機(jī)制(如抗體依賴性細(xì)胞介導(dǎo)的細(xì)胞毒性(ADCC)或抗體依賴性補(bǔ)體介導(dǎo)的裂解)的抗體介導(dǎo)的的保護(hù)負(fù)責(zé)由H1-SS-np和H1-SS-np’免疫引發(fā)的保護(hù)。報(bào)告了通過廣泛中和性HA莖抗體在小鼠中的流感保護(hù)依賴于Fc相互作用(DiLillo,et.al.Nat Med 20,143-151(2014)),并且已經(jīng)在人和獼猴血漿兩者中報(bào)告了在不存在中和的情況下針對流感HA的交叉反應(yīng)性ADCC(Jegaskanda,S.,et al.J Immunol 190,1837-1848(2013);Jegaskanda,et al.J.Virol.87,5512-5522(2013);Jegaskanda,et al.J Immunol 193,469-475(2014))。與這些報(bào)告一致,本文中呈現(xiàn)的結(jié)果提示基于HA莖的流感疫苗不需要必然聚焦于中和性表位以誘導(dǎo)廣泛的保護(hù)。
使用基于結(jié)構(gòu)的設(shè)計(jì)并避免對HA頭部結(jié)構(gòu)域的免疫顯性應(yīng)答,與納米顆??乖故酒脚_組合,本發(fā)明人成功地產(chǎn)生了僅HA莖的納米顆粒疫苗免疫原,其在雪貂中引發(fā)針對H5N1疾病的抗體介導(dǎo)的異亞型保護(hù)性免疫。這些結(jié)果證明,通過僅HA莖的納米顆粒疫苗引發(fā)非中和性抗體可以提供針對嚴(yán)重疾病的廣泛保護(hù),并且應(yīng)該用于開發(fā)通用流感疫苗。
序列表
<110> 美利堅(jiān)合眾國, 由健康及人類服務(wù)部部長代表
Mascola, John R.
Boyington, Jeffrey C.
Yassine, Hadi M.
Kwong, Peter D.
Graham, Barney S.
Kanekiyo, Masaru
<120> 穩(wěn)定化的流感血凝素莖區(qū)三聚體及其用途
<130> 6137NIAID-36-PCT
<140> 尚未分配
<141> 2015-05-27
<150> 62/003,471
<151> 2014-05-27
<160> 401
<170> PatentIn version 3.5
<210> 1
<211> 504
<212> DNA
<213> 幽門螺桿菌
<400> 1
atgctgtccg acatcatcaa gctgctgaac gaacaggtga acaaggagat gcagagctcc 60
aacctgtaca tgagtatgtc tagttggtgt tatacacact cactggacgg cgctgggctg 120
ttcctgtttg atcacgcagc cgaggaatac gaacatgcaa agaaactgat cattttcctg 180
aatgagaaca atgtgcccgt ccagctgact tcaatcagcg cccctgaaca taagttcgag 240
ggcctgaccc agatctttca gaaagcttac gaacacgagc agcatatttc cgaatctatc 300
aacaatattg tggaccacgc cattaagagc aaagatcatg ctaccttcaa ctttctgcag 360
tggtacgtgg ccgagcagca cgaggaggag gtcctgttta aggacatcct ggataaaatc 420
gaactgattg gaaacgagaa tcatggcctg tacctggcag atcagtatgt gaagggcatt 480
gccaagtcca gaaaaagtgg gtca 504
<210> 2
<211> 168
<212> PRT
<213> 幽門螺桿菌
<400> 2
Met Leu Ser Asp Ile Ile Lys Leu Leu Asn Glu Gln Val Asn Lys Glu
1 5 10 15
Met Gln Ser Ser Asn Leu Tyr Met Ser Met Ser Ser Trp Cys Tyr Thr
20 25 30
His Ser Leu Asp Gly Ala Gly Leu Phe Leu Phe Asp His Ala Ala Glu
35 40 45
Glu Tyr Glu His Ala Lys Lys Leu Ile Ile Phe Leu Asn Glu Asn Asn
50 55 60
Val Pro Val Gln Leu Thr Ser Ile Ser Ala Pro Glu His Lys Phe Glu
65 70 75 80
Gly Leu Thr Gln Ile Phe Gln Lys Ala Tyr Glu His Glu Gln His Ile
85 90 95
Ser Glu Ser Ile Asn Asn Ile Val Asp His Ala Ile Lys Ser Lys Asp
100 105 110
His Ala Thr Phe Asn Phe Leu Gln Trp Tyr Val Ala Glu Gln His Glu
115 120 125
Glu Glu Val Leu Phe Lys Asp Ile Leu Asp Lys Ile Glu Leu Ile Gly
130 135 140
Asn Glu Asn His Gly Leu Tyr Leu Ala Asp Gln Tyr Val Lys Gly Ile
145 150 155 160
Ala Lys Ser Arg Lys Ser Gly Ser
165
<210> 3
<211> 504
<212> DNA
<213> 幽門螺桿菌
<400> 3
tgacccactt tttctggact tggcaatgcc cttcacatac tgatctgcca ggtacaggcc 60
atgattctcg tttccaatca gttcgatttt atccaggatg tccttaaaca ggacctcctc 120
ctcgtgctgc tcggccacgt accactgcag aaagttgaag gtagcatgat ctttgctctt 180
aatggcgtgg tccacaatat tgttgataga ttcggaaata tgctgctcgt gttcgtaagc 240
tttctgaaag atctgggtca ggccctcgaa cttatgttca ggggcgctga ttgaagtcag 300
ctggacgggc acattgttct cattcaggaa aatgatcagt ttctttgcat gttcgtattc 360
ctcggctgcg tgatcaaaca ggaacagccc agcgccgtcc agtgagtgtg tataacacca 420
actagacata ctcatgtaca ggttggagct ctgcatctcc ttgttcacct gttcgttcag 480
cagcttgatg atgtcggaca gcat 504
<210> 4
<211> 492
<212> DNA
<213> A流感病毒
<400> 4
atcatcaagc tgctgaacga acaggtgaac aaggagatgc agagctccaa cctgtacatg 60
agtatgtcta gttggtgtta tacacactca ctggacggcg ctgggctgtt cctgtttgat 120
cacgcagccg aggaatacga acatgcaaag aaactgatca ttttcctgaa tgagaacaat 180
gtgcccgtcc agctgacttc aatcagcgcc cctgaacata agttcgaggg cctgacccag 240
atctttcaga aagcttacga acacgagcag catatttccg aatctatcaa caatattgtg 300
gaccacgcca ttaagagcaa agatcatgct accttcaact ttctgcagtg gtacgtggcc 360
gagcagcacg aggaggaggt cctgtttaag gacatcctgg ataaaatcga actgattgga 420
aacgagaatc atggcctgta cctggcagat cagtatgtga agggcattgc caagtccaga 480
aaaagtgggt ca 492
<210> 5
<211> 165
<212> PRT
<213> A流感病毒
<400> 5
Asp Ile Ile Lys Leu Leu Asn Glu Gln Val Asn Lys Glu Met Gln Ser
1 5 10 15
Ser Asn Leu Tyr Met Ser Met Ser Ser Trp Cys Tyr Thr His Ser Leu
20 25 30
Asp Gly Ala Gly Leu Phe Leu Phe Asp His Ala Ala Glu Glu Tyr Glu
35 40 45
His Ala Lys Lys Leu Ile Ile Phe Leu Asn Glu Asn Asn Val Pro Val
50 55 60
Gln Leu Thr Ser Ile Ser Ala Pro Glu His Lys Phe Glu Gly Leu Thr
65 70 75 80
Gln Ile Phe Gln Lys Ala Tyr Glu His Glu Gln His Ile Ser Glu Ser
85 90 95
Ile Asn Asn Ile Val Asp His Ala Ile Lys Ser Lys Asp His Ala Thr
100 105 110
Phe Asn Phe Leu Gln Trp Tyr Val Ala Glu Gln His Glu Glu Glu Val
115 120 125
Leu Phe Lys Asp Ile Leu Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn
130 135 140
His Gly Leu Tyr Leu Ala Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser
145 150 155 160
Arg Lys Ser Gly Ser
165
<210> 6
<211> 492
<212> DNA
<213> A流感病毒
<400> 6
tgacccactt tttctggact tggcaatgcc cttcacatac tgatctgcca ggtacaggcc 60
atgattctcg tttccaatca gttcgatttt atccaggatg tccttaaaca ggacctcctc 120
ctcgtgctgc tcggccacgt accactgcag aaagttgaag gtagcatgat ctttgctctt 180
aatggcgtgg tccacaatat tgttgataga ttcggaaata tgctgctcgt gttcgtaagc 240
tttctgaaag atctgggtca ggccctcgaa cttatgttca ggggcgctga ttgaagtcag 300
ctggacgggc acattgttct cattcaggaa aatgatcagt ttctttgcat gttcgtattc 360
ctcggctgcg tgatcaaaca ggaacagccc agcgccgtcc agtgagtgtg tataacacca 420
actagacata ctcatgtaca ggttggagct ctgcatctcc ttgttcacct gttcgttcag 480
cagcttgatg at 492
<210> 7
<211> 1695
<212> DNA
<213> A流感病毒
<400> 7
atgaaggcca aactgctggt gctgctgtgt acctttaccg ccacctacgc cgacacaatc 60
tgtatcggct accacgccaa caatagcacc gacaccgtgg atacagtgct ggagaagaac 120
gtgaccgtga cccactctgt gaacctgctg gaggacagcc acaatggcaa gctgtgtctg 180
ctgaaaggca ttgcccctct gcagctgggc aattgttctg tggccggatg gattctgggc 240
aaccccgagt gtgagctgct gatttctaag gagagctgga gctacatcgt ggagaccccc 300
aatcctgaga atggcacctg ctaccctggc tacttcgccg attacgagga gctgcgcgag 360
cagctgtcta gcgtgtccag cttcgagaga ttcgagatct tccccaagga gtccagctgg 420
cctaatcaca cagtgacagg cgtgtctgcc agctgtagcc acaacggcaa aagcagcttc 480
taccggaacc tgctgtggct gacaggcaag aatggcctgt accccaacct gagcaagagc 540
tacgtgaaca acaaggaaaa ggaagtgctg gtgctgtggg gagtgcacca ccctcccaac 600
atcggaaatc agcgggccct gtaccacaca gagaacgcct atgtgagcgt ggtgtccagc 660
cactacagca gaagattcac ccccgagatc gccaagagac ccaaagtgag agaccaggag 720
ggccggatca attactactg gaccctgctg gagcctggcg ataccatcat cttcgaggcc 780
aacggcaatc tgatcgcccc ttggtatgcc tttgccctga gcagaggctt tggcagcggc 840
atcatcacaa gcaacgcccc catggatgag tgtgatgcca agtgccagac acctcagggc 900
gccatcaata gcagcctgcc cttccagaat gtgcaccctg tgaccatcgg cgagtgcccc 960
aagtatgtga gaagcgccaa gctgagaatg gtgaccggcc tgagaaacat ccctagcatc 1020
cagagcagag gactgtttgg agccatcgcc ggattcatcg agggaggatg gacaggcatg 1080
gtggatggct ggtacggcta ccaccaccag aatgagcagg gctctggata tgccgccgat 1140
cagaagtcta cccagaacgc catcaacggc atcaccaaca aggtgaacag cgtgatcgag 1200
aagatgaaca cccagtttac cgctgtgggc aaggagttca acaagctgga gcggaggatg 1260
gagaacctga acaagaaggt ggacgacggc tttctggaca tctggaccta caatgccgaa 1320
ctcctggtcc tcctcgagaa tgagaggacc ctggacttcc acgacagcaa cgtgaagaac 1380
ctgtatgaga aggtgaagag ccagctgaag aacaacgcca aggagatcgg caacggctgc 1440
ttcgagttct accacaagtg taacaacgag tgtatggaga gcgtgaagaa cggcacctac 1500
gactacccta agtacagcga ggagagcaag ctgaaccggg agaagatcga tggcgtgaag 1560
ctggagagca tgggcgtgta tcagatcctg gccatctaca gcacagtggc ctcttctctg 1620
gtgctgctgg tgtctctggg cgccatctcc ttttggatgt gctccaacgg cagcctgcag 1680
tgcaggatct gtatc 1695
<210> 8
<211> 565
<212> PRT
<213> A流感病毒
<400> 8
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Leu Glu Asp Ser His Asn Gly Lys Leu Cys Leu Leu Lys Gly Ile
50 55 60
Ala Pro Leu Gln Leu Gly Asn Cys Ser Val Ala Gly Trp Ile Leu Gly
65 70 75 80
Asn Pro Glu Cys Glu Leu Leu Ile Ser Lys Glu Ser Trp Ser Tyr Ile
85 90 95
Val Glu Thr Pro Asn Pro Glu Asn Gly Thr Cys Tyr Pro Gly Tyr Phe
100 105 110
Ala Asp Tyr Glu Glu Leu Arg Glu Gln Leu Ser Ser Val Ser Ser Phe
115 120 125
Glu Arg Phe Glu Ile Phe Pro Lys Glu Ser Ser Trp Pro Asn His Thr
130 135 140
Val Thr Gly Val Ser Ala Ser Cys Ser His Asn Gly Lys Ser Ser Phe
145 150 155 160
Tyr Arg Asn Leu Leu Trp Leu Thr Gly Lys Asn Gly Leu Tyr Pro Asn
165 170 175
Leu Ser Lys Ser Tyr Val Asn Asn Lys Glu Lys Glu Val Leu Val Leu
180 185 190
Trp Gly Val His His Pro Pro Asn Ile Gly Asn Gln Arg Ala Leu Tyr
195 200 205
His Thr Glu Asn Ala Tyr Val Ser Val Val Ser Ser His Tyr Ser Arg
210 215 220
Arg Phe Thr Pro Glu Ile Ala Lys Arg Pro Lys Val Arg Asp Gln Glu
225 230 235 240
Gly Arg Ile Asn Tyr Tyr Trp Thr Leu Leu Glu Pro Gly Asp Thr Ile
245 250 255
Ile Phe Glu Ala Asn Gly Asn Leu Ile Ala Pro Trp Tyr Ala Phe Ala
260 265 270
Leu Ser Arg Gly Phe Gly Ser Gly Ile Ile Thr Ser Asn Ala Pro Met
275 280 285
Asp Glu Cys Asp Ala Lys Cys Gln Thr Pro Gln Gly Ala Ile Asn Ser
290 295 300
Ser Leu Pro Phe Gln Asn Val His Pro Val Thr Ile Gly Glu Cys Pro
305 310 315 320
Lys Tyr Val Arg Ser Ala Lys Leu Arg Met Val Thr Gly Leu Arg Asn
325 330 335
Ile Pro Ser Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe
340 345 350
Ile Glu Gly Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His
355 360 365
His Gln Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr
370 375 380
Gln Asn Ala Ile Asn Gly Ile Thr Asn Lys Val Asn Ser Val Ile Glu
385 390 395 400
Lys Met Asn Thr Gln Phe Thr Ala Val Gly Lys Glu Phe Asn Lys Leu
405 410 415
Glu Arg Arg Met Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe Leu
420 425 430
Asp Ile Trp Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Glu Asn Glu
435 440 445
Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr Glu Lys
450 455 460
Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly Cys
465 470 475 480
Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser Val Lys
485 490 495
Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys Leu Asn
500 505 510
Arg Glu Lys Ile Asp Gly Val Lys Leu Glu Ser Met Gly Val Tyr Gln
515 520 525
Ile Leu Ala Ile Tyr Ser Thr Val Ala Ser Ser Leu Val Leu Leu Val
530 535 540
Ser Leu Gly Ala Ile Ser Phe Trp Met Cys Ser Asn Gly Ser Leu Gln
545 550 555 560
Cys Arg Ile Cys Ile
565
<210> 9
<211> 1695
<212> DNA
<213> A流感病毒
<400> 9
gatacagatc ctgcactgca ggctgccgtt ggagcacatc caaaaggaga tggcgcccag 60
agacaccagc agcaccagag aagaggccac tgtgctgtag atggccagga tctgatacac 120
gcccatgctc tccagcttca cgccatcgat cttctcccgg ttcagcttgc tctcctcgct 180
gtacttaggg tagtcgtagg tgccgttctt cacgctctcc atacactcgt tgttacactt 240
gtggtagaac tcgaagcagc cgttgccgat ctccttggcg ttgttcttca gctggctctt 300
caccttctca tacaggttct tcacgttgct gtcgtggaag tccagggtcc tctcattctc 360
gaggaggacc aggagttcgg cattgtaggt ccagatgtcc agaaagccgt cgtccacctt 420
cttgttcagg ttctccatcc tccgctccag cttgttgaac tccttgccca cagcggtaaa 480
ctgggtgttc atcttctcga tcacgctgtt caccttgttg gtgatgccgt tgatggcgtt 540
ctgggtagac ttctgatcgg cggcatatcc agagccctgc tcattctggt ggtggtagcc 600
gtaccagcca tccaccatgc ctgtccatcc tccctcgatg aatccggcga tggctccaaa 660
cagtcctctg ctctggatgc tagggatgtt tctcaggccg gtcaccattc tcagcttggc 720
gcttctcaca tacttggggc actcgccgat ggtcacaggg tgcacattct ggaagggcag 780
gctgctattg atggcgccct gaggtgtctg gcacttggca tcacactcat ccatgggggc 840
gttgcttgtg atgatgccgc tgccaaagcc tctgctcagg gcaaaggcat accaaggggc 900
gatcagattg ccgttggcct cgaagatgat ggtatcgcca ggctccagca gggtccagta 960
gtaattgatc cggccctcct ggtctctcac tttgggtctc ttggcgatct cgggggtgaa 1020
tcttctgctg tagtggctgg acaccacgct cacataggcg ttctctgtgt ggtacagggc 1080
ccgctgattt ccgatgttgg gagggtggtg cactccccac agcaccagca cttccttttc 1140
cttgttgttc acgtagctct tgctcaggtt ggggtacagg ccattcttgc ctgtcagcca 1200
cagcaggttc cggtagaagc tgcttttgcc gttgtggcta cagctggcag acacgcctgt 1260
cactgtgtga ttaggccagc tggactcctt ggggaagatc tcgaatctct cgaagctgga 1320
cacgctagac agctgctcgc gcagctcctc gtaatcggcg aagtagccag ggtagcaggt 1380
gccattctca ggattggggg tctccacgat gtagctccag ctctccttag aaatcagcag 1440
ctcacactcg gggttgccca gaatccatcc ggccacagaa caattgccca gctgcagagg 1500
ggcaatgcct ttcagcagac acagcttgcc attgtggctg tcctccagca ggttcacaga 1560
gtgggtcacg gtcacgttct tctccagcac tgtatccacg gtgtcggtgc tattgttggc 1620
gtggtagccg atacagattg tgtcggcgta ggtggcggta aaggtacaca gcagcaccag 1680
cagtttggcc ttcat 1695
<210> 10
<211> 1698
<212> DNA
<213> A流感病毒
<400> 10
atgaaggcta ttttggtcgt gctcctgtac acctttgcca cagccaatgc cgataccctt 60
tgtattggct accatgcaaa caactctacc gatacggtcg acacggtgct cgaaaagaat 120
gttactgtca cccactctgt gaacttgctg gaggataaac acaatggcaa gctctgcaaa 180
ctgcgagggg tggctcccct gcatctggga aaatgtaata ttgccggctg gatactgggt 240
aatccagaat gcgaatcctt gagtacggca tccagttggt cctatatcgt cgagaccccg 300
tcaagtgaca atgggacctg ctacccaggc gacttcattg attatgaaga gctgagggag 360
cagttgtcat ccgtaagcag cttcgaaagg tttgagattt tcccgaaaac tagctcctgg 420
cccaatcatg actctaacaa aggagttact gcagcctgtc ctcatgcggg cgcgaaaagc 480
ttctacaaga acctgatatg gctcgtgaag aaaggcaatt catacccaaa actgtctaag 540
agctacataa acgataaagg gaaagaggtt ctggtgcttt ggggcataca ccacccatct 600
acctcagccg accagcagtc tctgtatcag aacgccgaca catacgtgtt tgtgggcagc 660
tcccgctatt ctaagaagtt caaacccgag atcgccatca gaccaaaggt gagagaccag 720
gaaggaagga tgaattatta ctggaccttg gtcgaacctg gcgataagat aacgtttgag 780
gctacgggca acctggtcgt gccgagatat gcttttgcca tggagaggaa tgcggggagc 840
ggaattatca tcagcgacac tccagttcat gactgtaata ccacatgtca gacaccgaag 900
ggcgccatca acacgagctt gccctttcag aatatacatc caatcacaat cggaaaatgc 960
cccaagtacg tgaaaagcac taaactgaga ctcgccaccg gactcaggaa tatcccaagc 1020
atccagtcac ggggtctgtt cggcgctatc gccggattta ttgaaggcgg ctggacgggg 1080
atggtggacg gttggtacgg ctaccatcat caaaatgagc agggctccgg atacgccgct 1140
gacctgaaat ctacgcagaa tgccatagat gagatcacaa acaaggtcaa tagtgtgata 1200
gaaaaaatga atactcagtt cacagctgtt ggaaaggagt ttaaccacct cgagaagcga 1260
attgagaacc tgaacaagaa ggtggacgat ggctttttgg atatctggac gtataacgct 1320
gagctgcttg ttctgctgga gaacgaaaga acccttgact accacgattc caacgtgaag 1380
aatctgtatg agaaagtgcg aagccagttg aaaaacaacg caaaagaaat aggcaacggc 1440
tgtttcgagt tctaccacaa atgcgataac acctgcatgg agagtgtgaa gaacggaacg 1500
tacgattatc caaaatactc cgaggaggcc aaactcaata gggaggagat agacggtgtt 1560
aagctggagt ccacacgcat ctatcagatt ctggcgatct actctactgt ggcttccagc 1620
ctggtgctgg tcgtttccct tggggcgatc agcttctgga tgtgcagcaa tggctccctg 1680
caatgccgca tctgcatc 1698
<210> 11
<211> 566
<212> PRT
<213> A流感病毒
<400> 11
Met Lys Ala Ile Leu Val Val Leu Leu Tyr Thr Phe Ala Thr Ala Asn
1 5 10 15
Ala Asp Thr Leu Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Leu Glu Asp Lys His Asn Gly Lys Leu Cys Lys Leu Arg Gly Val
50 55 60
Ala Pro Leu His Leu Gly Lys Cys Asn Ile Ala Gly Trp Ile Leu Gly
65 70 75 80
Asn Pro Glu Cys Glu Ser Leu Ser Thr Ala Ser Ser Trp Ser Tyr Ile
85 90 95
Val Glu Thr Pro Ser Ser Asp Asn Gly Thr Cys Tyr Pro Gly Asp Phe
100 105 110
Ile Asp Tyr Glu Glu Leu Arg Glu Gln Leu Ser Ser Val Ser Ser Phe
115 120 125
Glu Arg Phe Glu Ile Phe Pro Lys Thr Ser Ser Trp Pro Asn His Asp
130 135 140
Ser Asn Lys Gly Val Thr Ala Ala Cys Pro His Ala Gly Ala Lys Ser
145 150 155 160
Phe Tyr Lys Asn Leu Ile Trp Leu Val Lys Lys Gly Asn Ser Tyr Pro
165 170 175
Lys Leu Ser Lys Ser Tyr Ile Asn Asp Lys Gly Lys Glu Val Leu Val
180 185 190
Leu Trp Gly Ile His His Pro Ser Thr Ser Ala Asp Gln Gln Ser Leu
195 200 205
Tyr Gln Asn Ala Asp Thr Tyr Val Phe Val Gly Ser Ser Arg Tyr Ser
210 215 220
Lys Lys Phe Lys Pro Glu Ile Ala Ile Arg Pro Lys Val Arg Asp Gln
225 230 235 240
Glu Gly Arg Met Asn Tyr Tyr Trp Thr Leu Val Glu Pro Gly Asp Lys
245 250 255
Ile Thr Phe Glu Ala Thr Gly Asn Leu Val Val Pro Arg Tyr Ala Phe
260 265 270
Ala Met Glu Arg Asn Ala Gly Ser Gly Ile Ile Ile Ser Asp Thr Pro
275 280 285
Val His Asp Cys Asn Thr Thr Cys Gln Thr Pro Lys Gly Ala Ile Asn
290 295 300
Thr Ser Leu Pro Phe Gln Asn Ile His Pro Ile Thr Ile Gly Lys Cys
305 310 315 320
Pro Lys Tyr Val Lys Ser Thr Lys Leu Arg Leu Ala Thr Gly Leu Arg
325 330 335
Asn Ile Pro Ser Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly
340 345 350
Phe Ile Glu Gly Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr
355 360 365
His His Gln Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Leu Lys Ser
370 375 380
Thr Gln Asn Ala Ile Asp Glu Ile Thr Asn Lys Val Asn Ser Val Ile
385 390 395 400
Glu Lys Met Asn Thr Gln Phe Thr Ala Val Gly Lys Glu Phe Asn His
405 410 415
Leu Glu Lys Arg Ile Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe
420 425 430
Leu Asp Ile Trp Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Glu Asn
435 440 445
Glu Arg Thr Leu Asp Tyr His Asp Ser Asn Val Lys Asn Leu Tyr Glu
450 455 460
Lys Val Arg Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly
465 470 475 480
Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Thr Cys Met Glu Ser Val
485 490 495
Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ala Lys Leu
500 505 510
Asn Arg Glu Glu Ile Asp Gly Val Lys Leu Glu Ser Thr Arg Ile Tyr
515 520 525
Gln Ile Leu Ala Ile Tyr Ser Thr Val Ala Ser Ser Leu Val Leu Val
530 535 540
Val Ser Leu Gly Ala Ile Ser Phe Trp Met Cys Ser Asn Gly Ser Leu
545 550 555 560
Gln Cys Arg Ile Cys Ile
565
<210> 12
<211> 1698
<212> DNA
<213> A流感病毒
<400> 12
gatgcagatg cggcattgca gggagccatt gctgcacatc cagaagctga tcgccccaag 60
ggaaacgacc agcaccaggc tggaagccac agtagagtag atcgccagaa tctgatagat 120
gcgtgtggac tccagcttaa caccgtctat ctcctcccta ttgagtttgg cctcctcgga 180
gtattttgga taatcgtacg ttccgttctt cacactctcc atgcaggtgt tatcgcattt 240
gtggtagaac tcgaaacagc cgttgcctat ttcttttgcg ttgtttttca actggcttcg 300
cactttctca tacagattct tcacgttgga atcgtggtag tcaagggttc tttcgttctc 360
cagcagaaca agcagctcag cgttatacgt ccagatatcc aaaaagccat cgtccacctt 420
cttgttcagg ttctcaattc gcttctcgag gtggttaaac tcctttccaa cagctgtgaa 480
ctgagtattc attttttcta tcacactatt gaccttgttt gtgatctcat ctatggcatt 540
ctgcgtagat ttcaggtcag cggcgtatcc ggagccctgc tcattttgat gatggtagcc 600
gtaccaaccg tccaccatcc ccgtccagcc gccttcaata aatccggcga tagcgccgaa 660
cagaccccgt gactggatgc ttgggatatt cctgagtccg gtggcgagtc tcagtttagt 720
gcttttcacg tacttggggc attttccgat tgtgattgga tgtatattct gaaagggcaa 780
gctcgtgttg atggcgccct tcggtgtctg acatgtggta ttacagtcat gaactggagt 840
gtcgctgatg ataattccgc tccccgcatt cctctccatg gcaaaagcat atctcggcac 900
gaccaggttg cccgtagcct caaacgttat cttatcgcca ggttcgacca aggtccagta 960
ataattcatc cttccttcct ggtctctcac ctttggtctg atggcgatct cgggtttgaa 1020
cttcttagaa tagcgggagc tgcccacaaa cacgtatgtg tcggcgttct gatacagaga 1080
ctgctggtcg gctgaggtag atgggtggtg tatgccccaa agcaccagaa cctctttccc 1140
tttatcgttt atgtagctct tagacagttt tgggtatgaa ttgcctttct tcacgagcca 1200
tatcaggttc ttgtagaagc ttttcgcgcc cgcatgagga caggctgcag taactccttt 1260
gttagagtca tgattgggcc aggagctagt tttcgggaaa atctcaaacc tttcgaagct 1320
gcttacggat gacaactgct ccctcagctc ttcataatca atgaagtcgc ctgggtagca 1380
ggtcccattg tcacttgacg gggtctcgac gatataggac caactggatg ccgtactcaa 1440
ggattcgcat tctggattac ccagtatcca gccggcaata ttacattttc ccagatgcag 1500
gggagccacc cctcgcagtt tgcagagctt gccattgtgt ttatcctcca gcaagttcac 1560
agagtgggtg acagtaacat tcttttcgag caccgtgtcg accgtatcgg tagagttgtt 1620
tgcatggtag ccaatacaaa gggtatcggc attggctgtg gcaaaggtgt acaggagcac 1680
gaccaaaata gccttcat 1698
<210> 13
<211> 1683
<212> DNA
<213> A流感病毒
<400> 13
atggccatca tctacctgat cctgctgttt acagctgtga gaggcgacca gatctgtatc 60
ggctaccacg ccaacaatag caccgagaag gtggacacca tcctggagag aaacgtgaca 120
gtgacccacg ccaaggacat cctggaaaag acccacaacg gcaagctgtg taagctgaac 180
ggcatccctc ctctggaact gggcgattgt tctatcgccg gatggctgct gggaaacccc 240
gagtgtgata ggctgctgtc tgtgcctgag tggagctaca tcatggagaa ggagaaccct 300
agggacggcc tgtgttaccc tggcagcttc aacgattacg aggagctgaa gcacctgctg 360
tctagcgtga agcacttcga gaaggtgaag atcctgccca aggacagatg gacccagcac 420
acaacaacag gaggaagcag agcctgcgcc gtgtctggca accccagctt cttccggaat 480
atggtgtggc tgaccaagaa gggcagcaat taccctgtgg cccagggcag ctacaataat 540
accagcggcg agcagatgct gatcatctgg ggagtgcacc accctaatga cgagaccgag 600
cagagaaccc tgtaccagaa tgtgggcacc tacgtgtctg tgggcaccag caccctgaat 660
aagagaagca cccccgagat tgccacaaga cccaaggtga acggccaggg aggaagaatg 720
gagttcagct ggaccctgct ggatatgtgg gacaccatca actttgagag caccggcaat 780
ctgatcgccc ctgagtacgg cttcaagatc agcaagagag gcagcagcgg catcatgaaa 840
accgagggca ccctggagaa ttgtgagacc aagtgccaga cacctctggg cgccatcaat 900
accaccctgc ccttccacaa tgtgcaccct ctgaccatcg gcgagtgccc taagtatgtg 960
aagagcgaga agctggtgct ggccacagga ctgagaaacg tgccccagat cgagagcaga 1020
ggcctgtttg gagccatcgc cggattcatc gagggaggat ggcagggaat ggtcgatggc 1080
tggtacggct accaccacag caatgatcag ggctctggct atgccgccga taaggagtct 1140
acccagaagg cctttgacgg catcaccaac aaggtgaaca gcgtgatcga gaagatgaac 1200
acccagtttg aggctgtggg caaggagttt agcaacctgg agcggagact ggagaacctg 1260
aacaagaaga tggaggacgg cttcctggat gtgtggacct acaatgccga actgctggtg 1320
ctgatggaga atgagcggac cctggacttc cacgacagca acgtgaagaa cctgtacgac 1380
aaagtgagga tgcagctgag ggacaacgtg aaggaactgg gcaatggctg cttcgagttc 1440
taccacaagt gtgacgacga gtgtatgaac tccgtgaaga acggcaccta cgactaccct 1500
aagtacgagg aggagagcaa gctgaaccgg aacgagatca agggcgtgaa gctgtctagc 1560
atgggcgtgt atcagatcct ggccatctat gccacagtgg ccggatctct gagcctggca 1620
attatgatgg ctggaatcag cttctggatg tgctccaatg gcagcctgca gtgccggatc 1680
tgt 1683
<210> 14
<211> 564
<212> PRT
<213> A流感病毒
<400> 14
Met Ala Ile Ile Tyr Leu Ile Leu Leu Phe Thr Ala Val Arg Gly Asp
1 5 10 15
Gln Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Glu Lys Val Asp
20 25 30
Thr Ile Leu Glu Arg Asn Val Thr Val Thr His Ala Lys Asp Ile Leu
35 40 45
Glu Lys Thr His Asn Gly Lys Leu Cys Lys Leu Asn Gly Ile Pro Pro
50 55 60
Leu Glu Leu Gly Asp Cys Ser Ile Ala Gly Trp Leu Leu Gly Asn Pro
65 70 75 80
Glu Cys Asp Arg Leu Leu Ser Val Pro Glu Trp Ser Tyr Ile Met Glu
85 90 95
Lys Glu Asn Pro Arg Asp Gly Leu Cys Tyr Pro Gly Ser Phe Asn Asp
100 105 110
Tyr Glu Glu Leu Lys His Leu Leu Ser Ser Val Lys His Phe Glu Lys
115 120 125
Val Lys Ile Leu Pro Lys Asp Arg Trp Thr Gln His Thr Thr Thr Gly
130 135 140
Gly Ser Arg Ala Cys Ala Val Ser Gly Asn Pro Ser Phe Phe Arg Asn
145 150 155 160
Met Val Trp Leu Thr Lys Lys Gly Ser Asn Tyr Pro Val Ala Lys Gly
165 170 175
Ser Tyr Asn Asn Thr Ser Gly Glu Gln Met Leu Ile Ile Trp Gly Val
180 185 190
His His Pro Asn Asp Glu Thr Glu Gln Arg Thr Leu Tyr Gln Asn Val
195 200 205
Gly Thr Tyr Val Ser Val Gly Thr Ser Thr Leu Asn Lys Arg Ser Thr
210 215 220
Pro Asp Tyr His Ile Ala Thr Arg Pro Lys Val Asn Gly Gln Gly Gly
225 230 235 240
Arg Met Glu Phe Ser Trp Thr Leu Leu Asp Met Trp Asp Thr Ile Asn
245 250 255
Phe Glu Ser Thr Gly Asn Leu Ile Ala Pro Glu Tyr Gly Phe Lys Ile
260 265 270
Ser Lys Arg Gly Ser Ser Gly Ile Met Lys Thr Glu Gly Thr Leu Glu
275 280 285
Asn Cys Glu Thr Lys Cys Gln Thr Pro Leu Gly Ala Ile Asn Thr Thr
290 295 300
Leu Pro Phe His Asn Val His Pro Leu Thr Ile Gly Glu Cys Pro Lys
305 310 315 320
Tyr Val Lys Ser Glu Lys Leu Val Leu Ala Thr Gly Leu Arg Asn Val
325 330 335
Pro Gln Ile Glu Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile
340 345 350
Glu Gly Gly Trp Gln Gly Met Val Asp Gly Trp Tyr Gly Tyr His His
355 360 365
Ser Asn Asp Gln Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr Gln
370 375 380
Lys Ala Phe Asp Gly Ile Thr Asn Lys Val Asn Ser Val Ile Glu Lys
385 390 395 400
Met Asn Thr Gln Phe Glu Ala Val Gly Lys Glu Phe Ser Asn Leu Glu
405 410 415
Arg Arg Leu Glu Asn Leu Asn Lys Lys Met Glu Asp Gly Phe Leu Asp
420 425 430
Val Trp Thr Tyr Asn Ala Glu Leu Leu Val Leu Met Glu Asn Glu Arg
435 440 445
Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr Asp Lys Val
450 455 460
Arg Met Gln Leu Arg Asp Asn Val Lys Glu Leu Gly Asn Gly Cys Phe
465 470 475 480
Glu Phe Tyr His Lys Cys Asp Asp Glu Cys Met Asn Ser Val Lys Asn
485 490 495
Gly Thr Tyr Asp Tyr Pro Lys Tyr Glu Glu Glu Ser Lys Leu Asn Arg
500 505 510
Asn Glu Ile Lys Gly Val Lys Leu Ser Ser Met Gly Val Tyr Gln Ile
515 520 525
Leu Ala Ile Tyr Ala Thr Val Ala Gly Ser Leu Ser Leu Ala Ile Met
530 535 540
Met Ala Gly Ile Ser Phe Trp Met Cys Ser Asn Gly Ser Leu Gln Cys
545 550 555 560
Arg Ile Cys Ile
<210> 15
<211> 1683
<212> DNA
<213> A流感病毒
<400> 15
acagatccgg cactgcaggc tgccattgga gcacatccag aagctgattc cagccatcat 60
aattgccagg ctcagagatc cggccactgt ggcatagatg gccaggatct gatacacgcc 120
catgctagac agcttcacgc ccttgatctc gttccggttc agcttgctct cctcctcgta 180
cttagggtag tcgtaggtgc cgttcttcac ggagttcata cactcgtcgt cacacttgtg 240
gtagaactcg aagcagccat tgcccagttc cttcacgttg tccctcagct gcatcctcac 300
tttgtcgtac aggttcttca cgttgctgtc gtggaagtcc agggtccgct cattctccat 360
cagcaccagc agttcggcat tgtaggtcca cacatccagg aagccgtcct ccatcttctt 420
gttcaggttc tccagtctcc gctccaggtt gctaaactcc ttgcccacag cctcaaactg 480
ggtgttcatc ttctcgatca cgctgttcac cttgttggtg atgccgtcaa aggccttctg 540
ggtagactcc ttatcggcgg catagccaga gccctgatca ttgctgtggt ggtagccgta 600
ccagccatcg accattccct gccatcctcc ctcgatgaat ccggcgatgg ctccaaacag 660
gcctctgctc tcgatctggg gcacgtttct cagtcctgtg gccagcacca gcttctcgct 720
cttcacatac ttagggcact cgccgatggt cagagggtgc acattgtgga agggcagggt 780
ggtattgatg gcgcccagag gtgtctggca cttggtctca caattctcca gggtgccctc 840
ggttttcatg atgccgctgc tgcctctctt gctgatcttg aagccgtact caggggcgat 900
cagattgccg gtgctctcaa agttgatggt gtcccacata tccagcaggg tccagctgaa 960
ctccattctt cctccctggc cgttcacctt gggtcttgtg gcaatctcgg gggtgcttct 1020
cttattcagg gtgctggtgc ccacagacac gtaggtgccc acattctggt acagggttct 1080
ctgctcggtc tcgtcattag ggtggtgcac tccccagatg atcagcatct gctcgccgct 1140
ggtattattg tagctgccct gggccacagg gtaattgctg cccttcttgg tcagccacac 1200
catattccgg aagaagctgg ggttgccaga cacggcgcag gctctgcttc ctcctgttgt 1260
tgtgtgctgg gtccatctgt ccttgggcag gatcttcacc ttctcgaagt gcttcacgct 1320
agacagcagg tgcttcagct cctcgtaatc gttgaagctg ccagggtaac acaggccgtc 1380
cctagggttc tccttctcca tgatgtagct ccactcaggc acagacagca gcctatcaca 1440
ctcggggttt cccagcagcc atccggcgat agaacaatcg cccagttcca gaggagggat 1500
gccgttcagc ttacacagct tgccgttgtg ggtcttttcc aggatgtcct tggcgtgggt 1560
cactgtcacg tttctctcca ggatggtgtc caccttctcg gtgctattgt tggcgtggta 1620
gccgatacag atctggtcgc ctctcacagc tgtaaacagc aggatcaggt agatgatggc 1680
cat 1683
<210> 16
<211> 1704
<212> DNA
<213> A流感病毒
<400> 16
atggaaaaga tcgtgctgct gctggccatt gtgagcctgg tgaagagcga ccagatctgc 60
attggctacc acgccaacaa tagcacagag caggtggaca ccatcatgga aaaaaacgtg 120
accgtgaccc acgctcagga catcctggaa aagacccaca acggcaagct gtgtgatctg 180
gacggcgtga agcctctgat cctgagagat tgtagcgtgg ctggatggct gctgggcaac 240
cctatgtgcg acgagttcat caacgtgccc gagtggagct atatcgtgga gaaggccaac 300
cccaccaacg atctgtgtta ccccggcagc ttcaacgatt acgaggaact gaagcacctg 360
ctgtcccgga tcaaccactt cgagaagatc cagatcatcc ccaagtcctc ttggagcgat 420
cacgaagcct ctagcggagt gtctagcgcc tgtccttacc tgggcagccc cagcttcttc 480
agaaacgtgg tgtggctgat caagaagaac agcacctacc ccaccatcaa gaagagctac 540
aacaacacca accaggaaga tctgctggtc ctgtggggaa tccaccaccc taatgatgcc 600
gccgagcaga ccagactgta ccagaacccc accacctata tcagcatcgg caccagcacc 660
ctgaatcaga gactggtgcc caagatcgcc accagatcca aggtgaacgg ccagagcggc 720
aggatggaat tcttctggac catcctgaag cccaacgacg ccatcaactt cgagagcaac 780
ggcaacttta tcgcccctga gtacgcctac aagatcgtga agaagggcga cagcgccatc 840
atgaagagcg agctggaata cggcaactgc aacaccaagt gccagacacc tatgggcgcc 900
atcaacagca gcatgccctt ccacaacatc caccctctga ccatcggcga gtgccctaag 960
tacgtgaaga gcaacagact ggtgctggcc acaggcctga gaaatagccc ccagcgggag 1020
agcagaagaa agaagagggg cctgtttgga gccatcgccg gctttattga aggcggctgg 1080
cagggaatgg tggatggctg gtacggctac caccacagca atgagcaggg ctctggatat 1140
gccgccgaca aagagtctac ccagaaggcc atcgacggcg tcaccaacaa ggtgaacagc 1200
atcatcgaca agatgaacac ccagttcgag gctgtgggca gagagttcaa caacctggaa 1260
cggcggatcg agaacctgaa caagaaaatg gaagatggct tcctggatgt gtggacctac 1320
aatgccgaac tgctggtgct gatggaaaac gagcggaccc tggacttcca cgacagcaac 1380
gtgaagaacc tgtacgacaa agtgcggctg cagctgagag acaacgccaa agagctgggc 1440
aacggctgct tcgagttcta ccacaagtgc gacaacgagt gcatggaaag catccggaac 1500
ggcacctaca actaccctca gtacagcgag gaagccaggc tgaagaggga agagatcagc 1560
ggcgtgaaac tggaatccat cggcacctac cagatcctga gcatctacag cacagtggcc 1620
tcttctctgg ccctggccat tatgatggcc ggactgagcc tgtggatgtg cagcaatggc 1680
agcctgcagt gcaggatctg catc 1704
<210> 17
<211> 568
<212> PRT
<213> A流感病毒
<400> 17
Met Glu Lys Ile Val Leu Leu Leu Ala Ile Val Ser Leu Val Lys Ser
1 5 10 15
Asp Gln Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Glu Gln Val
20 25 30
Asp Thr Ile Met Glu Lys Asn Val Thr Val Thr His Ala Gln Asp Ile
35 40 45
Leu Glu Lys Thr His Asn Gly Lys Leu Cys Asp Leu Asp Gly Val Lys
50 55 60
Pro Leu Ile Leu Arg Asp Cys Ser Val Ala Gly Trp Leu Leu Gly Asn
65 70 75 80
Pro Met Cys Asp Glu Phe Ile Asn Val Pro Glu Trp Ser Tyr Ile Val
85 90 95
Glu Lys Ala Asn Pro Thr Asn Asp Leu Cys Tyr Pro Gly Ser Phe Asn
100 105 110
Asp Tyr Glu Glu Leu Lys His Leu Leu Ser Arg Ile Asn His Phe Glu
115 120 125
Lys Ile Gln Ile Ile Pro Lys Ser Ser Trp Ser Asp His Glu Ala Ser
130 135 140
Ser Gly Val Ser Ser Ala Cys Pro Tyr Leu Gly Ser Pro Ser Phe Phe
145 150 155 160
Arg Asn Val Val Trp Leu Ile Lys Lys Asn Ser Thr Tyr Pro Thr Ile
165 170 175
Lys Lys Ser Tyr Asn Asn Thr Asn Gln Glu Asp Leu Leu Val Leu Trp
180 185 190
Gly Ile His His Pro Asn Asp Ala Ala Glu Gln Thr Arg Leu Tyr Gln
195 200 205
Asn Pro Thr Thr Tyr Ile Ser Ile Gly Thr Ser Thr Leu Asn Gln Arg
210 215 220
Leu Val Pro Lys Ile Ala Thr Arg Ser Lys Val Asn Gly Gln Ser Gly
225 230 235 240
Arg Met Glu Phe Phe Trp Thr Ile Leu Lys Pro Asn Asp Ala Ile Asn
245 250 255
Phe Glu Ser Asn Gly Asn Phe Ile Ala Pro Glu Tyr Ala Tyr Lys Ile
260 265 270
Val Lys Lys Gly Asp Ser Ala Ile Met Lys Ser Glu Leu Glu Tyr Gly
275 280 285
Asn Cys Asn Thr Lys Cys Gln Thr Pro Met Gly Ala Ile Asn Ser Ser
290 295 300
Met Pro Phe His Asn Ile His Pro Leu Thr Ile Gly Glu Cys Pro Lys
305 310 315 320
Tyr Val Lys Ser Asn Arg Leu Val Leu Ala Thr Gly Leu Arg Asn Ser
325 330 335
Pro Gln Arg Glu Ser Arg Arg Lys Lys Arg Gly Leu Phe Gly Ala Ile
340 345 350
Ala Gly Phe Ile Glu Gly Gly Trp Gln Gly Met Val Asp Gly Trp Tyr
355 360 365
Gly Tyr His His Ser Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Lys
370 375 380
Glu Ser Thr Gln Lys Ala Ile Asp Gly Val Thr Asn Lys Val Asn Ser
385 390 395 400
Ile Ile Asp Lys Met Asn Thr Gln Phe Glu Ala Val Gly Arg Glu Phe
405 410 415
Asn Asn Leu Glu Arg Arg Ile Glu Asn Leu Asn Lys Lys Met Glu Asp
420 425 430
Gly Phe Leu Asp Val Trp Thr Tyr Asn Ala Glu Leu Leu Val Leu Met
435 440 445
Glu Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu
450 455 460
Tyr Asp Lys Val Arg Leu Gln Leu Arg Asp Asn Ala Lys Glu Leu Gly
465 470 475 480
Asn Gly Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Glu Cys Met Glu
485 490 495
Ser Ile Arg Asn Gly Thr Tyr Asn Tyr Pro Gln Tyr Ser Glu Glu Ala
500 505 510
Arg Leu Lys Arg Glu Glu Ile Ser Gly Val Lys Leu Glu Ser Ile Gly
515 520 525
Thr Tyr Gln Ile Leu Ser Ile Tyr Ser Thr Val Ala Ser Ser Leu Ala
530 535 540
Leu Ala Ile Met Met Ala Gly Leu Ser Leu Trp Met Cys Ser Asn Gly
545 550 555 560
Ser Leu Gln Cys Arg Ile Cys Ile
565
<210> 18
<211> 1704
<212> DNA
<213> A流感病毒
<400> 18
gatgcagatc ctgcactgca ggctgccatt gctgcacatc cacaggctca gtccggccat 60
cataatggcc agggccagag aagaggccac tgtgctgtag atgctcagga tctggtaggt 120
gccgatggat tccagtttca cgccgctgat ctcttccctc ttcagcctgg cttcctcgct 180
gtactgaggg tagttgtagg tgccgttccg gatgctttcc atgcactcgt tgtcgcactt 240
gtggtagaac tcgaagcagc cgttgcccag ctctttggcg ttgtctctca gctgcagccg 300
cactttgtcg tacaggttct tcacgttgct gtcgtggaag tccagggtcc gctcgttttc 360
catcagcacc agcagttcgg cattgtaggt ccacacatcc aggaagccat cttccatttt 420
cttgttcagg ttctcgatcc gccgttccag gttgttgaac tctctgccca cagcctcgaa 480
ctgggtgttc atcttgtcga tgatgctgtt caccttgttg gtgacgccgt cgatggcctt 540
ctgggtagac tctttgtcgg cggcatatcc agagccctgc tcattgctgt ggtggtagcc 600
gtaccagcca tccaccattc cctgccagcc gccttcaata aagccggcga tggctccaaa 660
caggcccctc ttctttcttc tgctctcccg ctgggggcta tttctcaggc ctgtggccag 720
caccagtctg ttgctcttca cgtacttagg gcactcgccg atggtcagag ggtggatgtt 780
gtggaagggc atgctgctgt tgatggcgcc cataggtgtc tggcacttgg tgttgcagtt 840
gccgtattcc agctcgctct tcatgatggc gctgtcgccc ttcttcacga tcttgtaggc 900
gtactcaggg gcgataaagt tgccgttgct ctcgaagttg atggcgtcgt tgggcttcag 960
gatggtccag aagaattcca tcctgccgct ctggccgttc accttggatc tggtggcgat 1020
cttgggcacc agtctctgat tcagggtgct ggtgccgatg ctgatatagg tggtggggtt 1080
ctggtacagt ctggtctgct cggcggcatc attagggtgg tggattcccc acaggaccag 1140
cagatcttcc tggttggtgt tgttgtagct cttcttgatg gtggggtagg tgctgttctt 1200
cttgatcagc cacaccacgt ttctgaagaa gctggggctg cccaggtaag gacaggcgct 1260
agacactccg ctagaggctt cgtgatcgct ccaagaggac ttggggatga tctggatctt 1320
ctcgaagtgg ttgatccggg acagcaggtg cttcagttcc tcgtaatcgt tgaagctgcc 1380
ggggtaacac agatcgttgg tggggttggc cttctccacg atatagctcc actcgggcac 1440
gttgatgaac tcgtcgcaca tagggttgcc cagcagccat ccagccacgc tacaatctct 1500
caggatcaga ggcttcacgc cgtccagatc acacagcttg ccgttgtggg tcttttccag 1560
gatgtcctga gcgtgggtca cggtcacgtt tttttccatg atggtgtcca cctgctctgt 1620
gctattgttg gcgtggtagc caatgcagat ctggtcgctc ttcaccaggc tcacaatggc 1680
cagcagcagc acgatctttt ccat 1704
<210> 19
<211> 147
<212> DNA
<213> A流感病毒
<400> 19
atgaaggcca aactgctggt gctgctgtgt acctttaccg ccacctacgc cgacacaatc 60
tgtatcggct accacgccaa caatagcacc gacaccgtgg atacagtgct ggagaagaac 120
gtgaccgtga cccactctgt gaacctg 147
<210> 20
<211> 49
<212> PRT
<213> A流感病毒
<400> 20
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu
<210> 21
<211> 147
<212> DNA
<213> A流感病毒
<400> 21
caggttcaca gagtgggtca cggtcacgtt cttctccagc actgtatcca cggtgtcggt 60
gctattgttg gcgtggtagc cgatacagat tgtgtcggcg taggtggcgg taaaggtaca 120
cagcagcacc agcagtttgg ccttcat 147
<210> 22
<211> 678
<212> DNA
<213> A流感病毒
<400> 22
gatgccaagt gccagacacc tcagggcgcc atcaatagca gcctgccctt ccagaatgtg 60
caccctgtga ccatcggcga gtgccccaag tatgtgagaa gcgccaagct gagaatggtg 120
accggcctga gaaacatccc tagcatccag agcagaggac tgtttggagc catcgccgga 180
ttcatcgagg gaggatggac aggcatggtg gatggctggt acggctacca ccaccagaat 240
gagcagggct ctggatatgc cgccgatcag aagtctaccc agaacgccat caacggcatc 300
accaacaagg tgaacagcgt gatcgagaag atgaacaccc agtttaccgc tgtgggcaag 360
gagttcaaca agctggagcg gaggatggag aacctgaaca agaaggtgga cgacggcttt 420
ctggacatct ggacctacaa tgccgaactc ctggtcctcc tcgagaatga gaggaccctg 480
gacttccacg acagcaacgt gaagaacctg tatgagaagg tgaagagcca gctgaagaac 540
aacgccaagg agatcggcaa cggctgcttc gagttctacc acaagtgtaa caacgagtgt 600
atggagagcg tgaagaacgg cacctacgac taccctaagt acagcgagga gagcaagctg 660
aaccgggaga agatcgat 678
<210> 23
<211> 226
<212> PRT
<213> A流感病毒
<400> 23
Asp Ala Lys Cys Gln Thr Pro Gln Gly Ala Ile Asn Ser Ser Leu Pro
1 5 10 15
Phe Gln Asn Val His Pro Val Thr Ile Gly Glu Cys Pro Lys Tyr Val
20 25 30
Arg Ser Ala Lys Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
35 40 45
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
50 55 60
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
65 70 75 80
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
85 90 95
Ile Asn Gly Ile Thr Asn Lys Val Asn Ser Val Ile Glu Lys Met Asn
100 105 110
Thr Gln Phe Thr Ala Val Gly Lys Glu Phe Asn Lys Leu Glu Arg Arg
115 120 125
Met Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe Leu Asp Ile Trp
130 135 140
Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Glu Asn Glu Arg Thr Leu
145 150 155 160
Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr Glu Lys Val Lys Ser
165 170 175
Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly Cys Phe Glu Phe
180 185 190
Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser Val Lys Asn Gly Thr
195 200 205
Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys Leu Asn Arg Glu Lys
210 215 220
Ile Asp
225
<210> 24
<211> 678
<212> DNA
<213> A流感病毒
<400> 24
atcgatcttc tcccggttca gcttgctctc ctcgctgtac ttagggtagt cgtaggtgcc 60
gttcttcacg ctctccatac actcgttgtt acacttgtgg tagaactcga agcagccgtt 120
gccgatctcc ttggcgttgt tcttcagctg gctcttcacc ttctcataca ggttcttcac 180
gttgctgtcg tggaagtcca gggtcctctc attctcgagg aggaccagga gttcggcatt 240
gtaggtccag atgtccagaa agccgtcgtc caccttcttg ttcaggttct ccatcctccg 300
ctccagcttg ttgaactcct tgcccacagc ggtaaactgg gtgttcatct tctcgatcac 360
gctgttcacc ttgttggtga tgccgttgat ggcgttctgg gtagacttct gatcggcggc 420
atatccagag ccctgctcat tctggtggtg gtagccgtac cagccatcca ccatgcctgt 480
ccatcctccc tcgatgaatc cggcgatggc tccaaacagt cctctgctct ggatgctagg 540
gatgtttctc aggccggtca ccattctcag cttggcgctt ctcacatact tggggcactc 600
gccgatggtc acagggtgca cattctggaa gggcaggctg ctattgatgg cgccctgagg 660
tgtctggcac ttggcatc 678
<210> 25
<211> 576
<212> DNA
<213> A流感病毒
<400> 25
gatgccaagt gccagacacc tcagggcgcc atcaatagca gcctgccctt ccagaatgtg 60
caccctgtga ccatcggcga gtgccccaag tatgtgagaa gcgccaagct gagaatggtg 120
accggcctga gaaacatccc tagcatccag agcagaggac tgtttggagc catcgccgga 180
ttcatcgagg gaggatggac aggcatggtg gatggctggt acggctacca ccaccagaat 240
gagcagggct ctggatatgc cgccgatcag aagtctaccc agaacgccat caacggcatc 300
accaacaagg tgaacagcgt gatcgagaag atgtacaatg ccgaactcct ggtcctcctc 360
gagaatgaga ggaccctgga cttccacgac agcaacgtga agaacctgta tgagaaggtg 420
aagagccagc tgaagaacaa cgccaaggag atcggcaacg gctgcttcga gttctaccac 480
aagtgtaaca acgagtgtat ggagagcgtg aagaacggca cctacgacta ccctaagtac 540
agcgaggaga gcaagctgaa ccgggagaag atcgat 576
<210> 26
<211> 193
<212> PRT
<213> A流感病毒
<400> 26
Asp Ala Lys Cys Gln Thr Pro Gln Gly Ala Ile Asn Ser Ser Leu Pro
1 5 10 15
Phe Gln Asn Val His Pro Val Thr Ile Gly Glu Cys Pro Lys Tyr Val
20 25 30
Arg Ser Ala Lys Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
35 40 45
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
50 55 60
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
65 70 75 80
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
85 90 95
Ile Asn Gly Ile Thr Asn Lys Val Asn Ser Val Ile Glu Lys Met Thr
100 105 110
Tyr Asn Ala Glu Leu Leu Val Leu Leu Glu Asn Glu Arg Thr Leu Asp
115 120 125
Phe His Asp Ser Asn Val Lys Asn Leu Tyr Glu Lys Val Lys Ser Gln
130 135 140
Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly Cys Phe Glu Phe Tyr
145 150 155 160
His Lys Cys Asn Asn Glu Cys Met Glu Ser Val Lys Asn Gly Thr Tyr
165 170 175
Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys Leu Asn Arg Glu Lys Ile
180 185 190
Asp
<210> 27
<211> 576
<212> DNA
<213> A流感病毒
<400> 27
atcgatcttc tcccggttca gcttgctctc ctcgctgtac ttagggtagt cgtaggtgcc 60
gttcttcacg ctctccatac actcgttgtt acacttgtgg tagaactcga agcagccgtt 120
gccgatctcc ttggcgttgt tcttcagctg gctcttcacc ttctcataca ggttcttcac 180
gttgctgtcg tggaagtcca gggtcctctc attctcgagg aggaccagga gttcggcatt 240
gtacatcttc tcgatcacgc tgttcacctt gttggtgatg ccgttgatgg cgttctgggt 300
agacttctga tcggcggcat atccagagcc ctgctcattc tggtggtggt agccgtacca 360
gccatccacc atgcctgtcc atcctccctc gatgaatccg gcgatggctc caaacagtcc 420
tctgctctgg atgctaggga tgtttctcag gccggtcacc attctcagct tggcgcttct 480
cacatacttg gggcactcgc cgatggtcac agggtgcaca ttctggaagg gcaggctgct 540
attgatggcg ccctgaggtg tctggcactt ggcatc 576
<210> 28
<211> 570
<212> DNA
<213> A流感病毒
<400> 28
ctgagaatgg tgaccggcct gagaaacatc cctagcatcc agagcagagg actgtttgga 60
gccatcgccg gattcatcga gggaggatgg acaggcatgg tggatggctg gtacggctac 120
caccaccaga atgagcaggg ctctggatat gccgccgatc agaagtctac ccagaacgcc 180
atcaacggca tcaccaacaa ggtgaacagc gtgatcgaga agatgaacac ccagtttacc 240
gctgtgggca aggagttcaa caagctggag cggaggatgg agaacctgaa caagaaggtg 300
gacgacggct ttctggacat ctggacctac aatgccgaac tcctggtcct cctcgagaat 360
gagaggaccc tggacttcca cgacagcaac gtgaagaacc tgtatgagaa ggtgaagagc 420
cagctgaaga acaacgccaa ggagatcggc aacggctgct tcgagttcta ccacaagtgt 480
aacaacgagt gtatggagag cgtgaagaac ggcacctacg actaccctaa gtacagcgag 540
gagagcaagc tgaaccggga gaagatcgat 570
<210> 29
<211> 194
<212> PRT
<213> A流感病毒
<400> 29
Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln Arg Glu Thr Ser
1 5 10 15
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
20 25 30
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
35 40 45
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
50 55 60
Ile Asn Gly Ile Thr Asn Lys Val Asn Ser Val Ile Glu Lys Met Asn
65 70 75 80
Thr Gln Phe Thr Ala Val Gly Lys Glu Phe Asn Lys Leu Glu Arg Arg
85 90 95
Met Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe Leu Asp Ile Trp
100 105 110
Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Glu Asn Glu Arg Thr Leu
115 120 125
Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr Glu Lys Val Lys Ser
130 135 140
Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly Cys Phe Glu Phe
145 150 155 160
Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser Val Lys Asn Gly Thr
165 170 175
Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys Leu Asn Arg Glu Lys
180 185 190
Ile Asp
<210> 30
<211> 570
<212> DNA
<213> A流感病毒
<400> 30
atcgatcttc tcccggttca gcttgctctc ctcgctgtac ttagggtagt cgtaggtgcc 60
gttcttcacg ctctccatac actcgttgtt acacttgtgg tagaactcga agcagccgtt 120
gccgatctcc ttggcgttgt tcttcagctg gctcttcacc ttctcataca ggttcttcac 180
gttgctgtcg tggaagtcca gggtcctctc attctcgagg aggaccagga gttcggcatt 240
gtaggtccag atgtccagaa agccgtcgtc caccttcttg ttcaggttct ccatcctccg 300
ctccagcttg ttgaactcct tgcccacagc ggtaaactgg gtgttcatct tctcgatcac 360
gctgttcacc ttgttggtga tgccgttgat ggcgttctgg gtagacttct gatcggcggc 420
atatccagag ccctgctcat tctggtggtg gtagccgtac cagccatcca ccatgcctgt 480
ccatcctccc tcgatgaatc cggcgatggc tccaaacagt cctctgctct ggatgctagg 540
gatgtttctc aggccggtca ccattctcag 570
<210> 31
<211> 468
<212> DNA
<213> A流感病毒
<400> 31
ctgagaatgg tgaccggcct gagaaacatc cctagcatcc agagcagagg actgtttgga 60
gccatcgccg gattcatcga gggaggatgg acaggcatgg tggatggctg gtacggctac 120
caccaccaga atgagcaggg ctctggatat gccgccgatc agaagtctac ccagaacgcc 180
atcaacggca tcaccaacaa ggtgaacagc gtgatcgaga agatgtacaa tgccgaactc 240
ctggtcctcc tcgagaatga gaggaccctg gacttccacg acagcaacgt gaagaacctg 300
tatgagaagg tgaagagcca gctgaagaac aacgccaagg agatcggcaa cggctgcttc 360
gagttctacc acaagtgtaa caacgagtgt atggagagcg tgaagaacgg cacctacgac 420
taccctaagt acagcgagga gagcaagctg aaccgggaga agatcgat 468
<210> 32
<211> 157
<212> PRT
<213> A流感病毒
<400> 32
Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln Arg Glu Thr Arg
1 5 10 15
Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly Gly Trp Thr Gly
20 25 30
Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn Glu Gln Gly Ser
35 40 45
Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala Ile Asn Gly Ile
50 55 60
Thr Asn Lys Val Asn Ser Val Ile Glu Lys Met Thr Tyr Asn Ala Glu
65 70 75 80
Leu Leu Val Leu Leu Glu Asn Glu Arg Thr Leu Asp Phe His Asp Ser
85 90 95
Asn Val Lys Asn Leu Tyr Glu Lys Val Lys Ser Gln Leu Lys Asn Asn
100 105 110
Ala Lys Glu Ile Gly Asn Gly Cys Phe Glu Phe Tyr His Lys Cys Asn
115 120 125
Asn Glu Cys Met Glu Ser Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys
130 135 140
Tyr Ser Glu Glu Ser Lys Leu Asn Arg Glu Lys Ile Asp
145 150 155
<210> 33
<211> 468
<212> DNA
<213> A流感病毒
<400> 33
atcgatcttc tcccggttca gcttgctctc ctcgctgtac ttagggtagt cgtaggtgcc 60
gttcttcacg ctctccatac actcgttgtt acacttgtgg tagaactcga agcagccgtt 120
gccgatctcc ttggcgttgt tcttcagctg gctcttcacc ttctcataca ggttcttcac 180
gttgctgtcg tggaagtcca gggtcctctc attctcgagg aggaccagga gttcggcatt 240
gtacatcttc tcgatcacgc tgttcacctt gttggtgatg ccgttgatgg cgttctgggt 300
agacttctga tcggcggcat atccagagcc ctgctcattc tggtggtggt agccgtacca 360
gccatccacc atgcctgtcc atcctccctc gatgaatccg gcgatggctc caaacagtcc 420
tctgctctgg atgctaggga tgtttctcag gccggtcacc attctcag 468
<210> 34
<211> 147
<212> DNA
<213> A流感病毒
<400> 34
atgaaggcta ttttggtcgt gctcctgtac acctttgcca cagccaatgc cgataccctt 60
tgtattggct accatgcaaa caactctacc gatacggtcg acacggtgct cgaaaagaat 120
gttactgtca cccactctgt gaacttg 147
<210> 35
<211> 49
<212> PRT
<213> A流感病毒
<400> 35
Met Lys Ala Ile Leu Val Val Leu Leu Tyr Thr Phe Ala Thr Ala Asn
1 5 10 15
Ala Asp Thr Leu Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu
<210> 36
<211> 147
<212> DNA
<213> A流感病毒
<400> 36
caagttcaca gagtgggtga cagtaacatt cttttcgagc accgtgtcga ccgtatcggt 60
agagttgttt gcatggtagc caatacaaag ggtatcggca ttggctgtgg caaaggtgta 120
caggagcacg accaaaatag ccttcat 147
<210> 37
<211> 672
<212> DNA
<213> A流感病毒
<400> 37
acatgtcaga caccgaaggg cgccatcaac acgagcttgc cctttcagaa tatacatcca 60
atcacaatcg gaaaatgccc caagtacgtg aaaagcacta aactgagact cgccaccgga 120
ctcaggaata tcccaagcat ccagtcacgg ggtctgttcg gcgctatcgc cggatttatt 180
gaaggcggct ggacggggat ggtggacggt tggtacggct accatcatca aaatgagcag 240
ggctccggat acgccgctga cctgaaatct acgcagaatg ccatagatga gatcacaaac 300
aaggtcaata gtgtgataga aaaaatgaat actcagttca cagctgttgg aaaggagttt 360
aaccacctcg agaagcgaat tgagaacctg aacaagaagg tggacgatgg ctttttggat 420
atctggacgt ataacgctga gctgcttgtt ctgctggaga acgaaagaac ccttgactac 480
cacgattcca acgtgaagaa tctgtatgag aaagtgcgaa gccagttgaa aaacaacgca 540
aaagaaatag gcaacggctg tttcgagttc taccacaaat gcgataacac ctgcatggag 600
agtgtgaaga acggaacgta cgattatcca aaatactccg aggaggccaa actcaatagg 660
gaggagatag ac 672
<210> 38
<211> 224
<212> PRT
<213> A流感病毒
<400> 38
Thr Cys Gln Thr Pro Lys Gly Ala Ile Asn Thr Ser Leu Pro Phe Gln
1 5 10 15
Asn Ile His Pro Ile Thr Ile Gly Lys Cys Pro Lys Tyr Val Lys Ser
20 25 30
Thr Lys Leu Arg Leu Ala Thr Gly Leu Arg Asn Ile Pro Ser Ile Gln
35 40 45
Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly Gly Trp
50 55 60
Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn Glu Gln
65 70 75 80
Gly Ser Gly Tyr Ala Ala Asp Leu Lys Ser Thr Gln Asn Ala Ile Asp
85 90 95
Glu Ile Thr Asn Lys Val Asn Ser Val Ile Glu Lys Met Asn Thr Gln
100 105 110
Phe Thr Ala Val Gly Lys Glu Phe Asn His Leu Glu Lys Arg Ile Glu
115 120 125
Asn Leu Asn Lys Lys Val Asp Asp Gly Phe Leu Asp Ile Trp Thr Tyr
130 135 140
Asn Ala Glu Leu Leu Val Leu Leu Glu Asn Glu Arg Thr Leu Asp Tyr
145 150 155 160
His Asp Ser Asn Val Lys Asn Leu Tyr Glu Lys Val Arg Ser Gln Leu
165 170 175
Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly Cys Phe Glu Phe Tyr His
180 185 190
Lys Cys Asp Asn Thr Cys Met Glu Ser Val Lys Asn Gly Thr Tyr Asp
195 200 205
Tyr Pro Lys Tyr Ser Glu Glu Ala Lys Leu Asn Arg Glu Glu Ile Asp
210 215 220
<210> 39
<211> 672
<212> DNA
<213> A流感病毒
<400> 39
gtctatctcc tccctattga gtttggcctc ctcggagtat tttggataat cgtacgttcc 60
gttcttcaca ctctccatgc aggtgttatc gcatttgtgg tagaactcga aacagccgtt 120
gcctatttct tttgcgttgt ttttcaactg gcttcgcact ttctcataca gattcttcac 180
gttggaatcg tggtagtcaa gggttctttc gttctccagc agaacaagca gctcagcgtt 240
atacgtccag atatccaaaa agccatcgtc caccttcttg ttcaggttct caattcgctt 300
ctcgaggtgg ttaaactcct ttccaacagc tgtgaactga gtattcattt tttctatcac 360
actattgacc ttgtttgtga tctcatctat ggcattctgc gtagatttca ggtcagcggc 420
gtatccggag ccctgctcat tttgatgatg gtagccgtac caaccgtcca ccatccccgt 480
ccagccgcct tcaataaatc cggcgatagc gccgaacaga ccccgtgact ggatgcttgg 540
gatattcctg agtccggtgg cgagtctcag tttagtgctt ttcacgtact tggggcattt 600
tccgattgtg attggatgta tattctgaaa gggcaagctc gtgttgatgg cgcccttcgg 660
tgtctgacat gt 672
<210> 40
<211> 573
<212> DNA
<213> A流感病毒
<400> 40
acatgtcaga caccgaaggg cgccatcaac acgagcttgc cctttcagaa tatacatcca 60
atcacaatcg gaaaatgccc caagtacgtg aaaagcacta aactgagact cgccaccgga 120
ctcaggaata tcccaagcat ccagtcacgg ggtctgttcg gcgctatcgc cggatttatt 180
gaaggcggct ggacggggat ggtggacggt tggtacggct accatcatca aaatgagcag 240
ggctccggat acgccgctga cctgaaatct acgcagaatg ccatagatga gatcacaaac 300
aaggtcaata gtgtgataga aaaaatgacg tataacgctg agctgcttgt tctgctggag 360
aacgaaagaa cccttgacta ccacgattcc aacgtgaaga atctgtatga gaaagtgcga 420
agccagttga aaaacaacgc aaaagaaata ggcaacggct gtttcgagtt ctaccacaaa 480
tgcgataaca cctgcatgga gagtgtgaag aacggaacgt acgattatcc aaaatactcc 540
gaggaggcca aactcaatag ggaggagata gac 573
<210> 41
<211> 191
<212> PRT
<213> A流感病毒
<400> 41
Thr Cys Gln Thr Pro Lys Gly Ala Ile Asn Thr Ser Leu Pro Phe Gln
1 5 10 15
Asn Ile His Pro Ile Thr Ile Gly Lys Cys Pro Lys Tyr Val Lys Ser
20 25 30
Thr Lys Leu Arg Leu Ala Thr Gly Leu Arg Asn Ile Pro Ser Ile Gln
35 40 45
Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly Gly Trp
50 55 60
Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn Glu Gln
65 70 75 80
Gly Ser Gly Tyr Ala Ala Asp Leu Lys Ser Thr Gln Asn Ala Ile Asp
85 90 95
Glu Ile Thr Asn Lys Val Asn Ser Val Ile Glu Lys Met Thr Tyr Asn
100 105 110
Ala Glu Leu Leu Val Leu Leu Glu Asn Glu Arg Thr Leu Asp Tyr His
115 120 125
Asp Ser Asn Val Lys Asn Leu Tyr Glu Lys Val Arg Ser Gln Leu Lys
130 135 140
Asn Asn Ala Lys Glu Ile Gly Asn Gly Cys Phe Glu Phe Tyr His Lys
145 150 155 160
Cys Asp Asn Thr Cys Met Glu Ser Val Lys Asn Gly Thr Tyr Asp Tyr
165 170 175
Pro Lys Tyr Ser Glu Glu Ala Lys Leu Asn Arg Glu Glu Ile Asp
180 185 190
<210> 42
<211> 573
<212> DNA
<213> A流感病毒
<400> 42
gtctatctcc tccctattga gtttggcctc ctcggagtat tttggataat cgtacgttcc 60
gttcttcaca ctctccatgc aggtgttatc gcatttgtgg tagaactcga aacagccgtt 120
gcctatttct tttgcgttgt ttttcaactg gcttcgcact ttctcataca gattcttcac 180
gttggaatcg tggtagtcaa gggttctttc gttctccagc agaacaagca gctcagcgtt 240
atacgtcatt ttttctatca cactattgac cttgtttgtg atctcatcta tggcattctg 300
cgtagatttc aggtcagcgg cgtatccgga gccctgctca ttttgatgat ggtagccgta 360
ccaaccgtcc accatccccg tccagccgcc ttcaataaat ccggcgatag cgccgaacag 420
accccgtgac tggatgcttg ggatattcct gagtccggtg gcgagtctca gtttagtgct 480
tttcacgtac ttggggcatt ttccgattgt gattggatgt atattctgaa agggcaagct 540
cgtgttgatg gcgcccttcg gtgtctgaca tgt 573
<210> 43
<211> 507
<212> DNA
<213> A流感病毒
<400> 43
ctgagactcg ccaccggact caggaatatc ccaagcatcc agtcacgggg tctgttcggc 60
gctatcgccg gatttattga aggcggctgg acggggatgg tggacggttg gtacggctac 120
catcatcaaa atgagcaggg ctccggatac gccgctgacc tgaaatctac gcagaatgcc 180
atagatgaga tcacaaacaa ggtcaatagt gtgatagaaa aaatgaatac tcagttcaca 240
gctgttggaa aggagtttaa ccacctcgag aagcgaattg agaacctgaa caagaaggtg 300
gacgatggct ttttggatat ctggacgtat aacgctgagc tgcttgttct gctggagaac 360
gaaagaaccc ttgactacca cgattccaac gtgaagaatc tgtatgagaa agtgcgaagc 420
cagttgaaaa acaacgcaaa agaaataggc aacggctgtt tcgagttcta ccacaaatgc 480
gataacacct gcatggagag tgtgaag 507
<210> 44
<211> 190
<212> PRT
<213> A流感病毒
<400> 44
Leu Arg Leu Ala Thr Gly Leu Arg Asn Ile Pro Ser Ile Gln Ser Arg
1 5 10 15
Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly Gly Trp Thr Gly
20 25 30
Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn Glu Gln Gly Ser
35 40 45
Gly Tyr Ala Ala Asp Leu Lys Ser Thr Gln Asn Ala Ile Asp Glu Ile
50 55 60
Thr Asn Lys Val Asn Ser Val Ile Glu Lys Met Asn Thr Gln Phe Thr
65 70 75 80
Ala Val Gly Lys Glu Phe Asn His Leu Glu Lys Arg Ile Glu Asn Leu
85 90 95
Asn Lys Lys Val Asp Asp Gly Phe Leu Asp Ile Trp Thr Tyr Asn Ala
100 105 110
Glu Leu Leu Val Leu Leu Glu Asn Glu Arg Thr Leu Asp Tyr His Asp
115 120 125
Ser Asn Val Lys Asn Leu Tyr Glu Lys Val Arg Ser Gln Leu Lys Asn
130 135 140
Asn Ala Lys Glu Ile Gly Asn Gly Cys Phe Glu Phe Tyr His Lys Cys
145 150 155 160
Asp Asn Thr Cys Met Glu Ser Val Lys Asn Gly Thr Tyr Asp Tyr Pro
165 170 175
Lys Tyr Ser Glu Glu Ala Lys Leu Asn Arg Glu Glu Ile Asp
180 185 190
<210> 45
<211> 507
<212> DNA
<213> A流感病毒
<400> 45
cttcacactc tccatgcagg tgttatcgca tttgtggtag aactcgaaac agccgttgcc 60
tatttctttt gcgttgtttt tcaactggct tcgcactttc tcatacagat tcttcacgtt 120
ggaatcgtgg tagtcaaggg ttctttcgtt ctccagcaga acaagcagct cagcgttata 180
cgtccagata tccaaaaagc catcgtccac cttcttgttc aggttctcaa ttcgcttctc 240
gaggtggtta aactcctttc caacagctgt gaactgagta ttcatttttt ctatcacact 300
attgaccttg tttgtgatct catctatggc attctgcgta gatttcaggt cagcggcgta 360
tccggagccc tgctcatttt gatgatggta gccgtaccaa ccgtccacca tccccgtcca 420
gccgccttca ataaatccgg cgatagcgcc gaacagaccc cgtgactgga tgcttgggat 480
attcctgagt ccggtggcga gtctcag 507
<210> 46
<211> 471
<212> DNA
<213> A流感病毒
<400> 46
ctgagactcg ccaccggact caggaatatc ccaagcatcc agtcacgggg tctgttcggc 60
gctatcgccg gatttattga aggcggctgg acggggatgg tggacggttg gtacggctac 120
catcatcaaa atgagcaggg ctccggatac gccgctgacc tgaaatctac gcagaatgcc 180
atagatgaga tcacaaacaa ggtcaatagt gtgatagaaa aaatgacgta taacgctgag 240
ctgcttgttc tgctggagaa cgaaagaacc cttgactacc acgattccaa cgtgaagaat 300
ctgtatgaga aagtgcgaag ccagttgaaa aacaacgcaa aagaaatagg caacggctgt 360
ttcgagttct accacaaatg cgataacacc tgcatggaga gtgtgaagaa cggaacgtac 420
gattatccaa aatactccga ggaggccaaa ctcaataggg aggagataga c 471
<210> 47
<211> 157
<212> PRT
<213> A流感病毒
<400> 47
Leu Arg Leu Ala Thr Gly Leu Arg Asn Ile Pro Ser Ile Gln Ser Arg
1 5 10 15
Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly Gly Trp Thr Gly
20 25 30
Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn Glu Gln Gly Ser
35 40 45
Gly Tyr Ala Ala Asp Leu Lys Ser Thr Gln Asn Ala Ile Asp Glu Ile
50 55 60
Thr Asn Lys Val Asn Ser Val Ile Glu Lys Met Thr Tyr Asn Ala Glu
65 70 75 80
Leu Leu Val Leu Leu Glu Asn Glu Arg Thr Leu Asp Tyr His Asp Ser
85 90 95
Asn Val Lys Asn Leu Tyr Glu Lys Val Arg Ser Gln Leu Lys Asn Asn
100 105 110
Ala Lys Glu Ile Gly Asn Gly Cys Phe Glu Phe Tyr His Lys Cys Asp
115 120 125
Asn Thr Cys Met Glu Ser Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys
130 135 140
Tyr Ser Glu Glu Ala Lys Leu Asn Arg Glu Glu Ile Asp
145 150 155
<210> 48
<211> 471
<212> DNA
<213> A流感病毒
<400> 48
gtctatctcc tccctattga gtttggcctc ctcggagtat tttggataat cgtacgttcc 60
gttcttcaca ctctccatgc aggtgttatc gcatttgtgg tagaactcga aacagccgtt 120
gcctatttct tttgcgttgt ttttcaactg gcttcgcact ttctcataca gattcttcac 180
gttggaatcg tggtagtcaa gggttctttc gttctccagc agaacaagca gctcagcgtt 240
atacgtcatt ttttctatca cactattgac cttgtttgtg atctcatcta tggcattctg 300
cgtagatttc aggtcagcgg cgtatccgga gccctgctca ttttgatgat ggtagccgta 360
ccaaccgtcc accatccccg tccagccgcc ttcaataaat ccggcgatag cgccgaacag 420
accccgtgac tggatgcttg ggatattcct gagtccggtg gcgagtctca g 471
<210> 49
<211> 141
<212> DNA
<213> A流感病毒
<400> 49
atggccatca tctacctgat cctgctgttt acagctgtga gaggcgacca gatctgtatc 60
ggctaccacg ccaacaatag caccgagaag gtggacacca tcctggagag aaacgtgaca 120
gtgacccacg ccaaggacat c 141
<210> 50
<211> 47
<212> PRT
<213> A流感病毒
<400> 50
Met Ala Ile Ile Tyr Leu Ile Leu Leu Phe Thr Ala Val Arg Gly Asp
1 5 10 15
Gln Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Glu Lys Val Asp
20 25 30
Thr Ile Leu Glu Arg Asn Val Thr Val Thr His Ala Lys Asp Ile
35 40 45
<210> 51
<211> 141
<212> DNA
<213> A流感病毒
<400> 51
gatgtccttg gcgtgggtca ctgtcacgtt tctctccagg atggtgtcca ccttctcggt 60
gctattgttg gcgtggtagc cgatacagat ctggtcgcct ctcacagctg taaacagcag 120
gatcaggtag atgatggcca t 141
<210> 52
<211> 672
<212> DNA
<213> A流感病毒
<400> 52
aagtgccaga cacctctggg cgccatcaat accaccctgc ccttccacaa tgtgcaccct 60
ctgaccatcg gcgagtgccc taagtatgtg aagagcgaga agctggtgct ggccacagga 120
ctgagaaacg tgccccagat cgagagcaga ggcctgtttg gagccatcgc cggattcatc 180
gagggaggat ggcagggaat ggtcgatggc tggtacggct accaccacag caatgatcag 240
ggctctggct atgccgccga taaggagtct acccagaagg cctttgacgg catcaccaac 300
aaggtgaaca gcgtgatcga gaagatgaac acccagtttg aggctgtggg caaggagttt 360
agcaacctgg agcggagact ggagaacctg aacaagaaga tggaggacgg cttcctggat 420
gtgtggacct acaatgccga actgctggtg ctgatggaga atgagcggac cctggacttc 480
cacgacagca acgtgaagaa cctgtacgac aaagtgagga tgcagctgag ggacaacgtg 540
aaggaactgg gcaatggctg cttcgagttc taccacaagt gtgacgacga gtgtatgaac 600
tccgtgaaga acggcaccta cgactaccct aagtacgagg aggagagcaa gctgaaccgg 660
aacgagatca ag 672
<210> 53
<211> 224
<212> PRT
<213> A流感病毒
<400> 53
Lys Cys Gln Thr Pro Leu Gly Ala Ile Asn Thr Thr Leu Pro Phe His
1 5 10 15
Asn Val His Pro Leu Thr Ile Gly Glu Cys Pro Lys Tyr Val Lys Ser
20 25 30
Glu Lys Leu Val Leu Ala Thr Gly Leu Arg Asn Val Pro Gln Ile Glu
35 40 45
Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly Gly Trp
50 55 60
Gln Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Ser Asn Asp Gln
65 70 75 80
Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr Gln Lys Ala Phe Asp
85 90 95
Gly Ile Thr Asn Lys Val Asn Ser Val Ile Glu Lys Met Asn Thr Gln
100 105 110
Phe Glu Ala Val Gly Lys Glu Phe Ser Asn Leu Glu Arg Arg Leu Glu
115 120 125
Asn Leu Asn Lys Lys Met Glu Asp Gly Phe Leu Asp Val Trp Thr Tyr
130 135 140
Asn Ala Glu Leu Leu Val Leu Met Glu Asn Glu Arg Thr Leu Asp Phe
145 150 155 160
His Asp Ser Asn Val Lys Asn Leu Tyr Asp Lys Val Arg Met Gln Leu
165 170 175
Arg Asp Asn Val Lys Glu Leu Gly Asn Gly Cys Phe Glu Phe Tyr His
180 185 190
Lys Cys Asp Asp Glu Cys Met Asn Ser Val Lys Asn Gly Thr Tyr Asp
195 200 205
Tyr Pro Lys Tyr Glu Glu Glu Ser Lys Leu Asn Arg Asn Glu Ile Lys
210 215 220
<210> 54
<211> 672
<212> DNA
<213> A流感病毒
<400> 54
cttgatctcg ttccggttca gcttgctctc ctcctcgtac ttagggtagt cgtaggtgcc 60
gttcttcacg gagttcatac actcgtcgtc acacttgtgg tagaactcga agcagccatt 120
gcccagttcc ttcacgttgt ccctcagctg catcctcact ttgtcgtaca ggttcttcac 180
gttgctgtcg tggaagtcca gggtccgctc attctccatc agcaccagca gttcggcatt 240
gtaggtccac acatccagga agccgtcctc catcttcttg ttcaggttct ccagtctccg 300
ctccaggttg ctaaactcct tgcccacagc ctcaaactgg gtgttcatct tctcgatcac 360
gctgttcacc ttgttggtga tgccgtcaaa ggccttctgg gtagactcct tatcggcggc 420
atagccagag ccctgatcat tgctgtggtg gtagccgtac cagccatcga ccattccctg 480
ccatcctccc tcgatgaatc cggcgatggc tccaaacagg cctctgctct cgatctgggg 540
cacgtttctc agtcctgtgg ccagcaccag cttctcgctc ttcacatact tagggcactc 600
gccgatggtc agagggtgca cattgtggaa gggcagggtg gtattgatgg cgcccagagg 660
tgtctggcac tt 672
<210> 55
<211> 573
<212> DNA
<213> A流感病毒
<400> 55
aagtgccaga cacctctggg cgccatcaat accaccctgc ccttccacaa tgtgcaccct 60
ctgaccatcg gcgagtgccc taagtatgtg aagagcgaga agctggtgct ggccacagga 120
ctgagaaacg tgccccagat cgagagcaga ggcctgtttg gagccatcgc cggattcatc 180
gagggaggat ggcagggaat ggtcgatggc tggtacggct accaccacag caatgatcag 240
ggctctggct atgccgccga taaggagtct acccagaagg cctttgacgg catcaccaac 300
aaggtgaaca gcgtgatcga gaagatgacc tacaatgccg aactgctggt gctgatggag 360
aatgagcgga ccctggactt ccacgacagc aacgtgaaga acctgtacga caaagtgagg 420
atgcagctga gggacaacgt gaaggaactg ggcaatggct gcttcgagtt ctaccacaag 480
tgtgacgacg agtgtatgaa ctccgtgaag aacggcacct acgactaccc taagtacgag 540
gaggagagca agctgaaccg gaacgagatc aag 573
<210> 56
<211> 191
<212> PRT
<213> A流感病毒
<400> 56
Lys Cys Gln Thr Pro Leu Gly Ala Ile Asn Thr Thr Leu Pro Phe His
1 5 10 15
Asn Val His Pro Leu Thr Ile Gly Glu Cys Pro Lys Tyr Val Lys Ser
20 25 30
Glu Lys Leu Val Leu Ala Thr Gly Leu Arg Asn Val Pro Gln Ile Glu
35 40 45
Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly Gly Trp
50 55 60
Gln Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Ser Asn Asp Gln
65 70 75 80
Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr Gln Lys Ala Phe Asp
85 90 95
Gly Ile Thr Asn Lys Val Asn Ser Val Ile Glu Lys Met Thr Tyr Asn
100 105 110
Ala Glu Leu Leu Val Leu Met Glu Asn Glu Arg Thr Leu Asp Phe His
115 120 125
Asp Ser Asn Val Lys Asn Leu Tyr Asp Lys Val Arg Met Gln Leu Arg
130 135 140
Asp Asn Val Lys Glu Leu Gly Asn Gly Cys Phe Glu Phe Tyr His Lys
145 150 155 160
Cys Asp Asp Glu Cys Met Asn Ser Val Lys Asn Gly Thr Tyr Asp Tyr
165 170 175
Pro Lys Tyr Glu Glu Glu Ser Lys Leu Asn Arg Asn Glu Ile Lys
180 185 190
<210> 57
<211> 573
<212> DNA
<213> A流感病毒
<400> 57
cttgatctcg ttccggttca gcttgctctc ctcctcgtac ttagggtagt cgtaggtgcc 60
gttcttcacg gagttcatac actcgtcgtc acacttgtgg tagaactcga agcagccatt 120
gcccagttcc ttcacgttgt ccctcagctg catcctcact ttgtcgtaca ggttcttcac 180
gttgctgtcg tggaagtcca gggtccgctc attctccatc agcaccagca gttcggcatt 240
gtaggtcatc ttctcgatca cgctgttcac cttgttggtg atgccgtcaa aggccttctg 300
ggtagactcc ttatcggcgg catagccaga gccctgatca ttgctgtggt ggtagccgta 360
ccagccatcg accattccct gccatcctcc ctcgatgaat ccggcgatgg ctccaaacag 420
gcctctgctc tcgatctggg gcacgtttct cagtcctgtg gccagcacca gcttctcgct 480
cttcacatac ttagggcact cgccgatggt cagagggtgc acattgtgga agggcagggt 540
ggtattgatg gcgcccagag gtgtctggca ctt 573
<210> 58
<211> 570
<212> DNA
<213> A流感病毒
<400> 58
ctggtgctgg ccacaggact gagaaacgtg ccccagatcg agagcagagg cctgtttgga 60
gccatcgccg gattcatcga gggaggatgg cagggaatgg tcgatggctg gtacggctac 120
caccacagca atgatcaggg ctctggctat gccgccgata aggagtctac ccagaaggcc 180
tttgacggca tcaccaacaa ggtgaacagc gtgatcgaga agatgaacac ccagtttgag 240
gctgtgggca aggagtttag caacctggag cggagactgg agaacctgaa caagaagatg 300
gaggacggct tcctggatgt gtggacctac aatgccgaac tgctggtgct gatggagaat 360
gagcggaccc tggacttcca cgacagcaac gtgaagaacc tgtacgacaa agtgaggatg 420
cagctgaggg acaacgtgaa ggaactgggc aatggctgct tcgagttcta ccacaagtgt 480
gacgacgagt gtatgaactc cgtgaagaac ggcacctacg actaccctaa gtacgaggag 540
gagagcaagc tgaaccggaa cgagatcaag 570
<210> 59
<211> 190
<212> PRT
<213> A流感病毒
<400> 59
Leu Val Leu Ala Thr Gly Leu Arg Asn Val Pro Gln Ile Glu Ser Arg
1 5 10 15
Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly Gly Trp Gln Gly
20 25 30
Met Val Asp Gly Trp Tyr Gly Tyr His His Ser Asn Asp Gln Gly Ser
35 40 45
Gly Tyr Ala Ala Asp Lys Glu Ser Thr Gln Lys Ala Phe Asp Gly Ile
50 55 60
Thr Asn Lys Val Asn Ser Val Ile Glu Lys Met Asn Thr Gln Phe Glu
65 70 75 80
Ala Val Gly Lys Glu Phe Ser Asn Leu Glu Arg Arg Leu Glu Asn Leu
85 90 95
Asn Lys Lys Met Glu Asp Gly Phe Leu Asp Val Trp Thr Tyr Asn Ala
100 105 110
Glu Leu Leu Val Leu Met Glu Asn Glu Arg Thr Leu Asp Phe His Asp
115 120 125
Ser Asn Val Lys Asn Leu Tyr Asp Lys Val Arg Met Gln Leu Arg Asp
130 135 140
Asn Val Lys Glu Leu Gly Asn Gly Cys Phe Glu Phe Tyr His Lys Cys
145 150 155 160
Asp Asp Glu Cys Met Asn Ser Val Lys Asn Gly Thr Tyr Asp Tyr Pro
165 170 175
Lys Tyr Glu Glu Glu Ser Lys Leu Asn Arg Asn Glu Ile Lys
180 185 190
<210> 60
<211> 570
<212> DNA
<213> A流感病毒
<400> 60
cttgatctcg ttccggttca gcttgctctc ctcctcgtac ttagggtagt cgtaggtgcc 60
gttcttcacg gagttcatac actcgtcgtc acacttgtgg tagaactcga agcagccatt 120
gcccagttcc ttcacgttgt ccctcagctg catcctcact ttgtcgtaca ggttcttcac 180
gttgctgtcg tggaagtcca gggtccgctc attctccatc agcaccagca gttcggcatt 240
gtaggtccac acatccagga agccgtcctc catcttcttg ttcaggttct ccagtctccg 300
ctccaggttg ctaaactcct tgcccacagc ctcaaactgg gtgttcatct tctcgatcac 360
gctgttcacc ttgttggtga tgccgtcaaa ggccttctgg gtagactcct tatcggcggc 420
atagccagag ccctgatcat tgctgtggtg gtagccgtac cagccatcga ccattccctg 480
ccatcctccc tcgatgaatc cggcgatggc tccaaacagg cctctgctct cgatctgggg 540
cacgtttctc agtcctgtgg ccagcaccag 570
<210> 61
<211> 471
<212> DNA
<213> A流感病毒
<400> 61
ctggtgctgg ccacaggact gagaaacgtg ccccagatcg agagcagagg cctgtttgga 60
gccatcgccg gattcatcga gggaggatgg cagggaatgg tcgatggctg gtacggctac 120
caccacagca atgatcaggg ctctggctat gccgccgata aggagtctac ccagaaggcc 180
tttgacggca tcaccaacaa ggtgaacagc gtgatcgaga agatgaccta caatgccgaa 240
ctgctggtgc tgatggagaa tgagcggacc ctggacttcc acgacagcaa cgtgaagaac 300
ctgtacgaca aagtgaggat gcagctgagg gacaacgtga aggaactggg caatggctgc 360
ttcgagttct accacaagtg tgacgacgag tgtatgaact ccgtgaagaa cggcacctac 420
gactacccta agtacgagga ggagagcaag ctgaaccgga acgagatcaa g 471
<210> 62
<211> 157
<212> PRT
<213> A流感病毒
<400> 62
Leu Val Leu Ala Thr Gly Leu Arg Asn Val Pro Gln Ile Glu Ser Arg
1 5 10 15
Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly Gly Trp Gln Gly
20 25 30
Met Val Asp Gly Trp Tyr Gly Tyr His His Ser Asn Asp Gln Gly Ser
35 40 45
Gly Tyr Ala Ala Asp Lys Glu Ser Thr Gln Lys Ala Phe Asp Gly Ile
50 55 60
Thr Asn Lys Val Asn Ser Val Ile Glu Lys Met Thr Tyr Asn Ala Glu
65 70 75 80
Leu Leu Val Leu Met Glu Asn Glu Arg Thr Leu Asp Phe His Asp Ser
85 90 95
Asn Val Lys Asn Leu Tyr Asp Lys Val Arg Met Gln Leu Arg Asp Asn
100 105 110
Val Lys Glu Leu Gly Asn Gly Cys Phe Glu Phe Tyr His Lys Cys Asp
115 120 125
Asp Glu Cys Met Asn Ser Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys
130 135 140
Tyr Glu Glu Glu Ser Lys Leu Asn Arg Asn Glu Ile Lys
145 150 155
<210> 63
<211> 471
<212> DNA
<213> A流感病毒
<400> 63
cttgatctcg ttccggttca gcttgctctc ctcctcgtac ttagggtagt cgtaggtgcc 60
gttcttcacg gagttcatac actcgtcgtc acacttgtgg tagaactcga agcagccatt 120
gcccagttcc ttcacgttgt ccctcagctg catcctcact ttgtcgtaca ggttcttcac 180
gttgctgtcg tggaagtcca gggtccgctc attctccatc agcaccagca gttcggcatt 240
gtaggtcatc ttctcgatca cgctgttcac cttgttggtg atgccgtcaa aggccttctg 300
ggtagactcc ttatcggcgg catagccaga gccctgatca ttgctgtggt ggtagccgta 360
ccagccatcg accattccct gccatcctcc ctcgatgaat ccggcgatgg ctccaaacag 420
gcctctgctc tcgatctggg gcacgtttct cagtcctgtg gccagcacca g 471
<210> 64
<211> 150
<212> DNA
<213> A流感病毒
<400> 64
gccaccatgg aaaagatcgt gctgctgctg gccattgtga gcctggtgaa gagcgaccag 60
atctgcattg gctaccacgc caacaatagc acagagcagg tggacaccat catggaaaaa 120
aacgtgaccg tgacccacgc tcaggacatc 150
<210> 65
<211> 48
<212> PRT
<213> A流感病毒
<400> 65
Met Glu Lys Ile Val Leu Leu Leu Ala Ile Val Ser Leu Val Lys Ser
1 5 10 15
Asp Gln Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Glu Gln Val
20 25 30
Asp Thr Ile Met Glu Lys Asn Val Thr Val Thr His Ala Gln Asp Ile
35 40 45
<210> 66
<211> 150
<212> DNA
<213> A流感病毒
<400> 66
gatgtcctga gcgtgggtca cggtcacgtt tttttccatg atggtgtcca cctgctctgt 60
gctattgttg gcgtggtagc caatgcagat ctggtcgctc ttcaccaggc tcacaatggc 120
cagcagcagc acgatctttt ccatggtggc 150
<210> 67
<211> 681
<212> DNA
<213> A流感病毒
<400> 67
aagtgccaga cacctatggg cgccatcaac agcagcatgc ccttccacaa catccaccct 60
ctgaccatcg gcgagtgccc taagtacgtg aagagcaaca gactggtgct ggccacaggc 120
ctgagaaata gcccccagcg ggagagcaga agaaagaaga ggggcctgtt tggagccatc 180
gccggcttta ttgaaggcgg ctggcaggga atggtggatg gctggtacgg ctaccaccac 240
agcaatgagc agggctctgg atatgccgcc gacaaagagt ctacccagaa ggccatcgac 300
ggcgtcacca acaaggtgaa cagcatcatc gacaagatga acacccagtt cgaggctgtg 360
ggcagagagt tcaacaacct ggaacggcgg atcgagaacc tgaacaagaa aatggaagat 420
ggcttcctgg atgtgtggac ctacaatgcc gaactgctgg tgctgatgga aaacgagcgg 480
accctggact tccacgacag caacgtgaag aacctgtacg acaaagtgcg gctgcagctg 540
agagacaacg ccaaagagct gggcaacggc tgcttcgagt tctaccacaa gtgcgacaac 600
gagtgcatgg aaagcatccg gaacggcacc tacaactacc ctcagtacag cgaggaagcc 660
aggctgaaga gggaagagat c 681
<210> 68
<211> 227
<212> PRT
<213> A流感病毒
<400> 68
Lys Cys Gln Thr Pro Met Gly Ala Ile Asn Ser Ser Met Pro Phe His
1 5 10 15
Asn Ile His Pro Leu Thr Ile Gly Glu Cys Pro Lys Tyr Val Lys Ser
20 25 30
Asn Arg Leu Val Leu Ala Thr Gly Leu Arg Asn Ser Pro Gln Arg Glu
35 40 45
Ser Arg Arg Lys Lys Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile
50 55 60
Glu Gly Gly Trp Gln Gly Met Val Asp Gly Trp Tyr Gly Tyr His His
65 70 75 80
Ser Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr Gln
85 90 95
Lys Ala Ile Asp Gly Val Thr Asn Lys Val Asn Ser Ile Ile Asp Lys
100 105 110
Met Asn Thr Gln Phe Glu Ala Val Gly Arg Glu Phe Asn Asn Leu Glu
115 120 125
Arg Arg Ile Glu Asn Leu Asn Lys Lys Met Glu Asp Gly Phe Leu Asp
130 135 140
Val Trp Thr Tyr Asn Ala Glu Leu Leu Val Leu Met Glu Asn Glu Arg
145 150 155 160
Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr Asp Lys Val
165 170 175
Arg Leu Gln Leu Arg Asp Asn Ala Lys Glu Leu Gly Asn Gly Cys Phe
180 185 190
Glu Phe Tyr His Lys Cys Asp Asn Glu Cys Met Glu Ser Ile Arg Asn
195 200 205
Gly Thr Tyr Asn Tyr Pro Gln Tyr Ser Glu Glu Ala Arg Leu Lys Arg
210 215 220
Glu Glu Ile
225
<210> 69
<211> 681
<212> DNA
<213> A流感病毒
<400> 69
gatctcttcc ctcttcagcc tggcttcctc gctgtactga gggtagttgt aggtgccgtt 60
ccggatgctt tccatgcact cgttgtcgca cttgtggtag aactcgaagc agccgttgcc 120
cagctctttg gcgttgtctc tcagctgcag ccgcactttg tcgtacaggt tcttcacgtt 180
gctgtcgtgg aagtccaggg tccgctcgtt ttccatcagc accagcagtt cggcattgta 240
ggtccacaca tccaggaagc catcttccat tttcttgttc aggttctcga tccgccgttc 300
caggttgttg aactctctgc ccacagcctc gaactgggtg ttcatcttgt cgatgatgct 360
gttcaccttg ttggtgacgc cgtcgatggc cttctgggta gactctttgt cggcggcata 420
tccagagccc tgctcattgc tgtggtggta gccgtaccag ccatccacca ttccctgcca 480
gccgccttca ataaagccgg cgatggctcc aaacaggccc ctcttctttc ttctgctctc 540
ccgctggggg ctatttctca ggcctgtggc cagcaccagt ctgttgctct tcacgtactt 600
agggcactcg ccgatggtca gagggtggat gttgtggaag ggcatgctgc tgttgatggc 660
gcccataggt gtctggcact t 681
<210> 70
<211> 582
<212> DNA
<213> A流感病毒
<400> 70
aagtgccaga cacctatggg cgccatcaac agcagcatgc ccttccacaa catccaccct 60
ctgaccatcg gcgagtgccc taagtacgtg aagagcaaca gactggtgct ggccacaggc 120
ctgagaaata gcccccagcg ggagagcaga agaaagaaga ggggcctgtt tggagccatc 180
gccggcttta ttgaaggcgg ctggcaggga atggtggatg gctggtacgg ctaccaccac 240
agcaatgagc agggctctgg atatgccgcc gacaaagagt ctacccagaa ggccatcgac 300
ggcgtcacca acaaggtgaa cagcatcatc gacaagatga cctacaatgc cgaactgctg 360
gtgctgatgg aaaacgagcg gaccctggac ttccacgaca gcaacgtgaa gaacctgtac 420
gacaaagtgc ggctgcagct gagagacaac gccaaagagc tgggcaacgg ctgcttcgag 480
ttctaccaca agtgcgacaa cgagtgcatg gaaagcatcc ggaacggcac ctacaactac 540
cctcagtaca gcgaggaagc caggctgaag agggaagaga tc 582
<210> 71
<211> 194
<212> PRT
<213> A流感病毒
<400> 71
Lys Cys Gln Thr Pro Met Gly Ala Ile Asn Ser Ser Met Pro Phe His
1 5 10 15
Asn Ile His Pro Leu Thr Ile Gly Glu Cys Pro Lys Tyr Val Lys Ser
20 25 30
Asn Arg Leu Val Leu Ala Thr Gly Leu Arg Asn Ser Pro Gln Arg Glu
35 40 45
Ser Arg Arg Lys Lys Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile
50 55 60
Glu Gly Gly Trp Gln Gly Met Val Asp Gly Trp Tyr Gly Tyr His His
65 70 75 80
Ser Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr Gln
85 90 95
Lys Ala Ile Asp Gly Val Thr Asn Lys Val Asn Ser Ile Ile Asp Lys
100 105 110
Met Thr Tyr Asn Ala Glu Leu Leu Val Leu Met Glu Asn Glu Arg Thr
115 120 125
Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr Asp Lys Val Arg
130 135 140
Leu Gln Leu Arg Asp Asn Ala Lys Glu Leu Gly Asn Gly Cys Phe Glu
145 150 155 160
Phe Tyr His Lys Cys Asp Asn Glu Cys Met Glu Ser Ile Arg Asn Gly
165 170 175
Thr Tyr Asn Tyr Pro Gln Tyr Ser Glu Glu Ala Arg Leu Lys Arg Glu
180 185 190
Glu Ile
<210> 72
<211> 582
<212> DNA
<213> A流感病毒
<400> 72
gatctcttcc ctcttcagcc tggcttcctc gctgtactga gggtagttgt aggtgccgtt 60
ccggatgctt tccatgcact cgttgtcgca cttgtggtag aactcgaagc agccgttgcc 120
cagctctttg gcgttgtctc tcagctgcag ccgcactttg tcgtacaggt tcttcacgtt 180
gctgtcgtgg aagtccaggg tccgctcgtt ttccatcagc accagcagtt cggcattgta 240
ggtcatcttg tcgatgatgc tgttcacctt gttggtgacg ccgtcgatgg ccttctgggt 300
agactctttg tcggcggcat atccagagcc ctgctcattg ctgtggtggt agccgtacca 360
gccatccacc attccctgcc agccgccttc aataaagccg gcgatggctc caaacaggcc 420
cctcttcttt cttctgctct cccgctgggg gctatttctc aggcctgtgg ccagcaccag 480
tctgttgctc ttcacgtact tagggcactc gccgatggtc agagggtgga tgttgtggaa 540
gggcatgctg ctgttgatgg cgcccatagg tgtctggcac tt 582
<210> 73
<211> 579
<212> DNA
<213> A流感病毒
<400> 73
ctggtgctgg ccacaggcct gagaaatagc ccccagcggg agagcagaag aaagaagagg 60
ggcctgtttg gagccatcgc cggctttatt gaaggcggct ggcagggaat ggtggatggc 120
tggtacggct accaccacag caatgagcag ggctctggat atgccgccga caaagagtct 180
acccagaagg ccatcgacgg cgtcaccaac aaggtgaaca gcatcatcga caagatgaac 240
acccagttcg aggctgtggg cagagagttc aacaacctgg aacggcggat cgagaacctg 300
aacaagaaaa tggaagatgg cttcctggat gtgtggacct acaatgccga actgctggtg 360
ctgatggaaa acgagcggac cctggacttc cacgacagca acgtgaagaa cctgtacgac 420
aaagtgcggc tgcagctgag agacaacgcc aaagagctgg gcaacggctg cttcgagttc 480
taccacaagt gcgacaacga gtgcatggaa agcatccgga acggcaccta caactaccct 540
cagtacagcg aggaagccag gctgaagagg gaagagatc 579
<210> 74
<211> 193
<212> PRT
<213> A流感病毒
<400> 74
Leu Val Leu Ala Thr Gly Leu Arg Asn Ser Pro Gln Arg Glu Ser Arg
1 5 10 15
Arg Lys Lys Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
20 25 30
Gly Trp Gln Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Ser Asn
35 40 45
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr Gln Lys Ala
50 55 60
Ile Asp Gly Val Thr Asn Lys Val Asn Ser Ile Ile Asp Lys Met Asn
65 70 75 80
Thr Gln Phe Glu Ala Val Gly Arg Glu Phe Asn Asn Leu Glu Arg Arg
85 90 95
Ile Glu Asn Leu Asn Lys Lys Met Glu Asp Gly Phe Leu Asp Val Trp
100 105 110
Thr Tyr Asn Ala Glu Leu Leu Val Leu Met Glu Asn Glu Arg Thr Leu
115 120 125
Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr Asp Lys Val Arg Leu
130 135 140
Gln Leu Arg Asp Asn Ala Lys Glu Leu Gly Asn Gly Cys Phe Glu Phe
145 150 155 160
Tyr His Lys Cys Asp Asn Glu Cys Met Glu Ser Ile Arg Asn Gly Thr
165 170 175
Tyr Asn Tyr Pro Gln Tyr Ser Glu Glu Ala Arg Leu Lys Arg Glu Glu
180 185 190
Ile
<210> 75
<211> 579
<212> DNA
<213> A流感病毒
<400> 75
gatctcttcc ctcttcagcc tggcttcctc gctgtactga gggtagttgt aggtgccgtt 60
ccggatgctt tccatgcact cgttgtcgca cttgtggtag aactcgaagc agccgttgcc 120
cagctctttg gcgttgtctc tcagctgcag ccgcactttg tcgtacaggt tcttcacgtt 180
gctgtcgtgg aagtccaggg tccgctcgtt ttccatcagc accagcagtt cggcattgta 240
ggtccacaca tccaggaagc catcttccat tttcttgttc aggttctcga tccgccgttc 300
caggttgttg aactctctgc ccacagcctc gaactgggtg ttcatcttgt cgatgatgct 360
gttcaccttg ttggtgacgc cgtcgatggc cttctgggta gactctttgt cggcggcata 420
tccagagccc tgctcattgc tgtggtggta gccgtaccag ccatccacca ttccctgcca 480
gccgccttca ataaagccgg cgatggctcc aaacaggccc ctcttctttc ttctgctctc 540
ccgctggggg ctatttctca ggcctgtggc cagcaccag 579
<210> 76
<211> 480
<212> DNA
<213> A流感病毒
<400> 76
ctggtgctgg ccacaggcct gagaaatagc ccccagcggg agagcagaag aaagaagagg 60
ggcctgtttg gagccatcgc cggctttatt gaaggcggct ggcagggaat ggtggatggc 120
tggtacggct accaccacag caatgagcag ggctctggat atgccgccga caaagagtct 180
acccagaagg ccatcgacgg cgtcaccaac aaggtgaaca gcatcatcga caagatgacc 240
tacaatgccg aactgctggt gctgatggaa aacgagcgga ccctggactt ccacgacagc 300
aacgtgaaga acctgtacga caaagtgcgg ctgcagctga gagacaacgc caaagagctg 360
ggcaacggct gcttcgagtt ctaccacaag tgcgacaacg agtgcatgga aagcatccgg 420
aacggcacct acaactaccc tcagtacagc gaggaagcca ggctgaagag ggaagagatc 480
<210> 77
<211> 160
<212> PRT
<213> A流感病毒
<400> 77
Leu Val Leu Ala Thr Gly Leu Arg Asn Ser Pro Gln Arg Glu Ser Arg
1 5 10 15
Arg Lys Lys Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
20 25 30
Gly Trp Gln Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Ser Asn
35 40 45
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr Gln Lys Ala
50 55 60
Ile Asp Gly Val Thr Asn Lys Val Asn Ser Ile Ile Asp Lys Met Thr
65 70 75 80
Tyr Asn Ala Glu Leu Leu Val Leu Met Glu Asn Glu Arg Thr Leu Asp
85 90 95
Phe His Asp Ser Asn Val Lys Asn Leu Tyr Asp Lys Val Arg Leu Gln
100 105 110
Leu Arg Asp Asn Ala Lys Glu Leu Gly Asn Gly Cys Phe Glu Phe Tyr
115 120 125
His Lys Cys Asp Asn Glu Cys Met Glu Ser Ile Arg Asn Gly Thr Tyr
130 135 140
Asn Tyr Pro Gln Tyr Ser Glu Glu Ala Arg Leu Lys Arg Glu Glu Ile
145 150 155 160
<210> 78
<211> 480
<212> DNA
<213> A流感病毒
<400> 78
gatctcttcc ctcttcagcc tggcttcctc gctgtactga gggtagttgt aggtgccgtt 60
ccggatgctt tccatgcact cgttgtcgca cttgtggtag aactcgaagc agccgttgcc 120
cagctctttg gcgttgtctc tcagctgcag ccgcactttg tcgtacaggt tcttcacgtt 180
gctgtcgtgg aagtccaggg tccgctcgtt ttccatcagc accagcagtt cggcattgta 240
ggtcatcttg tcgatgatgc tgttcacctt gttggtgacg ccgtcgatgg ccttctgggt 300
agactctttg tcggcggcat atccagagcc ctgctcattg ctgtggtggt agccgtacca 360
gccatccacc attccctgcc agccgccttc aataaagccg gcgatggctc caaacaggcc 420
cctcttcttt cttctgctct cccgctgggg gctatttctc aggcctgtgg ccagcaccag 480
<210> 79
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 79
atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60
tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120
gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180
aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240
gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300
ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360
aattctgtca tcgaaaaaat gggcagcgga ggctccggaa cagacctggc tgagctgctg 420
gtgctgctgc tgaaccagtg gactctgctg ttccacgata gcaacgtgaa gaatctgtat 480
gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540
ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600
cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgac 645
<210> 80
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 80
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 81
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 81
gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60
gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120
cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180
gttgctatcg tggaacagca gagtccactg gttcagcagc agcaccagca gctcagccag 240
gtctgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacca tgttagtaat 300
gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360
ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420
tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480
catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540
ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600
ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645
<210> 82
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 82
atgaaggcaa tcctggtcgt cctgctgtat actttcgcta ccgctaacgc tgacaccctg 60
tgcatcggct atcacgctaa caactcaacc gacacagtgg atactgtcct ggagaagaac 120
gtgactgtca cccactctgt gaatctgggc agtggactga ggctggcaac tggactgcga 180
aacatcccac agcgggaaac cagaggactg ttcggcgcta tcgcagggtt tattgagggc 240
gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaacga gcagggatca 300
ggctacgccg ctgacctgaa gagcacacag aatgcaatcg atgaaattac taacatggtg 360
aattccgtca tcgagaaaat gggcagcgga ggctccggaa ccgacctggc agaactgctg 420
gtgctgctgc tgaaccagtg gacactgctg taccacgata gtaacgtgaa gaatctgtat 480
gagaaagtcc gatcacagct gaagaacaat gctaaagaaa tcgggaatgg atgcttcgag 540
ttttaccata agtgcgacaa cacctgtatg gagagcgtga aaaatggcac atacgattat 600
cccaagtatt ccgaggaagc caaactgaac agagaggaaa ttgac 645
<210> 83
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 83
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Leu Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Leu Ala Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Leu Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asp Glu Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Tyr His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Arg Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Thr Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ala Lys
195 200 205
Leu Asn Arg Glu Glu Ile Asp
210 215
<210> 84
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 84
gtcaatttcc tctctgttca gtttggcttc ctcggaatac ttgggataat cgtatgtgcc 60
atttttcacg ctctccatac aggtgttgtc gcacttatgg taaaactcga agcatccatt 120
cccgatttct ttagcattgt tcttcagctg tgatcggact ttctcataca gattcttcac 180
gttactatcg tggtacagca gtgtccactg gttcagcagc agcaccagca gttctgccag 240
gtcggttccg gagcctccgc tgcccatttt ctcgatgacg gaattcacca tgttagtaat 300
ttcatcgatt gcattctgtg tgctcttcag gtcagcggcg tagcctgatc cctgctcgtt 360
ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420
tgcgatagcg ccgaacagtc ctctggtttc ccgctgtggg atgtttcgca gtccagttgc 480
cagcctcagt ccactgccca gattcacaga gtgggtgaca gtcacgttct tctccaggac 540
agtatccact gtgtcggttg agttgttagc gtgatagccg atgcacaggg tgtcagcgtt 600
agcggtagcg aaagtataca gcaggacgac caggattgcc ttcat 645
<210> 85
<211> 639
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 85
atggctatca tctacctgat cctgctgttc actgctgtgc ggggggacca gatttgcatc 60
ggctaccacg ctaataattc aactgagaag gtggatacta tcctggagcg gaacgtgacc 120
gtcacacacg ctaaagacat tggcagcgga ctggtgctgg caaccggact gaggaatgtc 180
ccacagatcg agtcccgcgg actgttcggc gctatcgcag ggtttattga aggcgggtgg 240
cagggaatga ttgatgggtg gtacggctac caccattcta acgaccaagg aagtggctac 300
gccgctgata aggagagtac tcagaaagcc ttcgatggca tcaccaacat ggtgaattca 360
gtcattgaga agatgggcag cggaggctcc ggaaccgacc tggcagaact gctggtgctg 420
ctgctgaatc agtggacact gctgtttcac gactctaacg tgaagaatct gtatgataaa 480
gtccggatgc agctgagaga caacgtgaag gagctgggga atggatgctt cgaattttac 540
cataagtgcg acgatgagtg tatgaacagt gtcaaaaatg gcacatacga ttatcccaag 600
tatgaggaag agtcaaaact gaaccgaaat gaaatcaag 639
<210> 86
<211> 213
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 86
Met Ala Ile Ile Tyr Leu Ile Leu Leu Phe Thr Ala Val Arg Gly Asp
1 5 10 15
Gln Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Glu Lys Val Asp
20 25 30
Thr Ile Leu Glu Arg Asn Val Thr Val Thr His Ala Lys Asp Ile Gly
35 40 45
Ser Gly Leu Val Leu Ala Thr Gly Leu Arg Asn Val Pro Gln Ile Glu
50 55 60
Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly Gly Trp
65 70 75 80
Gln Gly Met Ile Asp Gly Trp Tyr Gly Tyr His His Ser Asn Asp Gln
85 90 95
Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr Gln Lys Ala Phe Asp
100 105 110
Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly Ser Gly
115 120 125
Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu Asn Gln
130 135 140
Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr Asp Lys
145 150 155 160
Val Arg Met Gln Leu Arg Asp Asn Val Lys Glu Leu Gly Asn Gly Cys
165 170 175
Phe Glu Phe Tyr His Lys Cys Asp Asp Glu Cys Met Asn Ser Val Lys
180 185 190
Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Glu Glu Glu Ser Lys Leu Asn
195 200 205
Arg Asn Glu Ile Lys
210
<210> 87
<211> 639
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 87
cttgatttca tttcggttca gttttgactc ttcctcatac ttgggataat cgtatgtgcc 60
atttttgaca ctgttcatac actcatcgtc gcacttatgg taaaattcga agcatccatt 120
ccccagctcc ttcacgttgt ctctcagctg catccggact ttatcataca gattcttcac 180
gttagagtcg tgaaacagca gtgtccactg attcagcagc agcaccagca gttctgccag 240
gtcggttccg gagcctccgc tgcccatctt ctcaatgact gaattcacca tgttggtgat 300
gccatcgaag gctttctgag tactctcctt atcagcggcg tagccacttc cttggtcgtt 360
agaatggtgg tagccgtacc acccatcaat cattccctgc cacccgcctt caataaaccc 420
tgcgatagcg ccgaacagtc cgcgggactc gatctgtggg acattcctca gtccggttgc 480
cagcaccagt ccgctgccaa tgtctttagc gtgtgtgacg gtcacgttcc gctccaggat 540
agtatccacc ttctcagttg aattattagc gtggtagccg atgcaaatct ggtccccccg 600
cacagcagtg aacagcagga tcaggtagat gatagccat 639
<210> 88
<211> 651
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 88
atggaaaaaa tcgtgctgct gctggctatc gtgtccctgg tgaagtccga ccagatctgt 60
attgggtatc atgctaacaa ctccacagaa caggtggata ctatcatgga gaagaacgtg 120
accgtcacac acgctcagga cattggatgg ggactggtcc tggcaaccgg actgagaaat 180
tcaccacaga gggaaagccg gagaaagaaa cgcggactgt tcggcgctat cgcagggttt 240
attgagggcg ggtggcaggg aatggtggat gggtggtacg gctaccacca ttccaacgaa 300
cagggatctg gctacgccgc tgataaggag tctactcaga aagctatcga cggcgtgacc 360
aacatggtca atagtatcat tgataagatg ggctctggag gcagtggaac cgacctggca 420
gagctgctgg tgctgctgct gaaccagtgg acactgctgt tccacgactc taacgtgaag 480
aatctgtatg ataaagtccg actgcagctg cgggacaacg ccaaggaact ggggaatgga 540
tgcttcgagt tctaccataa gtgcgataac gaatgtatgg agagcatccg aaacggcaca 600
tacaattatc cccagtattc cgaggaagct aggctgaaac gcgaggaaat t 651
<210> 89
<211> 217
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 89
Met Glu Lys Ile Val Leu Leu Leu Ala Ile Val Ser Leu Val Lys Ser
1 5 10 15
Asp Gln Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Glu Gln Val
20 25 30
Asp Thr Ile Met Glu Lys Asn Val Thr Val Thr His Ala Gln Asp Ile
35 40 45
Gly Trp Gly Leu Val Leu Ala Thr Gly Leu Arg Asn Ser Pro Gln Arg
50 55 60
Glu Ser Arg Arg Lys Lys Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe
65 70 75 80
Ile Glu Gly Gly Trp Gln Gly Met Val Asp Gly Trp Tyr Gly Tyr His
85 90 95
His Ser Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr
100 105 110
Gln Lys Ala Ile Asp Gly Val Thr Asn Met Val Asn Ser Ile Ile Asp
115 120 125
Lys Met Gly Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val
130 135 140
Leu Leu Leu Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys
145 150 155 160
Asn Leu Tyr Asp Lys Val Arg Leu Gln Leu Arg Asp Asn Ala Lys Glu
165 170 175
Leu Gly Asn Gly Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Glu Cys
180 185 190
Met Glu Ser Ile Arg Asn Gly Thr Tyr Asn Tyr Pro Gln Tyr Ser Glu
195 200 205
Glu Ala Arg Leu Lys Arg Glu Glu Ile
210 215
<210> 90
<211> 651
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 90
aatttcctcg cgtttcagcc tagcttcctc ggaatactgg ggataattgt atgtgccgtt 60
tcggatgctc tccatacatt cgttatcgca cttatggtag aactcgaagc atccattccc 120
cagttccttg gcgttgtccc gcagctgcag tcggacttta tcatacagat tcttcacgtt 180
agagtcgtgg aacagcagtg tccactggtt cagcagcagc accagcagct ctgccaggtc 240
ggttccactg cctccagagc ccatcttatc aatgatacta ttgaccatgt tggtcacgcc 300
gtcgatagct ttctgagtag actccttatc agcggcgtag ccagatccct gttcgttgga 360
atggtggtag ccgtaccacc catccaccat tccctgccac ccgccctcaa taaaccctgc 420
gatagcgccg aacagtccgc gtttctttct ccggctttcc ctctgtggtg aatttctcag 480
tccggttgcc aggaccagtc cccatccaat gtcctgagcg tgtgtgacgg tcacgttctt 540
ctccatgata gtatccacct gttctgtgga gttgttagca tgatacccaa tacagatctg 600
gtcggacttc accagggaca cgatagccag cagcagcacg attttttcca t 651
<210> 91
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 91
atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60
tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120
gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180
aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240
gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300
ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360
aattctgtca tcgaaaaaat gggcagcgga ggctccggaa cagacctggc tgagctgctg 420
gtgctgctgc tgaacgagcg gactctggat ttccacgata gcaacgtgaa gaatctgtat 480
gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540
ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600
cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgac 645
<210> 92
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 92
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 93
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 93
gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60
gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120
cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180
gttgctatcg tggaaatcca gagtccgctc gttcagcagc agcaccagca gctcagccag 240
gtctgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacca tgttagtaat 300
gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360
ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420
tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480
catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540
ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600
ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645
<210> 94
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 94
atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60
tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccattct ggagaagaac 120
gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180
aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240
gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300
ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360
aattctgtca tcgaaaaaat gggcagcgga ggctccggaa cagacctggc tgagctgctg 420
gtgctgatgc tgaaccagtt cactctgctg ttccacgata gcaacgtgaa gaatctgtat 480
gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540
ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600
cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgac 645
<210> 95
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 95
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Ile Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Met Leu
130 135 140
Asn Gln Phe Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 96
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 96
gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60
gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120
cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180
gttgctatcg tggaacagca gagtgaactg gttcagcatc agcaccagca gctcagccag 240
gtctgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacca tgttagtaat 300
gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360
ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420
tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480
catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccagaat 540
ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600
ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645
<210> 97
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 97
atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60
tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120
gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180
aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240
gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300
ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360
aattctgtca tcgaaaaaat gggcaatgga acaggcggag ctgacctggc tgagctgctg 420
gtgctgctgc tgaaccagtg gactctgctg ttccacgata gcaacgtgaa gaatctgtat 480
gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540
ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600
cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgac 645
<210> 98
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 98
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Asn Gly Thr Gly Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 99
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 99
gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60
gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120
cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180
gttgctatcg tggaacagca gagtccactg gttcagcagc agcaccagca gctcagccag 240
gtcagctccg cctgttccat tgcccatttt ttcgatgaca gaattcacca tgttagtaat 300
gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360
ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420
tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480
catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540
ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600
ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645
<210> 100
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 100
atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60
tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120
gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180
aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240
gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300
ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360
aattctgtca tcgaaaaaat gggcggaaat ggcactggag ctgacctggc tgagctgctg 420
gtgctgctgc tgaaccagtg gactctgctg ttccacgata gcaacgtgaa gaatctgtat 480
gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540
ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600
cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgac 645
<210> 101
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 101
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 102
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 102
gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60
gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120
cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180
gttgctatcg tggaacagca gagtccactg gttcagcagc agcaccagca gctcagccag 240
gtcagctcca gtgccatttc cgcccatttt ttcgatgaca gaattcacca tgttagtaat 300
gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360
ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420
tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480
catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540
ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600
ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645
<210> 103
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 103
atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60
tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120
gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180
aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240
gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300
ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360
aattctgtca tcgaaaaaat gggcagcgga ggcaacggaa cagacctggc tgagctgctg 420
gtgctgctgc tgaaccagtg gactctgctg ttccacgata gcaacgtgaa gaatctgtat 480
gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540
ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600
cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgac 645
<210> 104
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 104
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Asn Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 105
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 105
gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60
gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120
cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180
gttgctatcg tggaacagca gagtccactg gttcagcagc agcaccagca gctcagccag 240
gtctgttccg ttgcctccgc tgcccatttt ttcgatgaca gaattcacca tgttagtaat 300
gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360
ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420
tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480
catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540
ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600
ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645
<210> 106
<211> 1149
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 106
atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60
tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120
gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180
aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240
gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300
ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360
aattctgtca tcgaaaaaat gggcagcgga ggctccggaa cagacctggc tgagctgctg 420
gtgctgctgc tgaaccagtg gactctgctg ttccacgata gcaacgtgaa gaatctgtat 480
gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540
ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600
cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgacagtgg gggcgacatc 660
atcaagctgc tgaacgaaca ggtgaacaag gagatgcaga gctccaacct gtacatgagt 720
atgtctagtt ggtgttatac acactcactg gacggcgctg ggctgttcct gtttgatcac 780
gcagccgagg aatacgaaca tgcaaagaaa ctgatcattt tcctgaatga gaacaatgtg 840
cccgtccagc tgacttcaat cagcgcccct gaacataagt tcgagggcct gacccagatc 900
tttcagaaag cttacgaaca cgagcagcat atttccgaat ctatcaacaa tattgtggac 960
cacgccatta agagcaaaga tcatgctacc ttcaactttc tgcagtggta cgtggccgag 1020
cagcacgagg aggaggtcct gtttaaggac atcctggata aaatcgaact gattggaaac 1080
gagaatcatg gcctgtacct ggcagatcag tatgtgaagg gcattgccaa gtccagaaaa 1140
agtgggtca 1149
<210> 107
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 107
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 108
<211> 1149
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 108
tgacccactt tttctggact tggcaatgcc cttcacatac tgatctgcca ggtacaggcc 60
atgattctcg tttccaatca gttcgatttt atccaggatg tccttaaaca ggacctcctc 120
ctcgtgctgc tcggccacgt accactgcag aaagttgaag gtagcatgat ctttgctctt 180
aatggcgtgg tccacaatat tgttgataga ttcggaaata tgctgctcgt gttcgtaagc 240
tttctgaaag atctgggtca ggccctcgaa cttatgttca ggggcgctga ttgaagtcag 300
ctggacgggc acattgttct cattcaggaa aatgatcagt ttctttgcat gttcgtattc 360
ctcggctgcg tgatcaaaca ggaacagccc agcgccgtcc agtgagtgtg tataacacca 420
actagacata ctcatgtaca ggttggagct ctgcatctcc ttgttcacct gttcgttcag 480
cagcttgatg atgtcgcccc cactgtcaat tttctctcga ttcagcttac tctcttcaga 540
atatttggga tagtcgtaag tgccgttctt cacagactcc atacattcat tgttgcactt 600
atggtaaaac tcgaagcatc cattcccgat ttctttggca ttgttcttca gctgggattt 660
gaccttctca tacagattct tcacgttgct atcgtggaac agcagagtcc actggttcag 720
cagcagcacc agcagctcag ccaggtctgt tccggagcct ccgctgccca ttttttcgat 780
gacagaattc accatgttag taatgccatt gattgcgttc tgtgtagact tctgatcagc 840
ggcgtagccg ctgccctgct cattctgatg gtggtagccg taccacccgt ccaccattcc 900
tgtccacccg ccctcaataa accctgcgat agcgccgaac agtcctcttg tttcccgctg 960
tgggatgttg cgcagtccgg tgaccatcct cagtccgctg cccagattca ctgagtgggt 1020
gacagtcacg ttcttctcca ggacggtatc cactgtgtcg gtggagttgt ttgcgtgata 1080
gccgatgcag atagtgtcag cgtaggttgc ggtaaaagta cacagcagga ccagcagttt 1140
ggccttcat 1149
<210> 109
<211> 1149
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 109
atgaaggcaa tcctggtcgt cctgctgtat actttcgcta ccgctaacgc tgacaccctg 60
tgcatcggct atcacgctaa caactcaacc gacacagtgg atactgtcct ggagaagaac 120
gtgactgtca cccactctgt gaatctgggc agtggactga ggctggcaac tggactgcga 180
aacatcccac agcgggaaac cagaggactg ttcggcgcta tcgcagggtt tattgagggc 240
gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaacga gcagggatca 300
ggctacgccg ctgacctgaa gagcacacag aatgcaatcg atgaaattac taacatggtg 360
aattccgtca tcgagaaaat gggcagcgga ggctccggaa ccgacctggc agaactgctg 420
gtgctgctgc tgaaccagtg gacactgctg taccacgata gtaacgtgaa gaatctgtat 480
gagaaagtcc gatcacagct gaagaacaat gctaaagaaa tcgggaatgg atgcttcgag 540
ttttaccata agtgcgacaa cacctgtatg gagagcgtga aaaatggcac atacgattat 600
cccaagtatt ccgaggaagc caaactgaac agagaggaaa ttgactctgg gggcgacatc 660
atcaagctgc tgaacgaaca ggtcaacaag gagatgcaga gctccaatct gtacatgtcc 720
atgtctagtt ggtgttatac ccactctctg gacggcgctg ggctgttcct gtttgatcac 780
gcagccgagg aatacgaaca tgcaaagaaa ctgatcattt tcctgaacga gaacaatgtg 840
cccgtccagc tgacatcaat cagcgcccct gaacataagt tcgagggcct gactcagatc 900
tttcagaaag cttacgaaca cgagcagcat atttccgaat ctatcaacaa tattgtggac 960
cacgccatta agagtaaaga tcatgctacc ttcaattttc tgcagtggta cgtggccgag 1020
cagcacgagg aggaggtcct gtttaaggac atcctggata aaatcgaact gattggaaac 1080
gagaatcatg gcctgtacct ggcagatcag tatgtgaagg gcattgccaa gagccggaaa 1140
agtgggtca 1149
<210> 110
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 110
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Leu Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Leu Ala Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Leu Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asp Glu Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Tyr His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Arg Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Thr Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ala Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 111
<211> 1149
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 111
tgacccactt ttccggctct tggcaatgcc cttcacatac tgatctgcca ggtacaggcc 60
atgattctcg tttccaatca gttcgatttt atccaggatg tccttaaaca ggacctcctc 120
ctcgtgctgc tcggccacgt accactgcag aaaattgaag gtagcatgat ctttactctt 180
aatggcgtgg tccacaatat tgttgataga ttcggaaata tgctgctcgt gttcgtaagc 240
tttctgaaag atctgagtca ggccctcgaa cttatgttca ggggcgctga ttgatgtcag 300
ctggacgggc acattgttct cgttcaggaa aatgatcagt ttctttgcat gttcgtattc 360
ctcggctgcg tgatcaaaca ggaacagccc agcgccgtcc agagagtggg tataacacca 420
actagacatg gacatgtaca gattggagct ctgcatctcc ttgttgacct gttcgttcag 480
cagcttgatg atgtcgcccc cagagtcaat ttcctctctg ttcagtttgg cttcctcgga 540
atacttggga taatcgtatg tgccattttt cacgctctcc atacaggtgt tgtcgcactt 600
atggtaaaac tcgaagcatc cattcccgat ttctttagca ttgttcttca gctgtgatcg 660
gactttctca tacagattct tcacgttact atcgtggtac agcagtgtcc actggttcag 720
cagcagcacc agcagttctg ccaggtcggt tccggagcct ccgctgccca ttttctcgat 780
gacggaattc accatgttag taatttcatc gattgcattc tgtgtgctct tcaggtcagc 840
ggcgtagcct gatccctgct cgttctgatg gtggtagccg taccacccgt ccaccattcc 900
tgtccacccg ccctcaataa accctgcgat agcgccgaac agtcctctgg tttcccgctg 960
tgggatgttt cgcagtccag ttgccagcct cagtccactg cccagattca cagagtgggt 1020
gacagtcacg ttcttctcca ggacagtatc cactgtgtcg gttgagttgt tagcgtgata 1080
gccgatgcac agggtgtcag cgttagcggt agcgaaagta tacagcagga cgaccaggat 1140
tgccttcat 1149
<210> 112
<211> 1143
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 112
atggctatca tctacctgat cctgctgttc actgctgtgc ggggggacca gatttgcatc 60
ggctaccacg ctaataattc aactgagaag gtggatacta tcctggagcg gaacgtgacc 120
gtcacacacg ctaaagacat tggcagcgga ctggtgctgg caaccggact gaggaatgtc 180
ccacagatcg agtcccgcgg actgttcggc gctatcgcag ggtttattga aggcgggtgg 240
cagggaatga ttgatgggtg gtacggctac caccattcta acgaccaagg aagtggctac 300
gccgctgata aggagagtac tcagaaagcc ttcgatggca tcaccaacat ggtgaattca 360
gtcattgaga agatgggcag cggaggctcc ggaaccgacc tggcagaact gctggtgctg 420
ctgctgaatc agtggacact gctgtttcac gactctaacg tgaagaatct gtatgataaa 480
gtccggatgc agctgagaga caacgtgaag gagctgggga atggatgctt cgaattttac 540
cataagtgcg acgatgagtg tatgaacagt gtcaaaaatg gcacatacga ttatcccaag 600
tatgaggaag agtcaaaact gaaccgaaat gaaatcaaga gcgggggcga catcatcaag 660
ctgctgaacg agcaagtgaa taaggaaatg cagagctcca acctgtacat gtccatgtct 720
agttggtgtt atactcactc tctggatggc gccgggctgt tcctgtttga ccacgcagcc 780
gaagagtacg agcatgctaa gaaactgatc attttcctga acgaaaacaa cgtgcccgtc 840
cagctgacat caatcagcgc acctgagcat aagttcgaag gcctgactca gatctttcag 900
aaagcttacg agcacgaaca gcatatttcc gagtctatca acaatattgt ggaccacgcc 960
atcaagagca aagatcatgc taccttcaac tttctgcagt ggtacgtggc cgagcagcac 1020
gaagaggaag tcctgtttaa ggacatcctg gataaaatcg agctgattgg aaacgaaaat 1080
catggcctgt acctggcaga ccagtatgtg aagggcattg ccaagtccag aaaaagtggg 1140
tca 1143
<210> 113
<211> 381
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 113
Met Ala Ile Ile Tyr Leu Ile Leu Leu Phe Thr Ala Val Arg Gly Asp
1 5 10 15
Gln Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Glu Lys Val Asp
20 25 30
Thr Ile Leu Glu Arg Asn Val Thr Val Thr His Ala Lys Asp Ile Gly
35 40 45
Ser Gly Leu Val Leu Ala Thr Gly Leu Arg Asn Val Pro Gln Ile Glu
50 55 60
Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly Gly Trp
65 70 75 80
Gln Gly Met Ile Asp Gly Trp Tyr Gly Tyr His His Ser Asn Asp Gln
85 90 95
Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr Gln Lys Ala Phe Asp
100 105 110
Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly Ser Gly
115 120 125
Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu Asn Gln
130 135 140
Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr Asp Lys
145 150 155 160
Val Arg Met Gln Leu Arg Asp Asn Val Lys Glu Leu Gly Asn Gly Cys
165 170 175
Phe Glu Phe Tyr His Lys Cys Asp Asp Glu Cys Met Asn Ser Val Lys
180 185 190
Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Glu Glu Glu Ser Lys Leu Asn
195 200 205
Arg Asn Glu Ile Lys Ser Gly Gly Asp Ile Ile Lys Leu Leu Asn Glu
210 215 220
Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser Met Ser
225 230 235 240
Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe Leu Phe
245 250 255
Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile Ile Phe
260 265 270
Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser Ala Pro
275 280 285
Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala Tyr Glu
290 295 300
His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp His Ala
305 310 315 320
Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp Tyr Val
325 330 335
Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu Asp Lys
340 345 350
Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala Asp Gln
355 360 365
Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 114
<211> 1143
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 114
tgacccactt tttctggact tggcaatgcc cttcacatac tggtctgcca ggtacaggcc 60
atgattttcg tttccaatca gctcgatttt atccaggatg tccttaaaca ggacttcctc 120
ttcgtgctgc tcggccacgt accactgcag aaagttgaag gtagcatgat ctttgctctt 180
gatggcgtgg tccacaatat tgttgataga ctcggaaata tgctgttcgt gctcgtaagc 240
tttctgaaag atctgagtca ggccttcgaa cttatgctca ggtgcgctga ttgatgtcag 300
ctggacgggc acgttgtttt cgttcaggaa aatgatcagt ttcttagcat gctcgtactc 360
ttcggctgcg tggtcaaaca ggaacagccc ggcgccatcc agagagtgag tataacacca 420
actagacatg gacatgtaca ggttggagct ctgcatttcc ttattcactt gctcgttcag 480
cagcttgatg atgtcgcccc cgctcttgat ttcatttcgg ttcagttttg actcttcctc 540
atacttggga taatcgtatg tgccattttt gacactgttc atacactcat cgtcgcactt 600
atggtaaaat tcgaagcatc cattccccag ctccttcacg ttgtctctca gctgcatccg 660
gactttatca tacagattct tcacgttaga gtcgtgaaac agcagtgtcc actgattcag 720
cagcagcacc agcagttctg ccaggtcggt tccggagcct ccgctgccca tcttctcaat 780
gactgaattc accatgttgg tgatgccatc gaaggctttc tgagtactct ccttatcagc 840
ggcgtagcca cttccttggt cgttagaatg gtggtagccg taccacccat caatcattcc 900
ctgccacccg ccttcaataa accctgcgat agcgccgaac agtccgcggg actcgatctg 960
tgggacattc ctcagtccgg ttgccagcac cagtccgctg ccaatgtctt tagcgtgtgt 1020
gacggtcacg ttccgctcca ggatagtatc caccttctca gttgaattat tagcgtggta 1080
gccgatgcaa atctggtccc cccgcacagc agtgaacagc aggatcaggt agatgatagc 1140
cat 1143
<210> 115
<211> 1158
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 115
atggaaaaaa tcgtgctgct gctggctatc gtgtccctgg tgaagtccga ccagatctgt 60
attgggtatc atgctaacaa ctccacagaa caggtggata ctatcatgga gaagaacgtg 120
accgtcacac acgctcagga cattggatgg ggactggtcc tggcaaccgg actgagaaat 180
tcaccacaga gggaaagccg gagaaagaaa cgcggactgt tcggcgctat cgcagggttt 240
attgagggcg ggtggcaggg aatggtggat gggtggtacg gctaccacca ttccaacgaa 300
cagggatctg gctacgccgc tgataaggag tctactcaga aagctatcga cggcgtgacc 360
aacatggtca atagtatcat tgataagatg ggctctggag gcagtggaac cgacctggca 420
gagctgctgg tgctgctgct gaaccagtgg acactgctgt tccacgactc taacgtgaag 480
aatctgtatg ataaagtccg actgcagctg cgggacaacg ccaaggaact ggggaatgga 540
tgcttcgagt tctaccataa gtgcgataac gaatgtatgg agagcatccg aaacggcaca 600
tacaattatc cccagtattc cgaggaagct aggctgaaac gcgaggaaat tagctccggg 660
ggagacatca ttaagctgct gaacgaacag gtgaacaagg agatgcagtc tagtaacctg 720
tacatgagta tgtcaagctg gtgttatact cactcactgg atggcgccgg gctgttcctg 780
tttgaccacg cagccgagga atacgaacat gctaagaaac tgatcatttt cctgaatgag 840
aacaatgtgc ccgtccagct gacatccatc tctgcacctg aacataagtt cgagggcctg 900
actcagatct ttcagaaagc ctacgaacac gagcagcata ttagtgagtc aatcaacaat 960
attgtggacc acgccatcaa gagcaaagat catgctacct tcaattttct gcagtggtac 1020
gtggccgagc agcacgagga agaggtcctg tttaaggaca tcctggataa aatcgaactg 1080
attggaaacg agaatcatgg cctgtacctg gcagaccagt atgtgaaggg cattgccaag 1140
tccaggaaaa gcgggtcc 1158
<210> 116
<211> 386
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 116
Met Glu Lys Ile Val Leu Leu Leu Ala Ile Val Ser Leu Val Lys Ser
1 5 10 15
Asp Gln Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Glu Gln Val
20 25 30
Asp Thr Ile Met Glu Lys Asn Val Thr Val Thr His Ala Gln Asp Ile
35 40 45
Gly Trp Gly Leu Val Leu Ala Thr Gly Leu Arg Asn Ser Pro Gln Arg
50 55 60
Glu Ser Arg Arg Lys Lys Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe
65 70 75 80
Ile Glu Gly Gly Trp Gln Gly Met Val Asp Gly Trp Tyr Gly Tyr His
85 90 95
His Ser Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr
100 105 110
Gln Lys Ala Ile Asp Gly Val Thr Asn Met Val Asn Ser Ile Ile Asp
115 120 125
Lys Met Gly Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val
130 135 140
Leu Leu Leu Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys
145 150 155 160
Asn Leu Tyr Asp Lys Val Arg Leu Gln Leu Arg Asp Asn Ala Lys Glu
165 170 175
Leu Gly Asn Gly Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Glu Cys
180 185 190
Met Glu Ser Ile Arg Asn Gly Thr Tyr Asn Tyr Pro Gln Tyr Ser Glu
195 200 205
Glu Ala Arg Leu Lys Arg Glu Glu Ile Ser Ser Gly Gly Asp Ile Ile
210 215 220
Lys Leu Leu Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu
225 230 235 240
Tyr Met Ser Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala
245 250 255
Gly Leu Phe Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys
260 265 270
Lys Leu Ile Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr
275 280 285
Ser Ile Ser Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe
290 295 300
Gln Lys Ala Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn
305 310 315 320
Ile Val Asp His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe
325 330 335
Leu Gln Trp Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys
340 345 350
Asp Ile Leu Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu
355 360 365
Tyr Leu Ala Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser
370 375 380
Gly Ser
385
<210> 117
<211> 1158
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 117
ggacccgctt ttcctggact tggcaatgcc cttcacatac tggtctgcca ggtacaggcc 60
atgattctcg tttccaatca gttcgatttt atccaggatg tccttaaaca ggacctcttc 120
ctcgtgctgc tcggccacgt accactgcag aaaattgaag gtagcatgat ctttgctctt 180
gatggcgtgg tccacaatat tgttgattga ctcactaata tgctgctcgt gttcgtaggc 240
tttctgaaag atctgagtca ggccctcgaa cttatgttca ggtgcagaga tggatgtcag 300
ctggacgggc acattgttct cattcaggaa aatgatcagt ttcttagcat gttcgtattc 360
ctcggctgcg tggtcaaaca ggaacagccc ggcgccatcc agtgagtgag tataacacca 420
gcttgacata ctcatgtaca ggttactaga ctgcatctcc ttgttcacct gttcgttcag 480
cagcttaatg atgtctcccc cggagctaat ttcctcgcgt ttcagcctag cttcctcgga 540
atactgggga taattgtatg tgccgtttcg gatgctctcc atacattcgt tatcgcactt 600
atggtagaac tcgaagcatc cattccccag ttccttggcg ttgtcccgca gctgcagtcg 660
gactttatca tacagattct tcacgttaga gtcgtggaac agcagtgtcc actggttcag 720
cagcagcacc agcagctctg ccaggtcggt tccactgcct ccagagccca tcttatcaat 780
gatactattg accatgttgg tcacgccgtc gatagctttc tgagtagact ccttatcagc 840
ggcgtagcca gatccctgtt cgttggaatg gtggtagccg taccacccat ccaccattcc 900
ctgccacccg ccctcaataa accctgcgat agcgccgaac agtccgcgtt tctttctccg 960
gctttccctc tgtggtgaat ttctcagtcc ggttgccagg accagtcccc atccaatgtc 1020
ctgagcgtgt gtgacggtca cgttcttctc catgatagta tccacctgtt ctgtggagtt 1080
gttagcatga tacccaatac agatctggtc ggacttcacc agggacacga tagccagcag 1140
cagcacgatt ttttccat 1158
<210> 118
<211> 1149
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 118
atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60
tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120
gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180
aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240
gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300
ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360
aattctgtca tcgaaaaaat gggcagcgga ggctccggaa cagacctggc tgagctgctg 420
gtgctgctgc tgaacgagcg gactctggat ttccacgata gcaacgtgaa gaatctgtat 480
gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540
ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600
cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgacagtgg gggcgacatc 660
atcaagctgc tgaacgaaca ggtgaacaag gagatgcaga gctccaacct gtacatgagt 720
atgtctagtt ggtgttatac acactcactg gacggcgctg ggctgttcct gtttgatcac 780
gcagccgagg aatacgaaca tgcaaagaaa ctgatcattt tcctgaatga gaacaatgtg 840
cccgtccagc tgacttcaat cagcgcccct gaacataagt tcgagggcct gacccagatc 900
tttcagaaag cttacgaaca cgagcagcat atttccgaat ctatcaacaa tattgtggac 960
cacgccatta agagcaaaga tcatgctacc ttcaactttc tgcagtggta cgtggccgag 1020
cagcacgagg aggaggtcct gtttaaggac atcctggata aaatcgaact gattggaaac 1080
gagaatcatg gcctgtacct ggcagatcag tatgtgaagg gcattgccaa gtccagaaaa 1140
agtgggtca 1149
<210> 119
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 119
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 120
<211> 1149
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 120
tgacccactt tttctggact tggcaatgcc cttcacatac tgatctgcca ggtacaggcc 60
atgattctcg tttccaatca gttcgatttt atccaggatg tccttaaaca ggacctcctc 120
ctcgtgctgc tcggccacgt accactgcag aaagttgaag gtagcatgat ctttgctctt 180
aatggcgtgg tccacaatat tgttgataga ttcggaaata tgctgctcgt gttcgtaagc 240
tttctgaaag atctgggtca ggccctcgaa cttatgttca ggggcgctga ttgaagtcag 300
ctggacgggc acattgttct cattcaggaa aatgatcagt ttctttgcat gttcgtattc 360
ctcggctgcg tgatcaaaca ggaacagccc agcgccgtcc agtgagtgtg tataacacca 420
actagacata ctcatgtaca ggttggagct ctgcatctcc ttgttcacct gttcgttcag 480
cagcttgatg atgtcgcccc cactgtcaat tttctctcga ttcagcttac tctcttcaga 540
atatttggga tagtcgtaag tgccgttctt cacagactcc atacattcat tgttgcactt 600
atggtaaaac tcgaagcatc cattcccgat ttctttggca ttgttcttca gctgggattt 660
gaccttctca tacagattct tcacgttgct atcgtggaaa tccagagtcc gctcgttcag 720
cagcagcacc agcagctcag ccaggtctgt tccggagcct ccgctgccca ttttttcgat 780
gacagaattc accatgttag taatgccatt gattgcgttc tgtgtagact tctgatcagc 840
ggcgtagccg ctgccctgct cattctgatg gtggtagccg taccacccgt ccaccattcc 900
tgtccacccg ccctcaataa accctgcgat agcgccgaac agtcctcttg tttcccgctg 960
tgggatgttg cgcagtccgg tgaccatcct cagtccgctg cccagattca ctgagtgggt 1020
gacagtcacg ttcttctcca ggacggtatc cactgtgtcg gtggagttgt ttgcgtgata 1080
gccgatgcag atagtgtcag cgtaggttgc ggtaaaagta cacagcagga ccagcagttt 1140
ggccttcat 1149
<210> 121
<211> 1149
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 121
atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60
tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccattct ggagaagaac 120
gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180
aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240
gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300
ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360
aattctgtca tcgaaaaaat gggcagcgga ggctccggaa cagacctggc tgagctgctg 420
gtgctgatgc tgaaccagtt cactctgctg ttccacgata gcaacgtgaa gaatctgtat 480
gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540
ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600
cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgacagtgg gggcgacatc 660
atcaagctgc tgaacgaaca ggtgaacaag gagatgcaga gctccaacct gtacatgagt 720
atgtctagtt ggtgttatac acactcactg gacggcgctg ggctgttcct gtttgatcac 780
gcagccgagg aatacgaaca tgcaaagaaa ctgatcattt tcctgaatga gaacaatgtg 840
cccgtccagc tgacttcaat cagcgcccct gaacataagt tcgagggcct gacccagatc 900
tttcagaaag cttacgaaca cgagcagcat atttccgaat ctatcaacaa tattgtggac 960
cacgccatta agagcaaaga tcatgctacc ttcaactttc tgcagtggta cgtggccgag 1020
cagcacgagg aggaggtcct gtttaaggac atcctggata aaatcgaact gattggaaac 1080
gagaatcatg gcctgtacct ggcagatcag tatgtgaagg gcattgccaa gtccagaaaa 1140
agtgggtca 1149
<210> 122
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 122
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Ile Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Met Leu
130 135 140
Asn Gln Phe Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 123
<211> 1149
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 123
tgacccactt tttctggact tggcaatgcc cttcacatac tgatctgcca ggtacaggcc 60
atgattctcg tttccaatca gttcgatttt atccaggatg tccttaaaca ggacctcctc 120
ctcgtgctgc tcggccacgt accactgcag aaagttgaag gtagcatgat ctttgctctt 180
aatggcgtgg tccacaatat tgttgataga ttcggaaata tgctgctcgt gttcgtaagc 240
tttctgaaag atctgggtca ggccctcgaa cttatgttca ggggcgctga ttgaagtcag 300
ctggacgggc acattgttct cattcaggaa aatgatcagt ttctttgcat gttcgtattc 360
ctcggctgcg tgatcaaaca ggaacagccc agcgccgtcc agtgagtgtg tataacacca 420
actagacata ctcatgtaca ggttggagct ctgcatctcc ttgttcacct gttcgttcag 480
cagcttgatg atgtcgcccc cactgtcaat tttctctcga ttcagcttac tctcttcaga 540
atatttggga tagtcgtaag tgccgttctt cacagactcc atacattcat tgttgcactt 600
atggtaaaac tcgaagcatc cattcccgat ttctttggca ttgttcttca gctgggattt 660
gaccttctca tacagattct tcacgttgct atcgtggaac agcagagtga actggttcag 720
catcagcacc agcagctcag ccaggtctgt tccggagcct ccgctgccca ttttttcgat 780
gacagaattc accatgttag taatgccatt gattgcgttc tgtgtagact tctgatcagc 840
ggcgtagccg ctgccctgct cattctgatg gtggtagccg taccacccgt ccaccattcc 900
tgtccacccg ccctcaataa accctgcgat agcgccgaac agtcctcttg tttcccgctg 960
tgggatgttg cgcagtccgg tgaccatcct cagtccgctg cccagattca ctgagtgggt 1020
gacagtcacg ttcttctcca gaatggtatc cactgtgtcg gtggagttgt ttgcgtgata 1080
gccgatgcag atagtgtcag cgtaggttgc ggtaaaagta cacagcagga ccagcagttt 1140
ggccttcat 1149
<210> 124
<211> 1149
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 124
atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60
tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120
gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180
aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240
gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300
ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360
aattctgtca tcgaaaaaat gggcaatgga acaggcggag ctgacctggc tgagctgctg 420
gtgctgctgc tgaaccagtg gactctgctg ttccacgata gcaacgtgaa gaatctgtat 480
gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540
ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600
cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgacagtgg gggcgacatc 660
atcaagctgc tgaacgaaca ggtgaacaag gagatgcaga gctccaacct gtacatgagt 720
atgtctagtt ggtgttatac acactcactg gacggcgctg ggctgttcct gtttgatcac 780
gcagccgagg aatacgaaca tgcaaagaaa ctgatcattt tcctgaatga gaacaatgtg 840
cccgtccagc tgacttcaat cagcgcccct gaacataagt tcgagggcct gacccagatc 900
tttcagaaag cttacgaaca cgagcagcat atttccgaat ctatcaacaa tattgtggac 960
cacgccatta agagcaaaga tcatgctacc ttcaactttc tgcagtggta cgtggccgag 1020
cagcacgagg aggaggtcct gtttaaggac atcctggata aaatcgaact gattggaaac 1080
gagaatcatg gcctgtacct ggcagatcag tatgtgaagg gcattgccaa gtccagaaaa 1140
agtgggtca 1149
<210> 125
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 125
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Asn Gly Thr Gly Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 126
<211> 1149
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 126
tgacccactt tttctggact tggcaatgcc cttcacatac tgatctgcca ggtacaggcc 60
atgattctcg tttccaatca gttcgatttt atccaggatg tccttaaaca ggacctcctc 120
ctcgtgctgc tcggccacgt accactgcag aaagttgaag gtagcatgat ctttgctctt 180
aatggcgtgg tccacaatat tgttgataga ttcggaaata tgctgctcgt gttcgtaagc 240
tttctgaaag atctgggtca ggccctcgaa cttatgttca ggggcgctga ttgaagtcag 300
ctggacgggc acattgttct cattcaggaa aatgatcagt ttctttgcat gttcgtattc 360
ctcggctgcg tgatcaaaca ggaacagccc agcgccgtcc agtgagtgtg tataacacca 420
actagacata ctcatgtaca ggttggagct ctgcatctcc ttgttcacct gttcgttcag 480
cagcttgatg atgtcgcccc cactgtcaat tttctctcga ttcagcttac tctcttcaga 540
atatttggga tagtcgtaag tgccgttctt cacagactcc atacattcat tgttgcactt 600
atggtaaaac tcgaagcatc cattcccgat ttctttggca ttgttcttca gctgggattt 660
gaccttctca tacagattct tcacgttgct atcgtggaac agcagagtcc actggttcag 720
cagcagcacc agcagctcag ccaggtcagc tccgcctgtt ccattgccca ttttttcgat 780
gacagaattc accatgttag taatgccatt gattgcgttc tgtgtagact tctgatcagc 840
ggcgtagccg ctgccctgct cattctgatg gtggtagccg taccacccgt ccaccattcc 900
tgtccacccg ccctcaataa accctgcgat agcgccgaac agtcctcttg tttcccgctg 960
tgggatgttg cgcagtccgg tgaccatcct cagtccgctg cccagattca ctgagtgggt 1020
gacagtcacg ttcttctcca ggacggtatc cactgtgtcg gtggagttgt ttgcgtgata 1080
gccgatgcag atagtgtcag cgtaggttgc ggtaaaagta cacagcagga ccagcagttt 1140
ggccttcat 1149
<210> 127
<211> 1149
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 127
atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60
tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120
gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180
aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240
gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300
ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360
aattctgtca tcgaaaaaat gggcggaaat ggcactggag ctgacctggc tgagctgctg 420
gtgctgctgc tgaaccagtg gactctgctg ttccacgata gcaacgtgaa gaatctgtat 480
gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540
ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600
cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgacagtgg gggcgacatc 660
atcaagctgc tgaacgaaca ggtgaacaag gagatgcaga gctccaacct gtacatgagt 720
atgtctagtt ggtgttatac acactcactg gacggcgctg ggctgttcct gtttgatcac 780
gcagccgagg aatacgaaca tgcaaagaaa ctgatcattt tcctgaatga gaacaatgtg 840
cccgtccagc tgacttcaat cagcgcccct gaacataagt tcgagggcct gacccagatc 900
tttcagaaag cttacgaaca cgagcagcat atttccgaat ctatcaacaa tattgtggac 960
cacgccatta agagcaaaga tcatgctacc ttcaactttc tgcagtggta cgtggccgag 1020
cagcacgagg aggaggtcct gtttaaggac atcctggata aaatcgaact gattggaaac 1080
gagaatcatg gcctgtacct ggcagatcag tatgtgaagg gcattgccaa gtccagaaaa 1140
agtgggtca 1149
<210> 128
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 128
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 129
<211> 1149
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 129
tgacccactt tttctggact tggcaatgcc cttcacatac tgatctgcca ggtacaggcc 60
atgattctcg tttccaatca gttcgatttt atccaggatg tccttaaaca ggacctcctc 120
ctcgtgctgc tcggccacgt accactgcag aaagttgaag gtagcatgat ctttgctctt 180
aatggcgtgg tccacaatat tgttgataga ttcggaaata tgctgctcgt gttcgtaagc 240
tttctgaaag atctgggtca ggccctcgaa cttatgttca ggggcgctga ttgaagtcag 300
ctggacgggc acattgttct cattcaggaa aatgatcagt ttctttgcat gttcgtattc 360
ctcggctgcg tgatcaaaca ggaacagccc agcgccgtcc agtgagtgtg tataacacca 420
actagacata ctcatgtaca ggttggagct ctgcatctcc ttgttcacct gttcgttcag 480
cagcttgatg atgtcgcccc cactgtcaat tttctctcga ttcagcttac tctcttcaga 540
atatttggga tagtcgtaag tgccgttctt cacagactcc atacattcat tgttgcactt 600
atggtaaaac tcgaagcatc cattcccgat ttctttggca ttgttcttca gctgggattt 660
gaccttctca tacagattct tcacgttgct atcgtggaac agcagagtcc actggttcag 720
cagcagcacc agcagctcag ccaggtcagc tccagtgcca tttccgccca ttttttcgat 780
gacagaattc accatgttag taatgccatt gattgcgttc tgtgtagact tctgatcagc 840
ggcgtagccg ctgccctgct cattctgatg gtggtagccg taccacccgt ccaccattcc 900
tgtccacccg ccctcaataa accctgcgat agcgccgaac agtcctcttg tttcccgctg 960
tgggatgttg cgcagtccgg tgaccatcct cagtccgctg cccagattca ctgagtgggt 1020
gacagtcacg ttcttctcca ggacggtatc cactgtgtcg gtggagttgt ttgcgtgata 1080
gccgatgcag atagtgtcag cgtaggttgc ggtaaaagta cacagcagga ccagcagttt 1140
ggccttcat 1149
<210> 130
<211> 1149
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 130
atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60
tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120
gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180
aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240
gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300
ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360
aattctgtca tcgaaaaaat gggcagcgga ggcaacggaa cagacctggc tgagctgctg 420
gtgctgctgc tgaaccagtg gactctgctg ttccacgata gcaacgtgaa gaatctgtat 480
gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540
ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600
cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgacagtgg gggcgacatc 660
atcaagctgc tgaacgaaca ggtgaacaag gagatgcaga gctccaacct gtacatgagt 720
atgtctagtt ggtgttatac acactcactg gacggcgctg ggctgttcct gtttgatcac 780
gcagccgagg aatacgaaca tgcaaagaaa ctgatcattt tcctgaatga gaacaatgtg 840
cccgtccagc tgacttcaat cagcgcccct gaacataagt tcgagggcct gacccagatc 900
tttcagaaag cttacgaaca cgagcagcat atttccgaat ctatcaacaa tattgtggac 960
cacgccatta agagcaaaga tcatgctacc ttcaactttc tgcagtggta cgtggccgag 1020
cagcacgagg aggaggtcct gtttaaggac atcctggata aaatcgaact gattggaaac 1080
gagaatcatg gcctgtacct ggcagatcag tatgtgaagg gcattgccaa gtccagaaaa 1140
agtgggtca 1149
<210> 131
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 131
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Asn Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 132
<211> 1149
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 132
tgacccactt tttctggact tggcaatgcc cttcacatac tgatctgcca ggtacaggcc 60
atgattctcg tttccaatca gttcgatttt atccaggatg tccttaaaca ggacctcctc 120
ctcgtgctgc tcggccacgt accactgcag aaagttgaag gtagcatgat ctttgctctt 180
aatggcgtgg tccacaatat tgttgataga ttcggaaata tgctgctcgt gttcgtaagc 240
tttctgaaag atctgggtca ggccctcgaa cttatgttca ggggcgctga ttgaagtcag 300
ctggacgggc acattgttct cattcaggaa aatgatcagt ttctttgcat gttcgtattc 360
ctcggctgcg tgatcaaaca ggaacagccc agcgccgtcc agtgagtgtg tataacacca 420
actagacata ctcatgtaca ggttggagct ctgcatctcc ttgttcacct gttcgttcag 480
cagcttgatg atgtcgcccc cactgtcaat tttctctcga ttcagcttac tctcttcaga 540
atatttggga tagtcgtaag tgccgttctt cacagactcc atacattcat tgttgcactt 600
atggtaaaac tcgaagcatc cattcccgat ttctttggca ttgttcttca gctgggattt 660
gaccttctca tacagattct tcacgttgct atcgtggaac agcagagtcc actggttcag 720
cagcagcacc agcagctcag ccaggtctgt tccgttgcct ccgctgccca ttttttcgat 780
gacagaattc accatgttag taatgccatt gattgcgttc tgtgtagact tctgatcagc 840
ggcgtagccg ctgccctgct cattctgatg gtggtagccg taccacccgt ccaccattcc 900
tgtccacccg ccctcaataa accctgcgat agcgccgaac agtcctcttg tttcccgctg 960
tgggatgttg cgcagtccgg tgaccatcct cagtccgctg cccagattca ctgagtgggt 1020
gacagtcacg ttcttctcca ggacggtatc cactgtgtcg gtggagttgt ttgcgtgata 1080
gccgatgcag atagtgtcag cgtaggttgc ggtaaaagta cacagcagga ccagcagttt 1140
ggccttcat 1149
<210> 133
<211> 33
<212> PRT
<213> A流感病毒
<400> 133
Asn Thr Gln Phe Thr Ala Val Gly Lys Glu Phe Asn Lys Leu Glu Arg
1 5 10 15
Arg Met Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe Leu Asp Ile
20 25 30
Trp
<210> 134
<211> 12
<212> PRT
<213> A流感病毒
<400> 134
Asn Thr Gln Phe Thr Ala Val Gly Lys Glu Phe Asn
1 5 10
<210> 135
<211> 12
<212> PRT
<213> A流感病毒
<400> 135
Asn Lys Leu Glu Arg Arg Met Glu Asn Leu Asn Lys
1 5 10
<210> 136
<211> 11
<212> PRT
<213> A流感病毒
<400> 136
Lys Lys Val Asp Asp Gly Phe Leu Asp Ile Trp
1 5 10
<210> 137
<211> 33
<212> PRT
<213> A流感病毒
<400> 137
Asn Thr Gln Phe Thr Ala Val Gly Lys Glu Phe Asn His Leu Glu Lys
1 5 10 15
Arg Ile Glu Asn Leu Asn Lys Lys Val Asp Asp Gly Phe Leu Asp Ile
20 25 30
Trp
<210> 138
<211> 11
<212> PRT
<213> A流感病毒
<400> 138
Asn Thr Gln Phe Thr Ala Val Gly Lys Glu Phe
1 5 10
<210> 139
<211> 11
<212> PRT
<213> A流感病毒
<400> 139
Phe Asn His Leu Glu Lys Arg Ile Glu Asn Leu
1 5 10
<210> 140
<211> 13
<212> PRT
<213> A流感病毒
<400> 140
Leu Asn Lys Lys Val Asp Asp Gly Phe Leu Asp Ile Trp
1 5 10
<210> 141
<211> 33
<212> PRT
<213> A流感病毒
<400> 141
Asn Thr Gln Phe Glu Ala Val Gly Lys Glu Phe Ser Asn Leu Glu Arg
1 5 10 15
Arg Leu Glu Asn Leu Asn Lys Lys Met Glu Asp Gly Phe Leu Asp Val
20 25 30
Trp
<210> 142
<211> 11
<212> PRT
<213> A流感病毒
<400> 142
Asn Thr Gln Phe Glu Ala Val Gly Lys Glu Phe
1 5 10
<210> 143
<211> 12
<212> PRT
<213> A流感病毒
<400> 143
Phe Ser Asn Leu Glu Arg Arg Leu Glu Asn Leu Asn
1 5 10
<210> 144
<211> 12
<212> PRT
<213> A流感病毒
<400> 144
Asn Lys Lys Met Glu Asp Gly Phe Leu Asp Val Trp
1 5 10
<210> 145
<211> 33
<212> PRT
<213> A流感病毒
<400> 145
Asn Thr Gln Phe Glu Ala Val Gly Arg Glu Phe Asn Asn Leu Glu Arg
1 5 10 15
Arg Ile Glu Asn Leu Asn Lys Lys Met Glu Asp Gly Phe Leu Asp Val
20 25 30
Trp
<210> 146
<211> 11
<212> PRT
<213> A流感病毒
<400> 146
Asn Thr Gln Phe Glu Ala Val Gly Arg Glu Phe
1 5 10
<210> 147
<211> 12
<212> PRT
<213> A流感病毒
<400> 147
Phe Asn Asn Leu Glu Arg Arg Ile Glu Asn Leu Asn
1 5 10
<210> 148
<211> 12
<212> PRT
<213> A流感病毒
<400> 148
Asn Lys Lys Met Glu Asp Gly Phe Leu Asp Val Trp
1 5 10
<210> 149
<211> 53
<212> PRT
<213> A流感病毒
<400> 149
Lys Val Asn Ser Val Ile Glu Lys Met Asn Thr Gln Phe Thr Ala Val
1 5 10 15
Gly Lys Glu Phe Asn Lys Leu Glu Arg Arg Met Glu Asn Leu Asn Lys
20 25 30
Lys Val Asp Asp Gly Phe Leu Asp Ile Trp Thr Tyr Asn Ala Glu Leu
35 40 45
Leu Val Leu Leu Glu
50
<210> 150
<211> 20
<212> PRT
<213> A流感病毒
<400> 150
Lys Val Asn Ser Val Ile Glu Lys Met Thr Tyr Asn Ala Glu Leu Leu
1 5 10 15
Val Leu Leu Glu
20
<210> 151
<211> 53
<212> PRT
<213> A流感病毒
<400> 151
Lys Val Asn Ser Val Ile Glu Lys Met Asn Thr Gln Phe Thr Ala Val
1 5 10 15
Gly Lys Glu Phe Asn His Leu Glu Lys Arg Ile Glu Asn Leu Asn Lys
20 25 30
Lys Val Asp Asp Gly Phe Leu Asp Ile Trp Thr Tyr Asn Ala Glu Leu
35 40 45
Leu Val Leu Leu Glu
50
<210> 152
<211> 20
<212> PRT
<213> A流感病毒
<400> 152
Lys Val Asn Ser Val Ile Glu Lys Met Thr Tyr Asn Ala Glu Leu Leu
1 5 10 15
Val Leu Leu Glu
20
<210> 153
<211> 53
<212> PRT
<213> A流感病毒
<400> 153
Lys Val Asn Ser Val Ile Glu Lys Met Asn Thr Gln Phe Glu Ala Val
1 5 10 15
Gly Lys Glu Phe Ser Asn Leu Glu Arg Arg Leu Glu Asn Leu Asn Lys
20 25 30
Lys Met Glu Asp Gly Phe Leu Asp Val Trp Thr Tyr Asn Ala Glu Leu
35 40 45
Leu Val Leu Met Glu
50
<210> 154
<211> 20
<212> PRT
<213> A流感病毒
<400> 154
Lys Val Asn Ser Val Ile Glu Lys Met Thr Tyr Asn Ala Glu Leu Leu
1 5 10 15
Val Leu Met Glu
20
<210> 155
<211> 53
<212> PRT
<213> A流感病毒
<400> 155
Lys Val Asn Ser Ile Ile Asp Lys Met Asn Thr Gln Phe Glu Ala Val
1 5 10 15
Gly Arg Glu Phe Asn Asn Leu Glu Arg Arg Ile Glu Asn Leu Asn Lys
20 25 30
Lys Met Glu Asp Gly Phe Leu Asp Val Trp Thr Tyr Asn Ala Glu Leu
35 40 45
Leu Val Leu Met Glu
50
<210> 156
<211> 20
<212> PRT
<213> A流感病毒
<400> 156
Lys Val Asn Ser Ile Ile Asp Lys Met Thr Tyr Asn Ala Glu Leu Leu
1 5 10 15
Val Leu Met Glu
20
<210> 157
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 157
atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60
tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120
gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180
aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240
gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300
ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360
aattctgtca tcgaaaaaat gggcagcgga ggctccggaa cagacctggc tgagctgctg 420
gtgctgctga tgaacgagcg gactctggat ttccacgata gcaacgtgaa gaatctgtat 480
gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540
ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600
cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgac 645
<210> 158
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 158
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Met
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 159
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 159
gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60
gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120
cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180
gttgctatcg tggaaatcca gagtccgctc gttcatcagc agcaccagca gctcagccag 240
gtctgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacca tgttagtaat 300
gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360
ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420
tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480
catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540
ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600
ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645
<210> 160
<211> 1149
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 160
atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60
tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120
gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180
aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240
gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300
ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360
aattctgtca tcgaaaaaat gggcagcgga ggctccggaa cagacctggc tgagctgctg 420
gtgctgctga tgaacgagcg gactctggat ttccacgata gcaacgtgaa gaatctgtat 480
gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540
ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600
cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgacagtgg gggcgacatc 660
atcaagctgc tgaacgaaca ggtgaacaag gagatgcaga gctccaacct gtacatgagt 720
atgtctagtt ggtgttatac acactcactg gacggcgctg ggctgttcct gtttgatcac 780
gcagccgagg aatacgaaca tgcaaagaaa ctgatcattt tcctgaatga gaacaatgtg 840
cccgtccagc tgacttcaat cagcgcccct gaacataagt tcgagggcct gacccagatc 900
tttcagaaag cttacgaaca cgagcagcat atttccgaat ctatcaacaa tattgtggac 960
cacgccatta agagcaaaga tcatgctacc ttcaactttc tgcagtggta cgtggccgag 1020
cagcacgagg aggaggtcct gtttaaggac atcctggata aaatcgaact gattggaaac 1080
gagaatcatg gcctgtacct ggcagatcag tatgtgaagg gcattgccaa gtccagaaaa 1140
agtgggtca 1149
<210> 161
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 161
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Met
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 162
<211> 1149
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 162
tgacccactt tttctggact tggcaatgcc cttcacatac tgatctgcca ggtacaggcc 60
atgattctcg tttccaatca gttcgatttt atccaggatg tccttaaaca ggacctcctc 120
ctcgtgctgc tcggccacgt accactgcag aaagttgaag gtagcatgat ctttgctctt 180
aatggcgtgg tccacaatat tgttgataga ttcggaaata tgctgctcgt gttcgtaagc 240
tttctgaaag atctgggtca ggccctcgaa cttatgttca ggggcgctga ttgaagtcag 300
ctggacgggc acattgttct cattcaggaa aatgatcagt ttctttgcat gttcgtattc 360
ctcggctgcg tgatcaaaca ggaacagccc agcgccgtcc agtgagtgtg tataacacca 420
actagacata ctcatgtaca ggttggagct ctgcatctcc ttgttcacct gttcgttcag 480
cagcttgatg atgtcgcccc cactgtcaat tttctctcga ttcagcttac tctcttcaga 540
atatttggga tagtcgtaag tgccgttctt cacagactcc atacattcat tgttgcactt 600
atggtaaaac tcgaagcatc cattcccgat ttctttggca ttgttcttca gctgggattt 660
gaccttctca tacagattct tcacgttgct atcgtggaaa tccagagtcc gctcgttcat 720
cagcagcacc agcagctcag ccaggtctgt tccggagcct ccgctgccca ttttttcgat 780
gacagaattc accatgttag taatgccatt gattgcgttc tgtgtagact tctgatcagc 840
ggcgtagccg ctgccctgct cattctgatg gtggtagccg taccacccgt ccaccattcc 900
tgtccacccg ccctcaataa accctgcgat agcgccgaac agtcctcttg tttcccgctg 960
tgggatgttg cgcagtccgg tgaccatcct cagtccgctg cccagattca ctgagtgggt 1020
gacagtcacg ttcttctcca ggacggtatc cactgtgtcg gtggagttgt ttgcgtgata 1080
gccgatgcag atagtgtcag cgtaggttgc ggtaaaagta cacagcagga ccagcagttt 1140
ggccttcat 1149
<210> 163
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 163
atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60
tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120
gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180
aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240
gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300
ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatcgtg 360
aattctgtca tcgaaaaaat gggcagcgga ggctccggaa cagacctggc tgagctgctg 420
gtgctgctga tcaacgagcg gactctggat ttccacgata gcaacgtgaa gaatctgtat 480
gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540
ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600
cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgac 645
<210> 164
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 164
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Ile Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Ile
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 165
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 165
gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60
gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120
cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180
gttgctatcg tggaaatcca gagtccgctc gttgatcagc agcaccagca gctcagccag 240
gtctgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacga tgttagtaat 300
gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360
ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420
tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480
catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540
ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600
ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645
<210> 166
<211> 1149
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 166
atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60
tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120
gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180
aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240
gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300
ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatcgtg 360
aattctgtca tcgaaaaaat gggcagcgga ggctccggaa cagacctggc tgagctgctg 420
gtgctgctga tcaacgagcg gactctggat ttccacgata gcaacgtgaa gaatctgtat 480
gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540
ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600
cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgacagtgg gggcgacatc 660
atcaagctgc tgaacgaaca ggtgaacaag gagatgcaga gctccaacct gtacatgagt 720
atgtctagtt ggtgttatac acactcactg gacggcgctg ggctgttcct gtttgatcac 780
gcagccgagg aatacgaaca tgcaaagaaa ctgatcattt tcctgaatga gaacaatgtg 840
cccgtccagc tgacttcaat cagcgcccct gaacataagt tcgagggcct gacccagatc 900
tttcagaaag cttacgaaca cgagcagcat atttccgaat ctatcaacaa tattgtggac 960
cacgccatta agagcaaaga tcatgctacc ttcaactttc tgcagtggta cgtggccgag 1020
cagcacgagg aggaggtcct gtttaaggac atcctggata aaatcgaact gattggaaac 1080
gagaatcatg gcctgtacct ggcagatcag tatgtgaagg gcattgccaa gtccagaaaa 1140
agtgggtca 1149
<210> 167
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 167
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Ile Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Ile
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 168
<211> 1149
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 168
tgacccactt tttctggact tggcaatgcc cttcacatac tgatctgcca ggtacaggcc 60
atgattctcg tttccaatca gttcgatttt atccaggatg tccttaaaca ggacctcctc 120
ctcgtgctgc tcggccacgt accactgcag aaagttgaag gtagcatgat ctttgctctt 180
aatggcgtgg tccacaatat tgttgataga ttcggaaata tgctgctcgt gttcgtaagc 240
tttctgaaag atctgggtca ggccctcgaa cttatgttca ggggcgctga ttgaagtcag 300
ctggacgggc acattgttct cattcaggaa aatgatcagt ttctttgcat gttcgtattc 360
ctcggctgcg tgatcaaaca ggaacagccc agcgccgtcc agtgagtgtg tataacacca 420
actagacata ctcatgtaca ggttggagct ctgcatctcc ttgttcacct gttcgttcag 480
cagcttgatg atgtcgcccc cactgtcaat tttctctcga ttcagcttac tctcttcaga 540
atatttggga tagtcgtaag tgccgttctt cacagactcc atacattcat tgttgcactt 600
atggtaaaac tcgaagcatc cattcccgat ttctttggca ttgttcttca gctgggattt 660
gaccttctca tacagattct tcacgttgct atcgtggaaa tccagagtcc gctcgttgat 720
cagcagcacc agcagctcag ccaggtctgt tccggagcct ccgctgccca ttttttcgat 780
gacagaattc acgatgttag taatgccatt gattgcgttc tgtgtagact tctgatcagc 840
ggcgtagccg ctgccctgct cattctgatg gtggtagccg taccacccgt ccaccattcc 900
tgtccacccg ccctcaataa accctgcgat agcgccgaac agtcctcttg tttcccgctg 960
tgggatgttg cgcagtccgg tgaccatcct cagtccgctg cccagattca ctgagtgggt 1020
gacagtcacg ttcttctcca ggacggtatc cactgtgtcg gtggagttgt ttgcgtgata 1080
gccgatgcag atagtgtcag cgtaggttgc ggtaaaagta cacagcagga ccagcagttt 1140
ggccttcat 1149
<210> 169
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 169
atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60
tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120
gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180
aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240
gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300
ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacctggtg 360
aattctgtca tcgaaaaaat gggcagcgga ggctccggaa cagacctggc tgagctgctg 420
gtgctgctga tcaacgagcg gactctggat ttccacgata gcaacgtgaa gaatctgtat 480
gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540
ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600
cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgac 645
<210> 170
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 170
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Leu Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Ile
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 171
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 171
gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60
gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120
cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180
gttgctatcg tggaaatcca gagtccgctc gttgatcagc agcaccagca gctcagccag 240
gtctgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacca ggttagtaat 300
gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360
ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420
tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480
catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540
ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600
ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645
<210> 172
<211> 1149
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 172
atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60
tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120
gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180
aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240
gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300
ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacctggtg 360
aattctgtca tcgaaaaaat gggcagcgga ggctccggaa cagacctggc tgagctgctg 420
gtgctgctga tcaacgagcg gactctggat ttccacgata gcaacgtgaa gaatctgtat 480
gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540
ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600
cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgacagtgg gggcgacatc 660
atcaagctgc tgaacgaaca ggtgaacaag gagatgcaga gctccaacct gtacatgagt 720
atgtctagtt ggtgttatac acactcactg gacggcgctg ggctgttcct gtttgatcac 780
gcagccgagg aatacgaaca tgcaaagaaa ctgatcattt tcctgaatga gaacaatgtg 840
cccgtccagc tgacttcaat cagcgcccct gaacataagt tcgagggcct gacccagatc 900
tttcagaaag cttacgaaca cgagcagcat atttccgaat ctatcaacaa tattgtggac 960
cacgccatta agagcaaaga tcatgctacc ttcaactttc tgcagtggta cgtggccgag 1020
cagcacgagg aggaggtcct gtttaaggac atcctggata aaatcgaact gattggaaac 1080
gagaatcatg gcctgtacct ggcagatcag tatgtgaagg gcattgccaa gtccagaaaa 1140
agtgggtca 1149
<210> 173
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 173
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Leu Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Ile
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 174
<211> 1149
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 174
tgacccactt tttctggact tggcaatgcc cttcacatac tgatctgcca ggtacaggcc 60
atgattctcg tttccaatca gttcgatttt atccaggatg tccttaaaca ggacctcctc 120
ctcgtgctgc tcggccacgt accactgcag aaagttgaag gtagcatgat ctttgctctt 180
aatggcgtgg tccacaatat tgttgataga ttcggaaata tgctgctcgt gttcgtaagc 240
tttctgaaag atctgggtca ggccctcgaa cttatgttca ggggcgctga ttgaagtcag 300
ctggacgggc acattgttct cattcaggaa aatgatcagt ttctttgcat gttcgtattc 360
ctcggctgcg tgatcaaaca ggaacagccc agcgccgtcc agtgagtgtg tataacacca 420
actagacata ctcatgtaca ggttggagct ctgcatctcc ttgttcacct gttcgttcag 480
cagcttgatg atgtcgcccc cactgtcaat tttctctcga ttcagcttac tctcttcaga 540
atatttggga tagtcgtaag tgccgttctt cacagactcc atacattcat tgttgcactt 600
atggtaaaac tcgaagcatc cattcccgat ttctttggca ttgttcttca gctgggattt 660
gaccttctca tacagattct tcacgttgct atcgtggaaa tccagagtcc gctcgttgat 720
cagcagcacc agcagctcag ccaggtctgt tccggagcct ccgctgccca ttttttcgat 780
gacagaattc accaggttag taatgccatt gattgcgttc tgtgtagact tctgatcagc 840
ggcgtagccg ctgccctgct cattctgatg gtggtagccg taccacccgt ccaccattcc 900
tgtccacccg ccctcaataa accctgcgat agcgccgaac agtcctcttg tttcccgctg 960
tgggatgttg cgcagtccgg tgaccatcct cagtccgctg cccagattca ctgagtgggt 1020
gacagtcacg ttcttctcca ggacggtatc cactgtgtcg gtggagttgt ttgcgtgata 1080
gccgatgcag atagtgtcag cgtaggttgc ggtaaaagta cacagcagga ccagcagttt 1140
ggccttcat 1149
<210> 175
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 175
atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60
tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120
gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180
aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240
gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300
ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacctggtg 360
aattctgtca tcgaaaaaat gggcagcgga ggctccggaa cagacctggc tgagctgctg 420
gtgctgctgc tgaacgagcg gactctggat ttccacgata gcaacgtgaa gaatctgtat 480
gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540
ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600
cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgac 645
<210> 176
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 176
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Leu Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 177
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 177
gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60
gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120
cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180
gttgctatcg tggaaatcca gagtccgctc gttcagcagc agcaccagca gctcagccag 240
gtctgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacca ggttagtaat 300
gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360
ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420
tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480
catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540
ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600
ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645
<210> 178
<211> 1149
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 178
atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60
tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120
gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180
aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240
gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300
ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacctggtg 360
aattctgtca tcgaaaaaat gggcagcgga ggctccggaa cagacctggc tgagctgctg 420
gtgctgctgc tgaacgagcg gactctggat ttccacgata gcaacgtgaa gaatctgtat 480
gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540
ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600
cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgacagtgg gggcgacatc 660
atcaagctgc tgaacgaaca ggtgaacaag gagatgcaga gctccaacct gtacatgagt 720
atgtctagtt ggtgttatac acactcactg gacggcgctg ggctgttcct gtttgatcac 780
gcagccgagg aatacgaaca tgcaaagaaa ctgatcattt tcctgaatga gaacaatgtg 840
cccgtccagc tgacttcaat cagcgcccct gaacataagt tcgagggcct gacccagatc 900
tttcagaaag cttacgaaca cgagcagcat atttccgaat ctatcaacaa tattgtggac 960
cacgccatta agagcaaaga tcatgctacc ttcaactttc tgcagtggta cgtggccgag 1020
cagcacgagg aggaggtcct gtttaaggac atcctggata aaatcgaact gattggaaac 1080
gagaatcatg gcctgtacct ggcagatcag tatgtgaagg gcattgccaa gtccagaaaa 1140
agtgggtca 1149
<210> 179
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 179
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Leu Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 180
<211> 1149
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 180
tgacccactt tttctggact tggcaatgcc cttcacatac tgatctgcca ggtacaggcc 60
atgattctcg tttccaatca gttcgatttt atccaggatg tccttaaaca ggacctcctc 120
ctcgtgctgc tcggccacgt accactgcag aaagttgaag gtagcatgat ctttgctctt 180
aatggcgtgg tccacaatat tgttgataga ttcggaaata tgctgctcgt gttcgtaagc 240
tttctgaaag atctgggtca ggccctcgaa cttatgttca ggggcgctga ttgaagtcag 300
ctggacgggc acattgttct cattcaggaa aatgatcagt ttctttgcat gttcgtattc 360
ctcggctgcg tgatcaaaca ggaacagccc agcgccgtcc agtgagtgtg tataacacca 420
actagacata ctcatgtaca ggttggagct ctgcatctcc ttgttcacct gttcgttcag 480
cagcttgatg atgtcgcccc cactgtcaat tttctctcga ttcagcttac tctcttcaga 540
atatttggga tagtcgtaag tgccgttctt cacagactcc atacattcat tgttgcactt 600
atggtaaaac tcgaagcatc cattcccgat ttctttggca ttgttcttca gctgggattt 660
gaccttctca tacagattct tcacgttgct atcgtggaaa tccagagtcc gctcgttcag 720
cagcagcacc agcagctcag ccaggtctgt tccggagcct ccgctgccca ttttttcgat 780
gacagaattc accaggttag taatgccatt gattgcgttc tgtgtagact tctgatcagc 840
ggcgtagccg ctgccctgct cattctgatg gtggtagccg taccacccgt ccaccattcc 900
tgtccacccg ccctcaataa accctgcgat agcgccgaac agtcctcttg tttcccgctg 960
tgggatgttg cgcagtccgg tgaccatcct cagtccgctg cccagattca ctgagtgggt 1020
gacagtcacg ttcttctcca ggacggtatc cactgtgtcg gtggagttgt ttgcgtgata 1080
gccgatgcag atagtgtcag cgtaggttgc ggtaaaagta cacagcagga ccagcagttt 1140
ggccttcat 1149
<210> 181
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 181
atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60
tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120
gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180
aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240
gggtggacag gaatggtgga cgggtggtac ggctaccacc ataacaatac ccagggcagc 300
ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360
aattctgtca tcgaaaaaat gggcggaaat ggcactggag ctgacctggc tgagctgctg 420
gtgctgctgc tgaaccagtg gactctgctg ttccacgata gcaacgtgaa gaatctgtat 480
gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540
ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600
cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgac 645
<210> 182
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 182
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Asn Asn
85 90 95
Thr Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 183
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 183
gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60
gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120
cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180
gttgctatcg tggaacagca gagtccactg gttcagcagc agcaccagca gctcagccag 240
gtcagctcca gtgccatttc cgcccatttt ttcgatgaca gaattcacca tgttagtaat 300
gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgggtatt 360
gttatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420
tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480
catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540
ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600
ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645
<210> 184
<211> 1149
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 184
atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60
tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120
gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180
aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240
gggtggacag gaatggtgga cgggtggtac ggctaccacc ataacaatac ccagggcagc 300
ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360
aattctgtca tcgaaaaaat gggcggaaat ggcactggag ctgacctggc tgagctgctg 420
gtgctgctgc tgaaccagtg gactctgctg ttccacgata gcaacgtgaa gaatctgtat 480
gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540
ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600
cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgacagtgg gggcgacatc 660
atcaagctgc tgaacgaaca ggtgaacaag gagatgaaca gcaccaacct gtacatgagt 720
atgtctagtt ggtgttatac acactcactg gacggcgctg ggctgttcct gtttgatcac 780
gcagccgagg aatacgaaca tgcaaagaaa ctgatcattt tcctgaatga gaacaatgtg 840
cccgtccagc tgacttcaat cagcgcccct gaacataagt tcgagggcct gacccagatc 900
tttcagaaag cttacgaaca cgagcagcat atttccgaat ctatcaacaa tattgtggac 960
cacgccatta agagcaaaga tcatgctacc ttcaactttc tgcagtggta cgtggccgag 1020
cagcacgagg aggaggtcct gtttaaggac atcctggata aaatcgaact gattggaaac 1080
gagaatcatg gcctgtacct ggcagatcag tatgtgaagg gcattgccaa gtccagaaaa 1140
agtgggtca 1149
<210> 185
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 185
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Asn Asn
85 90 95
Thr Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Asn Ser Thr Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 186
<211> 1149
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 186
tgacccactt tttctggact tggcaatgcc cttcacatac tgatctgcca ggtacaggcc 60
atgattctcg tttccaatca gttcgatttt atccaggatg tccttaaaca ggacctcctc 120
ctcgtgctgc tcggccacgt accactgcag aaagttgaag gtagcatgat ctttgctctt 180
aatggcgtgg tccacaatat tgttgataga ttcggaaata tgctgctcgt gttcgtaagc 240
tttctgaaag atctgggtca ggccctcgaa cttatgttca ggggcgctga ttgaagtcag 300
ctggacgggc acattgttct cattcaggaa aatgatcagt ttctttgcat gttcgtattc 360
ctcggctgcg tgatcaaaca ggaacagccc agcgccgtcc agtgagtgtg tataacacca 420
actagacata ctcatgtaca ggttggtgct gttcatctcc ttgttcacct gttcgttcag 480
cagcttgatg atgtcgcccc cactgtcaat tttctctcga ttcagcttac tctcttcaga 540
atatttggga tagtcgtaag tgccgttctt cacagactcc atacattcat tgttgcactt 600
atggtaaaac tcgaagcatc cattcccgat ttctttggca ttgttcttca gctgggattt 660
gaccttctca tacagattct tcacgttgct atcgtggaac agcagagtcc actggttcag 720
cagcagcacc agcagctcag ccaggtcagc tccagtgcca tttccgccca ttttttcgat 780
gacagaattc accatgttag taatgccatt gattgcgttc tgtgtagact tctgatcagc 840
ggcgtagccg ctgccctggg tattgttatg gtggtagccg taccacccgt ccaccattcc 900
tgtccacccg ccctcaataa accctgcgat agcgccgaac agtcctcttg tttcccgctg 960
tgggatgttg cgcagtccgg tgaccatcct cagtccgctg cccagattca ctgagtgggt 1020
gacagtcacg ttcttctcca ggacggtatc cactgtgtcg gtggagttgt ttgcgtgata 1080
gccgatgcag atagtgtcag cgtaggttgc ggtaaaagta cacagcagga ccagcagttt 1140
ggccttcat 1149
<210> 187
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 187
atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60
tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120
gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180
aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240
gggtggacag gaatggtgga cgggtggtac ggctaccacc ataacaatac ccagggcagc 300
ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360
aattctgtca tcgaaaaaat gggcggaaat ggcactggag ctgacctggc tgagctgctg 420
gtgctgctgc tgaaccagtg gactctgctg ttccacgata gcaacgtgaa gaatctgtat 480
gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540
ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600
cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgac 645
<210> 188
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 188
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Asn Asn
85 90 95
Thr Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 189
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 189
gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60
gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120
cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180
gttgctatcg tggaacagca gagtccactg gttcagcagc agcaccagca gctcagccag 240
gtcagctcca gtgccatttc cgcccatttt ttcgatgaca gaattcacca tgttagtaat 300
gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgggtatt 360
gttatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420
tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480
catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540
ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600
ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645
<210> 190
<211> 1149
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 190
atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60
tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120
gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180
aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240
gggtggacag gaatggtgga cgggtggtac ggctaccacc ataacaatac ccagggcagc 300
ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360
aattctgtca tcgaaaaaat gggcggaaat ggcactggag ctgacctggc tgagctgctg 420
gtgctgctgc tgaaccagtg gactctgctg ttccacgata gcaacgtgaa gaatctgtat 480
gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540
ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600
cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgacagtgg gggcgacatc 660
atcaagctgc tgaacgaaca ggtgaacaag gagatgaaca gcaccaacct gtacatgagt 720
atgtctagtt ggtgttatac acactcactg gacggcgctg ggctgttcct gtttgatcac 780
gcagccgagg aatacgaaca tgcaaagaaa ctgatcattt tcctgaatga gaacaatgtg 840
cccgtcaacc tgacttcaat cagcgcccct gaacataagt tcgagggcct gacccagatc 900
tttcagaaag cttacgaaca cgagcagcat atttccgaat ctatcaacaa tattgtggac 960
cacgccatta agagcaaaga tcatgctacc ttcaactttc tgcagtggta cgtggccgag 1020
cagcacgagg aggaggtcct gtttaaggac atcctggata aaatcgaact gattggaaac 1080
gagaatcatg gcctgtacct ggcagatcag tatgtgaagg gcattgccaa gtccagaaaa 1140
agtgggtca 1149
<210> 191
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 191
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Asn Asn
85 90 95
Thr Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Asn Ser Thr Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Asn Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 192
<211> 1149
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 192
tgacccactt tttctggact tggcaatgcc cttcacatac tgatctgcca ggtacaggcc 60
atgattctcg tttccaatca gttcgatttt atccaggatg tccttaaaca ggacctcctc 120
ctcgtgctgc tcggccacgt accactgcag aaagttgaag gtagcatgat ctttgctctt 180
aatggcgtgg tccacaatat tgttgataga ttcggaaata tgctgctcgt gttcgtaagc 240
tttctgaaag atctgggtca ggccctcgaa cttatgttca ggggcgctga ttgaagtcag 300
gttgacgggc acattgttct cattcaggaa aatgatcagt ttctttgcat gttcgtattc 360
ctcggctgcg tgatcaaaca ggaacagccc agcgccgtcc agtgagtgtg tataacacca 420
actagacata ctcatgtaca ggttggtgct gttcatctcc ttgttcacct gttcgttcag 480
cagcttgatg atgtcgcccc cactgtcaat tttctctcga ttcagcttac tctcttcaga 540
atatttggga tagtcgtaag tgccgttctt cacagactcc atacattcat tgttgcactt 600
atggtaaaac tcgaagcatc cattcccgat ttctttggca ttgttcttca gctgggattt 660
gaccttctca tacagattct tcacgttgct atcgtggaac agcagagtcc actggttcag 720
cagcagcacc agcagctcag ccaggtcagc tccagtgcca tttccgccca ttttttcgat 780
gacagaattc accatgttag taatgccatt gattgcgttc tgtgtagact tctgatcagc 840
ggcgtagccg ctgccctggg tattgttatg gtggtagccg taccacccgt ccaccattcc 900
tgtccacccg ccctcaataa accctgcgat agcgccgaac agtcctcttg tttcccgctg 960
tgggatgttg cgcagtccgg tgaccatcct cagtccgctg cccagattca ctgagtgggt 1020
gacagtcacg ttcttctcca ggacggtatc cactgtgtcg gtggagttgt ttgcgtgata 1080
gccgatgcag atagtgtcag cgtaggttgc ggtaaaagta cacagcagga ccagcagttt 1140
ggccttcat 1149
<210> 193
<211> 465
<212> DNA
<213> Aquifex aeolicus
<400> 193
atgcaaattt acgaagggaa actaaccgct gaagggctga ggttcggtat agtggcttcc 60
aggttcaacc acgcactcgt ggatagacta gttgagggag ctatagactg catagtaaga 120
cacgggggaa gggaagaaga cataacgctc gttagagtgc cgggctcctg ggaaattccc 180
gtggctgcgg gagagcttgc gagaaaagag gacatagacg ctgtgatagc gataggagtt 240
ctaataaggg gggctactcc ccactttgat tacatagcct ctgaagtgtc aaaagggctt 300
gcgaaccttt ccttagaact gagaaaaccc ataaccttcg gtgttataac tgcggacacc 360
ttggagcagg cgatagaaag ggcgggaaca aagcacggga ataagggctg ggaagctgca 420
ctttccgcaa tagaaatggc aaacttattt aagagtctga gatga 465
<210> 194
<211> 154
<212> PRT
<213> Aquifex aeolicus
<400> 194
Met Gln Ile Tyr Glu Gly Lys Leu Thr Ala Glu Gly Leu Arg Phe Gly
1 5 10 15
Ile Val Ala Ser Arg Phe Asn His Ala Leu Val Asp Arg Leu Val Glu
20 25 30
Gly Ala Ile Asp Cys Ile Val Arg His Gly Gly Arg Glu Glu Asp Ile
35 40 45
Thr Leu Val Arg Val Pro Gly Ser Trp Glu Ile Pro Val Ala Ala Gly
50 55 60
Glu Leu Ala Arg Lys Glu Asp Ile Asp Ala Val Ile Ala Ile Gly Val
65 70 75 80
Leu Ile Arg Gly Ala Thr Pro His Phe Asp Tyr Ile Ala Ser Glu Val
85 90 95
Ser Lys Gly Leu Ala Asn Leu Ser Leu Glu Leu Arg Lys Pro Ile Thr
100 105 110
Phe Gly Val Ile Thr Ala Asp Thr Leu Glu Gln Ala Ile Glu Arg Ala
115 120 125
Gly Thr Lys His Gly Asn Lys Gly Trp Glu Ala Ala Leu Ser Ala Ile
130 135 140
Glu Met Ala Asn Leu Phe Lys Ser Leu Arg
145 150
<210> 195
<211> 465
<212> DNA
<213> Aquifex aeolicus
<400> 195
tcatctcaga ctcttaaata agtttgccat ttctattgcg gaaagtgcag cttcccagcc 60
cttattcccg tgctttgttc ccgccctttc tatcgcctgc tccaaggtgt ccgcagttat 120
aacaccgaag gttatgggtt ttctcagttc taaggaaagg ttcgcaagcc cttttgacac 180
ttcagaggct atgtaatcaa agtggggagt agcccccctt attagaactc ctatcgctat 240
cacagcgtct atgtcctctt ttctcgcaag ctctcccgca gccacgggaa tttcccagga 300
gcccggcact ctaacgagcg ttatgtcttc ttcccttccc ccgtgtctta ctatgcagtc 360
tatagctccc tcaactagtc tatccacgag tgcgtggttg aacctggaag ccactatacc 420
gaacctcagc ccttcagcgg ttagtttccc ttcgtaaatt tgcat 465
<210> 196
<211> 642
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 196
atgaaggcca agctgctggt gctcctgtgc accttcaccg ccacctacgc cgacaccatc 60
tgcatcggct accacgccaa caacagcacc gacaccgtgg ataccgtgct ggaaaagaac 120
gtgaccgtga cccacagcgt gaacctgggc agcggcctgc ggatggtgac aggcctgcgg 180
aacatccccc agagagagac acggggcctg ttcggcgcca ttgccggctt tatcgagggc 240
ggctggaccg gcatggtgga cgggtggtac ggctaccacc accagaacga gcagggcagc 300
ggctacgccg ccgaccagaa gtccacccag aacgccatca acggcatcac caacatggtg 360
aacagcgtga tcgagaagat gggctccggc ggcagcggca ccgatctggc tgaactgctg 420
gtcctgctgc tgaacgagcg gaccctggac ttccacgaca gcaacgtgaa gaacctgtac 480
gagaaagtga agtcccagct gaagaacaac gccaaagaga tcggcaacgg ctgcttcgag 540
ttctaccaca agtgcaacaa cgagtgcatg gaaagcgtga agaacggcac ctacgactac 600
cccaagtaca gcgaggaaag caagctgaac cgcgagggag gc 642
<210> 197
<211> 214
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 197
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Gly Gly
210
<210> 198
<211> 642
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 198
gcctccctcg cggttcagct tgctttcctc gctgtacttg gggtagtcgt aggtgccgtt 60
cttcacgctt tccatgcact cgttgttgca cttgtggtag aactcgaagc agccgttgcc 120
gatctctttg gcgttgttct tcagctggga cttcactttc tcgtacaggt tcttcacgtt 180
gctgtcgtgg aagtccaggg tccgctcgtt cagcagcagg accagcagtt cagccagatc 240
ggtgccgctg ccgccggagc ccatcttctc gatcacgctg ttcaccatgt tggtgatgcc 300
gttgatggcg ttctgggtgg acttctggtc ggcggcgtag ccgctgccct gctcgttctg 360
gtggtggtag ccgtaccacc cgtccaccat gccggtccag ccgccctcga taaagccggc 420
aatggcgccg aacaggcccc gtgtctctct ctgggggatg ttccgcaggc ctgtcaccat 480
ccgcaggccg ctgcccaggt tcacgctgtg ggtcacggtc acgttctttt ccagcacggt 540
atccacggtg tcggtgctgt tgttggcgtg gtagccgatg cagatggtgt cggcgtaggt 600
ggcggtgaag gtgcacagga gcaccagcag cttggccttc at 642
<210> 199
<211> 1104
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 199
atgaaggcca agctgctggt gctcctgtgc accttcaccg ccacctacgc cgacaccatc 60
tgcatcggct accacgccaa caacagcacc gacaccgtgg ataccgtgct ggaaaagaac 120
gtgaccgtga cccacagcgt gaacctgggc agcggcctgc ggatggtgac aggcctgcgg 180
aacatccccc agagagagac acggggcctg ttcggcgcca ttgccggctt tatcgagggc 240
ggctggaccg gcatggtgga cgggtggtac ggctaccacc accagaacga gcagggcagc 300
ggctacgccg ccgaccagaa gtccacccag aacgccatca acggcatcac caacatggtg 360
aacagcgtga tcgagaagat gggctccggc ggcagcggca ccgatctggc tgaactgctg 420
gtcctgctgc tgaacgagcg gaccctggac ttccacgaca gcaacgtgaa gaacctgtac 480
gagaaagtga agtcccagct gaagaacaac gccaaagaga tcggcaacgg ctgcttcgag 540
ttctaccaca agtgcaacaa cgagtgcatg gaaagcgtga agaacggcac ctacgactac 600
cccaagtaca gcgaggaaag caagctgaac cgcgagggag gcatgcaaat ctacgagggc 660
aagctgacag ccgagggcct gagattcggc atcgtggcca gccggttcaa ccacgccctg 720
gtggacagac tggtggaagg cgccatcgac tgcatcgtgc ggcacggcgg cagagaagag 780
gacatcaccc tggtccgcgt gcccggcagc tgggaaattc ctgtggctgc cggcgagctg 840
gcccggaaag aggatatcga cgccgtcatc gccatcggcg tgctgatcag aggcgccacc 900
ccccacttcg actatatcgc cagcgaggtg tccaagggcc tggccaacct gagcctggaa 960
ctgcggaagc ccatcacctt cggagtgatc accgccgaca ccctggaaca ggccatcgag 1020
agagccggca ccaagcacgg caacaaggga tgggaagccg ccctgagcgc catcgagatg 1080
gccaatctgt tcaagagcct gcgc 1104
<210> 200
<211> 368
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 200
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Gly Gly Met Gln Ile Tyr Glu Gly Lys Leu Thr Ala
210 215 220
Glu Gly Leu Arg Phe Gly Ile Val Ala Ser Arg Phe Asn His Ala Leu
225 230 235 240
Val Asp Arg Leu Val Glu Gly Ala Ile Asp Cys Ile Val Arg His Gly
245 250 255
Gly Arg Glu Glu Asp Ile Thr Leu Val Arg Val Pro Gly Ser Trp Glu
260 265 270
Ile Pro Val Ala Ala Gly Glu Leu Ala Arg Lys Glu Asp Ile Asp Ala
275 280 285
Val Ile Ala Ile Gly Val Leu Ile Arg Gly Ala Thr Pro His Phe Asp
290 295 300
Tyr Ile Ala Ser Glu Val Ser Lys Gly Leu Ala Asn Leu Ser Leu Glu
305 310 315 320
Leu Arg Lys Pro Ile Thr Phe Gly Val Ile Thr Ala Asp Thr Leu Glu
325 330 335
Gln Ala Ile Glu Arg Ala Gly Thr Lys His Gly Asn Lys Gly Trp Glu
340 345 350
Ala Ala Leu Ser Ala Ile Glu Met Ala Asn Leu Phe Lys Ser Leu Arg
355 360 365
<210> 201
<211> 1104
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 201
gcgcaggctc ttgaacagat tggccatctc gatggcgctc agggcggctt cccatccctt 60
gttgccgtgc ttggtgccgg ctctctcgat ggcctgttcc agggtgtcgg cggtgatcac 120
tccgaaggtg atgggcttcc gcagttccag gctcaggttg gccaggccct tggacacctc 180
gctggcgata tagtcgaagt ggggggtggc gcctctgatc agcacgccga tggcgatgac 240
ggcgtcgata tcctctttcc gggccagctc gccggcagcc acaggaattt cccagctgcc 300
gggcacgcgg accagggtga tgtcctcttc tctgccgccg tgccgcacga tgcagtcgat 360
ggcgccttcc accagtctgt ccaccagggc gtggttgaac cggctggcca cgatgccgaa 420
tctcaggccc tcggctgtca gcttgccctc gtagatttgc atgcctccct cgcggttcag 480
cttgctttcc tcgctgtact tggggtagtc gtaggtgccg ttcttcacgc tttccatgca 540
ctcgttgttg cacttgtggt agaactcgaa gcagccgttg ccgatctctt tggcgttgtt 600
cttcagctgg gacttcactt tctcgtacag gttcttcacg ttgctgtcgt ggaagtccag 660
ggtccgctcg ttcagcagca ggaccagcag ttcagccaga tcggtgccgc tgccgccgga 720
gcccatcttc tcgatcacgc tgttcaccat gttggtgatg ccgttgatgg cgttctgggt 780
ggacttctgg tcggcggcgt agccgctgcc ctgctcgttc tggtggtggt agccgtacca 840
cccgtccacc atgccggtcc agccgccctc gataaagccg gcaatggcgc cgaacaggcc 900
ccgtgtctct ctctggggga tgttccgcag gcctgtcacc atccgcaggc cgctgcccag 960
gttcacgctg tgggtcacgg tcacgttctt ttccagcacg gtatccacgg tgtcggtgct 1020
gttgttggcg tggtagccga tgcagatggt gtcggcgtag gtggcggtga aggtgcacag 1080
gagcaccagc agcttggcct tcat 1104
<210> 202
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 202
atgaaggcca agctgctggt gctcctgtgc accttcaccg ccacctacgc cgacaccatc 60
tgcatcggct accacgccaa caacagcacc gacaccgtgg ataccgtgct ggaaaagaac 120
gtgaccgtga cccacagcgt gaacctgggc agcggcctgc ggatggtgac aggcctgcgg 180
aacatccccc agagagagac acggggcctg ttcggcgcca ttgccggctt tatcgagggc 240
ggctggaccg gcatggtgga cgggtggtac ggctaccacc accagaacga gcagggcagc 300
ggctacgccg ccgaccagaa gtccacccag aacgccatca acggcatcac caacatggtg 360
aacagcgtga tcgagaagat gggctccggc ggcagcggca ccgatctggc tgaactgctg 420
gtcctgctgc tgaacgagcg gaccctggac ttccacgaca gcaacgtgaa gaacctgtac 480
gagaaagtga agtcccagct gaagaacaac gccaaagaga tcggcaacgg ctgcttcgag 540
ttctaccaca agtgcaacaa cgagtgcatg gaaagcgtga agaacggcac ctacgactac 600
cccaagtaca gcgaggaaag caagctgaac cgcgagggaa gcggc 645
<210> 203
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 203
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Gly Ser Gly
210 215
<210> 204
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 204
gccgcttccc tcgcggttca gcttgctttc ctcgctgtac ttggggtagt cgtaggtgcc 60
gttcttcacg ctttccatgc actcgttgtt gcacttgtgg tagaactcga agcagccgtt 120
gccgatctct ttggcgttgt tcttcagctg ggacttcact ttctcgtaca ggttcttcac 180
gttgctgtcg tggaagtcca gggtccgctc gttcagcagc aggaccagca gttcagccag 240
atcggtgccg ctgccgccgg agcccatctt ctcgatcacg ctgttcacca tgttggtgat 300
gccgttgatg gcgttctggg tggacttctg gtcggcggcg tagccgctgc cctgctcgtt 360
ctggtggtgg tagccgtacc acccgtccac catgccggtc cagccgccct cgataaagcc 420
ggcaatggcg ccgaacaggc cccgtgtctc tctctggggg atgttccgca ggcctgtcac 480
catccgcagg ccgctgccca ggttcacgct gtgggtcacg gtcacgttct tttccagcac 540
ggtatccacg gtgtcggtgc tgttgttggc gtggtagccg atgcagatgg tgtcggcgta 600
ggtggcggtg aaggtgcaca ggagcaccag cagcttggcc ttcat 645
<210> 205
<211> 1107
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 205
atgaaggcca agctgctggt gctcctgtgc accttcaccg ccacctacgc cgacaccatc 60
tgcatcggct accacgccaa caacagcacc gacaccgtgg ataccgtgct ggaaaagaac 120
gtgaccgtga cccacagcgt gaacctgggc agcggcctgc ggatggtgac aggcctgcgg 180
aacatccccc agagagagac acggggcctg ttcggcgcca ttgccggctt tatcgagggc 240
ggctggaccg gcatggtgga cgggtggtac ggctaccacc accagaacga gcagggcagc 300
ggctacgccg ccgaccagaa gtccacccag aacgccatca acggcatcac caacatggtg 360
aacagcgtga tcgagaagat gggctccggc ggcagcggca ccgatctggc tgaactgctg 420
gtcctgctgc tgaacgagcg gaccctggac ttccacgaca gcaacgtgaa gaacctgtac 480
gagaaagtga agtcccagct gaagaacaac gccaaagaga tcggcaacgg ctgcttcgag 540
ttctaccaca agtgcaacaa cgagtgcatg gaaagcgtga agaacggcac ctacgactac 600
cccaagtaca gcgaggaaag caagctgaac cgcgagggaa gcggcatgca aatctacgag 660
ggcaagctga cagccgaggg cctgagattc ggcatcgtgg ccagccggtt caaccacgcc 720
ctggtggaca gactggtgga aggcgccatc gactgcatcg tgcggcacgg cggcagagaa 780
gaggacatca ccctggtccg cgtgcccggc agctgggaaa ttcctgtggc tgccggcgag 840
ctggcccgga aagaggatat cgacgccgtc atcgccatcg gcgtgctgat cagaggcgcc 900
accccccact tcgactatat cgccagcgag gtgtccaagg gcctggccaa cctgagcctg 960
gaactgcgga agcccatcac cttcggagtg atcaccgccg acaccctgga acaggccatc 1020
gagagagccg gcaccaagca cggcaacaag ggatgggaag ccgccctgag cgccatcgag 1080
atggccaatc tgttcaagag cctgcgc 1107
<210> 206
<211> 369
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 206
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Gly Ser Gly Met Gln Ile Tyr Glu Gly Lys Leu Thr
210 215 220
Ala Glu Gly Leu Arg Phe Gly Ile Val Ala Ser Arg Phe Asn His Ala
225 230 235 240
Leu Val Asp Arg Leu Val Glu Gly Ala Ile Asp Cys Ile Val Arg His
245 250 255
Gly Gly Arg Glu Glu Asp Ile Thr Leu Val Arg Val Pro Gly Ser Trp
260 265 270
Glu Ile Pro Val Ala Ala Gly Glu Leu Ala Arg Lys Glu Asp Ile Asp
275 280 285
Ala Val Ile Ala Ile Gly Val Leu Ile Arg Gly Ala Thr Pro His Phe
290 295 300
Asp Tyr Ile Ala Ser Glu Val Ser Lys Gly Leu Ala Asn Leu Ser Leu
305 310 315 320
Glu Leu Arg Lys Pro Ile Thr Phe Gly Val Ile Thr Ala Asp Thr Leu
325 330 335
Glu Gln Ala Ile Glu Arg Ala Gly Thr Lys His Gly Asn Lys Gly Trp
340 345 350
Glu Ala Ala Leu Ser Ala Ile Glu Met Ala Asn Leu Phe Lys Ser Leu
355 360 365
Arg
<210> 207
<211> 1107
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 207
gcgcaggctc ttgaacagat tggccatctc gatggcgctc agggcggctt cccatccctt 60
gttgccgtgc ttggtgccgg ctctctcgat ggcctgttcc agggtgtcgg cggtgatcac 120
tccgaaggtg atgggcttcc gcagttccag gctcaggttg gccaggccct tggacacctc 180
gctggcgata tagtcgaagt ggggggtggc gcctctgatc agcacgccga tggcgatgac 240
ggcgtcgata tcctctttcc gggccagctc gccggcagcc acaggaattt cccagctgcc 300
gggcacgcgg accagggtga tgtcctcttc tctgccgccg tgccgcacga tgcagtcgat 360
ggcgccttcc accagtctgt ccaccagggc gtggttgaac cggctggcca cgatgccgaa 420
tctcaggccc tcggctgtca gcttgccctc gtagatttgc atgccgcttc cctcgcggtt 480
cagcttgctt tcctcgctgt acttggggta gtcgtaggtg ccgttcttca cgctttccat 540
gcactcgttg ttgcacttgt ggtagaactc gaagcagccg ttgccgatct ctttggcgtt 600
gttcttcagc tgggacttca ctttctcgta caggttcttc acgttgctgt cgtggaagtc 660
cagggtccgc tcgttcagca gcaggaccag cagttcagcc agatcggtgc cgctgccgcc 720
ggagcccatc ttctcgatca cgctgttcac catgttggtg atgccgttga tggcgttctg 780
ggtggacttc tggtcggcgg cgtagccgct gccctgctcg ttctggtggt ggtagccgta 840
ccacccgtcc accatgccgg tccagccgcc ctcgataaag ccggcaatgg cgccgaacag 900
gccccgtgtc tctctctggg ggatgttccg caggcctgtc accatccgca ggccgctgcc 960
caggttcacg ctgtgggtca cggtcacgtt cttttccagc acggtatcca cggtgtcggt 1020
gctgttgttg gcgtggtagc cgatgcagat ggtgtcggcg taggtggcgg tgaaggtgca 1080
caggagcacc agcagcttgg ccttcat 1107
<210> 208
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 208
atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60
tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120
gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180
aacatcccaa gcatccagag cagaggactg ttcggcgcta tcgcagggtt tattgagggc 240
gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300
ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360
aattctgtca tcgaaaaaat gggcagcgga ggctccggaa cagacctggc tgagctgctg 420
gtgctgctgc tgaaccagtg gactctgctg ttccacgata gcaacgtgaa gaatctgtat 480
gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540
ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600
cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgac 645
<210> 209
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 209
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 210
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 210
gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60
gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120
cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180
gttgctatcg tggaacagca gagtccactg gttcagcagc agcaccagca gctcagccag 240
gtctgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacca tgttagtaat 300
gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360
ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420
tgcgatagcg ccgaacagtc ctctgctctg gatgcttggg atgttgcgca gtccggtgac 480
catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540
ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600
ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645
<210> 211
<211> 1149
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 211
atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60
tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120
gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180
aacatcccaa gcatccagag cagaggactg ttcggcgcta tcgcagggtt tattgagggc 240
gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300
ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360
aattctgtca tcgaaaaaat gggcagcgga ggctccggaa cagacctggc tgagctgctg 420
gtgctgctgc tgaaccagtg gactctgctg ttccacgata gcaacgtgaa gaatctgtat 480
gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540
ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600
cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgacagtgg gggcgacatc 660
atcaagctgc tgaacgaaca ggtgaacaag gagatgcaga gctccaacct gtacatgagt 720
atgtctagtt ggtgttatac acactcactg gacggcgctg ggctgttcct gtttgatcac 780
gcagccgagg aatacgaaca tgcaaagaaa ctgatcattt tcctgaatga gaacaatgtg 840
cccgtccagc tgacttcaat cagcgcccct gaacataagt tcgagggcct gacccagatc 900
tttcagaaag cttacgaaca cgagcagcat atttccgaat ctatcaacaa tattgtggac 960
cacgccatta agagcaaaga tcatgctacc ttcaactttc tgcagtggta cgtggccgag 1020
cagcacgagg aggaggtcct gtttaaggac atcctggata aaatcgaact gattggaaac 1080
gagaatcatg gcctgtacct ggcagatcag tatgtgaagg gcattgccaa gtccagaaaa 1140
agtgggtca 1149
<210> 212
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 212
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 213
<211> 1149
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 213
tgacccactt tttctggact tggcaatgcc cttcacatac tgatctgcca ggtacaggcc 60
atgattctcg tttccaatca gttcgatttt atccaggatg tccttaaaca ggacctcctc 120
ctcgtgctgc tcggccacgt accactgcag aaagttgaag gtagcatgat ctttgctctt 180
aatggcgtgg tccacaatat tgttgataga ttcggaaata tgctgctcgt gttcgtaagc 240
tttctgaaag atctgggtca ggccctcgaa cttatgttca ggggcgctga ttgaagtcag 300
ctggacgggc acattgttct cattcaggaa aatgatcagt ttctttgcat gttcgtattc 360
ctcggctgcg tgatcaaaca ggaacagccc agcgccgtcc agtgagtgtg tataacacca 420
actagacata ctcatgtaca ggttggagct ctgcatctcc ttgttcacct gttcgttcag 480
cagcttgatg atgtcgcccc cactgtcaat tttctctcga ttcagcttac tctcttcaga 540
atatttggga tagtcgtaag tgccgttctt cacagactcc atacattcat tgttgcactt 600
atggtaaaac tcgaagcatc cattcccgat ttctttggca ttgttcttca gctgggattt 660
gaccttctca tacagattct tcacgttgct atcgtggaac agcagagtcc actggttcag 720
cagcagcacc agcagctcag ccaggtctgt tccggagcct ccgctgccca ttttttcgat 780
gacagaattc accatgttag taatgccatt gattgcgttc tgtgtagact tctgatcagc 840
ggcgtagccg ctgccctgct cattctgatg gtggtagccg taccacccgt ccaccattcc 900
tgtccacccg ccctcaataa accctgcgat agcgccgaac agtcctctgc tctggatgct 960
tgggatgttg cgcagtccgg tgaccatcct cagtccgctg cccagattca ctgagtgggt 1020
gacagtcacg ttcttctcca ggacggtatc cactgtgtcg gtggagttgt ttgcgtgata 1080
gccgatgcag atagtgtcag cgtaggttgc ggtaaaagta cacagcagga ccagcagttt 1140
ggccttcat 1149
<210> 214
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 214
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Leu Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Leu Ala Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Leu Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asp Glu Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Tyr His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Arg Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Thr Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ala Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 215
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 215
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Leu Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Leu Ala Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Leu Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asp Glu Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Tyr His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Arg Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Thr Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ala Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 216
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 216
atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60
tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120
gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180
aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240
gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300
ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360
aattctgtca tcgaaaaaat gggcagcgga ggctccggaa catacaacgc tgagctgctg 420
gtgctgctgc tgaacgagcg gactctggat ttccacgata gcaacgtgaa gaatctgtat 480
gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540
ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600
cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgac 645
<210> 217
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 217
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 218
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 218
gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60
gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120
cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180
gttgctatcg tggaaatcca gagtccgctc gttcagcagc agcaccagca gctcagcgtt 240
gtatgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacca tgttagtaat 300
gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360
ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420
tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480
catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540
ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600
ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645
<210> 219
<211> 1149
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 219
atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60
tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120
gtgactgtca cccactcagt gaatctgggc agcggactga ggatggtcac cggactgcgc 180
aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240
gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300
ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360
aattctgtca tcgaaaaaat gggcagcgga ggctccggaa catacaacgc tgagctgctg 420
gtgctgctgc tgaacgagcg gactctggat ttccacgata gcaacgtgaa gaatctgtat 480
gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540
ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600
cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgacagtgg gggcgacatc 660
atcaagctgc tgaacgaaca ggtgaacaag gagatgcaga gctccaacct gtacatgagt 720
atgtctagtt ggtgttatac acactcactg gacggcgctg ggctgttcct gtttgatcac 780
gcagccgagg aatacgaaca tgcaaagaaa ctgatcattt tcctgaatga gaacaatgtg 840
cccgtccagc tgacttcaat cagcgcccct gaacataagt tcgagggcct gacccagatc 900
tttcagaaag cttacgaaca cgagcagcat atttccgaat ctatcaacaa tattgtggac 960
cacgccatta agagcaaaga tcatgctacc ttcaactttc tgcagtggta cgtggccgag 1020
cagcacgagg aggaggtcct gtttaaggac atcctggata aaatcgaact gattggaaac 1080
gagaatcatg gcctgtacct ggcagatcag tatgtgaagg gcattgccaa gtccagaaaa 1140
agtgggtca 1149
<210> 220
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 220
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 221
<211> 1151
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 221
rctgacccac tttttctgga cttggcaatg cccttcacat actgatctgc caggtacagg 60
ccatgattct cgtttccaat cagttcgatt ttatccagga tgtccttaaa caggacctcc 120
tcctcgtgct gctcggccac gtaccactgc agaaagttga aggtagcatg atctttgctc 180
ttaatggcgt ggtccacaat attgttgata gattcggaaa tatgctgctc gtgttcgtaa 240
gctttctgaa agatctgggt caggccctcg aacttatgtt caggggcgct gattgaagtc 300
agctggacgg gcacattgtt ctcattcagg aaaatgatca gtttctttgc atgttcgtat 360
tcctcggctg cgtgatcaaa caggaacagc ccagcgccgt ccagtgagtg tgtataacac 420
caactagaca tactcatgta caggttggag ctctgcatct ccttgttcac ctgttcgttc 480
agcagcttga tgatgtcgcc cccactgtca attttctctc gattcagctt actctcttca 540
gaatatttgg gatagtcgta agtgccgttc ttcacagact ccatacattc attgttgcac 600
ttatggtaaa actcgaagca tccattcccg atttctttgg cattgttctt cagctgggat 660
ttgaccttct catacagatt cttcacgttg ctatcgtgga aatccagagt ccgctcgttc 720
agcagcagca ccagcagctc agcgttgtat gttccggagc ctccgctgcc cattttttcg 780
atgacagaat tcaccatgtt agtaatgcca ttgattgcgt tctgtgtaga cttctgatca 840
gcggcgtagc cgctgccctg ctcattctga tggtggtagc cgtaccaccc gtccaccatt 900
cctgtccacc cgccctcaat aaaccctgcg atagcgccga acagtcctct tgtttcccgc 960
tgtgggatgt tgcgcagtcc ggtgaccatc ctcagtccgc tgcccagatt cactgagtgg 1020
gtgacagtca cgttcttctc caggacggta tccactgtgt cggtggagtt gtttgcgtga 1080
tagccgatgc agatagtgtc agcgtaggtt gcggtaaaag tacacagcag gaccagcagt 1140
ttggccttca t 1151
<210> 222
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 222
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 223
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 223
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 224
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 224
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Ile Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Ile
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 225
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 225
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Ile Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Ile
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 226
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 226
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Leu Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Ile
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 227
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 227
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Leu Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Ile
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 228
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 228
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Leu Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 229
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 229
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Leu Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 230
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 230
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Met
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 231
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 231
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Met
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 232
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 232
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Gln Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Gln
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 233
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 233
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Gln Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Gln
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 234
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 234
atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60
tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120
gtgactgtca ccaactcaac taatctgggc agcggactga ggatggtcac cggactgcgc 180
aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240
gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300
ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360
aattctgtca tcgaaaaaat gggcagcgga ggctccggaa cagacctggc tgagctgctg 420
gtgctgctgc tgaacgagcg gactctggat ttccacgata gcaacgtgaa gaatctgtat 480
gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540
ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600
cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgac 645
<210> 235
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 235
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr Asn Ser Thr Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 236
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 236
gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60
gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120
cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180
gttgctatcg tggaaatcca gagtccgctc gttcagcagc agcaccagca gctcagccag 240
gtctgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacca tgttagtaat 300
gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360
ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420
tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480
catcctcagt ccgctgccca gattagttga gttggtgaca gtcacgttct tctccaggac 540
ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600
ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645
<210> 237
<211> 1149
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 237
atgaaggcca aactgctggt cctgctgtgt acttttaccg caacctacgc tgacactatc 60
tgcatcggct atcacgcaaa caactccacc gacacagtgg ataccgtcct ggagaagaac 120
gtgactgtca ccaactcaac taatctgggc agcggactga ggatggtcac cggactgcgc 180
aacatcccac agcgggaaac aagaggactg ttcggcgcta tcgcagggtt tattgagggc 240
gggtggacag gaatggtgga cgggtggtac ggctaccacc atcagaatga gcagggcagc 300
ggctacgccg ctgatcagaa gtctacacag aacgcaatca atggcattac taacatggtg 360
aattctgtca tcgaaaaaat gggcagcgga ggctccggaa cagacctggc tgagctgctg 420
gtgctgctgc tgaacgagcg gactctggat ttccacgata gcaacgtgaa gaatctgtat 480
gagaaggtca aatcccagct gaagaacaat gccaaagaaa tcgggaatgg atgcttcgag 540
ttttaccata agtgcaacaa tgaatgtatg gagtctgtga agaacggcac ttacgactat 600
cccaaatatt ctgaagagag taagctgaat cgagagaaaa ttgacagtgg gggcgacatc 660
atcaagctgc tgaacgaaca ggtgaacaag gagatgcaga gctccaacct gtacatgagt 720
atgtctagtt ggtgttatac acactcactg gacggcgctg ggctgttcct gtttgatcac 780
gcagccgagg aatacgaaca tgcaaagaaa ctgatcattt tcctgaatga gaacaatgtg 840
cccgtccagc tgacttcaat cagcgcccct gaacataagt tcgagggcct gacccagatc 900
tttcagaaag cttacgaaca cgagcagcat atttccgaat ctatcaacaa tattgtggac 960
cacgccatta agagcaaaga tcatgctacc ttcaactttc tgcagtggta cgtggccgag 1020
cagcacgagg aggaggtcct gtttaaggac atcctggata aaatcgaact gattggaaac 1080
gagaatcatg gcctgtacct ggcagatcag tatgtgaagg gcattgccaa gtccagaaaa 1140
agtgggtca 1149
<210> 238
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 238
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr Asn Ser Thr Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 239
<211> 1149
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 239
tgacccactt tttctggact tggcaatgcc cttcacatac tgatctgcca ggtacaggcc 60
atgattctcg tttccaatca gttcgatttt atccaggatg tccttaaaca ggacctcctc 120
ctcgtgctgc tcggccacgt accactgcag aaagttgaag gtagcatgat ctttgctctt 180
aatggcgtgg tccacaatat tgttgataga ttcggaaata tgctgctcgt gttcgtaagc 240
tttctgaaag atctgggtca ggccctcgaa cttatgttca ggggcgctga ttgaagtcag 300
ctggacgggc acattgttct cattcaggaa aatgatcagt ttctttgcat gttcgtattc 360
ctcggctgcg tgatcaaaca ggaacagccc agcgccgtcc agtgagtgtg tataacacca 420
actagacata ctcatgtaca ggttggagct ctgcatctcc ttgttcacct gttcgttcag 480
cagcttgatg atgtcgcccc cactgtcaat tttctctcga ttcagcttac tctcttcaga 540
atatttggga tagtcgtaag tgccgttctt cacagactcc atacattcat tgttgcactt 600
atggtaaaac tcgaagcatc cattcccgat ttctttggca ttgttcttca gctgggattt 660
gaccttctca tacagattct tcacgttgct atcgtggaaa tccagagtcc gctcgttcag 720
cagcagcacc agcagctcag ccaggtctgt tccggagcct ccgctgccca ttttttcgat 780
gacagaattc accatgttag taatgccatt gattgcgttc tgtgtagact tctgatcagc 840
ggcgtagccg ctgccctgct cattctgatg gtggtagccg taccacccgt ccaccattcc 900
tgtccacccg ccctcaataa accctgcgat agcgccgaac agtcctcttg tttcccgctg 960
tgggatgttg cgcagtccgg tgaccatcct cagtccgctg cccagattag ttgagttggt 1020
gacagtcacg ttcttctcca ggacggtatc cactgtgtcg gtggagttgt ttgcgtgata 1080
gccgatgcag atagtgtcag cgtaggttgc ggtaaaagta cacagcagga ccagcagttt 1140
ggccttcat 1149
<210> 240
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 240
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Ile Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Met Leu
130 135 140
Asn Gln Phe Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 241
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 241
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Ile Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Met Leu
130 135 140
Asn Gln Phe Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 242
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 242
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Asn Gly Thr Gly Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 243
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 243
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Asn Gly Thr Gly Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 244
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 244
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 245
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 245
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 246
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 246
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Asn Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 247
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 247
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Asn Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 248
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 248
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Asn Asn
85 90 95
Thr Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 249
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 249
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Asn Asn
85 90 95
Thr Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Asn Ser Thr Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 250
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 250
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Asn Asn
85 90 95
Thr Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 251
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 251
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Asn Asn
85 90 95
Thr Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Asn Ser Thr Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Asn Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 252
<211> 214
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 252
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Gly Gly
210
<210> 253
<211> 368
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 253
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Gly Gly Met Gln Ile Tyr Glu Gly Lys Leu Thr Ala
210 215 220
Glu Gly Leu Arg Phe Gly Ile Val Ala Ser Arg Phe Asn His Ala Leu
225 230 235 240
Val Asp Arg Leu Val Glu Gly Ala Ile Asp Cys Ile Val Arg His Gly
245 250 255
Gly Arg Glu Glu Asp Ile Thr Leu Val Arg Val Pro Gly Ser Trp Glu
260 265 270
Ile Pro Val Ala Ala Gly Glu Leu Ala Arg Lys Glu Asp Ile Asp Ala
275 280 285
Val Ile Ala Ile Gly Val Leu Ile Arg Gly Ala Thr Pro His Phe Asp
290 295 300
Tyr Ile Ala Ser Glu Val Ser Lys Gly Leu Ala Asn Leu Ser Leu Glu
305 310 315 320
Leu Arg Lys Pro Ile Thr Phe Gly Val Ile Thr Ala Asp Thr Leu Glu
325 330 335
Gln Ala Ile Glu Arg Ala Gly Thr Lys His Gly Asn Lys Gly Trp Glu
340 345 350
Ala Ala Leu Ser Ala Ile Glu Met Ala Asn Leu Phe Lys Ser Leu Arg
355 360 365
<210> 254
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 254
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Gly Ser Gly
210 215
<210> 255
<211> 369
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 255
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Gly Ser Gly Met Gln Ile Tyr Glu Gly Lys Leu Thr
210 215 220
Ala Glu Gly Leu Arg Phe Gly Ile Val Ala Ser Arg Phe Asn His Ala
225 230 235 240
Leu Val Asp Arg Leu Val Glu Gly Ala Ile Asp Cys Ile Val Arg His
245 250 255
Gly Gly Arg Glu Glu Asp Ile Thr Leu Val Arg Val Pro Gly Ser Trp
260 265 270
Glu Ile Pro Val Ala Ala Gly Glu Leu Ala Arg Lys Glu Asp Ile Asp
275 280 285
Ala Val Ile Ala Ile Gly Val Leu Ile Arg Gly Ala Thr Pro His Phe
290 295 300
Asp Tyr Ile Ala Ser Glu Val Ser Lys Gly Leu Ala Asn Leu Ser Leu
305 310 315 320
Glu Leu Arg Lys Pro Ile Thr Phe Gly Val Ile Thr Ala Asp Thr Leu
325 330 335
Glu Gln Ala Ile Glu Arg Ala Gly Thr Lys His Gly Asn Lys Gly Trp
340 345 350
Glu Ala Ala Leu Ser Ala Ile Glu Met Ala Asn Leu Phe Lys Ser Leu
355 360 365
Arg
<210> 256
<211> 211
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 256
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Gly Gly
210
<210> 257
<211> 364
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 257
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Gly Gly Gln Ile Tyr Glu Gly Lys Leu Thr Ala Glu Gly Leu Arg
210 215 220
Phe Gly Ile Val Ala Ser Arg Phe Asn His Ala Leu Val Asp Arg Leu
225 230 235 240
Val Glu Gly Ala Ile Asp Cys Ile Val Arg His Gly Gly Arg Glu Glu
245 250 255
Asp Ile Thr Leu Val Arg Val Pro Gly Ser Trp Glu Ile Pro Val Ala
260 265 270
Ala Gly Glu Leu Ala Arg Lys Glu Asp Ile Asp Ala Val Ile Ala Ile
275 280 285
Gly Val Leu Ile Arg Gly Ala Thr Pro His Phe Asp Tyr Ile Ala Ser
290 295 300
Glu Val Ser Lys Gly Leu Ala Asn Leu Ser Leu Glu Leu Arg Lys Pro
305 310 315 320
Ile Thr Phe Gly Val Ile Thr Ala Asp Thr Leu Glu Gln Ala Ile Glu
325 330 335
Arg Ala Gly Thr Lys His Gly Asn Lys Gly Trp Glu Ala Ala Leu Ser
340 345 350
Ala Ile Glu Met Ala Asn Leu Phe Lys Ser Leu Arg
355 360
<210> 258
<211> 212
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 258
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Gly Ser Gly
210
<210> 259
<211> 365
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 259
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Ser
50 55 60
Ile Gln Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Gly Ser Gly Gln Ile Tyr Glu Gly Lys Leu Thr Ala Glu Gly Leu
210 215 220
Arg Phe Gly Ile Val Ala Ser Arg Phe Asn His Ala Leu Val Asp Arg
225 230 235 240
Leu Val Glu Gly Ala Ile Asp Cys Ile Val Arg His Gly Gly Arg Glu
245 250 255
Glu Asp Ile Thr Leu Val Arg Val Pro Gly Ser Trp Glu Ile Pro Val
260 265 270
Ala Ala Gly Glu Leu Ala Arg Lys Glu Asp Ile Asp Ala Val Ile Ala
275 280 285
Ile Gly Val Leu Ile Arg Gly Ala Thr Pro His Phe Asp Tyr Ile Ala
290 295 300
Ser Glu Val Ser Lys Gly Leu Ala Asn Leu Ser Leu Glu Leu Arg Lys
305 310 315 320
Pro Ile Thr Phe Gly Val Ile Thr Ala Asp Thr Leu Glu Gln Ala Ile
325 330 335
Glu Arg Ala Gly Thr Lys His Gly Asn Lys Gly Trp Glu Ala Ala Leu
340 345 350
Ser Ala Ile Glu Met Ala Asn Leu Phe Lys Ser Leu Arg
355 360 365
<210> 260
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(645)
<400> 260
atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
agc gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg ctg ctg 432
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
aac cag tgg act ctg ctg ttc cac gat agc aac gtg aag aat ctg tat 480
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
ctg aat cga gag aaa att gac 645
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 261
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 261
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 262
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 262
gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60
gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120
cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180
gttgctatcg tggaacagca gagtccactg gttcagcagc agcaccagca gctcagccag 240
gtctgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacca tgttagtaat 300
gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360
ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420
tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480
catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540
ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600
ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645
<210> 263
<211> 1155
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(1155)
<400> 263
atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
agc gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg ctg ctg 432
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
aac cag tgg act ctg ctg ttc cac gat agc aac gtg aag aat ctg tat 480
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
ctg aat cga gag aaa att gac agt ggg ggc gac atc atc aag ctg ctg 672
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
aac gaa cag gtg aac aag gag atg cag agc tcc aac ctg tac atg agt 720
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
atg tct agt tgg tgt tat aca cac tca ctg gac ggc gct ggg ctg ttc 768
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
ctg ttt gat cac gca gcc gag gaa tac gaa cat gca aag aaa ctg atc 816
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
att ttc ctg aat gag aac aat gtg ccc gtc cag ctg act tca atc agc 864
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
gcc cct gaa cat aag ttc gag ggc ctg acc cag atc ttt cag aaa gct 912
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
tac gaa cac gag cag cat att tcc gaa tct atc aac aat att gtg gac 960
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
cac gcc att aag agc aaa gat cat gct acc ttc aac ttt ctg cag tgg 1008
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
tac gtg gcc gag cag cac gag gag gag gtc ctg ttt aag gac atc ctg 1056
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
gat aaa atc gaa ctg att gga aac gag aat cat ggc ctg tac ctg gca 1104
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
gat cag tat gtg aag ggc att gcc aag tcc aga aaa agt ggg tca tga 1152
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
tga 1155
<210> 264
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 264
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 265
<211> 1155
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 265
tcatcatgac ccactttttc tggacttggc aatgcccttc acatactgat ctgccaggta 60
caggccatga ttctcgtttc caatcagttc gattttatcc aggatgtcct taaacaggac 120
ctcctcctcg tgctgctcgg ccacgtacca ctgcagaaag ttgaaggtag catgatcttt 180
gctcttaatg gcgtggtcca caatattgtt gatagattcg gaaatatgct gctcgtgttc 240
gtaagctttc tgaaagatct gggtcaggcc ctcgaactta tgttcagggg cgctgattga 300
agtcagctgg acgggcacat tgttctcatt caggaaaatg atcagtttct ttgcatgttc 360
gtattcctcg gctgcgtgat caaacaggaa cagcccagcg ccgtccagtg agtgtgtata 420
acaccaacta gacatactca tgtacaggtt ggagctctgc atctccttgt tcacctgttc 480
gttcagcagc ttgatgatgt cgcccccact gtcaattttc tctcgattca gcttactctc 540
ttcagaatat ttgggatagt cgtaagtgcc gttcttcaca gactccatac attcattgtt 600
gcacttatgg taaaactcga agcatccatt cccgatttct ttggcattgt tcttcagctg 660
ggatttgacc ttctcataca gattcttcac gttgctatcg tggaacagca gagtccactg 720
gttcagcagc agcaccagca gctcagccag gtctgttccg gagcctccgc tgcccatttt 780
ttcgatgaca gaattcacca tgttagtaat gccattgatt gcgttctgtg tagacttctg 840
atcagcggcg tagccgctgc cctgctcatt ctgatggtgg tagccgtacc acccgtccac 900
cattcctgtc cacccgccct caataaaccc tgcgatagcg ccgaacagtc ctcttgtttc 960
ccgctgtggg atgttgcgca gtccggtgac catcctcagt ccgctgccca gattcactga 1020
gtgggtgaca gtcacgttct tctccaggac ggtatccact gtgtcggtgg agttgtttgc 1080
gtgatagccg atgcagatag tgtcagcgta ggttgcggta aaagtacaca gcaggaccag 1140
cagtttggcc ttcat 1155
<210> 266
<211> 5579
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 266
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960
tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020
cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080
ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140
ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200
accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260
gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320
ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380
atatcgccac catgaaggcc aaactgctgg tcctgctgtg tacttttacc gcaacctacg 1440
ctgacactat ctgcatcggc tatcacgcaa acaactccac cgacacagtg gataccgtcc 1500
tggagaagaa cgtgactgtc acccactcag tgaatctggg cagcggactg aggatggtca 1560
ccggactgcg caacatccca cagcgggaaa caagaggact gttcggcgct atcgcagggt 1620
ttattgaggg cgggtggaca ggaatggtgg acgggtggta cggctaccac catcagaatg 1680
agcagggcag cggctacgcc gctgatcaga agtctacaca gaacgcaatc aatggcatta 1740
ctaacatggt gaattctgtc atcgaaaaaa tgggcagcgg aggctccgga acagacctgg 1800
ctgagctgct ggtgctgctg ctgaaccagt ggactctgct gttccacgat agcaacgtga 1860
agaatctgta tgagaaggtc aaatcccagc tgaagaacaa tgccaaagaa atcgggaatg 1920
gatgcttcga gttttaccat aagtgcaaca atgaatgtat ggagtctgtg aagaacggca 1980
cttacgacta tcccaaatat tctgaagaga gtaagctgaa tcgagagaaa attgacagtg 2040
ggggcgacat catcaagctg ctgaacgaac aggtgaacaa ggagatgcag agctccaacc 2100
tgtacatgag tatgtctagt tggtgttata cacactcact ggacggcgct gggctgttcc 2160
tgtttgatca cgcagccgag gaatacgaac atgcaaagaa actgatcatt ttcctgaatg 2220
agaacaatgt gcccgtccag ctgacttcaa tcagcgcccc tgaacataag ttcgagggcc 2280
tgacccagat ctttcagaaa gcttacgaac acgagcagca tatttccgaa tctatcaaca 2340
atattgtgga ccacgccatt aagagcaaag atcatgctac cttcaacttt ctgcagtggt 2400
acgtggccga gcagcacgag gaggaggtcc tgtttaagga catcctggat aaaatcgaac 2460
tgattggaaa cgagaatcat ggcctgtacc tggcagatca gtatgtgaag ggcattgcca 2520
agtccagaaa aagtgggtca tgatgaacac gtgggatcca gatctgctgt gccttctagt 2580
tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 2640
cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 2700
tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 2760
aggcatgctg gggatgcggt gggctctatg ggtacccagg tgctgaagaa ttgacccggt 2820
tcctcctggg ccagaaagaa gcaggcacat ccccttctct gtgacacacc ctgtccacgc 2880
ccctggttct tagttccagc cccactcata ggacactcat agctcaggag ggctccgcct 2940
tcaatcccac ccgctaaagt acttggagcg gtctctccct ccctcatcag cccaccaaac 3000
caaacctagc ctccaagagt gggaagaaat taaagcaaga taggctatta agtgcagagg 3060
gagagaaaat gcctccaaca tgtgaggaag taatgagaga aatcatagaa ttttaaggcc 3120
atgatttaag gccatcatgg ccttaatctt ccgcttcctc gctcactgac tcgctgcgct 3180
cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 3240
cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 3300
accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 3360
acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 3420
cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 3480
acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt 3540
atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 3600
agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 3660
acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 3720
gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg 3780
gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 3840
gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 3900
gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga 3960
acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga 4020
tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 4080
ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt 4140
catccatagt tgcctgactc gggggggggg ggcgctgagg tctgcctcgt gaagaaggtg 4200
ttgctgactc ataccaggcc tgaatcgccc catcatccag ccagaaagtg agggagccac 4260
ggttgatgag agctttgttg taggtggacc agttggtgat tttgaacttt tgctttgcca 4320
cggaacggtc tgcgttgtcg ggaagatgcg tgatctgatc cttcaactca gcaaaagttc 4380
gatttattca acaaagccgc cgtcccgtca agtcagcgta atgctctgcc agtgttacaa 4440
ccaattaacc aattctgatt agaaaaactc atcgagcatc aaatgaaact gcaatttatt 4500
catatcagga ttatcaatac catatttttg aaaaagccgt ttctgtaatg aaggagaaaa 4560
ctcaccgagg cagttccata ggatggcaag atcctggtat cggtctgcga ttccgactcg 4620
tccaacatca atacaaccta ttaatttccc ctcgtcaaaa ataaggttat caagtgagaa 4680
atcaccatga gtgacgactg aatccggtga gaatggcaaa agcttatgca tttctttcca 4740
gacttgttca acaggccagc cattacgctc gtcatcaaaa tcactcgcat caaccaaacc 4800
gttattcatt cgtgattgcg cctgagcgag acgaaatacg cgatcgctgt taaaaggaca 4860
attacaaaca ggaatcgaat gcaaccggcg caggaacact gccagcgcat caacaatatt 4920
ttcacctgaa tcaggatatt cttctaatac ctggaatgct gttttcccgg ggatcgcagt 4980
ggtgagtaac catgcatcat caggagtacg gataaaatgc ttgatggtcg gaagaggcat 5040
aaattccgtc agccagttta gtctgaccat ctcatctgta acatcattgg caacgctacc 5100
tttgccatgt ttcagaaaca actctggcgc atcgggcttc ccatacaatc gatagattgt 5160
cgcacctgat tgcccgacat tatcgcgagc ccatttatac ccatataaat cagcatccat 5220
gttggaattt aatcgcggcc tcgagcaaga cgtttcccgt tgaatatggc tcataacacc 5280
ccttgtatta ctgtttatgt aagcagacag ttttattgtt catgatgata tatttttatc 5340
ttgtgcaatg taacatcaga gattttgaga cacaacgtgg ctttcccccc ccccccatta 5400
ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 5460
aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga 5520
aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgtc 5579
<210> 267
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(645)
<400> 267
atg aag gca atc ctg gtc gtc ctg ctg tat act ttc gct acc gct aac 48
Met Lys Ala Ile Leu Val Val Leu Leu Tyr Thr Phe Ala Thr Ala Asn
1 5 10 15
gct gac acc ctg tgc atc ggc tat cac gct aac aac tca acc gac aca 96
Ala Asp Thr Leu Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
gtg gat act gtc ctg gag aag aac gtg act gtc acc cac tct gtg aat 144
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
ctg ggc agt gga ctg agg ctg gca act gga ctg cga aac atc cca cag 192
Leu Gly Ser Gly Leu Arg Leu Ala Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
cgg gaa acc aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aac 288
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
gag cag gga tca ggc tac gcc gct gac ctg aag agc aca cag aat gca 336
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Leu Lys Ser Thr Gln Asn Ala
100 105 110
atc gat gaa att act aac atg gtg aat tcc gtc atc gag aaa atg ggc 384
Ile Asp Glu Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
agc gga ggc tcc gga acc gac ctg gca gaa ctg ctg gtg ctg ctg ctg 432
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
aac cag tgg aca ctg ctg tac cac gat agt aac gtg aag aat ctg tat 480
Asn Gln Trp Thr Leu Leu Tyr His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
gag aaa gtc cga tca cag ctg aag aac aat gct aaa gaa atc ggg aat 528
Glu Lys Val Arg Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
gga tgc ttc gag ttt tac cat aag tgc gac aac acc tgt atg gag agc 576
Gly Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Thr Cys Met Glu Ser
180 185 190
gtg aaa aat ggc aca tac gat tat ccc aag tat tcc gag gaa gcc aaa 624
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ala Lys
195 200 205
ctg aac aga gag gaa att gac 645
Leu Asn Arg Glu Glu Ile Asp
210 215
<210> 268
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 268
Met Lys Ala Ile Leu Val Val Leu Leu Tyr Thr Phe Ala Thr Ala Asn
1 5 10 15
Ala Asp Thr Leu Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Leu Ala Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Leu Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asp Glu Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Tyr His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Arg Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Thr Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ala Lys
195 200 205
Leu Asn Arg Glu Glu Ile Asp
210 215
<210> 269
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 269
gtcaatttcc tctctgttca gtttggcttc ctcggaatac ttgggataat cgtatgtgcc 60
atttttcacg ctctccatac aggtgttgtc gcacttatgg taaaactcga agcatccatt 120
cccgatttct ttagcattgt tcttcagctg tgatcggact ttctcataca gattcttcac 180
gttactatcg tggtacagca gtgtccactg gttcagcagc agcaccagca gttctgccag 240
gtcggttccg gagcctccgc tgcccatttt ctcgatgacg gaattcacca tgttagtaat 300
ttcatcgatt gcattctgtg tgctcttcag gtcagcggcg tagcctgatc cctgctcgtt 360
ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420
tgcgatagcg ccgaacagtc ctctggtttc ccgctgtggg atgtttcgca gtccagttgc 480
cagcctcagt ccactgccca gattcacaga gtgggtgaca gtcacgttct tctccaggac 540
agtatccact gtgtcggttg agttgttagc gtgatagccg atgcacaggg tgtcagcgtt 600
agcggtagcg aaagtataca gcaggacgac caggattgcc ttcat 645
<210> 270
<211> 1155
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(1155)
<400> 270
atg aag gca atc ctg gtc gtc ctg ctg tat act ttc gct acc gct aac 48
Met Lys Ala Ile Leu Val Val Leu Leu Tyr Thr Phe Ala Thr Ala Asn
1 5 10 15
gct gac acc ctg tgc atc ggc tat cac gct aac aac tca acc gac aca 96
Ala Asp Thr Leu Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
gtg gat act gtc ctg gag aag aac gtg act gtc acc cac tct gtg aat 144
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
ctg ggc agt gga ctg agg ctg gca act gga ctg cga aac atc cca cag 192
Leu Gly Ser Gly Leu Arg Leu Ala Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
cgg gaa acc aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aac 288
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
gag cag gga tca ggc tac gcc gct gac ctg aag agc aca cag aat gca 336
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Leu Lys Ser Thr Gln Asn Ala
100 105 110
atc gat gaa att act aac atg gtg aat tcc gtc atc gag aaa atg ggc 384
Ile Asp Glu Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
agc gga ggc tcc gga acc gac ctg gca gaa ctg ctg gtg ctg ctg ctg 432
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
aac cag tgg aca ctg ctg tac cac gat agt aac gtg aag aat ctg tat 480
Asn Gln Trp Thr Leu Leu Tyr His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
gag aaa gtc cga tca cag ctg aag aac aat gct aaa gaa atc ggg aat 528
Glu Lys Val Arg Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
gga tgc ttc gag ttt tac cat aag tgc gac aac acc tgt atg gag agc 576
Gly Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Thr Cys Met Glu Ser
180 185 190
gtg aaa aat ggc aca tac gat tat ccc aag tat tcc gag gaa gcc aaa 624
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ala Lys
195 200 205
ctg aac aga gag gaa att gac tct ggg ggc gac atc atc aag ctg ctg 672
Leu Asn Arg Glu Glu Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
aac gaa cag gtc aac aag gag atg cag agc tcc aat ctg tac atg tcc 720
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
atg tct agt tgg tgt tat acc cac tct ctg gac ggc gct ggg ctg ttc 768
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
ctg ttt gat cac gca gcc gag gaa tac gaa cat gca aag aaa ctg atc 816
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
att ttc ctg aac gag aac aat gtg ccc gtc cag ctg aca tca atc agc 864
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
gcc cct gaa cat aag ttc gag ggc ctg act cag atc ttt cag aaa gct 912
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
tac gaa cac gag cag cat att tcc gaa tct atc aac aat att gtg gac 960
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
cac gcc att aag agt aaa gat cat gct acc ttc aat ttt ctg cag tgg 1008
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
tac gtg gcc gag cag cac gag gag gag gtc ctg ttt aag gac atc ctg 1056
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
gat aaa atc gaa ctg att gga aac gag aat cat ggc ctg tac ctg gca 1104
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
gat cag tat gtg aag ggc att gcc aag agc cgg aaa agt ggg tca tga 1152
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
tga 1155
<210> 271
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 271
Met Lys Ala Ile Leu Val Val Leu Leu Tyr Thr Phe Ala Thr Ala Asn
1 5 10 15
Ala Asp Thr Leu Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Leu Ala Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Leu Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asp Glu Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Tyr His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Arg Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Thr Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ala Lys
195 200 205
Leu Asn Arg Glu Glu Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 272
<211> 1155
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 272
tcatcatgac ccacttttcc ggctcttggc aatgcccttc acatactgat ctgccaggta 60
caggccatga ttctcgtttc caatcagttc gattttatcc aggatgtcct taaacaggac 120
ctcctcctcg tgctgctcgg ccacgtacca ctgcagaaaa ttgaaggtag catgatcttt 180
actcttaatg gcgtggtcca caatattgtt gatagattcg gaaatatgct gctcgtgttc 240
gtaagctttc tgaaagatct gagtcaggcc ctcgaactta tgttcagggg cgctgattga 300
tgtcagctgg acgggcacat tgttctcgtt caggaaaatg atcagtttct ttgcatgttc 360
gtattcctcg gctgcgtgat caaacaggaa cagcccagcg ccgtccagag agtgggtata 420
acaccaacta gacatggaca tgtacagatt ggagctctgc atctccttgt tgacctgttc 480
gttcagcagc ttgatgatgt cgcccccaga gtcaatttcc tctctgttca gtttggcttc 540
ctcggaatac ttgggataat cgtatgtgcc atttttcacg ctctccatac aggtgttgtc 600
gcacttatgg taaaactcga agcatccatt cccgatttct ttagcattgt tcttcagctg 660
tgatcggact ttctcataca gattcttcac gttactatcg tggtacagca gtgtccactg 720
gttcagcagc agcaccagca gttctgccag gtcggttccg gagcctccgc tgcccatttt 780
ctcgatgacg gaattcacca tgttagtaat ttcatcgatt gcattctgtg tgctcttcag 840
gtcagcggcg tagcctgatc cctgctcgtt ctgatggtgg tagccgtacc acccgtccac 900
cattcctgtc cacccgccct caataaaccc tgcgatagcg ccgaacagtc ctctggtttc 960
ccgctgtggg atgtttcgca gtccagttgc cagcctcagt ccactgccca gattcacaga 1020
gtgggtgaca gtcacgttct tctccaggac agtatccact gtgtcggttg agttgttagc 1080
gtgatagccg atgcacaggg tgtcagcgtt agcggtagcg aaagtataca gcaggacgac 1140
caggattgcc ttcat 1155
<210> 273
<211> 5579
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 273
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960
tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020
cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080
ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140
ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200
accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260
gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320
ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380
atatcgccac catgaaggca atcctggtcg tcctgctgta tactttcgct accgctaacg 1440
ctgacaccct gtgcatcggc tatcacgcta acaactcaac cgacacagtg gatactgtcc 1500
tggagaagaa cgtgactgtc acccactctg tgaatctggg cagtggactg aggctggcaa 1560
ctggactgcg aaacatccca cagcgggaaa ccagaggact gttcggcgct atcgcagggt 1620
ttattgaggg cgggtggaca ggaatggtgg acgggtggta cggctaccac catcagaacg 1680
agcagggatc aggctacgcc gctgacctga agagcacaca gaatgcaatc gatgaaatta 1740
ctaacatggt gaattccgtc atcgagaaaa tgggcagcgg aggctccgga accgacctgg 1800
cagaactgct ggtgctgctg ctgaaccagt ggacactgct gtaccacgat agtaacgtga 1860
agaatctgta tgagaaagtc cgatcacagc tgaagaacaa tgctaaagaa atcgggaatg 1920
gatgcttcga gttttaccat aagtgcgaca acacctgtat ggagagcgtg aaaaatggca 1980
catacgatta tcccaagtat tccgaggaag ccaaactgaa cagagaggaa attgactctg 2040
ggggcgacat catcaagctg ctgaacgaac aggtcaacaa ggagatgcag agctccaatc 2100
tgtacatgtc catgtctagt tggtgttata cccactctct ggacggcgct gggctgttcc 2160
tgtttgatca cgcagccgag gaatacgaac atgcaaagaa actgatcatt ttcctgaacg 2220
agaacaatgt gcccgtccag ctgacatcaa tcagcgcccc tgaacataag ttcgagggcc 2280
tgactcagat ctttcagaaa gcttacgaac acgagcagca tatttccgaa tctatcaaca 2340
atattgtgga ccacgccatt aagagtaaag atcatgctac cttcaatttt ctgcagtggt 2400
acgtggccga gcagcacgag gaggaggtcc tgtttaagga catcctggat aaaatcgaac 2460
tgattggaaa cgagaatcat ggcctgtacc tggcagatca gtatgtgaag ggcattgcca 2520
agagccggaa aagtgggtca tgatgaacac gtgggatcca gatctgctgt gccttctagt 2580
tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 2640
cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 2700
tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 2760
aggcatgctg gggatgcggt gggctctatg ggtacccagg tgctgaagaa ttgacccggt 2820
tcctcctggg ccagaaagaa gcaggcacat ccccttctct gtgacacacc ctgtccacgc 2880
ccctggttct tagttccagc cccactcata ggacactcat agctcaggag ggctccgcct 2940
tcaatcccac ccgctaaagt acttggagcg gtctctccct ccctcatcag cccaccaaac 3000
caaacctagc ctccaagagt gggaagaaat taaagcaaga taggctatta agtgcagagg 3060
gagagaaaat gcctccaaca tgtgaggaag taatgagaga aatcatagaa ttttaaggcc 3120
atgatttaag gccatcatgg ccttaatctt ccgcttcctc gctcactgac tcgctgcgct 3180
cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 3240
cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 3300
accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 3360
acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 3420
cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 3480
acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt 3540
atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 3600
agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 3660
acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 3720
gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg 3780
gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 3840
gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 3900
gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga 3960
acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga 4020
tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 4080
ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt 4140
catccatagt tgcctgactc gggggggggg ggcgctgagg tctgcctcgt gaagaaggtg 4200
ttgctgactc ataccaggcc tgaatcgccc catcatccag ccagaaagtg agggagccac 4260
ggttgatgag agctttgttg taggtggacc agttggtgat tttgaacttt tgctttgcca 4320
cggaacggtc tgcgttgtcg ggaagatgcg tgatctgatc cttcaactca gcaaaagttc 4380
gatttattca acaaagccgc cgtcccgtca agtcagcgta atgctctgcc agtgttacaa 4440
ccaattaacc aattctgatt agaaaaactc atcgagcatc aaatgaaact gcaatttatt 4500
catatcagga ttatcaatac catatttttg aaaaagccgt ttctgtaatg aaggagaaaa 4560
ctcaccgagg cagttccata ggatggcaag atcctggtat cggtctgcga ttccgactcg 4620
tccaacatca atacaaccta ttaatttccc ctcgtcaaaa ataaggttat caagtgagaa 4680
atcaccatga gtgacgactg aatccggtga gaatggcaaa agcttatgca tttctttcca 4740
gacttgttca acaggccagc cattacgctc gtcatcaaaa tcactcgcat caaccaaacc 4800
gttattcatt cgtgattgcg cctgagcgag acgaaatacg cgatcgctgt taaaaggaca 4860
attacaaaca ggaatcgaat gcaaccggcg caggaacact gccagcgcat caacaatatt 4920
ttcacctgaa tcaggatatt cttctaatac ctggaatgct gttttcccgg ggatcgcagt 4980
ggtgagtaac catgcatcat caggagtacg gataaaatgc ttgatggtcg gaagaggcat 5040
aaattccgtc agccagttta gtctgaccat ctcatctgta acatcattgg caacgctacc 5100
tttgccatgt ttcagaaaca actctggcgc atcgggcttc ccatacaatc gatagattgt 5160
cgcacctgat tgcccgacat tatcgcgagc ccatttatac ccatataaat cagcatccat 5220
gttggaattt aatcgcggcc tcgagcaaga cgtttcccgt tgaatatggc tcataacacc 5280
ccttgtatta ctgtttatgt aagcagacag ttttattgtt catgatgata tatttttatc 5340
ttgtgcaatg taacatcaga gattttgaga cacaacgtgg ctttcccccc ccccccatta 5400
ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 5460
aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga 5520
aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgtc 5579
<210> 274
<211> 639
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(639)
<400> 274
atg gct atc atc tac ctg atc ctg ctg ttc act gct gtg cgg ggg gac 48
Met Ala Ile Ile Tyr Leu Ile Leu Leu Phe Thr Ala Val Arg Gly Asp
1 5 10 15
cag att tgc atc ggc tac cac gct aat aat tca act gag aag gtg gat 96
Gln Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Glu Lys Val Asp
20 25 30
act atc ctg gag cgg aac gtg acc gtc aca cac gct aaa gac att ggc 144
Thr Ile Leu Glu Arg Asn Val Thr Val Thr His Ala Lys Asp Ile Gly
35 40 45
agc gga ctg gtg ctg gca acc gga ctg agg aat gtc cca cag atc gag 192
Ser Gly Leu Val Leu Ala Thr Gly Leu Arg Asn Val Pro Gln Ile Glu
50 55 60
tcc cgc gga ctg ttc ggc gct atc gca ggg ttt att gaa ggc ggg tgg 240
Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly Gly Trp
65 70 75 80
cag gga atg att gat ggg tgg tac ggc tac cac cat tct aac gac caa 288
Gln Gly Met Ile Asp Gly Trp Tyr Gly Tyr His His Ser Asn Asp Gln
85 90 95
gga agt ggc tac gcc gct gat aag gag agt act cag aaa gcc ttc gat 336
Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr Gln Lys Ala Phe Asp
100 105 110
ggc atc acc aac atg gtg aat tca gtc att gag aag atg ggc agc gga 384
Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly Ser Gly
115 120 125
ggc tcc gga acc gac ctg gca gaa ctg ctg gtg ctg ctg ctg aat cag 432
Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu Asn Gln
130 135 140
tgg aca ctg ctg ttt cac gac tct aac gtg aag aat ctg tat gat aaa 480
Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr Asp Lys
145 150 155 160
gtc cgg atg cag ctg aga gac aac gtg aag gag ctg ggg aat gga tgc 528
Val Arg Met Gln Leu Arg Asp Asn Val Lys Glu Leu Gly Asn Gly Cys
165 170 175
ttc gaa ttt tac cat aag tgc gac gat gag tgt atg aac agt gtc aaa 576
Phe Glu Phe Tyr His Lys Cys Asp Asp Glu Cys Met Asn Ser Val Lys
180 185 190
aat ggc aca tac gat tat ccc aag tat gag gaa gag tca aaa ctg aac 624
Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Glu Glu Glu Ser Lys Leu Asn
195 200 205
cga aat gaa atc aag 639
Arg Asn Glu Ile Lys
210
<210> 275
<211> 213
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 275
Met Ala Ile Ile Tyr Leu Ile Leu Leu Phe Thr Ala Val Arg Gly Asp
1 5 10 15
Gln Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Glu Lys Val Asp
20 25 30
Thr Ile Leu Glu Arg Asn Val Thr Val Thr His Ala Lys Asp Ile Gly
35 40 45
Ser Gly Leu Val Leu Ala Thr Gly Leu Arg Asn Val Pro Gln Ile Glu
50 55 60
Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly Gly Trp
65 70 75 80
Gln Gly Met Ile Asp Gly Trp Tyr Gly Tyr His His Ser Asn Asp Gln
85 90 95
Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr Gln Lys Ala Phe Asp
100 105 110
Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly Ser Gly
115 120 125
Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu Asn Gln
130 135 140
Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr Asp Lys
145 150 155 160
Val Arg Met Gln Leu Arg Asp Asn Val Lys Glu Leu Gly Asn Gly Cys
165 170 175
Phe Glu Phe Tyr His Lys Cys Asp Asp Glu Cys Met Asn Ser Val Lys
180 185 190
Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Glu Glu Glu Ser Lys Leu Asn
195 200 205
Arg Asn Glu Ile Lys
210
<210> 276
<211> 639
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 276
cttgatttca tttcggttca gttttgactc ttcctcatac ttgggataat cgtatgtgcc 60
atttttgaca ctgttcatac actcatcgtc gcacttatgg taaaattcga agcatccatt 120
ccccagctcc ttcacgttgt ctctcagctg catccggact ttatcataca gattcttcac 180
gttagagtcg tgaaacagca gtgtccactg attcagcagc agcaccagca gttctgccag 240
gtcggttccg gagcctccgc tgcccatctt ctcaatgact gaattcacca tgttggtgat 300
gccatcgaag gctttctgag tactctcctt atcagcggcg tagccacttc cttggtcgtt 360
agaatggtgg tagccgtacc acccatcaat cattccctgc cacccgcctt caataaaccc 420
tgcgatagcg ccgaacagtc cgcgggactc gatctgtggg acattcctca gtccggttgc 480
cagcaccagt ccgctgccaa tgtctttagc gtgtgtgacg gtcacgttcc gctccaggat 540
agtatccacc ttctcagttg aattattagc gtggtagccg atgcaaatct ggtccccccg 600
cacagcagtg aacagcagga tcaggtagat gatagccat 639
<210> 277
<211> 1149
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(1149)
<400> 277
atg gct atc atc tac ctg atc ctg ctg ttc act gct gtg cgg ggg gac 48
Met Ala Ile Ile Tyr Leu Ile Leu Leu Phe Thr Ala Val Arg Gly Asp
1 5 10 15
cag att tgc atc ggc tac cac gct aat aat tca act gag aag gtg gat 96
Gln Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Glu Lys Val Asp
20 25 30
act atc ctg gag cgg aac gtg acc gtc aca cac gct aaa gac att ggc 144
Thr Ile Leu Glu Arg Asn Val Thr Val Thr His Ala Lys Asp Ile Gly
35 40 45
agc gga ctg gtg ctg gca acc gga ctg agg aat gtc cca cag atc gag 192
Ser Gly Leu Val Leu Ala Thr Gly Leu Arg Asn Val Pro Gln Ile Glu
50 55 60
tcc cgc gga ctg ttc ggc gct atc gca ggg ttt att gaa ggc ggg tgg 240
Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly Gly Trp
65 70 75 80
cag gga atg att gat ggg tgg tac ggc tac cac cat tct aac gac caa 288
Gln Gly Met Ile Asp Gly Trp Tyr Gly Tyr His His Ser Asn Asp Gln
85 90 95
gga agt ggc tac gcc gct gat aag gag agt act cag aaa gcc ttc gat 336
Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr Gln Lys Ala Phe Asp
100 105 110
ggc atc acc aac atg gtg aat tca gtc att gag aag atg ggc agc gga 384
Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly Ser Gly
115 120 125
ggc tcc gga acc gac ctg gca gaa ctg ctg gtg ctg ctg ctg aat cag 432
Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu Asn Gln
130 135 140
tgg aca ctg ctg ttt cac gac tct aac gtg aag aat ctg tat gat aaa 480
Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr Asp Lys
145 150 155 160
gtc cgg atg cag ctg aga gac aac gtg aag gag ctg ggg aat gga tgc 528
Val Arg Met Gln Leu Arg Asp Asn Val Lys Glu Leu Gly Asn Gly Cys
165 170 175
ttc gaa ttt tac cat aag tgc gac gat gag tgt atg aac agt gtc aaa 576
Phe Glu Phe Tyr His Lys Cys Asp Asp Glu Cys Met Asn Ser Val Lys
180 185 190
aat ggc aca tac gat tat ccc aag tat gag gaa gag tca aaa ctg aac 624
Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Glu Glu Glu Ser Lys Leu Asn
195 200 205
cga aat gaa atc aag agc ggg ggc gac atc atc aag ctg ctg aac gag 672
Arg Asn Glu Ile Lys Ser Gly Gly Asp Ile Ile Lys Leu Leu Asn Glu
210 215 220
caa gtg aat aag gaa atg cag agc tcc aac ctg tac atg tcc atg tct 720
Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser Met Ser
225 230 235 240
agt tgg tgt tat act cac tct ctg gat ggc gcc ggg ctg ttc ctg ttt 768
Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe Leu Phe
245 250 255
gac cac gca gcc gaa gag tac gag cat gct aag aaa ctg atc att ttc 816
Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile Ile Phe
260 265 270
ctg aac gaa aac aac gtg ccc gtc cag ctg aca tca atc agc gca cct 864
Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser Ala Pro
275 280 285
gag cat aag ttc gaa ggc ctg act cag atc ttt cag aaa gct tac gag 912
Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala Tyr Glu
290 295 300
cac gaa cag cat att tcc gag tct atc aac aat att gtg gac cac gcc 960
His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp His Ala
305 310 315 320
atc aag agc aaa gat cat gct acc ttc aac ttt ctg cag tgg tac gtg 1008
Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp Tyr Val
325 330 335
gcc gag cag cac gaa gag gaa gtc ctg ttt aag gac atc ctg gat aaa 1056
Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu Asp Lys
340 345 350
atc gag ctg att gga aac gaa aat cat ggc ctg tac ctg gca gac cag 1104
Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala Asp Gln
355 360 365
tat gtg aag ggc att gcc aag tcc aga aaa agt ggg tca tga tga 1149
Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 278
<211> 381
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 278
Met Ala Ile Ile Tyr Leu Ile Leu Leu Phe Thr Ala Val Arg Gly Asp
1 5 10 15
Gln Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Glu Lys Val Asp
20 25 30
Thr Ile Leu Glu Arg Asn Val Thr Val Thr His Ala Lys Asp Ile Gly
35 40 45
Ser Gly Leu Val Leu Ala Thr Gly Leu Arg Asn Val Pro Gln Ile Glu
50 55 60
Ser Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly Gly Trp
65 70 75 80
Gln Gly Met Ile Asp Gly Trp Tyr Gly Tyr His His Ser Asn Asp Gln
85 90 95
Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr Gln Lys Ala Phe Asp
100 105 110
Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly Ser Gly
115 120 125
Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu Asn Gln
130 135 140
Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr Asp Lys
145 150 155 160
Val Arg Met Gln Leu Arg Asp Asn Val Lys Glu Leu Gly Asn Gly Cys
165 170 175
Phe Glu Phe Tyr His Lys Cys Asp Asp Glu Cys Met Asn Ser Val Lys
180 185 190
Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Glu Glu Glu Ser Lys Leu Asn
195 200 205
Arg Asn Glu Ile Lys Ser Gly Gly Asp Ile Ile Lys Leu Leu Asn Glu
210 215 220
Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser Met Ser
225 230 235 240
Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe Leu Phe
245 250 255
Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile Ile Phe
260 265 270
Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser Ala Pro
275 280 285
Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala Tyr Glu
290 295 300
His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp His Ala
305 310 315 320
Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp Tyr Val
325 330 335
Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu Asp Lys
340 345 350
Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala Asp Gln
355 360 365
Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 279
<211> 1149
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 279
tcatcatgac ccactttttc tggacttggc aatgcccttc acatactggt ctgccaggta 60
caggccatga ttttcgtttc caatcagctc gattttatcc aggatgtcct taaacaggac 120
ttcctcttcg tgctgctcgg ccacgtacca ctgcagaaag ttgaaggtag catgatcttt 180
gctcttgatg gcgtggtcca caatattgtt gatagactcg gaaatatgct gttcgtgctc 240
gtaagctttc tgaaagatct gagtcaggcc ttcgaactta tgctcaggtg cgctgattga 300
tgtcagctgg acgggcacgt tgttttcgtt caggaaaatg atcagtttct tagcatgctc 360
gtactcttcg gctgcgtggt caaacaggaa cagcccggcg ccatccagag agtgagtata 420
acaccaacta gacatggaca tgtacaggtt ggagctctgc atttccttat tcacttgctc 480
gttcagcagc ttgatgatgt cgcccccgct cttgatttca tttcggttca gttttgactc 540
ttcctcatac ttgggataat cgtatgtgcc atttttgaca ctgttcatac actcatcgtc 600
gcacttatgg taaaattcga agcatccatt ccccagctcc ttcacgttgt ctctcagctg 660
catccggact ttatcataca gattcttcac gttagagtcg tgaaacagca gtgtccactg 720
attcagcagc agcaccagca gttctgccag gtcggttccg gagcctccgc tgcccatctt 780
ctcaatgact gaattcacca tgttggtgat gccatcgaag gctttctgag tactctcctt 840
atcagcggcg tagccacttc cttggtcgtt agaatggtgg tagccgtacc acccatcaat 900
cattccctgc cacccgcctt caataaaccc tgcgatagcg ccgaacagtc cgcgggactc 960
gatctgtggg acattcctca gtccggttgc cagcaccagt ccgctgccaa tgtctttagc 1020
gtgtgtgacg gtcacgttcc gctccaggat agtatccacc ttctcagttg aattattagc 1080
gtggtagccg atgcaaatct ggtccccccg cacagcagtg aacagcagga tcaggtagat 1140
gatagccat 1149
<210> 280
<211> 5573
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 280
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960
tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020
cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080
ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140
ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200
accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260
gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320
ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380
atatcgccac catggctatc atctacctga tcctgctgtt cactgctgtg cggggggacc 1440
agatttgcat cggctaccac gctaataatt caactgagaa ggtggatact atcctggagc 1500
ggaacgtgac cgtcacacac gctaaagaca ttggcagcgg actggtgctg gcaaccggac 1560
tgaggaatgt cccacagatc gagtcccgcg gactgttcgg cgctatcgca gggtttattg 1620
aaggcgggtg gcagggaatg attgatgggt ggtacggcta ccaccattct aacgaccaag 1680
gaagtggcta cgccgctgat aaggagagta ctcagaaagc cttcgatggc atcaccaaca 1740
tggtgaattc agtcattgag aagatgggca gcggaggctc cggaaccgac ctggcagaac 1800
tgctggtgct gctgctgaat cagtggacac tgctgtttca cgactctaac gtgaagaatc 1860
tgtatgataa agtccggatg cagctgagag acaacgtgaa ggagctgggg aatggatgct 1920
tcgaatttta ccataagtgc gacgatgagt gtatgaacag tgtcaaaaat ggcacatacg 1980
attatcccaa gtatgaggaa gagtcaaaac tgaaccgaaa tgaaatcaag agcgggggcg 2040
acatcatcaa gctgctgaac gagcaagtga ataaggaaat gcagagctcc aacctgtaca 2100
tgtccatgtc tagttggtgt tatactcact ctctggatgg cgccgggctg ttcctgtttg 2160
accacgcagc cgaagagtac gagcatgcta agaaactgat cattttcctg aacgaaaaca 2220
acgtgcccgt ccagctgaca tcaatcagcg cacctgagca taagttcgaa ggcctgactc 2280
agatctttca gaaagcttac gagcacgaac agcatatttc cgagtctatc aacaatattg 2340
tggaccacgc catcaagagc aaagatcatg ctaccttcaa ctttctgcag tggtacgtgg 2400
ccgagcagca cgaagaggaa gtcctgttta aggacatcct ggataaaatc gagctgattg 2460
gaaacgaaaa tcatggcctg tacctggcag accagtatgt gaagggcatt gccaagtcca 2520
gaaaaagtgg gtcatgatga acacgtggga tccagatctg ctgtgccttc tagttgccag 2580
ccatctgttg tttgcccctc ccccgtgcct tccttgaccc tggaaggtgc cactcccact 2640
gtcctttcct aataaaatga ggaaattgca tcgcattgtc tgagtaggtg tcattctatt 2700
ctggggggtg gggtggggca ggacagcaag ggggaggatt gggaagacaa tagcaggcat 2760
gctggggatg cggtgggctc tatgggtacc caggtgctga agaattgacc cggttcctcc 2820
tgggccagaa agaagcaggc acatcccctt ctctgtgaca caccctgtcc acgcccctgg 2880
ttcttagttc cagccccact cataggacac tcatagctca ggagggctcc gccttcaatc 2940
ccacccgcta aagtacttgg agcggtctct ccctccctca tcagcccacc aaaccaaacc 3000
tagcctccaa gagtgggaag aaattaaagc aagataggct attaagtgca gagggagaga 3060
aaatgcctcc aacatgtgag gaagtaatga gagaaatcat agaattttaa ggccatgatt 3120
taaggccatc atggccttaa tcttccgctt cctcgctcac tgactcgctg cgctcggtcg 3180
ttcggctgcg gcgagcggta tcagctcact caaaggcggt aatacggtta tccacagaat 3240
caggggataa cgcaggaaag aacatgtgag caaaaggcca gcaaaaggcc aggaaccgta 3300
aaaaggccgc gttgctggcg tttttccata ggctccgccc ccctgacgag catcacaaaa 3360
atcgacgctc aagtcagagg tggcgaaacc cgacaggact ataaagatac caggcgtttc 3420
cccctggaag ctccctcgtg cgctctcctg ttccgaccct gccgcttacc ggatacctgt 3480
ccgcctttct cccttcggga agcgtggcgc tttctcatag ctcacgctgt aggtatctca 3540
gttcggtgta ggtcgttcgc tccaagctgg gctgtgtgca cgaacccccc gttcagcccg 3600
accgctgcgc cttatccggt aactatcgtc ttgagtccaa cccggtaaga cacgacttat 3660
cgccactggc agcagccact ggtaacagga ttagcagagc gaggtatgta ggcggtgcta 3720
cagagttctt gaagtggtgg cctaactacg gctacactag aagaacagta tttggtatct 3780
gcgctctgct gaagccagtt accttcggaa aaagagttgg tagctcttga tccggcaaac 3840
aaaccaccgc tggtagcggt ggtttttttg tttgcaagca gcagattacg cgcagaaaaa 3900
aaggatctca agaagatcct ttgatctttt ctacggggtc tgacgctcag tggaacgaaa 3960
actcacgtta agggattttg gtcatgagat tatcaaaaag gatcttcacc tagatccttt 4020
taaattaaaa atgaagtttt aaatcaatct aaagtatata tgagtaaact tggtctgaca 4080
gttaccaatg cttaatcagt gaggcaccta tctcagcgat ctgtctattt cgttcatcca 4140
tagttgcctg actcgggggg ggggggcgct gaggtctgcc tcgtgaagaa ggtgttgctg 4200
actcatacca ggcctgaatc gccccatcat ccagccagaa agtgagggag ccacggttga 4260
tgagagcttt gttgtaggtg gaccagttgg tgattttgaa cttttgcttt gccacggaac 4320
ggtctgcgtt gtcgggaaga tgcgtgatct gatccttcaa ctcagcaaaa gttcgattta 4380
ttcaacaaag ccgccgtccc gtcaagtcag cgtaatgctc tgccagtgtt acaaccaatt 4440
aaccaattct gattagaaaa actcatcgag catcaaatga aactgcaatt tattcatatc 4500
aggattatca ataccatatt tttgaaaaag ccgtttctgt aatgaaggag aaaactcacc 4560
gaggcagttc cataggatgg caagatcctg gtatcggtct gcgattccga ctcgtccaac 4620
atcaatacaa cctattaatt tcccctcgtc aaaaataagg ttatcaagtg agaaatcacc 4680
atgagtgacg actgaatccg gtgagaatgg caaaagctta tgcatttctt tccagacttg 4740
ttcaacaggc cagccattac gctcgtcatc aaaatcactc gcatcaacca aaccgttatt 4800
cattcgtgat tgcgcctgag cgagacgaaa tacgcgatcg ctgttaaaag gacaattaca 4860
aacaggaatc gaatgcaacc ggcgcaggaa cactgccagc gcatcaacaa tattttcacc 4920
tgaatcagga tattcttcta atacctggaa tgctgttttc ccggggatcg cagtggtgag 4980
taaccatgca tcatcaggag tacggataaa atgcttgatg gtcggaagag gcataaattc 5040
cgtcagccag tttagtctga ccatctcatc tgtaacatca ttggcaacgc tacctttgcc 5100
atgtttcaga aacaactctg gcgcatcggg cttcccatac aatcgataga ttgtcgcacc 5160
tgattgcccg acattatcgc gagcccattt atacccatat aaatcagcat ccatgttgga 5220
atttaatcgc ggcctcgagc aagacgtttc ccgttgaata tggctcataa caccccttgt 5280
attactgttt atgtaagcag acagttttat tgttcatgat gatatatttt tatcttgtgc 5340
aatgtaacat cagagatttt gagacacaac gtggctttcc cccccccccc attattgaag 5400
catttatcag ggttattgtc tcatgagcgg atacatattt gaatgtattt agaaaaataa 5460
acaaataggg gttccgcgca catttccccg aaaagtgcca cctgacgtct aagaaaccat 5520
tattatcatg acattaacct ataaaaatag gcgtatcacg aggccctttc gtc 5573
<210> 281
<211> 654
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(654)
<400> 281
atg gaa aaa atc gtg ctg ctg ctg gct atc gtg tcc ctg gtg aag tcc 48
Met Glu Lys Ile Val Leu Leu Leu Ala Ile Val Ser Leu Val Lys Ser
1 5 10 15
gac cag atc tgt att ggg tat cat gct aac aac tcc aca gaa cag gtg 96
Asp Gln Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Glu Gln Val
20 25 30
gat act atc atg gag aag aac gtg acc gtc aca cac gct cag gac att 144
Asp Thr Ile Met Glu Lys Asn Val Thr Val Thr His Ala Gln Asp Ile
35 40 45
gga tgg gga ctg gtc ctg gca acc gga ctg aga aat tca cca cag agg 192
Gly Trp Gly Leu Val Leu Ala Thr Gly Leu Arg Asn Ser Pro Gln Arg
50 55 60
gaa agc cgg aga aag aaa cgc gga ctg ttc ggc gct atc gca ggg ttt 240
Glu Ser Arg Arg Lys Lys Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe
65 70 75 80
att gag ggc ggg tgg cag gga atg gtg gat ggg tgg tac ggc tac cac 288
Ile Glu Gly Gly Trp Gln Gly Met Val Asp Gly Trp Tyr Gly Tyr His
85 90 95
cat tcc aac gaa cag gga tct ggc tac gcc gct gat aag gag tct act 336
His Ser Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr
100 105 110
cag aaa gct atc gac ggc gtg acc aac atg gtc aat agt atc att gat 384
Gln Lys Ala Ile Asp Gly Val Thr Asn Met Val Asn Ser Ile Ile Asp
115 120 125
aag atg ggc tct gga ggc agt gga acc gac ctg gca gag ctg ctg gtg 432
Lys Met Gly Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val
130 135 140
ctg ctg ctg aac cag tgg aca ctg ctg ttc cac gac tct aac gtg aag 480
Leu Leu Leu Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys
145 150 155 160
aat ctg tat gat aaa gtc cga ctg cag ctg cgg gac aac gcc aag gaa 528
Asn Leu Tyr Asp Lys Val Arg Leu Gln Leu Arg Asp Asn Ala Lys Glu
165 170 175
ctg ggg aat gga tgc ttc gag ttc tac cat aag tgc gat aac gaa tgt 576
Leu Gly Asn Gly Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Glu Cys
180 185 190
atg gag agc atc cga aac ggc aca tac aat tat ccc cag tat tcc gag 624
Met Glu Ser Ile Arg Asn Gly Thr Tyr Asn Tyr Pro Gln Tyr Ser Glu
195 200 205
gaa gct agg ctg aaa cgc gag gaa att agc 654
Glu Ala Arg Leu Lys Arg Glu Glu Ile Ser
210 215
<210> 282
<211> 218
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 282
Met Glu Lys Ile Val Leu Leu Leu Ala Ile Val Ser Leu Val Lys Ser
1 5 10 15
Asp Gln Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Glu Gln Val
20 25 30
Asp Thr Ile Met Glu Lys Asn Val Thr Val Thr His Ala Gln Asp Ile
35 40 45
Gly Trp Gly Leu Val Leu Ala Thr Gly Leu Arg Asn Ser Pro Gln Arg
50 55 60
Glu Ser Arg Arg Lys Lys Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe
65 70 75 80
Ile Glu Gly Gly Trp Gln Gly Met Val Asp Gly Trp Tyr Gly Tyr His
85 90 95
His Ser Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr
100 105 110
Gln Lys Ala Ile Asp Gly Val Thr Asn Met Val Asn Ser Ile Ile Asp
115 120 125
Lys Met Gly Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val
130 135 140
Leu Leu Leu Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys
145 150 155 160
Asn Leu Tyr Asp Lys Val Arg Leu Gln Leu Arg Asp Asn Ala Lys Glu
165 170 175
Leu Gly Asn Gly Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Glu Cys
180 185 190
Met Glu Ser Ile Arg Asn Gly Thr Tyr Asn Tyr Pro Gln Tyr Ser Glu
195 200 205
Glu Ala Arg Leu Lys Arg Glu Glu Ile Ser
210 215
<210> 283
<211> 654
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 283
gctaatttcc tcgcgtttca gcctagcttc ctcggaatac tggggataat tgtatgtgcc 60
gtttcggatg ctctccatac attcgttatc gcacttatgg tagaactcga agcatccatt 120
ccccagttcc ttggcgttgt cccgcagctg cagtcggact ttatcataca gattcttcac 180
gttagagtcg tggaacagca gtgtccactg gttcagcagc agcaccagca gctctgccag 240
gtcggttcca ctgcctccag agcccatctt atcaatgata ctattgacca tgttggtcac 300
gccgtcgata gctttctgag tagactcctt atcagcggcg tagccagatc cctgttcgtt 360
ggaatggtgg tagccgtacc acccatccac cattccctgc cacccgccct caataaaccc 420
tgcgatagcg ccgaacagtc cgcgtttctt tctccggctt tccctctgtg gtgaatttct 480
cagtccggtt gccaggacca gtccccatcc aatgtcctga gcgtgtgtga cggtcacgtt 540
cttctccatg atagtatcca cctgttctgt ggagttgtta gcatgatacc caatacagat 600
ctggtcggac ttcaccaggg acacgatagc cagcagcagc acgatttttt ccat 654
<210> 284
<211> 1164
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(1164)
<400> 284
atg gaa aaa atc gtg ctg ctg ctg gct atc gtg tcc ctg gtg aag tcc 48
Met Glu Lys Ile Val Leu Leu Leu Ala Ile Val Ser Leu Val Lys Ser
1 5 10 15
gac cag atc tgt att ggg tat cat gct aac aac tcc aca gaa cag gtg 96
Asp Gln Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Glu Gln Val
20 25 30
gat act atc atg gag aag aac gtg acc gtc aca cac gct cag gac att 144
Asp Thr Ile Met Glu Lys Asn Val Thr Val Thr His Ala Gln Asp Ile
35 40 45
gga tgg gga ctg gtc ctg gca acc gga ctg aga aat tca cca cag agg 192
Gly Trp Gly Leu Val Leu Ala Thr Gly Leu Arg Asn Ser Pro Gln Arg
50 55 60
gaa agc cgg aga aag aaa cgc gga ctg ttc ggc gct atc gca ggg ttt 240
Glu Ser Arg Arg Lys Lys Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe
65 70 75 80
att gag ggc ggg tgg cag gga atg gtg gat ggg tgg tac ggc tac cac 288
Ile Glu Gly Gly Trp Gln Gly Met Val Asp Gly Trp Tyr Gly Tyr His
85 90 95
cat tcc aac gaa cag gga tct ggc tac gcc gct gat aag gag tct act 336
His Ser Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr
100 105 110
cag aaa gct atc gac ggc gtg acc aac atg gtc aat agt atc att gat 384
Gln Lys Ala Ile Asp Gly Val Thr Asn Met Val Asn Ser Ile Ile Asp
115 120 125
aag atg ggc tct gga ggc agt gga acc gac ctg gca gag ctg ctg gtg 432
Lys Met Gly Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val
130 135 140
ctg ctg ctg aac cag tgg aca ctg ctg ttc cac gac tct aac gtg aag 480
Leu Leu Leu Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys
145 150 155 160
aat ctg tat gat aaa gtc cga ctg cag ctg cgg gac aac gcc aag gaa 528
Asn Leu Tyr Asp Lys Val Arg Leu Gln Leu Arg Asp Asn Ala Lys Glu
165 170 175
ctg ggg aat gga tgc ttc gag ttc tac cat aag tgc gat aac gaa tgt 576
Leu Gly Asn Gly Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Glu Cys
180 185 190
atg gag agc atc cga aac ggc aca tac aat tat ccc cag tat tcc gag 624
Met Glu Ser Ile Arg Asn Gly Thr Tyr Asn Tyr Pro Gln Tyr Ser Glu
195 200 205
gaa gct agg ctg aaa cgc gag gaa att agc tcc ggg gga gac atc att 672
Glu Ala Arg Leu Lys Arg Glu Glu Ile Ser Ser Gly Gly Asp Ile Ile
210 215 220
aag ctg ctg aac gaa cag gtg aac aag gag atg cag tct agt aac ctg 720
Lys Leu Leu Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu
225 230 235 240
tac atg agt atg tca agc tgg tgt tat act cac tca ctg gat ggc gcc 768
Tyr Met Ser Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala
245 250 255
ggg ctg ttc ctg ttt gac cac gca gcc gag gaa tac gaa cat gct aag 816
Gly Leu Phe Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys
260 265 270
aaa ctg atc att ttc ctg aat gag aac aat gtg ccc gtc cag ctg aca 864
Lys Leu Ile Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr
275 280 285
tcc atc tct gca cct gaa cat aag ttc gag ggc ctg act cag atc ttt 912
Ser Ile Ser Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe
290 295 300
cag aaa gcc tac gaa cac gag cag cat att agt gag tca atc aac aat 960
Gln Lys Ala Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn
305 310 315 320
att gtg gac cac gcc atc aag agc aaa gat cat gct acc ttc aat ttt 1008
Ile Val Asp His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe
325 330 335
ctg cag tgg tac gtg gcc gag cag cac gag gaa gag gtc ctg ttt aag 1056
Leu Gln Trp Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys
340 345 350
gac atc ctg gat aaa atc gaa ctg att gga aac gag aat cat ggc ctg 1104
Asp Ile Leu Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu
355 360 365
tac ctg gca gac cag tat gtg aag ggc att gcc aag tcc agg aaa agc 1152
Tyr Leu Ala Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser
370 375 380
ggg tcc tga tga 1164
Gly Ser
385
<210> 285
<211> 386
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 285
Met Glu Lys Ile Val Leu Leu Leu Ala Ile Val Ser Leu Val Lys Ser
1 5 10 15
Asp Gln Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Glu Gln Val
20 25 30
Asp Thr Ile Met Glu Lys Asn Val Thr Val Thr His Ala Gln Asp Ile
35 40 45
Gly Trp Gly Leu Val Leu Ala Thr Gly Leu Arg Asn Ser Pro Gln Arg
50 55 60
Glu Ser Arg Arg Lys Lys Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe
65 70 75 80
Ile Glu Gly Gly Trp Gln Gly Met Val Asp Gly Trp Tyr Gly Tyr His
85 90 95
His Ser Asn Glu Gln Gly Ser Gly Tyr Ala Ala Asp Lys Glu Ser Thr
100 105 110
Gln Lys Ala Ile Asp Gly Val Thr Asn Met Val Asn Ser Ile Ile Asp
115 120 125
Lys Met Gly Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val
130 135 140
Leu Leu Leu Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys
145 150 155 160
Asn Leu Tyr Asp Lys Val Arg Leu Gln Leu Arg Asp Asn Ala Lys Glu
165 170 175
Leu Gly Asn Gly Cys Phe Glu Phe Tyr His Lys Cys Asp Asn Glu Cys
180 185 190
Met Glu Ser Ile Arg Asn Gly Thr Tyr Asn Tyr Pro Gln Tyr Ser Glu
195 200 205
Glu Ala Arg Leu Lys Arg Glu Glu Ile Ser Ser Gly Gly Asp Ile Ile
210 215 220
Lys Leu Leu Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu
225 230 235 240
Tyr Met Ser Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala
245 250 255
Gly Leu Phe Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys
260 265 270
Lys Leu Ile Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr
275 280 285
Ser Ile Ser Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe
290 295 300
Gln Lys Ala Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn
305 310 315 320
Ile Val Asp His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe
325 330 335
Leu Gln Trp Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys
340 345 350
Asp Ile Leu Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu
355 360 365
Tyr Leu Ala Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser
370 375 380
Gly Ser
385
<210> 286
<211> 1164
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 286
tcatcaggac ccgcttttcc tggacttggc aatgcccttc acatactggt ctgccaggta 60
caggccatga ttctcgtttc caatcagttc gattttatcc aggatgtcct taaacaggac 120
ctcttcctcg tgctgctcgg ccacgtacca ctgcagaaaa ttgaaggtag catgatcttt 180
gctcttgatg gcgtggtcca caatattgtt gattgactca ctaatatgct gctcgtgttc 240
gtaggctttc tgaaagatct gagtcaggcc ctcgaactta tgttcaggtg cagagatgga 300
tgtcagctgg acgggcacat tgttctcatt caggaaaatg atcagtttct tagcatgttc 360
gtattcctcg gctgcgtggt caaacaggaa cagcccggcg ccatccagtg agtgagtata 420
acaccagctt gacatactca tgtacaggtt actagactgc atctccttgt tcacctgttc 480
gttcagcagc ttaatgatgt ctcccccgga gctaatttcc tcgcgtttca gcctagcttc 540
ctcggaatac tggggataat tgtatgtgcc gtttcggatg ctctccatac attcgttatc 600
gcacttatgg tagaactcga agcatccatt ccccagttcc ttggcgttgt cccgcagctg 660
cagtcggact ttatcataca gattcttcac gttagagtcg tggaacagca gtgtccactg 720
gttcagcagc agcaccagca gctctgccag gtcggttcca ctgcctccag agcccatctt 780
atcaatgata ctattgacca tgttggtcac gccgtcgata gctttctgag tagactcctt 840
atcagcggcg tagccagatc cctgttcgtt ggaatggtgg tagccgtacc acccatccac 900
cattccctgc cacccgccct caataaaccc tgcgatagcg ccgaacagtc cgcgtttctt 960
tctccggctt tccctctgtg gtgaatttct cagtccggtt gccaggacca gtccccatcc 1020
aatgtcctga gcgtgtgtga cggtcacgtt cttctccatg atagtatcca cctgttctgt 1080
ggagttgtta gcatgatacc caatacagat ctggtcggac ttcaccaggg acacgatagc 1140
cagcagcagc acgatttttt ccat 1164
<210> 287
<211> 5588
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 287
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960
tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020
cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080
ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140
ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200
accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260
gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320
ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380
atatcgccac catggaaaaa atcgtgctgc tgctggctat cgtgtccctg gtgaagtccg 1440
accagatctg tattgggtat catgctaaca actccacaga acaggtggat actatcatgg 1500
agaagaacgt gaccgtcaca cacgctcagg acattggatg gggactggtc ctggcaaccg 1560
gactgagaaa ttcaccacag agggaaagcc ggagaaagaa acgcggactg ttcggcgcta 1620
tcgcagggtt tattgagggc gggtggcagg gaatggtgga tgggtggtac ggctaccacc 1680
attccaacga acagggatct ggctacgccg ctgataagga gtctactcag aaagctatcg 1740
acggcgtgac caacatggtc aatagtatca ttgataagat gggctctgga ggcagtggaa 1800
ccgacctggc agagctgctg gtgctgctgc tgaaccagtg gacactgctg ttccacgact 1860
ctaacgtgaa gaatctgtat gataaagtcc gactgcagct gcgggacaac gccaaggaac 1920
tggggaatgg atgcttcgag ttctaccata agtgcgataa cgaatgtatg gagagcatcc 1980
gaaacggcac atacaattat ccccagtatt ccgaggaagc taggctgaaa cgcgaggaaa 2040
ttagctccgg gggagacatc attaagctgc tgaacgaaca ggtgaacaag gagatgcagt 2100
ctagtaacct gtacatgagt atgtcaagct ggtgttatac tcactcactg gatggcgccg 2160
ggctgttcct gtttgaccac gcagccgagg aatacgaaca tgctaagaaa ctgatcattt 2220
tcctgaatga gaacaatgtg cccgtccagc tgacatccat ctctgcacct gaacataagt 2280
tcgagggcct gactcagatc tttcagaaag cctacgaaca cgagcagcat attagtgagt 2340
caatcaacaa tattgtggac cacgccatca agagcaaaga tcatgctacc ttcaattttc 2400
tgcagtggta cgtggccgag cagcacgagg aagaggtcct gtttaaggac atcctggata 2460
aaatcgaact gattggaaac gagaatcatg gcctgtacct ggcagaccag tatgtgaagg 2520
gcattgccaa gtccaggaaa agcgggtcct gatgaacacg tgggatccag atctgctgtg 2580
ccttctagtt gccagccatc tgttgtttgc ccctcccccg tgccttcctt gaccctggaa 2640
ggtgccactc ccactgtcct ttcctaataa aatgaggaaa ttgcatcgca ttgtctgagt 2700
aggtgtcatt ctattctggg gggtggggtg gggcaggaca gcaaggggga ggattgggaa 2760
gacaatagca ggcatgctgg ggatgcggtg ggctctatgg gtacccaggt gctgaagaat 2820
tgacccggtt cctcctgggc cagaaagaag caggcacatc cccttctctg tgacacaccc 2880
tgtccacgcc cctggttctt agttccagcc ccactcatag gacactcata gctcaggagg 2940
gctccgcctt caatcccacc cgctaaagta cttggagcgg tctctccctc cctcatcagc 3000
ccaccaaacc aaacctagcc tccaagagtg ggaagaaatt aaagcaagat aggctattaa 3060
gtgcagaggg agagaaaatg cctccaacat gtgaggaagt aatgagagaa atcatagaat 3120
tttaaggcca tgatttaagg ccatcatggc cttaatcttc cgcttcctcg ctcactgact 3180
cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag gcggtaatac 3240
ggttatccac agaatcaggg gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa 3300
aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc cgcccccctg 3360
acgagcatca caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa 3420
gataccaggc gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc 3480
ttaccggata cctgtccgcc tttctccctt cgggaagcgt ggcgctttct catagctcac 3540
gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac 3600
cccccgttca gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag tccaacccgg 3660
taagacacga cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggt 3720
atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa ctacggctac actagaagaa 3780
cagtatttgg tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct 3840
cttgatccgg caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga 3900
ttacgcgcag aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg 3960
ctcagtggaa cgaaaactca cgttaaggga ttttggtcat gagattatca aaaaggatct 4020
tcacctagat ccttttaaat taaaaatgaa gttttaaatc aatctaaagt atatatgagt 4080
aaacttggtc tgacagttac caatgcttaa tcagtgaggc acctatctca gcgatctgtc 4140
tatttcgttc atccatagtt gcctgactcg gggggggggg gcgctgaggt ctgcctcgtg 4200
aagaaggtgt tgctgactca taccaggcct gaatcgcccc atcatccagc cagaaagtga 4260
gggagccacg gttgatgaga gctttgttgt aggtggacca gttggtgatt ttgaactttt 4320
gctttgccac ggaacggtct gcgttgtcgg gaagatgcgt gatctgatcc ttcaactcag 4380
caaaagttcg atttattcaa caaagccgcc gtcccgtcaa gtcagcgtaa tgctctgcca 4440
gtgttacaac caattaacca attctgatta gaaaaactca tcgagcatca aatgaaactg 4500
caatttattc atatcaggat tatcaatacc atatttttga aaaagccgtt tctgtaatga 4560
aggagaaaac tcaccgaggc agttccatag gatggcaaga tcctggtatc ggtctgcgat 4620
tccgactcgt ccaacatcaa tacaacctat taatttcccc tcgtcaaaaa taaggttatc 4680
aagtgagaaa tcaccatgag tgacgactga atccggtgag aatggcaaaa gcttatgcat 4740
ttctttccag acttgttcaa caggccagcc attacgctcg tcatcaaaat cactcgcatc 4800
aaccaaaccg ttattcattc gtgattgcgc ctgagcgaga cgaaatacgc gatcgctgtt 4860
aaaaggacaa ttacaaacag gaatcgaatg caaccggcgc aggaacactg ccagcgcatc 4920
aacaatattt tcacctgaat caggatattc ttctaatacc tggaatgctg ttttcccggg 4980
gatcgcagtg gtgagtaacc atgcatcatc aggagtacgg ataaaatgct tgatggtcgg 5040
aagaggcata aattccgtca gccagtttag tctgaccatc tcatctgtaa catcattggc 5100
aacgctacct ttgccatgtt tcagaaacaa ctctggcgca tcgggcttcc catacaatcg 5160
atagattgtc gcacctgatt gcccgacatt atcgcgagcc catttatacc catataaatc 5220
agcatccatg ttggaattta atcgcggcct cgagcaagac gtttcccgtt gaatatggct 5280
cataacaccc cttgtattac tgtttatgta agcagacagt tttattgttc atgatgatat 5340
atttttatct tgtgcaatgt aacatcagag attttgagac acaacgtggc tttccccccc 5400
cccccattat tgaagcattt atcagggtta ttgtctcatg agcggataca tatttgaatg 5460
tatttagaaa aataaacaaa taggggttcc gcgcacattt ccccgaaaag tgccacctga 5520
cgtctaagaa accattatta tcatgacatt aacctataaa aataggcgta tcacgaggcc 5580
ctttcgtc 5588
<210> 288
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(645)
<400> 288
atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
agc gga ggc tcc gga aca tac aac gct gag ctg ctg gtg ctg ctg ctg 432
Ser Gly Gly Ser Gly Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
aac gag cgg act ctg gat ttc cac gat agc aac gtg aag aat ctg tat 480
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
ctg aat cga gag aaa att gac 645
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 289
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 289
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 290
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 290
gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60
gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120
cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180
gttgctatcg tggaaatcca gagtccgctc gttcagcagc agcaccagca gctcagcgtt 240
gtatgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacca tgttagtaat 300
gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360
ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420
tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480
catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540
ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600
ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645
<210> 291
<211> 1155
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(1155)
<400> 291
atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
agc gga ggc tcc gga aca tac aac gct gag ctg ctg gtg ctg ctg ctg 432
Ser Gly Gly Ser Gly Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
aac gag cgg act ctg gat ttc cac gat agc aac gtg aag aat ctg tat 480
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
ctg aat cga gag aaa att gac agt ggg ggc gac atc atc aag ctg ctg 672
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
aac gaa cag gtg aac aag gag atg cag agc tcc aac ctg tac atg agt 720
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
atg tct agt tgg tgt tat aca cac tca ctg gac ggc gct ggg ctg ttc 768
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
ctg ttt gat cac gca gcc gag gaa tac gaa cat gca aag aaa ctg atc 816
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
att ttc ctg aat gag aac aat gtg ccc gtc cag ctg act tca atc agc 864
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
gcc cct gaa cat aag ttc gag ggc ctg acc cag atc ttt cag aaa gct 912
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
tac gaa cac gag cag cat att tcc gaa tct atc aac aat att gtg gac 960
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
cac gcc att aag agc aaa gat cat gct acc ttc aac ttt ctg cag tgg 1008
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
tac gtg gcc gag cag cac gag gag gag gtc ctg ttt aag gac atc ctg 1056
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
gat aaa atc gaa ctg att gga aac gag aat cat ggc ctg tac ctg gca 1104
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
gat cag tat gtg aag ggc att gcc aag tcc aga aaa agt ggg tca tga 1152
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
tga 1155
<210> 292
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 292
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Tyr Asn Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 293
<211> 1155
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 293
tcatcatgac ccactttttc tggacttggc aatgcccttc acatactgat ctgccaggta 60
caggccatga ttctcgtttc caatcagttc gattttatcc aggatgtcct taaacaggac 120
ctcctcctcg tgctgctcgg ccacgtacca ctgcagaaag ttgaaggtag catgatcttt 180
gctcttaatg gcgtggtcca caatattgtt gatagattcg gaaatatgct gctcgtgttc 240
gtaagctttc tgaaagatct gggtcaggcc ctcgaactta tgttcagggg cgctgattga 300
agtcagctgg acgggcacat tgttctcatt caggaaaatg atcagtttct ttgcatgttc 360
gtattcctcg gctgcgtgat caaacaggaa cagcccagcg ccgtccagtg agtgtgtata 420
acaccaacta gacatactca tgtacaggtt ggagctctgc atctccttgt tcacctgttc 480
gttcagcagc ttgatgatgt cgcccccact gtcaattttc tctcgattca gcttactctc 540
ttcagaatat ttgggatagt cgtaagtgcc gttcttcaca gactccatac attcattgtt 600
gcacttatgg taaaactcga agcatccatt cccgatttct ttggcattgt tcttcagctg 660
ggatttgacc ttctcataca gattcttcac gttgctatcg tggaaatcca gagtccgctc 720
gttcagcagc agcaccagca gctcagcgtt gtatgttccg gagcctccgc tgcccatttt 780
ttcgatgaca gaattcacca tgttagtaat gccattgatt gcgttctgtg tagacttctg 840
atcagcggcg tagccgctgc cctgctcatt ctgatggtgg tagccgtacc acccgtccac 900
cattcctgtc cacccgccct caataaaccc tgcgatagcg ccgaacagtc ctcttgtttc 960
ccgctgtggg atgttgcgca gtccggtgac catcctcagt ccgctgccca gattcactga 1020
gtgggtgaca gtcacgttct tctccaggac ggtatccact gtgtcggtgg agttgtttgc 1080
gtgatagccg atgcagatag tgtcagcgta ggttgcggta aaagtacaca gcaggaccag 1140
cagtttggcc ttcat 1155
<210> 294
<211> 5579
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 294
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960
tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020
cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080
ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140
ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200
accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260
gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320
ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380
atatcgccac catgaaggcc aaactgctgg tcctgctgtg tacttttacc gcaacctacg 1440
ctgacactat ctgcatcggc tatcacgcaa acaactccac cgacacagtg gataccgtcc 1500
tggagaagaa cgtgactgtc acccactcag tgaatctggg cagcggactg aggatggtca 1560
ccggactgcg caacatccca cagcgggaaa caagaggact gttcggcgct atcgcagggt 1620
ttattgaggg cgggtggaca ggaatggtgg acgggtggta cggctaccac catcagaatg 1680
agcagggcag cggctacgcc gctgatcaga agtctacaca gaacgcaatc aatggcatta 1740
ctaacatggt gaattctgtc atcgaaaaaa tgggcagcgg aggctccgga acatacaacg 1800
ctgagctgct ggtgctgctg ctgaacgagc ggactctgga tttccacgat agcaacgtga 1860
agaatctgta tgagaaggtc aaatcccagc tgaagaacaa tgccaaagaa atcgggaatg 1920
gatgcttcga gttttaccat aagtgcaaca atgaatgtat ggagtctgtg aagaacggca 1980
cttacgacta tcccaaatat tctgaagaga gtaagctgaa tcgagagaaa attgacagtg 2040
ggggcgacat catcaagctg ctgaacgaac aggtgaacaa ggagatgcag agctccaacc 2100
tgtacatgag tatgtctagt tggtgttata cacactcact ggacggcgct gggctgttcc 2160
tgtttgatca cgcagccgag gaatacgaac atgcaaagaa actgatcatt ttcctgaatg 2220
agaacaatgt gcccgtccag ctgacttcaa tcagcgcccc tgaacataag ttcgagggcc 2280
tgacccagat ctttcagaaa gcttacgaac acgagcagca tatttccgaa tctatcaaca 2340
atattgtgga ccacgccatt aagagcaaag atcatgctac cttcaacttt ctgcagtggt 2400
acgtggccga gcagcacgag gaggaggtcc tgtttaagga catcctggat aaaatcgaac 2460
tgattggaaa cgagaatcat ggcctgtacc tggcagatca gtatgtgaag ggcattgcca 2520
agtccagaaa aagtgggtca tgatgaacac gtgggatcca gatctgctgt gccttctagt 2580
tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 2640
cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 2700
tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 2760
aggcatgctg gggatgcggt gggctctatg ggtacccagg tgctgaagaa ttgacccggt 2820
tcctcctggg ccagaaagaa gcaggcacat ccccttctct gtgacacacc ctgtccacgc 2880
ccctggttct tagttccagc cccactcata ggacactcat agctcaggag ggctccgcct 2940
tcaatcccac ccgctaaagt acttggagcg gtctctccct ccctcatcag cccaccaaac 3000
caaacctagc ctccaagagt gggaagaaat taaagcaaga taggctatta agtgcagagg 3060
gagagaaaat gcctccaaca tgtgaggaag taatgagaga aatcatagaa ttttaaggcc 3120
atgatttaag gccatcatgg ccttaatctt ccgcttcctc gctcactgac tcgctgcgct 3180
cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 3240
cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 3300
accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 3360
acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 3420
cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 3480
acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt 3540
atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 3600
agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 3660
acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 3720
gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg 3780
gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 3840
gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 3900
gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga 3960
acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga 4020
tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 4080
ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt 4140
catccatagt tgcctgactc gggggggggg ggcgctgagg tctgcctcgt gaagaaggtg 4200
ttgctgactc ataccaggcc tgaatcgccc catcatccag ccagaaagtg agggagccac 4260
ggttgatgag agctttgttg taggtggacc agttggtgat tttgaacttt tgctttgcca 4320
cggaacggtc tgcgttgtcg ggaagatgcg tgatctgatc cttcaactca gcaaaagttc 4380
gatttattca acaaagccgc cgtcccgtca agtcagcgta atgctctgcc agtgttacaa 4440
ccaattaacc aattctgatt agaaaaactc atcgagcatc aaatgaaact gcaatttatt 4500
catatcagga ttatcaatac catatttttg aaaaagccgt ttctgtaatg aaggagaaaa 4560
ctcaccgagg cagttccata ggatggcaag atcctggtat cggtctgcga ttccgactcg 4620
tccaacatca atacaaccta ttaatttccc ctcgtcaaaa ataaggttat caagtgagaa 4680
atcaccatga gtgacgactg aatccggtga gaatggcaaa agcttatgca tttctttcca 4740
gacttgttca acaggccagc cattacgctc gtcatcaaaa tcactcgcat caaccaaacc 4800
gttattcatt cgtgattgcg cctgagcgag acgaaatacg cgatcgctgt taaaaggaca 4860
attacaaaca ggaatcgaat gcaaccggcg caggaacact gccagcgcat caacaatatt 4920
ttcacctgaa tcaggatatt cttctaatac ctggaatgct gttttcccgg ggatcgcagt 4980
ggtgagtaac catgcatcat caggagtacg gataaaatgc ttgatggtcg gaagaggcat 5040
aaattccgtc agccagttta gtctgaccat ctcatctgta acatcattgg caacgctacc 5100
tttgccatgt ttcagaaaca actctggcgc atcgggcttc ccatacaatc gatagattgt 5160
cgcacctgat tgcccgacat tatcgcgagc ccatttatac ccatataaat cagcatccat 5220
gttggaattt aatcgcggcc tcgagcaaga cgtttcccgt tgaatatggc tcataacacc 5280
ccttgtatta ctgtttatgt aagcagacag ttttattgtt catgatgata tatttttatc 5340
ttgtgcaatg taacatcaga gattttgaga cacaacgtgg ctttcccccc ccccccatta 5400
ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 5460
aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga 5520
aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgtc 5579
<210> 295
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(645)
<400> 295
atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
agc gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg ctg ctg 432
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
aac gag cgg act ctg gat ttc cac gat agc aac gtg aag aat ctg tat 480
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
ctg aat cga gag aaa att gac 645
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 296
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 296
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 297
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 297
gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60
gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120
cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180
gttgctatcg tggaaatcca gagtccgctc gttcagcagc agcaccagca gctcagccag 240
gtctgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacca tgttagtaat 300
gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360
ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420
tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480
catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540
ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600
ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645
<210> 298
<211> 1155
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(1155)
<400> 298
atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
agc gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg ctg ctg 432
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
aac gag cgg act ctg gat ttc cac gat agc aac gtg aag aat ctg tat 480
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
ctg aat cga gag aaa att gac agt ggg ggc gac atc atc aag ctg ctg 672
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
aac gaa cag gtg aac aag gag atg cag agc tcc aac ctg tac atg agt 720
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
atg tct agt tgg tgt tat aca cac tca ctg gac ggc gct ggg ctg ttc 768
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
ctg ttt gat cac gca gcc gag gaa tac gaa cat gca aag aaa ctg atc 816
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
att ttc ctg aat gag aac aat gtg ccc gtc cag ctg act tca atc agc 864
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
gcc cct gaa cat aag ttc gag ggc ctg acc cag atc ttt cag aaa gct 912
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
tac gaa cac gag cag cat att tcc gaa tct atc aac aat att gtg gac 960
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
cac gcc att aag agc aaa gat cat gct acc ttc aac ttt ctg cag tgg 1008
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
tac gtg gcc gag cag cac gag gag gag gtc ctg ttt aag gac atc ctg 1056
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
gat aaa atc gaa ctg att gga aac gag aat cat ggc ctg tac ctg gca 1104
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
gat cag tat gtg aag ggc att gcc aag tcc aga aaa agt ggg tca tga 1152
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
tga 1155
<210> 299
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 299
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 300
<211> 1155
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 300
tcatcatgac ccactttttc tggacttggc aatgcccttc acatactgat ctgccaggta 60
caggccatga ttctcgtttc caatcagttc gattttatcc aggatgtcct taaacaggac 120
ctcctcctcg tgctgctcgg ccacgtacca ctgcagaaag ttgaaggtag catgatcttt 180
gctcttaatg gcgtggtcca caatattgtt gatagattcg gaaatatgct gctcgtgttc 240
gtaagctttc tgaaagatct gggtcaggcc ctcgaactta tgttcagggg cgctgattga 300
agtcagctgg acgggcacat tgttctcatt caggaaaatg atcagtttct ttgcatgttc 360
gtattcctcg gctgcgtgat caaacaggaa cagcccagcg ccgtccagtg agtgtgtata 420
acaccaacta gacatactca tgtacaggtt ggagctctgc atctccttgt tcacctgttc 480
gttcagcagc ttgatgatgt cgcccccact gtcaattttc tctcgattca gcttactctc 540
ttcagaatat ttgggatagt cgtaagtgcc gttcttcaca gactccatac attcattgtt 600
gcacttatgg taaaactcga agcatccatt cccgatttct ttggcattgt tcttcagctg 660
ggatttgacc ttctcataca gattcttcac gttgctatcg tggaaatcca gagtccgctc 720
gttcagcagc agcaccagca gctcagccag gtctgttccg gagcctccgc tgcccatttt 780
ttcgatgaca gaattcacca tgttagtaat gccattgatt gcgttctgtg tagacttctg 840
atcagcggcg tagccgctgc cctgctcatt ctgatggtgg tagccgtacc acccgtccac 900
cattcctgtc cacccgccct caataaaccc tgcgatagcg ccgaacagtc ctcttgtttc 960
ccgctgtggg atgttgcgca gtccggtgac catcctcagt ccgctgccca gattcactga 1020
gtgggtgaca gtcacgttct tctccaggac ggtatccact gtgtcggtgg agttgtttgc 1080
gtgatagccg atgcagatag tgtcagcgta ggttgcggta aaagtacaca gcaggaccag 1140
cagtttggcc ttcat 1155
<210> 301
<211> 5579
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 301
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960
tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020
cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080
ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140
ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200
accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260
gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320
ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380
atatcgccac catgaaggcc aaactgctgg tcctgctgtg tacttttacc gcaacctacg 1440
ctgacactat ctgcatcggc tatcacgcaa acaactccac cgacacagtg gataccgtcc 1500
tggagaagaa cgtgactgtc acccactcag tgaatctggg cagcggactg aggatggtca 1560
ccggactgcg caacatccca cagcgggaaa caagaggact gttcggcgct atcgcagggt 1620
ttattgaggg cgggtggaca ggaatggtgg acgggtggta cggctaccac catcagaatg 1680
agcagggcag cggctacgcc gctgatcaga agtctacaca gaacgcaatc aatggcatta 1740
ctaacatggt gaattctgtc atcgaaaaaa tgggcagcgg aggctccgga acagacctgg 1800
ctgagctgct ggtgctgctg ctgaacgagc ggactctgga tttccacgat agcaacgtga 1860
agaatctgta tgagaaggtc aaatcccagc tgaagaacaa tgccaaagaa atcgggaatg 1920
gatgcttcga gttttaccat aagtgcaaca atgaatgtat ggagtctgtg aagaacggca 1980
cttacgacta tcccaaatat tctgaagaga gtaagctgaa tcgagagaaa attgacagtg 2040
ggggcgacat catcaagctg ctgaacgaac aggtgaacaa ggagatgcag agctccaacc 2100
tgtacatgag tatgtctagt tggtgttata cacactcact ggacggcgct gggctgttcc 2160
tgtttgatca cgcagccgag gaatacgaac atgcaaagaa actgatcatt ttcctgaatg 2220
agaacaatgt gcccgtccag ctgacttcaa tcagcgcccc tgaacataag ttcgagggcc 2280
tgacccagat ctttcagaaa gcttacgaac acgagcagca tatttccgaa tctatcaaca 2340
atattgtgga ccacgccatt aagagcaaag atcatgctac cttcaacttt ctgcagtggt 2400
acgtggccga gcagcacgag gaggaggtcc tgtttaagga catcctggat aaaatcgaac 2460
tgattggaaa cgagaatcat ggcctgtacc tggcagatca gtatgtgaag ggcattgcca 2520
agtccagaaa aagtgggtca tgatgaacac gtgggatcca gatctgctgt gccttctagt 2580
tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 2640
cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 2700
tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 2760
aggcatgctg gggatgcggt gggctctatg ggtacccagg tgctgaagaa ttgacccggt 2820
tcctcctggg ccagaaagaa gcaggcacat ccccttctct gtgacacacc ctgtccacgc 2880
ccctggttct tagttccagc cccactcata ggacactcat agctcaggag ggctccgcct 2940
tcaatcccac ccgctaaagt acttggagcg gtctctccct ccctcatcag cccaccaaac 3000
caaacctagc ctccaagagt gggaagaaat taaagcaaga taggctatta agtgcagagg 3060
gagagaaaat gcctccaaca tgtgaggaag taatgagaga aatcatagaa ttttaaggcc 3120
atgatttaag gccatcatgg ccttaatctt ccgcttcctc gctcactgac tcgctgcgct 3180
cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 3240
cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 3300
accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 3360
acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 3420
cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 3480
acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt 3540
atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 3600
agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 3660
acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 3720
gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg 3780
gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 3840
gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 3900
gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga 3960
acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga 4020
tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 4080
ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt 4140
catccatagt tgcctgactc gggggggggg ggcgctgagg tctgcctcgt gaagaaggtg 4200
ttgctgactc ataccaggcc tgaatcgccc catcatccag ccagaaagtg agggagccac 4260
ggttgatgag agctttgttg taggtggacc agttggtgat tttgaacttt tgctttgcca 4320
cggaacggtc tgcgttgtcg ggaagatgcg tgatctgatc cttcaactca gcaaaagttc 4380
gatttattca acaaagccgc cgtcccgtca agtcagcgta atgctctgcc agtgttacaa 4440
ccaattaacc aattctgatt agaaaaactc atcgagcatc aaatgaaact gcaatttatt 4500
catatcagga ttatcaatac catatttttg aaaaagccgt ttctgtaatg aaggagaaaa 4560
ctcaccgagg cagttccata ggatggcaag atcctggtat cggtctgcga ttccgactcg 4620
tccaacatca atacaaccta ttaatttccc ctcgtcaaaa ataaggttat caagtgagaa 4680
atcaccatga gtgacgactg aatccggtga gaatggcaaa agcttatgca tttctttcca 4740
gacttgttca acaggccagc cattacgctc gtcatcaaaa tcactcgcat caaccaaacc 4800
gttattcatt cgtgattgcg cctgagcgag acgaaatacg cgatcgctgt taaaaggaca 4860
attacaaaca ggaatcgaat gcaaccggcg caggaacact gccagcgcat caacaatatt 4920
ttcacctgaa tcaggatatt cttctaatac ctggaatgct gttttcccgg ggatcgcagt 4980
ggtgagtaac catgcatcat caggagtacg gataaaatgc ttgatggtcg gaagaggcat 5040
aaattccgtc agccagttta gtctgaccat ctcatctgta acatcattgg caacgctacc 5100
tttgccatgt ttcagaaaca actctggcgc atcgggcttc ccatacaatc gatagattgt 5160
cgcacctgat tgcccgacat tatcgcgagc ccatttatac ccatataaat cagcatccat 5220
gttggaattt aatcgcggcc tcgagcaaga cgtttcccgt tgaatatggc tcataacacc 5280
ccttgtatta ctgtttatgt aagcagacag ttttattgtt catgatgata tatttttatc 5340
ttgtgcaatg taacatcaga gattttgaga cacaacgtgg ctttcccccc ccccccatta 5400
ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 5460
aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga 5520
aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgtc 5579
<210> 302
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(645)
<400> 302
atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
atc aat ggc att act aac atc gtg aat tct gtc atc gaa aaa atg ggc 384
Ile Asn Gly Ile Thr Asn Ile Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
agc gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg ctg atc 432
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Ile
130 135 140
aac gag cgg act ctg gat ttc cac gat agc aac gtg aag aat ctg tat 480
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
ctg aat cga gag aaa att gac 645
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 303
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 303
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Ile Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Ile
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 304
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 304
gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60
gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120
cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180
gttgctatcg tggaaatcca gagtccgctc gttgatcagc agcaccagca gctcagccag 240
gtctgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacga tgttagtaat 300
gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360
ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420
tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480
catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540
ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600
ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645
<210> 305
<211> 1155
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(1155)
<400> 305
atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
atc aat ggc att act aac atc gtg aat tct gtc atc gaa aaa atg ggc 384
Ile Asn Gly Ile Thr Asn Ile Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
agc gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg ctg atc 432
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Ile
130 135 140
aac gag cgg act ctg gat ttc cac gat agc aac gtg aag aat ctg tat 480
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
ctg aat cga gag aaa att gac agt ggg ggc gac atc atc aag ctg ctg 672
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
aac gaa cag gtg aac aag gag atg cag agc tcc aac ctg tac atg agt 720
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
atg tct agt tgg tgt tat aca cac tca ctg gac ggc gct ggg ctg ttc 768
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
ctg ttt gat cac gca gcc gag gaa tac gaa cat gca aag aaa ctg atc 816
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
att ttc ctg aat gag aac aat gtg ccc gtc cag ctg act tca atc agc 864
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
gcc cct gaa cat aag ttc gag ggc ctg acc cag atc ttt cag aaa gct 912
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
tac gaa cac gag cag cat att tcc gaa tct atc aac aat att gtg gac 960
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
cac gcc att aag agc aaa gat cat gct acc ttc aac ttt ctg cag tgg 1008
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
tac gtg gcc gag cag cac gag gag gag gtc ctg ttt aag gac atc ctg 1056
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
gat aaa atc gaa ctg att gga aac gag aat cat ggc ctg tac ctg gca 1104
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
gat cag tat gtg aag ggc att gcc aag tcc aga aaa agt ggg tca tga 1152
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
tga 1155
<210> 306
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 306
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Ile Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Ile
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 307
<211> 1155
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 307
tcatcatgac ccactttttc tggacttggc aatgcccttc acatactgat ctgccaggta 60
caggccatga ttctcgtttc caatcagttc gattttatcc aggatgtcct taaacaggac 120
ctcctcctcg tgctgctcgg ccacgtacca ctgcagaaag ttgaaggtag catgatcttt 180
gctcttaatg gcgtggtcca caatattgtt gatagattcg gaaatatgct gctcgtgttc 240
gtaagctttc tgaaagatct gggtcaggcc ctcgaactta tgttcagggg cgctgattga 300
agtcagctgg acgggcacat tgttctcatt caggaaaatg atcagtttct ttgcatgttc 360
gtattcctcg gctgcgtgat caaacaggaa cagcccagcg ccgtccagtg agtgtgtata 420
acaccaacta gacatactca tgtacaggtt ggagctctgc atctccttgt tcacctgttc 480
gttcagcagc ttgatgatgt cgcccccact gtcaattttc tctcgattca gcttactctc 540
ttcagaatat ttgggatagt cgtaagtgcc gttcttcaca gactccatac attcattgtt 600
gcacttatgg taaaactcga agcatccatt cccgatttct ttggcattgt tcttcagctg 660
ggatttgacc ttctcataca gattcttcac gttgctatcg tggaaatcca gagtccgctc 720
gttgatcagc agcaccagca gctcagccag gtctgttccg gagcctccgc tgcccatttt 780
ttcgatgaca gaattcacga tgttagtaat gccattgatt gcgttctgtg tagacttctg 840
atcagcggcg tagccgctgc cctgctcatt ctgatggtgg tagccgtacc acccgtccac 900
cattcctgtc cacccgccct caataaaccc tgcgatagcg ccgaacagtc ctcttgtttc 960
ccgctgtggg atgttgcgca gtccggtgac catcctcagt ccgctgccca gattcactga 1020
gtgggtgaca gtcacgttct tctccaggac ggtatccact gtgtcggtgg agttgtttgc 1080
gtgatagccg atgcagatag tgtcagcgta ggttgcggta aaagtacaca gcaggaccag 1140
cagtttggcc ttcat 1155
<210> 308
<211> 5579
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 308
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960
tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020
cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080
ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140
ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200
accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260
gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320
ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380
atatcgccac catgaaggcc aaactgctgg tcctgctgtg tacttttacc gcaacctacg 1440
ctgacactat ctgcatcggc tatcacgcaa acaactccac cgacacagtg gataccgtcc 1500
tggagaagaa cgtgactgtc acccactcag tgaatctggg cagcggactg aggatggtca 1560
ccggactgcg caacatccca cagcgggaaa caagaggact gttcggcgct atcgcagggt 1620
ttattgaggg cgggtggaca ggaatggtgg acgggtggta cggctaccac catcagaatg 1680
agcagggcag cggctacgcc gctgatcaga agtctacaca gaacgcaatc aatggcatta 1740
ctaacatcgt gaattctgtc atcgaaaaaa tgggcagcgg aggctccgga acagacctgg 1800
ctgagctgct ggtgctgctg atcaacgagc ggactctgga tttccacgat agcaacgtga 1860
agaatctgta tgagaaggtc aaatcccagc tgaagaacaa tgccaaagaa atcgggaatg 1920
gatgcttcga gttttaccat aagtgcaaca atgaatgtat ggagtctgtg aagaacggca 1980
cttacgacta tcccaaatat tctgaagaga gtaagctgaa tcgagagaaa attgacagtg 2040
ggggcgacat catcaagctg ctgaacgaac aggtgaacaa ggagatgcag agctccaacc 2100
tgtacatgag tatgtctagt tggtgttata cacactcact ggacggcgct gggctgttcc 2160
tgtttgatca cgcagccgag gaatacgaac atgcaaagaa actgatcatt ttcctgaatg 2220
agaacaatgt gcccgtccag ctgacttcaa tcagcgcccc tgaacataag ttcgagggcc 2280
tgacccagat ctttcagaaa gcttacgaac acgagcagca tatttccgaa tctatcaaca 2340
atattgtgga ccacgccatt aagagcaaag atcatgctac cttcaacttt ctgcagtggt 2400
acgtggccga gcagcacgag gaggaggtcc tgtttaagga catcctggat aaaatcgaac 2460
tgattggaaa cgagaatcat ggcctgtacc tggcagatca gtatgtgaag ggcattgcca 2520
agtccagaaa aagtgggtca tgatgaacac gtgggatcca gatctgctgt gccttctagt 2580
tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 2640
cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 2700
tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 2760
aggcatgctg gggatgcggt gggctctatg ggtacccagg tgctgaagaa ttgacccggt 2820
tcctcctggg ccagaaagaa gcaggcacat ccccttctct gtgacacacc ctgtccacgc 2880
ccctggttct tagttccagc cccactcata ggacactcat agctcaggag ggctccgcct 2940
tcaatcccac ccgctaaagt acttggagcg gtctctccct ccctcatcag cccaccaaac 3000
caaacctagc ctccaagagt gggaagaaat taaagcaaga taggctatta agtgcagagg 3060
gagagaaaat gcctccaaca tgtgaggaag taatgagaga aatcatagaa ttttaaggcc 3120
atgatttaag gccatcatgg ccttaatctt ccgcttcctc gctcactgac tcgctgcgct 3180
cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 3240
cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 3300
accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 3360
acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 3420
cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 3480
acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt 3540
atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 3600
agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 3660
acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 3720
gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg 3780
gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 3840
gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 3900
gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga 3960
acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga 4020
tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 4080
ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt 4140
catccatagt tgcctgactc gggggggggg ggcgctgagg tctgcctcgt gaagaaggtg 4200
ttgctgactc ataccaggcc tgaatcgccc catcatccag ccagaaagtg agggagccac 4260
ggttgatgag agctttgttg taggtggacc agttggtgat tttgaacttt tgctttgcca 4320
cggaacggtc tgcgttgtcg ggaagatgcg tgatctgatc cttcaactca gcaaaagttc 4380
gatttattca acaaagccgc cgtcccgtca agtcagcgta atgctctgcc agtgttacaa 4440
ccaattaacc aattctgatt agaaaaactc atcgagcatc aaatgaaact gcaatttatt 4500
catatcagga ttatcaatac catatttttg aaaaagccgt ttctgtaatg aaggagaaaa 4560
ctcaccgagg cagttccata ggatggcaag atcctggtat cggtctgcga ttccgactcg 4620
tccaacatca atacaaccta ttaatttccc ctcgtcaaaa ataaggttat caagtgagaa 4680
atcaccatga gtgacgactg aatccggtga gaatggcaaa agcttatgca tttctttcca 4740
gacttgttca acaggccagc cattacgctc gtcatcaaaa tcactcgcat caaccaaacc 4800
gttattcatt cgtgattgcg cctgagcgag acgaaatacg cgatcgctgt taaaaggaca 4860
attacaaaca ggaatcgaat gcaaccggcg caggaacact gccagcgcat caacaatatt 4920
ttcacctgaa tcaggatatt cttctaatac ctggaatgct gttttcccgg ggatcgcagt 4980
ggtgagtaac catgcatcat caggagtacg gataaaatgc ttgatggtcg gaagaggcat 5040
aaattccgtc agccagttta gtctgaccat ctcatctgta acatcattgg caacgctacc 5100
tttgccatgt ttcagaaaca actctggcgc atcgggcttc ccatacaatc gatagattgt 5160
cgcacctgat tgcccgacat tatcgcgagc ccatttatac ccatataaat cagcatccat 5220
gttggaattt aatcgcggcc tcgagcaaga cgtttcccgt tgaatatggc tcataacacc 5280
ccttgtatta ctgtttatgt aagcagacag ttttattgtt catgatgata tatttttatc 5340
ttgtgcaatg taacatcaga gattttgaga cacaacgtgg ctttcccccc ccccccatta 5400
ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 5460
aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga 5520
aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgtc 5579
<210> 309
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(645)
<400> 309
atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
atc aat ggc att act aac ctg gtg aat tct gtc atc gaa aaa atg ggc 384
Ile Asn Gly Ile Thr Asn Leu Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
agc gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg ctg atc 432
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Ile
130 135 140
aac gag cgg act ctg gat ttc cac gat agc aac gtg aag aat ctg tat 480
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
ctg aat cga gag aaa att gac 645
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 310
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 310
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Leu Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Ile
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 311
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 311
gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60
gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120
cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180
gttgctatcg tggaaatcca gagtccgctc gttgatcagc agcaccagca gctcagccag 240
gtctgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacca ggttagtaat 300
gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360
ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420
tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480
catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540
ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600
ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645
<210> 312
<211> 1155
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(1155)
<400> 312
atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
atc aat ggc att act aac ctg gtg aat tct gtc atc gaa aaa atg ggc 384
Ile Asn Gly Ile Thr Asn Leu Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
agc gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg ctg atc 432
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Ile
130 135 140
aac gag cgg act ctg gat ttc cac gat agc aac gtg aag aat ctg tat 480
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
ctg aat cga gag aaa att gac agt ggg ggc gac atc atc aag ctg ctg 672
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
aac gaa cag gtg aac aag gag atg cag agc tcc aac ctg tac atg agt 720
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
atg tct agt tgg tgt tat aca cac tca ctg gac ggc gct ggg ctg ttc 768
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
ctg ttt gat cac gca gcc gag gaa tac gaa cat gca aag aaa ctg atc 816
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
att ttc ctg aat gag aac aat gtg ccc gtc cag ctg act tca atc agc 864
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
gcc cct gaa cat aag ttc gag ggc ctg acc cag atc ttt cag aaa gct 912
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
tac gaa cac gag cag cat att tcc gaa tct atc aac aat att gtg gac 960
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
cac gcc att aag agc aaa gat cat gct acc ttc aac ttt ctg cag tgg 1008
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
tac gtg gcc gag cag cac gag gag gag gtc ctg ttt aag gac atc ctg 1056
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
gat aaa atc gaa ctg att gga aac gag aat cat ggc ctg tac ctg gca 1104
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
gat cag tat gtg aag ggc att gcc aag tcc aga aaa agt ggg tca tga 1152
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
tga 1155
<210> 313
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 313
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Leu Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Ile
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 314
<211> 1155
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 314
tcatcatgac ccactttttc tggacttggc aatgcccttc acatactgat ctgccaggta 60
caggccatga ttctcgtttc caatcagttc gattttatcc aggatgtcct taaacaggac 120
ctcctcctcg tgctgctcgg ccacgtacca ctgcagaaag ttgaaggtag catgatcttt 180
gctcttaatg gcgtggtcca caatattgtt gatagattcg gaaatatgct gctcgtgttc 240
gtaagctttc tgaaagatct gggtcaggcc ctcgaactta tgttcagggg cgctgattga 300
agtcagctgg acgggcacat tgttctcatt caggaaaatg atcagtttct ttgcatgttc 360
gtattcctcg gctgcgtgat caaacaggaa cagcccagcg ccgtccagtg agtgtgtata 420
acaccaacta gacatactca tgtacaggtt ggagctctgc atctccttgt tcacctgttc 480
gttcagcagc ttgatgatgt cgcccccact gtcaattttc tctcgattca gcttactctc 540
ttcagaatat ttgggatagt cgtaagtgcc gttcttcaca gactccatac attcattgtt 600
gcacttatgg taaaactcga agcatccatt cccgatttct ttggcattgt tcttcagctg 660
ggatttgacc ttctcataca gattcttcac gttgctatcg tggaaatcca gagtccgctc 720
gttgatcagc agcaccagca gctcagccag gtctgttccg gagcctccgc tgcccatttt 780
ttcgatgaca gaattcacca ggttagtaat gccattgatt gcgttctgtg tagacttctg 840
atcagcggcg tagccgctgc cctgctcatt ctgatggtgg tagccgtacc acccgtccac 900
cattcctgtc cacccgccct caataaaccc tgcgatagcg ccgaacagtc ctcttgtttc 960
ccgctgtggg atgttgcgca gtccggtgac catcctcagt ccgctgccca gattcactga 1020
gtgggtgaca gtcacgttct tctccaggac ggtatccact gtgtcggtgg agttgtttgc 1080
gtgatagccg atgcagatag tgtcagcgta ggttgcggta aaagtacaca gcaggaccag 1140
cagtttggcc ttcat 1155
<210> 315
<211> 5579
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 315
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960
tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020
cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080
ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140
ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200
accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260
gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320
ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380
atatcgccac catgaaggcc aaactgctgg tcctgctgtg tacttttacc gcaacctacg 1440
ctgacactat ctgcatcggc tatcacgcaa acaactccac cgacacagtg gataccgtcc 1500
tggagaagaa cgtgactgtc acccactcag tgaatctggg cagcggactg aggatggtca 1560
ccggactgcg caacatccca cagcgggaaa caagaggact gttcggcgct atcgcagggt 1620
ttattgaggg cgggtggaca ggaatggtgg acgggtggta cggctaccac catcagaatg 1680
agcagggcag cggctacgcc gctgatcaga agtctacaca gaacgcaatc aatggcatta 1740
ctaacctggt gaattctgtc atcgaaaaaa tgggcagcgg aggctccgga acagacctgg 1800
ctgagctgct ggtgctgctg atcaacgagc ggactctgga tttccacgat agcaacgtga 1860
agaatctgta tgagaaggtc aaatcccagc tgaagaacaa tgccaaagaa atcgggaatg 1920
gatgcttcga gttttaccat aagtgcaaca atgaatgtat ggagtctgtg aagaacggca 1980
cttacgacta tcccaaatat tctgaagaga gtaagctgaa tcgagagaaa attgacagtg 2040
ggggcgacat catcaagctg ctgaacgaac aggtgaacaa ggagatgcag agctccaacc 2100
tgtacatgag tatgtctagt tggtgttata cacactcact ggacggcgct gggctgttcc 2160
tgtttgatca cgcagccgag gaatacgaac atgcaaagaa actgatcatt ttcctgaatg 2220
agaacaatgt gcccgtccag ctgacttcaa tcagcgcccc tgaacataag ttcgagggcc 2280
tgacccagat ctttcagaaa gcttacgaac acgagcagca tatttccgaa tctatcaaca 2340
atattgtgga ccacgccatt aagagcaaag atcatgctac cttcaacttt ctgcagtggt 2400
acgtggccga gcagcacgag gaggaggtcc tgtttaagga catcctggat aaaatcgaac 2460
tgattggaaa cgagaatcat ggcctgtacc tggcagatca gtatgtgaag ggcattgcca 2520
agtccagaaa aagtgggtca tgatgaacac gtgggatcca gatctgctgt gccttctagt 2580
tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 2640
cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 2700
tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 2760
aggcatgctg gggatgcggt gggctctatg ggtacccagg tgctgaagaa ttgacccggt 2820
tcctcctggg ccagaaagaa gcaggcacat ccccttctct gtgacacacc ctgtccacgc 2880
ccctggttct tagttccagc cccactcata ggacactcat agctcaggag ggctccgcct 2940
tcaatcccac ccgctaaagt acttggagcg gtctctccct ccctcatcag cccaccaaac 3000
caaacctagc ctccaagagt gggaagaaat taaagcaaga taggctatta agtgcagagg 3060
gagagaaaat gcctccaaca tgtgaggaag taatgagaga aatcatagaa ttttaaggcc 3120
atgatttaag gccatcatgg ccttaatctt ccgcttcctc gctcactgac tcgctgcgct 3180
cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 3240
cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 3300
accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 3360
acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 3420
cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 3480
acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt 3540
atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 3600
agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 3660
acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 3720
gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg 3780
gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 3840
gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 3900
gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga 3960
acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga 4020
tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 4080
ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt 4140
catccatagt tgcctgactc gggggggggg ggcgctgagg tctgcctcgt gaagaaggtg 4200
ttgctgactc ataccaggcc tgaatcgccc catcatccag ccagaaagtg agggagccac 4260
ggttgatgag agctttgttg taggtggacc agttggtgat tttgaacttt tgctttgcca 4320
cggaacggtc tgcgttgtcg ggaagatgcg tgatctgatc cttcaactca gcaaaagttc 4380
gatttattca acaaagccgc cgtcccgtca agtcagcgta atgctctgcc agtgttacaa 4440
ccaattaacc aattctgatt agaaaaactc atcgagcatc aaatgaaact gcaatttatt 4500
catatcagga ttatcaatac catatttttg aaaaagccgt ttctgtaatg aaggagaaaa 4560
ctcaccgagg cagttccata ggatggcaag atcctggtat cggtctgcga ttccgactcg 4620
tccaacatca atacaaccta ttaatttccc ctcgtcaaaa ataaggttat caagtgagaa 4680
atcaccatga gtgacgactg aatccggtga gaatggcaaa agcttatgca tttctttcca 4740
gacttgttca acaggccagc cattacgctc gtcatcaaaa tcactcgcat caaccaaacc 4800
gttattcatt cgtgattgcg cctgagcgag acgaaatacg cgatcgctgt taaaaggaca 4860
attacaaaca ggaatcgaat gcaaccggcg caggaacact gccagcgcat caacaatatt 4920
ttcacctgaa tcaggatatt cttctaatac ctggaatgct gttttcccgg ggatcgcagt 4980
ggtgagtaac catgcatcat caggagtacg gataaaatgc ttgatggtcg gaagaggcat 5040
aaattccgtc agccagttta gtctgaccat ctcatctgta acatcattgg caacgctacc 5100
tttgccatgt ttcagaaaca actctggcgc atcgggcttc ccatacaatc gatagattgt 5160
cgcacctgat tgcccgacat tatcgcgagc ccatttatac ccatataaat cagcatccat 5220
gttggaattt aatcgcggcc tcgagcaaga cgtttcccgt tgaatatggc tcataacacc 5280
ccttgtatta ctgtttatgt aagcagacag ttttattgtt catgatgata tatttttatc 5340
ttgtgcaatg taacatcaga gattttgaga cacaacgtgg ctttcccccc ccccccatta 5400
ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 5460
aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga 5520
aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgtc 5579
<210> 316
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(645)
<400> 316
atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
atc aat ggc att act aac ctg gtg aat tct gtc atc gaa aaa atg ggc 384
Ile Asn Gly Ile Thr Asn Leu Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
agc gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg ctg ctg 432
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
aac gag cgg act ctg gat ttc cac gat agc aac gtg aag aat ctg tat 480
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
ctg aat cga gag aaa att gac 645
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 317
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 317
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Leu Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 318
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 318
gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60
gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120
cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180
gttgctatcg tggaaatcca gagtccgctc gttcagcagc agcaccagca gctcagccag 240
gtctgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacca ggttagtaat 300
gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360
ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420
tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480
catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540
ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600
ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645
<210> 319
<211> 1155
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(1155)
<400> 319
atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
atc aat ggc att act aac ctg gtg aat tct gtc atc gaa aaa atg ggc 384
Ile Asn Gly Ile Thr Asn Leu Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
agc gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg ctg ctg 432
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
aac gag cgg act ctg gat ttc cac gat agc aac gtg aag aat ctg tat 480
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
ctg aat cga gag aaa att gac agt ggg ggc gac atc atc aag ctg ctg 672
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
aac gaa cag gtg aac aag gag atg cag agc tcc aac ctg tac atg agt 720
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
atg tct agt tgg tgt tat aca cac tca ctg gac ggc gct ggg ctg ttc 768
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
ctg ttt gat cac gca gcc gag gaa tac gaa cat gca aag aaa ctg atc 816
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
att ttc ctg aat gag aac aat gtg ccc gtc cag ctg act tca atc agc 864
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
gcc cct gaa cat aag ttc gag ggc ctg acc cag atc ttt cag aaa gct 912
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
tac gaa cac gag cag cat att tcc gaa tct atc aac aat att gtg gac 960
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
cac gcc att aag agc aaa gat cat gct acc ttc aac ttt ctg cag tgg 1008
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
tac gtg gcc gag cag cac gag gag gag gtc ctg ttt aag gac atc ctg 1056
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
gat aaa atc gaa ctg att gga aac gag aat cat ggc ctg tac ctg gca 1104
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
gat cag tat gtg aag ggc att gcc aag tcc aga aaa agt ggg tca tga 1152
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
tga 1155
<210> 320
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 320
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Leu Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 321
<211> 1155
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 321
tcatcatgac ccactttttc tggacttggc aatgcccttc acatactgat ctgccaggta 60
caggccatga ttctcgtttc caatcagttc gattttatcc aggatgtcct taaacaggac 120
ctcctcctcg tgctgctcgg ccacgtacca ctgcagaaag ttgaaggtag catgatcttt 180
gctcttaatg gcgtggtcca caatattgtt gatagattcg gaaatatgct gctcgtgttc 240
gtaagctttc tgaaagatct gggtcaggcc ctcgaactta tgttcagggg cgctgattga 300
agtcagctgg acgggcacat tgttctcatt caggaaaatg atcagtttct ttgcatgttc 360
gtattcctcg gctgcgtgat caaacaggaa cagcccagcg ccgtccagtg agtgtgtata 420
acaccaacta gacatactca tgtacaggtt ggagctctgc atctccttgt tcacctgttc 480
gttcagcagc ttgatgatgt cgcccccact gtcaattttc tctcgattca gcttactctc 540
ttcagaatat ttgggatagt cgtaagtgcc gttcttcaca gactccatac attcattgtt 600
gcacttatgg taaaactcga agcatccatt cccgatttct ttggcattgt tcttcagctg 660
ggatttgacc ttctcataca gattcttcac gttgctatcg tggaaatcca gagtccgctc 720
gttcagcagc agcaccagca gctcagccag gtctgttccg gagcctccgc tgcccatttt 780
ttcgatgaca gaattcacca ggttagtaat gccattgatt gcgttctgtg tagacttctg 840
atcagcggcg tagccgctgc cctgctcatt ctgatggtgg tagccgtacc acccgtccac 900
cattcctgtc cacccgccct caataaaccc tgcgatagcg ccgaacagtc ctcttgtttc 960
ccgctgtggg atgttgcgca gtccggtgac catcctcagt ccgctgccca gattcactga 1020
gtgggtgaca gtcacgttct tctccaggac ggtatccact gtgtcggtgg agttgtttgc 1080
gtgatagccg atgcagatag tgtcagcgta ggttgcggta aaagtacaca gcaggaccag 1140
cagtttggcc ttcat 1155
<210> 322
<211> 5579
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 322
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960
tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020
cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080
ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140
ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200
accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260
gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320
ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380
atatcgccac catgaaggcc aaactgctgg tcctgctgtg tacttttacc gcaacctacg 1440
ctgacactat ctgcatcggc tatcacgcaa acaactccac cgacacagtg gataccgtcc 1500
tggagaagaa cgtgactgtc acccactcag tgaatctggg cagcggactg aggatggtca 1560
ccggactgcg caacatccca cagcgggaaa caagaggact gttcggcgct atcgcagggt 1620
ttattgaggg cgggtggaca ggaatggtgg acgggtggta cggctaccac catcagaatg 1680
agcagggcag cggctacgcc gctgatcaga agtctacaca gaacgcaatc aatggcatta 1740
ctaacctggt gaattctgtc atcgaaaaaa tgggcagcgg aggctccgga acagacctgg 1800
ctgagctgct ggtgctgctg ctgaacgagc ggactctgga tttccacgat agcaacgtga 1860
agaatctgta tgagaaggtc aaatcccagc tgaagaacaa tgccaaagaa atcgggaatg 1920
gatgcttcga gttttaccat aagtgcaaca atgaatgtat ggagtctgtg aagaacggca 1980
cttacgacta tcccaaatat tctgaagaga gtaagctgaa tcgagagaaa attgacagtg 2040
ggggcgacat catcaagctg ctgaacgaac aggtgaacaa ggagatgcag agctccaacc 2100
tgtacatgag tatgtctagt tggtgttata cacactcact ggacggcgct gggctgttcc 2160
tgtttgatca cgcagccgag gaatacgaac atgcaaagaa actgatcatt ttcctgaatg 2220
agaacaatgt gcccgtccag ctgacttcaa tcagcgcccc tgaacataag ttcgagggcc 2280
tgacccagat ctttcagaaa gcttacgaac acgagcagca tatttccgaa tctatcaaca 2340
atattgtgga ccacgccatt aagagcaaag atcatgctac cttcaacttt ctgcagtggt 2400
acgtggccga gcagcacgag gaggaggtcc tgtttaagga catcctggat aaaatcgaac 2460
tgattggaaa cgagaatcat ggcctgtacc tggcagatca gtatgtgaag ggcattgcca 2520
agtccagaaa aagtgggtca tgatgaacac gtgggatcca gatctgctgt gccttctagt 2580
tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 2640
cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 2700
tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 2760
aggcatgctg gggatgcggt gggctctatg ggtacccagg tgctgaagaa ttgacccggt 2820
tcctcctggg ccagaaagaa gcaggcacat ccccttctct gtgacacacc ctgtccacgc 2880
ccctggttct tagttccagc cccactcata ggacactcat agctcaggag ggctccgcct 2940
tcaatcccac ccgctaaagt acttggagcg gtctctccct ccctcatcag cccaccaaac 3000
caaacctagc ctccaagagt gggaagaaat taaagcaaga taggctatta agtgcagagg 3060
gagagaaaat gcctccaaca tgtgaggaag taatgagaga aatcatagaa ttttaaggcc 3120
atgatttaag gccatcatgg ccttaatctt ccgcttcctc gctcactgac tcgctgcgct 3180
cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 3240
cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 3300
accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 3360
acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 3420
cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 3480
acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt 3540
atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 3600
agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 3660
acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 3720
gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg 3780
gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 3840
gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 3900
gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga 3960
acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga 4020
tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 4080
ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt 4140
catccatagt tgcctgactc gggggggggg ggcgctgagg tctgcctcgt gaagaaggtg 4200
ttgctgactc ataccaggcc tgaatcgccc catcatccag ccagaaagtg agggagccac 4260
ggttgatgag agctttgttg taggtggacc agttggtgat tttgaacttt tgctttgcca 4320
cggaacggtc tgcgttgtcg ggaagatgcg tgatctgatc cttcaactca gcaaaagttc 4380
gatttattca acaaagccgc cgtcccgtca agtcagcgta atgctctgcc agtgttacaa 4440
ccaattaacc aattctgatt agaaaaactc atcgagcatc aaatgaaact gcaatttatt 4500
catatcagga ttatcaatac catatttttg aaaaagccgt ttctgtaatg aaggagaaaa 4560
ctcaccgagg cagttccata ggatggcaag atcctggtat cggtctgcga ttccgactcg 4620
tccaacatca atacaaccta ttaatttccc ctcgtcaaaa ataaggttat caagtgagaa 4680
atcaccatga gtgacgactg aatccggtga gaatggcaaa agcttatgca tttctttcca 4740
gacttgttca acaggccagc cattacgctc gtcatcaaaa tcactcgcat caaccaaacc 4800
gttattcatt cgtgattgcg cctgagcgag acgaaatacg cgatcgctgt taaaaggaca 4860
attacaaaca ggaatcgaat gcaaccggcg caggaacact gccagcgcat caacaatatt 4920
ttcacctgaa tcaggatatt cttctaatac ctggaatgct gttttcccgg ggatcgcagt 4980
ggtgagtaac catgcatcat caggagtacg gataaaatgc ttgatggtcg gaagaggcat 5040
aaattccgtc agccagttta gtctgaccat ctcatctgta acatcattgg caacgctacc 5100
tttgccatgt ttcagaaaca actctggcgc atcgggcttc ccatacaatc gatagattgt 5160
cgcacctgat tgcccgacat tatcgcgagc ccatttatac ccatataaat cagcatccat 5220
gttggaattt aatcgcggcc tcgagcaaga cgtttcccgt tgaatatggc tcataacacc 5280
ccttgtatta ctgtttatgt aagcagacag ttttattgtt catgatgata tatttttatc 5340
ttgtgcaatg taacatcaga gattttgaga cacaacgtgg ctttcccccc ccccccatta 5400
ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 5460
aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga 5520
aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgtc 5579
<210> 323
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(645)
<400> 323
atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
agc gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg ctg atg 432
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Met
130 135 140
aac gag cgg act ctg gat ttc cac gat agc aac gtg aag aat ctg tat 480
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
ctg aat cga gag aaa att gac 645
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 324
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 324
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Met
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 325
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 325
gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60
gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120
cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180
gttgctatcg tggaaatcca gagtccgctc gttcatcagc agcaccagca gctcagccag 240
gtctgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacca tgttagtaat 300
gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360
ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420
tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480
catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540
ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600
ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645
<210> 326
<211> 1155
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(1155)
<400> 326
atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
agc gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg ctg atg 432
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Met
130 135 140
aac gag cgg act ctg gat ttc cac gat agc aac gtg aag aat ctg tat 480
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
ctg aat cga gag aaa att gac agt ggg ggc gac atc atc aag ctg ctg 672
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
aac gaa cag gtg aac aag gag atg cag agc tcc aac ctg tac atg agt 720
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
atg tct agt tgg tgt tat aca cac tca ctg gac ggc gct ggg ctg ttc 768
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
ctg ttt gat cac gca gcc gag gaa tac gaa cat gca aag aaa ctg atc 816
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
att ttc ctg aat gag aac aat gtg ccc gtc cag ctg act tca atc agc 864
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
gcc cct gaa cat aag ttc gag ggc ctg acc cag atc ttt cag aaa gct 912
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
tac gaa cac gag cag cat att tcc gaa tct atc aac aat att gtg gac 960
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
cac gcc att aag agc aaa gat cat gct acc ttc aac ttt ctg cag tgg 1008
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
tac gtg gcc gag cag cac gag gag gag gtc ctg ttt aag gac atc ctg 1056
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
gat aaa atc gaa ctg att gga aac gag aat cat ggc ctg tac ctg gca 1104
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
gat cag tat gtg aag ggc att gcc aag tcc aga aaa agt ggg tca tga 1152
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
tga 1155
<210> 327
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 327
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Met
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 328
<211> 1155
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 328
tcatcatgac ccactttttc tggacttggc aatgcccttc acatactgat ctgccaggta 60
caggccatga ttctcgtttc caatcagttc gattttatcc aggatgtcct taaacaggac 120
ctcctcctcg tgctgctcgg ccacgtacca ctgcagaaag ttgaaggtag catgatcttt 180
gctcttaatg gcgtggtcca caatattgtt gatagattcg gaaatatgct gctcgtgttc 240
gtaagctttc tgaaagatct gggtcaggcc ctcgaactta tgttcagggg cgctgattga 300
agtcagctgg acgggcacat tgttctcatt caggaaaatg atcagtttct ttgcatgttc 360
gtattcctcg gctgcgtgat caaacaggaa cagcccagcg ccgtccagtg agtgtgtata 420
acaccaacta gacatactca tgtacaggtt ggagctctgc atctccttgt tcacctgttc 480
gttcagcagc ttgatgatgt cgcccccact gtcaattttc tctcgattca gcttactctc 540
ttcagaatat ttgggatagt cgtaagtgcc gttcttcaca gactccatac attcattgtt 600
gcacttatgg taaaactcga agcatccatt cccgatttct ttggcattgt tcttcagctg 660
ggatttgacc ttctcataca gattcttcac gttgctatcg tggaaatcca gagtccgctc 720
gttcatcagc agcaccagca gctcagccag gtctgttccg gagcctccgc tgcccatttt 780
ttcgatgaca gaattcacca tgttagtaat gccattgatt gcgttctgtg tagacttctg 840
atcagcggcg tagccgctgc cctgctcatt ctgatggtgg tagccgtacc acccgtccac 900
cattcctgtc cacccgccct caataaaccc tgcgatagcg ccgaacagtc ctcttgtttc 960
ccgctgtggg atgttgcgca gtccggtgac catcctcagt ccgctgccca gattcactga 1020
gtgggtgaca gtcacgttct tctccaggac ggtatccact gtgtcggtgg agttgtttgc 1080
gtgatagccg atgcagatag tgtcagcgta ggttgcggta aaagtacaca gcaggaccag 1140
cagtttggcc ttcat 1155
<210> 329
<211> 5579
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 329
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960
tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020
cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080
ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140
ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200
accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260
gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320
ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380
atatcgccac catgaaggcc aaactgctgg tcctgctgtg tacttttacc gcaacctacg 1440
ctgacactat ctgcatcggc tatcacgcaa acaactccac cgacacagtg gataccgtcc 1500
tggagaagaa cgtgactgtc acccactcag tgaatctggg cagcggactg aggatggtca 1560
ccggactgcg caacatccca cagcgggaaa caagaggact gttcggcgct atcgcagggt 1620
ttattgaggg cgggtggaca ggaatggtgg acgggtggta cggctaccac catcagaatg 1680
agcagggcag cggctacgcc gctgatcaga agtctacaca gaacgcaatc aatggcatta 1740
ctaacatggt gaattctgtc atcgaaaaaa tgggcagcgg aggctccgga acagacctgg 1800
ctgagctgct ggtgctgctg atgaacgagc ggactctgga tttccacgat agcaacgtga 1860
agaatctgta tgagaaggtc aaatcccagc tgaagaacaa tgccaaagaa atcgggaatg 1920
gatgcttcga gttttaccat aagtgcaaca atgaatgtat ggagtctgtg aagaacggca 1980
cttacgacta tcccaaatat tctgaagaga gtaagctgaa tcgagagaaa attgacagtg 2040
ggggcgacat catcaagctg ctgaacgaac aggtgaacaa ggagatgcag agctccaacc 2100
tgtacatgag tatgtctagt tggtgttata cacactcact ggacggcgct gggctgttcc 2160
tgtttgatca cgcagccgag gaatacgaac atgcaaagaa actgatcatt ttcctgaatg 2220
agaacaatgt gcccgtccag ctgacttcaa tcagcgcccc tgaacataag ttcgagggcc 2280
tgacccagat ctttcagaaa gcttacgaac acgagcagca tatttccgaa tctatcaaca 2340
atattgtgga ccacgccatt aagagcaaag atcatgctac cttcaacttt ctgcagtggt 2400
acgtggccga gcagcacgag gaggaggtcc tgtttaagga catcctggat aaaatcgaac 2460
tgattggaaa cgagaatcat ggcctgtacc tggcagatca gtatgtgaag ggcattgcca 2520
agtccagaaa aagtgggtca tgatgaacac gtgggatcca gatctgctgt gccttctagt 2580
tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 2640
cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 2700
tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 2760
aggcatgctg gggatgcggt gggctctatg ggtacccagg tgctgaagaa ttgacccggt 2820
tcctcctggg ccagaaagaa gcaggcacat ccccttctct gtgacacacc ctgtccacgc 2880
ccctggttct tagttccagc cccactcata ggacactcat agctcaggag ggctccgcct 2940
tcaatcccac ccgctaaagt acttggagcg gtctctccct ccctcatcag cccaccaaac 3000
caaacctagc ctccaagagt gggaagaaat taaagcaaga taggctatta agtgcagagg 3060
gagagaaaat gcctccaaca tgtgaggaag taatgagaga aatcatagaa ttttaaggcc 3120
atgatttaag gccatcatgg ccttaatctt ccgcttcctc gctcactgac tcgctgcgct 3180
cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 3240
cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 3300
accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 3360
acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 3420
cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 3480
acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt 3540
atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 3600
agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 3660
acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 3720
gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg 3780
gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 3840
gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 3900
gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga 3960
acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga 4020
tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 4080
ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt 4140
catccatagt tgcctgactc gggggggggg ggcgctgagg tctgcctcgt gaagaaggtg 4200
ttgctgactc ataccaggcc tgaatcgccc catcatccag ccagaaagtg agggagccac 4260
ggttgatgag agctttgttg taggtggacc agttggtgat tttgaacttt tgctttgcca 4320
cggaacggtc tgcgttgtcg ggaagatgcg tgatctgatc cttcaactca gcaaaagttc 4380
gatttattca acaaagccgc cgtcccgtca agtcagcgta atgctctgcc agtgttacaa 4440
ccaattaacc aattctgatt agaaaaactc atcgagcatc aaatgaaact gcaatttatt 4500
catatcagga ttatcaatac catatttttg aaaaagccgt ttctgtaatg aaggagaaaa 4560
ctcaccgagg cagttccata ggatggcaag atcctggtat cggtctgcga ttccgactcg 4620
tccaacatca atacaaccta ttaatttccc ctcgtcaaaa ataaggttat caagtgagaa 4680
atcaccatga gtgacgactg aatccggtga gaatggcaaa agcttatgca tttctttcca 4740
gacttgttca acaggccagc cattacgctc gtcatcaaaa tcactcgcat caaccaaacc 4800
gttattcatt cgtgattgcg cctgagcgag acgaaatacg cgatcgctgt taaaaggaca 4860
attacaaaca ggaatcgaat gcaaccggcg caggaacact gccagcgcat caacaatatt 4920
ttcacctgaa tcaggatatt cttctaatac ctggaatgct gttttcccgg ggatcgcagt 4980
ggtgagtaac catgcatcat caggagtacg gataaaatgc ttgatggtcg gaagaggcat 5040
aaattccgtc agccagttta gtctgaccat ctcatctgta acatcattgg caacgctacc 5100
tttgccatgt ttcagaaaca actctggcgc atcgggcttc ccatacaatc gatagattgt 5160
cgcacctgat tgcccgacat tatcgcgagc ccatttatac ccatataaat cagcatccat 5220
gttggaattt aatcgcggcc tcgagcaaga cgtttcccgt tgaatatggc tcataacacc 5280
ccttgtatta ctgtttatgt aagcagacag ttttattgtt catgatgata tatttttatc 5340
ttgtgcaatg taacatcaga gattttgaga cacaacgtgg ctttcccccc ccccccatta 5400
ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 5460
aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga 5520
aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgtc 5579
<210> 330
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(645)
<400> 330
atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
atc aat ggc att act aac cag gtg aat tct gtc atc gaa aaa atg ggc 384
Ile Asn Gly Ile Thr Asn Gln Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
agc gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg ctg cag 432
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Gln
130 135 140
aac gag cgg act ctg gat ttc cac gat agc aac gtg aag aat ctg tat 480
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
ctg aat cga gag aaa att gac 645
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 331
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 331
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Gln Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Gln
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 332
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 332
gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60
gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120
cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180
gttgctatcg tggaaatcca gagtccgctc gttctgcagc agcaccagca gctcagccag 240
gtctgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacct ggttagtaat 300
gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360
ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420
tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480
catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540
ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600
ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645
<210> 333
<211> 1155
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(1155)
<400> 333
atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
atc aat ggc att act aac cag gtg aat tct gtc atc gaa aaa atg ggc 384
Ile Asn Gly Ile Thr Asn Gln Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
agc gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg ctg cag 432
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Gln
130 135 140
aac gag cgg act ctg gat ttc cac gat agc aac gtg aag aat ctg tat 480
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
ctg aat cga gag aaa att gac agt ggg ggc gac atc atc aag ctg ctg 672
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
aac gaa cag gtg aac aag gag atg cag agc tcc aac ctg tac atg agt 720
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
atg tct agt tgg tgt tat aca cac tca ctg gac ggc gct ggg ctg ttc 768
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
ctg ttt gat cac gca gcc gag gaa tac gaa cat gca aag aaa ctg atc 816
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
att ttc ctg aat gag aac aat gtg ccc gtc cag ctg act tca atc agc 864
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
gcc cct gaa cat aag ttc gag ggc ctg acc cag atc ttt cag aaa gct 912
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
tac gaa cac gag cag cat att tcc gaa tct atc aac aat att gtg gac 960
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
cac gcc att aag agc aaa gat cat gct acc ttc aac ttt ctg cag tgg 1008
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
tac gtg gcc gag cag cac gag gag gag gtc ctg ttt aag gac atc ctg 1056
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
gat aaa atc gaa ctg att gga aac gag aat cat ggc ctg tac ctg gca 1104
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
gat cag tat gtg aag ggc att gcc aag tcc aga aaa agt ggg tca tga 1152
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
tga 1155
<210> 334
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 334
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Gln Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Gln
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 335
<211> 1155
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 335
tcatcatgac ccactttttc tggacttggc aatgcccttc acatactgat ctgccaggta 60
caggccatga ttctcgtttc caatcagttc gattttatcc aggatgtcct taaacaggac 120
ctcctcctcg tgctgctcgg ccacgtacca ctgcagaaag ttgaaggtag catgatcttt 180
gctcttaatg gcgtggtcca caatattgtt gatagattcg gaaatatgct gctcgtgttc 240
gtaagctttc tgaaagatct gggtcaggcc ctcgaactta tgttcagggg cgctgattga 300
agtcagctgg acgggcacat tgttctcatt caggaaaatg atcagtttct ttgcatgttc 360
gtattcctcg gctgcgtgat caaacaggaa cagcccagcg ccgtccagtg agtgtgtata 420
acaccaacta gacatactca tgtacaggtt ggagctctgc atctccttgt tcacctgttc 480
gttcagcagc ttgatgatgt cgcccccact gtcaattttc tctcgattca gcttactctc 540
ttcagaatat ttgggatagt cgtaagtgcc gttcttcaca gactccatac attcattgtt 600
gcacttatgg taaaactcga agcatccatt cccgatttct ttggcattgt tcttcagctg 660
ggatttgacc ttctcataca gattcttcac gttgctatcg tggaaatcca gagtccgctc 720
gttctgcagc agcaccagca gctcagccag gtctgttccg gagcctccgc tgcccatttt 780
ttcgatgaca gaattcacct ggttagtaat gccattgatt gcgttctgtg tagacttctg 840
atcagcggcg tagccgctgc cctgctcatt ctgatggtgg tagccgtacc acccgtccac 900
cattcctgtc cacccgccct caataaaccc tgcgatagcg ccgaacagtc ctcttgtttc 960
ccgctgtggg atgttgcgca gtccggtgac catcctcagt ccgctgccca gattcactga 1020
gtgggtgaca gtcacgttct tctccaggac ggtatccact gtgtcggtgg agttgtttgc 1080
gtgatagccg atgcagatag tgtcagcgta ggttgcggta aaagtacaca gcaggaccag 1140
cagtttggcc ttcat 1155
<210> 336
<211> 5579
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 336
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960
tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020
cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080
ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140
ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200
accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260
gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320
ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380
atatcgccac catgaaggcc aaactgctgg tcctgctgtg tacttttacc gcaacctacg 1440
ctgacactat ctgcatcggc tatcacgcaa acaactccac cgacacagtg gataccgtcc 1500
tggagaagaa cgtgactgtc acccactcag tgaatctggg cagcggactg aggatggtca 1560
ccggactgcg caacatccca cagcgggaaa caagaggact gttcggcgct atcgcagggt 1620
ttattgaggg cgggtggaca ggaatggtgg acgggtggta cggctaccac catcagaatg 1680
agcagggcag cggctacgcc gctgatcaga agtctacaca gaacgcaatc aatggcatta 1740
ctaaccaggt gaattctgtc atcgaaaaaa tgggcagcgg aggctccgga acagacctgg 1800
ctgagctgct ggtgctgctg cagaacgagc ggactctgga tttccacgat agcaacgtga 1860
agaatctgta tgagaaggtc aaatcccagc tgaagaacaa tgccaaagaa atcgggaatg 1920
gatgcttcga gttttaccat aagtgcaaca atgaatgtat ggagtctgtg aagaacggca 1980
cttacgacta tcccaaatat tctgaagaga gtaagctgaa tcgagagaaa attgacagtg 2040
ggggcgacat catcaagctg ctgaacgaac aggtgaacaa ggagatgcag agctccaacc 2100
tgtacatgag tatgtctagt tggtgttata cacactcact ggacggcgct gggctgttcc 2160
tgtttgatca cgcagccgag gaatacgaac atgcaaagaa actgatcatt ttcctgaatg 2220
agaacaatgt gcccgtccag ctgacttcaa tcagcgcccc tgaacataag ttcgagggcc 2280
tgacccagat ctttcagaaa gcttacgaac acgagcagca tatttccgaa tctatcaaca 2340
atattgtgga ccacgccatt aagagcaaag atcatgctac cttcaacttt ctgcagtggt 2400
acgtggccga gcagcacgag gaggaggtcc tgtttaagga catcctggat aaaatcgaac 2460
tgattggaaa cgagaatcat ggcctgtacc tggcagatca gtatgtgaag ggcattgcca 2520
agtccagaaa aagtgggtca tgatgaacac gtgggatcca gatctgctgt gccttctagt 2580
tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 2640
cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 2700
tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 2760
aggcatgctg gggatgcggt gggctctatg ggtacccagg tgctgaagaa ttgacccggt 2820
tcctcctggg ccagaaagaa gcaggcacat ccccttctct gtgacacacc ctgtccacgc 2880
ccctggttct tagttccagc cccactcata ggacactcat agctcaggag ggctccgcct 2940
tcaatcccac ccgctaaagt acttggagcg gtctctccct ccctcatcag cccaccaaac 3000
caaacctagc ctccaagagt gggaagaaat taaagcaaga taggctatta agtgcagagg 3060
gagagaaaat gcctccaaca tgtgaggaag taatgagaga aatcatagaa ttttaaggcc 3120
atgatttaag gccatcatgg ccttaatctt ccgcttcctc gctcactgac tcgctgcgct 3180
cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 3240
cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 3300
accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 3360
acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 3420
cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 3480
acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt 3540
atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 3600
agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 3660
acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 3720
gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg 3780
gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 3840
gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 3900
gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga 3960
acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga 4020
tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 4080
ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt 4140
catccatagt tgcctgactc gggggggggg ggcgctgagg tctgcctcgt gaagaaggtg 4200
ttgctgactc ataccaggcc tgaatcgccc catcatccag ccagaaagtg agggagccac 4260
ggttgatgag agctttgttg taggtggacc agttggtgat tttgaacttt tgctttgcca 4320
cggaacggtc tgcgttgtcg ggaagatgcg tgatctgatc cttcaactca gcaaaagttc 4380
gatttattca acaaagccgc cgtcccgtca agtcagcgta atgctctgcc agtgttacaa 4440
ccaattaacc aattctgatt agaaaaactc atcgagcatc aaatgaaact gcaatttatt 4500
catatcagga ttatcaatac catatttttg aaaaagccgt ttctgtaatg aaggagaaaa 4560
ctcaccgagg cagttccata ggatggcaag atcctggtat cggtctgcga ttccgactcg 4620
tccaacatca atacaaccta ttaatttccc ctcgtcaaaa ataaggttat caagtgagaa 4680
atcaccatga gtgacgactg aatccggtga gaatggcaaa agcttatgca tttctttcca 4740
gacttgttca acaggccagc cattacgctc gtcatcaaaa tcactcgcat caaccaaacc 4800
gttattcatt cgtgattgcg cctgagcgag acgaaatacg cgatcgctgt taaaaggaca 4860
attacaaaca ggaatcgaat gcaaccggcg caggaacact gccagcgcat caacaatatt 4920
ttcacctgaa tcaggatatt cttctaatac ctggaatgct gttttcccgg ggatcgcagt 4980
ggtgagtaac catgcatcat caggagtacg gataaaatgc ttgatggtcg gaagaggcat 5040
aaattccgtc agccagttta gtctgaccat ctcatctgta acatcattgg caacgctacc 5100
tttgccatgt ttcagaaaca actctggcgc atcgggcttc ccatacaatc gatagattgt 5160
cgcacctgat tgcccgacat tatcgcgagc ccatttatac ccatataaat cagcatccat 5220
gttggaattt aatcgcggcc tcgagcaaga cgtttcccgt tgaatatggc tcataacacc 5280
ccttgtatta ctgtttatgt aagcagacag ttttattgtt catgatgata tatttttatc 5340
ttgtgcaatg taacatcaga gattttgaga cacaacgtgg ctttcccccc ccccccatta 5400
ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 5460
aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga 5520
aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgtc 5579
<210> 337
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(645)
<400> 337
atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
gtg gat acc gtc ctg gag aag aac gtg act gtc acc aac tca act aat 144
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr Asn Ser Thr Asn
35 40 45
ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
agc gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg ctg ctg 432
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
aac gag cgg act ctg gat ttc cac gat agc aac gtg aag aat ctg tat 480
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
ctg aat cga gag aaa att gac 645
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 338
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 338
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr Asn Ser Thr Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 339
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 339
gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60
gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120
cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180
gttgctatcg tggaaatcca gagtccgctc gttcagcagc agcaccagca gctcagccag 240
gtctgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacca tgttagtaat 300
gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360
ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420
tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480
catcctcagt ccgctgccca gattagttga gttggtgaca gtcacgttct tctccaggac 540
ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600
ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645
<210> 340
<211> 1155
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(1155)
<400> 340
atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
gtg gat acc gtc ctg gag aag aac gtg act gtc acc aac tca act aat 144
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr Asn Ser Thr Asn
35 40 45
ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
agc gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg ctg ctg 432
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
aac gag cgg act ctg gat ttc cac gat agc aac gtg aag aat ctg tat 480
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
ctg aat cga gag aaa att gac agt ggg ggc gac atc atc aag ctg ctg 672
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
aac gaa cag gtg aac aag gag atg cag agc tcc aac ctg tac atg agt 720
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
atg tct agt tgg tgt tat aca cac tca ctg gac ggc gct ggg ctg ttc 768
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
ctg ttt gat cac gca gcc gag gaa tac gaa cat gca aag aaa ctg atc 816
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
att ttc ctg aat gag aac aat gtg ccc gtc cag ctg act tca atc agc 864
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
gcc cct gaa cat aag ttc gag ggc ctg acc cag atc ttt cag aaa gct 912
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
tac gaa cac gag cag cat att tcc gaa tct atc aac aat att gtg gac 960
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
cac gcc att aag agc aaa gat cat gct acc ttc aac ttt ctg cag tgg 1008
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
tac gtg gcc gag cag cac gag gag gag gtc ctg ttt aag gac atc ctg 1056
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
gat aaa atc gaa ctg att gga aac gag aat cat ggc ctg tac ctg gca 1104
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
gat cag tat gtg aag ggc att gcc aag tcc aga aaa agt ggg tca tga 1152
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
tga 1155
<210> 341
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 341
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr Asn Ser Thr Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 342
<211> 1155
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 342
tcatcatgac ccactttttc tggacttggc aatgcccttc acatactgat ctgccaggta 60
caggccatga ttctcgtttc caatcagttc gattttatcc aggatgtcct taaacaggac 120
ctcctcctcg tgctgctcgg ccacgtacca ctgcagaaag ttgaaggtag catgatcttt 180
gctcttaatg gcgtggtcca caatattgtt gatagattcg gaaatatgct gctcgtgttc 240
gtaagctttc tgaaagatct gggtcaggcc ctcgaactta tgttcagggg cgctgattga 300
agtcagctgg acgggcacat tgttctcatt caggaaaatg atcagtttct ttgcatgttc 360
gtattcctcg gctgcgtgat caaacaggaa cagcccagcg ccgtccagtg agtgtgtata 420
acaccaacta gacatactca tgtacaggtt ggagctctgc atctccttgt tcacctgttc 480
gttcagcagc ttgatgatgt cgcccccact gtcaattttc tctcgattca gcttactctc 540
ttcagaatat ttgggatagt cgtaagtgcc gttcttcaca gactccatac attcattgtt 600
gcacttatgg taaaactcga agcatccatt cccgatttct ttggcattgt tcttcagctg 660
ggatttgacc ttctcataca gattcttcac gttgctatcg tggaaatcca gagtccgctc 720
gttcagcagc agcaccagca gctcagccag gtctgttccg gagcctccgc tgcccatttt 780
ttcgatgaca gaattcacca tgttagtaat gccattgatt gcgttctgtg tagacttctg 840
atcagcggcg tagccgctgc cctgctcatt ctgatggtgg tagccgtacc acccgtccac 900
cattcctgtc cacccgccct caataaaccc tgcgatagcg ccgaacagtc ctcttgtttc 960
ccgctgtggg atgttgcgca gtccggtgac catcctcagt ccgctgccca gattagttga 1020
gttggtgaca gtcacgttct tctccaggac ggtatccact gtgtcggtgg agttgtttgc 1080
gtgatagccg atgcagatag tgtcagcgta ggttgcggta aaagtacaca gcaggaccag 1140
cagtttggcc ttcat 1155
<210> 343
<211> 5579
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 343
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960
tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020
cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080
ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140
ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200
accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260
gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320
ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380
atatcgccac catgaaggcc aaactgctgg tcctgctgtg tacttttacc gcaacctacg 1440
ctgacactat ctgcatcggc tatcacgcaa acaactccac cgacacagtg gataccgtcc 1500
tggagaagaa cgtgactgtc accaactcaa ctaatctggg cagcggactg aggatggtca 1560
ccggactgcg caacatccca cagcgggaaa caagaggact gttcggcgct atcgcagggt 1620
ttattgaggg cgggtggaca ggaatggtgg acgggtggta cggctaccac catcagaatg 1680
agcagggcag cggctacgcc gctgatcaga agtctacaca gaacgcaatc aatggcatta 1740
ctaacatggt gaattctgtc atcgaaaaaa tgggcagcgg aggctccgga acagacctgg 1800
ctgagctgct ggtgctgctg ctgaacgagc ggactctgga tttccacgat agcaacgtga 1860
agaatctgta tgagaaggtc aaatcccagc tgaagaacaa tgccaaagaa atcgggaatg 1920
gatgcttcga gttttaccat aagtgcaaca atgaatgtat ggagtctgtg aagaacggca 1980
cttacgacta tcccaaatat tctgaagaga gtaagctgaa tcgagagaaa attgacagtg 2040
ggggcgacat catcaagctg ctgaacgaac aggtgaacaa ggagatgcag agctccaacc 2100
tgtacatgag tatgtctagt tggtgttata cacactcact ggacggcgct gggctgttcc 2160
tgtttgatca cgcagccgag gaatacgaac atgcaaagaa actgatcatt ttcctgaatg 2220
agaacaatgt gcccgtccag ctgacttcaa tcagcgcccc tgaacataag ttcgagggcc 2280
tgacccagat ctttcagaaa gcttacgaac acgagcagca tatttccgaa tctatcaaca 2340
atattgtgga ccacgccatt aagagcaaag atcatgctac cttcaacttt ctgcagtggt 2400
acgtggccga gcagcacgag gaggaggtcc tgtttaagga catcctggat aaaatcgaac 2460
tgattggaaa cgagaatcat ggcctgtacc tggcagatca gtatgtgaag ggcattgcca 2520
agtccagaaa aagtgggtca tgatgaacac gtgggatcca gatctgctgt gccttctagt 2580
tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 2640
cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 2700
tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 2760
aggcatgctg gggatgcggt gggctctatg ggtacccagg tgctgaagaa ttgacccggt 2820
tcctcctggg ccagaaagaa gcaggcacat ccccttctct gtgacacacc ctgtccacgc 2880
ccctggttct tagttccagc cccactcata ggacactcat agctcaggag ggctccgcct 2940
tcaatcccac ccgctaaagt acttggagcg gtctctccct ccctcatcag cccaccaaac 3000
caaacctagc ctccaagagt gggaagaaat taaagcaaga taggctatta agtgcagagg 3060
gagagaaaat gcctccaaca tgtgaggaag taatgagaga aatcatagaa ttttaaggcc 3120
atgatttaag gccatcatgg ccttaatctt ccgcttcctc gctcactgac tcgctgcgct 3180
cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 3240
cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 3300
accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 3360
acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 3420
cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 3480
acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt 3540
atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 3600
agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 3660
acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 3720
gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg 3780
gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 3840
gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 3900
gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga 3960
acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga 4020
tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 4080
ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt 4140
catccatagt tgcctgactc gggggggggg ggcgctgagg tctgcctcgt gaagaaggtg 4200
ttgctgactc ataccaggcc tgaatcgccc catcatccag ccagaaagtg agggagccac 4260
ggttgatgag agctttgttg taggtggacc agttggtgat tttgaacttt tgctttgcca 4320
cggaacggtc tgcgttgtcg ggaagatgcg tgatctgatc cttcaactca gcaaaagttc 4380
gatttattca acaaagccgc cgtcccgtca agtcagcgta atgctctgcc agtgttacaa 4440
ccaattaacc aattctgatt agaaaaactc atcgagcatc aaatgaaact gcaatttatt 4500
catatcagga ttatcaatac catatttttg aaaaagccgt ttctgtaatg aaggagaaaa 4560
ctcaccgagg cagttccata ggatggcaag atcctggtat cggtctgcga ttccgactcg 4620
tccaacatca atacaaccta ttaatttccc ctcgtcaaaa ataaggttat caagtgagaa 4680
atcaccatga gtgacgactg aatccggtga gaatggcaaa agcttatgca tttctttcca 4740
gacttgttca acaggccagc cattacgctc gtcatcaaaa tcactcgcat caaccaaacc 4800
gttattcatt cgtgattgcg cctgagcgag acgaaatacg cgatcgctgt taaaaggaca 4860
attacaaaca ggaatcgaat gcaaccggcg caggaacact gccagcgcat caacaatatt 4920
ttcacctgaa tcaggatatt cttctaatac ctggaatgct gttttcccgg ggatcgcagt 4980
ggtgagtaac catgcatcat caggagtacg gataaaatgc ttgatggtcg gaagaggcat 5040
aaattccgtc agccagttta gtctgaccat ctcatctgta acatcattgg caacgctacc 5100
tttgccatgt ttcagaaaca actctggcgc atcgggcttc ccatacaatc gatagattgt 5160
cgcacctgat tgcccgacat tatcgcgagc ccatttatac ccatataaat cagcatccat 5220
gttggaattt aatcgcggcc tcgagcaaga cgtttcccgt tgaatatggc tcataacacc 5280
ccttgtatta ctgtttatgt aagcagacag ttttattgtt catgatgata tatttttatc 5340
ttgtgcaatg taacatcaga gattttgaga cacaacgtgg ctttcccccc ccccccatta 5400
ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 5460
aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga 5520
aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgtc 5579
<210> 344
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(645)
<400> 344
atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
gtg gat acc att ctg gag aag aac gtg act gtc acc cac tca gtg aat 144
Val Asp Thr Ile Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
agc gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg atg ctg 432
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Met Leu
130 135 140
aac cag ttc act ctg ctg ttc cac gat agc aac gtg aag aat ctg tat 480
Asn Gln Phe Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
ctg aat cga gag aaa att gac 645
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 345
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 345
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Ile Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Met Leu
130 135 140
Asn Gln Phe Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 346
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 346
gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60
gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120
cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180
gttgctatcg tggaacagca gagtgaactg gttcagcatc agcaccagca gctcagccag 240
gtctgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacca tgttagtaat 300
gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360
ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420
tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480
catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccagaat 540
ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600
ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645
<210> 347
<211> 1155
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(1155)
<400> 347
atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
gtg gat acc att ctg gag aag aac gtg act gtc acc cac tca gtg aat 144
Val Asp Thr Ile Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
agc gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg atg ctg 432
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Met Leu
130 135 140
aac cag ttc act ctg ctg ttc cac gat agc aac gtg aag aat ctg tat 480
Asn Gln Phe Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
ctg aat cga gag aaa att gac agt ggg ggc gac atc atc aag ctg ctg 672
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
aac gaa cag gtg aac aag gag atg cag agc tcc aac ctg tac atg agt 720
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
atg tct agt tgg tgt tat aca cac tca ctg gac ggc gct ggg ctg ttc 768
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
ctg ttt gat cac gca gcc gag gaa tac gaa cat gca aag aaa ctg atc 816
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
att ttc ctg aat gag aac aat gtg ccc gtc cag ctg act tca atc agc 864
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
gcc cct gaa cat aag ttc gag ggc ctg acc cag atc ttt cag aaa gct 912
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
tac gaa cac gag cag cat att tcc gaa tct atc aac aat att gtg gac 960
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
cac gcc att aag agc aaa gat cat gct acc ttc aac ttt ctg cag tgg 1008
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
tac gtg gcc gag cag cac gag gag gag gtc ctg ttt aag gac atc ctg 1056
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
gat aaa atc gaa ctg att gga aac gag aat cat ggc ctg tac ctg gca 1104
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
gat cag tat gtg aag ggc att gcc aag tcc aga aaa agt ggg tca tga 1152
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
tga 1155
<210> 348
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 348
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Ile Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Met Leu
130 135 140
Asn Gln Phe Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 349
<211> 1155
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 349
tcatcatgac ccactttttc tggacttggc aatgcccttc acatactgat ctgccaggta 60
caggccatga ttctcgtttc caatcagttc gattttatcc aggatgtcct taaacaggac 120
ctcctcctcg tgctgctcgg ccacgtacca ctgcagaaag ttgaaggtag catgatcttt 180
gctcttaatg gcgtggtcca caatattgtt gatagattcg gaaatatgct gctcgtgttc 240
gtaagctttc tgaaagatct gggtcaggcc ctcgaactta tgttcagggg cgctgattga 300
agtcagctgg acgggcacat tgttctcatt caggaaaatg atcagtttct ttgcatgttc 360
gtattcctcg gctgcgtgat caaacaggaa cagcccagcg ccgtccagtg agtgtgtata 420
acaccaacta gacatactca tgtacaggtt ggagctctgc atctccttgt tcacctgttc 480
gttcagcagc ttgatgatgt cgcccccact gtcaattttc tctcgattca gcttactctc 540
ttcagaatat ttgggatagt cgtaagtgcc gttcttcaca gactccatac attcattgtt 600
gcacttatgg taaaactcga agcatccatt cccgatttct ttggcattgt tcttcagctg 660
ggatttgacc ttctcataca gattcttcac gttgctatcg tggaacagca gagtgaactg 720
gttcagcatc agcaccagca gctcagccag gtctgttccg gagcctccgc tgcccatttt 780
ttcgatgaca gaattcacca tgttagtaat gccattgatt gcgttctgtg tagacttctg 840
atcagcggcg tagccgctgc cctgctcatt ctgatggtgg tagccgtacc acccgtccac 900
cattcctgtc cacccgccct caataaaccc tgcgatagcg ccgaacagtc ctcttgtttc 960
ccgctgtggg atgttgcgca gtccggtgac catcctcagt ccgctgccca gattcactga 1020
gtgggtgaca gtcacgttct tctccagaat ggtatccact gtgtcggtgg agttgtttgc 1080
gtgatagccg atgcagatag tgtcagcgta ggttgcggta aaagtacaca gcaggaccag 1140
cagtttggcc ttcat 1155
<210> 350
<211> 5579
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 350
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960
tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020
cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080
ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140
ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200
accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260
gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320
ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380
atatcgccac catgaaggcc aaactgctgg tcctgctgtg tacttttacc gcaacctacg 1440
ctgacactat ctgcatcggc tatcacgcaa acaactccac cgacacagtg gataccattc 1500
tggagaagaa cgtgactgtc acccactcag tgaatctggg cagcggactg aggatggtca 1560
ccggactgcg caacatccca cagcgggaaa caagaggact gttcggcgct atcgcagggt 1620
ttattgaggg cgggtggaca ggaatggtgg acgggtggta cggctaccac catcagaatg 1680
agcagggcag cggctacgcc gctgatcaga agtctacaca gaacgcaatc aatggcatta 1740
ctaacatggt gaattctgtc atcgaaaaaa tgggcagcgg aggctccgga acagacctgg 1800
ctgagctgct ggtgctgatg ctgaaccagt tcactctgct gttccacgat agcaacgtga 1860
agaatctgta tgagaaggtc aaatcccagc tgaagaacaa tgccaaagaa atcgggaatg 1920
gatgcttcga gttttaccat aagtgcaaca atgaatgtat ggagtctgtg aagaacggca 1980
cttacgacta tcccaaatat tctgaagaga gtaagctgaa tcgagagaaa attgacagtg 2040
ggggcgacat catcaagctg ctgaacgaac aggtgaacaa ggagatgcag agctccaacc 2100
tgtacatgag tatgtctagt tggtgttata cacactcact ggacggcgct gggctgttcc 2160
tgtttgatca cgcagccgag gaatacgaac atgcaaagaa actgatcatt ttcctgaatg 2220
agaacaatgt gcccgtccag ctgacttcaa tcagcgcccc tgaacataag ttcgagggcc 2280
tgacccagat ctttcagaaa gcttacgaac acgagcagca tatttccgaa tctatcaaca 2340
atattgtgga ccacgccatt aagagcaaag atcatgctac cttcaacttt ctgcagtggt 2400
acgtggccga gcagcacgag gaggaggtcc tgtttaagga catcctggat aaaatcgaac 2460
tgattggaaa cgagaatcat ggcctgtacc tggcagatca gtatgtgaag ggcattgcca 2520
agtccagaaa aagtgggtca tgatgaacac gtgggatcca gatctgctgt gccttctagt 2580
tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 2640
cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 2700
tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 2760
aggcatgctg gggatgcggt gggctctatg ggtacccagg tgctgaagaa ttgacccggt 2820
tcctcctggg ccagaaagaa gcaggcacat ccccttctct gtgacacacc ctgtccacgc 2880
ccctggttct tagttccagc cccactcata ggacactcat agctcaggag ggctccgcct 2940
tcaatcccac ccgctaaagt acttggagcg gtctctccct ccctcatcag cccaccaaac 3000
caaacctagc ctccaagagt gggaagaaat taaagcaaga taggctatta agtgcagagg 3060
gagagaaaat gcctccaaca tgtgaggaag taatgagaga aatcatagaa ttttaaggcc 3120
atgatttaag gccatcatgg ccttaatctt ccgcttcctc gctcactgac tcgctgcgct 3180
cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 3240
cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 3300
accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 3360
acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 3420
cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 3480
acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt 3540
atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 3600
agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 3660
acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 3720
gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg 3780
gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 3840
gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 3900
gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga 3960
acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga 4020
tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 4080
ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt 4140
catccatagt tgcctgactc gggggggggg ggcgctgagg tctgcctcgt gaagaaggtg 4200
ttgctgactc ataccaggcc tgaatcgccc catcatccag ccagaaagtg agggagccac 4260
ggttgatgag agctttgttg taggtggacc agttggtgat tttgaacttt tgctttgcca 4320
cggaacggtc tgcgttgtcg ggaagatgcg tgatctgatc cttcaactca gcaaaagttc 4380
gatttattca acaaagccgc cgtcccgtca agtcagcgta atgctctgcc agtgttacaa 4440
ccaattaacc aattctgatt agaaaaactc atcgagcatc aaatgaaact gcaatttatt 4500
catatcagga ttatcaatac catatttttg aaaaagccgt ttctgtaatg aaggagaaaa 4560
ctcaccgagg cagttccata ggatggcaag atcctggtat cggtctgcga ttccgactcg 4620
tccaacatca atacaaccta ttaatttccc ctcgtcaaaa ataaggttat caagtgagaa 4680
atcaccatga gtgacgactg aatccggtga gaatggcaaa agcttatgca tttctttcca 4740
gacttgttca acaggccagc cattacgctc gtcatcaaaa tcactcgcat caaccaaacc 4800
gttattcatt cgtgattgcg cctgagcgag acgaaatacg cgatcgctgt taaaaggaca 4860
attacaaaca ggaatcgaat gcaaccggcg caggaacact gccagcgcat caacaatatt 4920
ttcacctgaa tcaggatatt cttctaatac ctggaatgct gttttcccgg ggatcgcagt 4980
ggtgagtaac catgcatcat caggagtacg gataaaatgc ttgatggtcg gaagaggcat 5040
aaattccgtc agccagttta gtctgaccat ctcatctgta acatcattgg caacgctacc 5100
tttgccatgt ttcagaaaca actctggcgc atcgggcttc ccatacaatc gatagattgt 5160
cgcacctgat tgcccgacat tatcgcgagc ccatttatac ccatataaat cagcatccat 5220
gttggaattt aatcgcggcc tcgagcaaga cgtttcccgt tgaatatggc tcataacacc 5280
ccttgtatta ctgtttatgt aagcagacag ttttattgtt catgatgata tatttttatc 5340
ttgtgcaatg taacatcaga gattttgaga cacaacgtgg ctttcccccc ccccccatta 5400
ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 5460
aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga 5520
aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgtc 5579
<210> 351
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(645)
<400> 351
atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
aat gga aca ggc gga gct gac ctg gct gag ctg ctg gtg ctg ctg ctg 432
Asn Gly Thr Gly Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
aac cag tgg act ctg ctg ttc cac gat agc aac gtg aag aat ctg tat 480
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
ctg aat cga gag aaa att gac 645
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 352
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 352
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Asn Gly Thr Gly Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 353
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 353
gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60
gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120
cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180
gttgctatcg tggaacagca gagtccactg gttcagcagc agcaccagca gctcagccag 240
gtcagctccg cctgttccat tgcccatttt ttcgatgaca gaattcacca tgttagtaat 300
gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360
ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420
tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480
catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540
ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600
ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645
<210> 354
<211> 1155
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(1155)
<400> 354
atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
aat gga aca ggc gga gct gac ctg gct gag ctg ctg gtg ctg ctg ctg 432
Asn Gly Thr Gly Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
aac cag tgg act ctg ctg ttc cac gat agc aac gtg aag aat ctg tat 480
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
ctg aat cga gag aaa att gac agt ggg ggc gac atc atc aag ctg ctg 672
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
aac gaa cag gtg aac aag gag atg cag agc tcc aac ctg tac atg agt 720
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
atg tct agt tgg tgt tat aca cac tca ctg gac ggc gct ggg ctg ttc 768
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
ctg ttt gat cac gca gcc gag gaa tac gaa cat gca aag aaa ctg atc 816
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
att ttc ctg aat gag aac aat gtg ccc gtc cag ctg act tca atc agc 864
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
gcc cct gaa cat aag ttc gag ggc ctg acc cag atc ttt cag aaa gct 912
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
tac gaa cac gag cag cat att tcc gaa tct atc aac aat att gtg gac 960
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
cac gcc att aag agc aaa gat cat gct acc ttc aac ttt ctg cag tgg 1008
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
tac gtg gcc gag cag cac gag gag gag gtc ctg ttt aag gac atc ctg 1056
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
gat aaa atc gaa ctg att gga aac gag aat cat ggc ctg tac ctg gca 1104
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
gat cag tat gtg aag ggc att gcc aag tcc aga aaa agt ggg tca tga 1152
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
tga 1155
<210> 355
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 355
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Asn Gly Thr Gly Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 356
<211> 1155
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 356
tcatcatgac ccactttttc tggacttggc aatgcccttc acatactgat ctgccaggta 60
caggccatga ttctcgtttc caatcagttc gattttatcc aggatgtcct taaacaggac 120
ctcctcctcg tgctgctcgg ccacgtacca ctgcagaaag ttgaaggtag catgatcttt 180
gctcttaatg gcgtggtcca caatattgtt gatagattcg gaaatatgct gctcgtgttc 240
gtaagctttc tgaaagatct gggtcaggcc ctcgaactta tgttcagggg cgctgattga 300
agtcagctgg acgggcacat tgttctcatt caggaaaatg atcagtttct ttgcatgttc 360
gtattcctcg gctgcgtgat caaacaggaa cagcccagcg ccgtccagtg agtgtgtata 420
acaccaacta gacatactca tgtacaggtt ggagctctgc atctccttgt tcacctgttc 480
gttcagcagc ttgatgatgt cgcccccact gtcaattttc tctcgattca gcttactctc 540
ttcagaatat ttgggatagt cgtaagtgcc gttcttcaca gactccatac attcattgtt 600
gcacttatgg taaaactcga agcatccatt cccgatttct ttggcattgt tcttcagctg 660
ggatttgacc ttctcataca gattcttcac gttgctatcg tggaacagca gagtccactg 720
gttcagcagc agcaccagca gctcagccag gtcagctccg cctgttccat tgcccatttt 780
ttcgatgaca gaattcacca tgttagtaat gccattgatt gcgttctgtg tagacttctg 840
atcagcggcg tagccgctgc cctgctcatt ctgatggtgg tagccgtacc acccgtccac 900
cattcctgtc cacccgccct caataaaccc tgcgatagcg ccgaacagtc ctcttgtttc 960
ccgctgtggg atgttgcgca gtccggtgac catcctcagt ccgctgccca gattcactga 1020
gtgggtgaca gtcacgttct tctccaggac ggtatccact gtgtcggtgg agttgtttgc 1080
gtgatagccg atgcagatag tgtcagcgta ggttgcggta aaagtacaca gcaggaccag 1140
cagtttggcc ttcat 1155
<210> 357
<211> 5579
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 357
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960
tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020
cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080
ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140
ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200
accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260
gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320
ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380
atatcgccac catgaaggcc aaactgctgg tcctgctgtg tacttttacc gcaacctacg 1440
ctgacactat ctgcatcggc tatcacgcaa acaactccac cgacacagtg gataccgtcc 1500
tggagaagaa cgtgactgtc acccactcag tgaatctggg cagcggactg aggatggtca 1560
ccggactgcg caacatccca cagcgggaaa caagaggact gttcggcgct atcgcagggt 1620
ttattgaggg cgggtggaca ggaatggtgg acgggtggta cggctaccac catcagaatg 1680
agcagggcag cggctacgcc gctgatcaga agtctacaca gaacgcaatc aatggcatta 1740
ctaacatggt gaattctgtc atcgaaaaaa tgggcaatgg aacaggcgga gctgacctgg 1800
ctgagctgct ggtgctgctg ctgaaccagt ggactctgct gttccacgat agcaacgtga 1860
agaatctgta tgagaaggtc aaatcccagc tgaagaacaa tgccaaagaa atcgggaatg 1920
gatgcttcga gttttaccat aagtgcaaca atgaatgtat ggagtctgtg aagaacggca 1980
cttacgacta tcccaaatat tctgaagaga gtaagctgaa tcgagagaaa attgacagtg 2040
ggggcgacat catcaagctg ctgaacgaac aggtgaacaa ggagatgcag agctccaacc 2100
tgtacatgag tatgtctagt tggtgttata cacactcact ggacggcgct gggctgttcc 2160
tgtttgatca cgcagccgag gaatacgaac atgcaaagaa actgatcatt ttcctgaatg 2220
agaacaatgt gcccgtccag ctgacttcaa tcagcgcccc tgaacataag ttcgagggcc 2280
tgacccagat ctttcagaaa gcttacgaac acgagcagca tatttccgaa tctatcaaca 2340
atattgtgga ccacgccatt aagagcaaag atcatgctac cttcaacttt ctgcagtggt 2400
acgtggccga gcagcacgag gaggaggtcc tgtttaagga catcctggat aaaatcgaac 2460
tgattggaaa cgagaatcat ggcctgtacc tggcagatca gtatgtgaag ggcattgcca 2520
agtccagaaa aagtgggtca tgatgaacac gtgggatcca gatctgctgt gccttctagt 2580
tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 2640
cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 2700
tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 2760
aggcatgctg gggatgcggt gggctctatg ggtacccagg tgctgaagaa ttgacccggt 2820
tcctcctggg ccagaaagaa gcaggcacat ccccttctct gtgacacacc ctgtccacgc 2880
ccctggttct tagttccagc cccactcata ggacactcat agctcaggag ggctccgcct 2940
tcaatcccac ccgctaaagt acttggagcg gtctctccct ccctcatcag cccaccaaac 3000
caaacctagc ctccaagagt gggaagaaat taaagcaaga taggctatta agtgcagagg 3060
gagagaaaat gcctccaaca tgtgaggaag taatgagaga aatcatagaa ttttaaggcc 3120
atgatttaag gccatcatgg ccttaatctt ccgcttcctc gctcactgac tcgctgcgct 3180
cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 3240
cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 3300
accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 3360
acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 3420
cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 3480
acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt 3540
atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 3600
agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 3660
acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 3720
gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg 3780
gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 3840
gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 3900
gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga 3960
acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga 4020
tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 4080
ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt 4140
catccatagt tgcctgactc gggggggggg ggcgctgagg tctgcctcgt gaagaaggtg 4200
ttgctgactc ataccaggcc tgaatcgccc catcatccag ccagaaagtg agggagccac 4260
ggttgatgag agctttgttg taggtggacc agttggtgat tttgaacttt tgctttgcca 4320
cggaacggtc tgcgttgtcg ggaagatgcg tgatctgatc cttcaactca gcaaaagttc 4380
gatttattca acaaagccgc cgtcccgtca agtcagcgta atgctctgcc agtgttacaa 4440
ccaattaacc aattctgatt agaaaaactc atcgagcatc aaatgaaact gcaatttatt 4500
catatcagga ttatcaatac catatttttg aaaaagccgt ttctgtaatg aaggagaaaa 4560
ctcaccgagg cagttccata ggatggcaag atcctggtat cggtctgcga ttccgactcg 4620
tccaacatca atacaaccta ttaatttccc ctcgtcaaaa ataaggttat caagtgagaa 4680
atcaccatga gtgacgactg aatccggtga gaatggcaaa agcttatgca tttctttcca 4740
gacttgttca acaggccagc cattacgctc gtcatcaaaa tcactcgcat caaccaaacc 4800
gttattcatt cgtgattgcg cctgagcgag acgaaatacg cgatcgctgt taaaaggaca 4860
attacaaaca ggaatcgaat gcaaccggcg caggaacact gccagcgcat caacaatatt 4920
ttcacctgaa tcaggatatt cttctaatac ctggaatgct gttttcccgg ggatcgcagt 4980
ggtgagtaac catgcatcat caggagtacg gataaaatgc ttgatggtcg gaagaggcat 5040
aaattccgtc agccagttta gtctgaccat ctcatctgta acatcattgg caacgctacc 5100
tttgccatgt ttcagaaaca actctggcgc atcgggcttc ccatacaatc gatagattgt 5160
cgcacctgat tgcccgacat tatcgcgagc ccatttatac ccatataaat cagcatccat 5220
gttggaattt aatcgcggcc tcgagcaaga cgtttcccgt tgaatatggc tcataacacc 5280
ccttgtatta ctgtttatgt aagcagacag ttttattgtt catgatgata tatttttatc 5340
ttgtgcaatg taacatcaga gattttgaga cacaacgtgg ctttcccccc ccccccatta 5400
ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 5460
aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga 5520
aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgtc 5579
<210> 358
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(645)
<400> 358
atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
gga aat ggc act gga gct gac ctg gct gag ctg ctg gtg ctg ctg ctg 432
Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
aac cag tgg act ctg ctg ttc cac gat agc aac gtg aag aat ctg tat 480
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
ctg aat cga gag aaa att gac 645
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 359
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 359
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 360
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 360
gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60
gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120
cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180
gttgctatcg tggaacagca gagtccactg gttcagcagc agcaccagca gctcagccag 240
gtcagctcca gtgccatttc cgcccatttt ttcgatgaca gaattcacca tgttagtaat 300
gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360
ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420
tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480
catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540
ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600
ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645
<210> 361
<211> 1155
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(1155)
<400> 361
atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
gga aat ggc act gga gct gac ctg gct gag ctg ctg gtg ctg ctg ctg 432
Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
aac cag tgg act ctg ctg ttc cac gat agc aac gtg aag aat ctg tat 480
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
ctg aat cga gag aaa att gac agt ggg ggc gac atc atc aag ctg ctg 672
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
aac gaa cag gtg aac aag gag atg cag agc tcc aac ctg tac atg agt 720
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
atg tct agt tgg tgt tat aca cac tca ctg gac ggc gct ggg ctg ttc 768
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
ctg ttt gat cac gca gcc gag gaa tac gaa cat gca aag aaa ctg atc 816
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
att ttc ctg aat gag aac aat gtg ccc gtc cag ctg act tca atc agc 864
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
gcc cct gaa cat aag ttc gag ggc ctg acc cag atc ttt cag aaa gct 912
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
tac gaa cac gag cag cat att tcc gaa tct atc aac aat att gtg gac 960
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
cac gcc att aag agc aaa gat cat gct acc ttc aac ttt ctg cag tgg 1008
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
tac gtg gcc gag cag cac gag gag gag gtc ctg ttt aag gac atc ctg 1056
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
gat aaa atc gaa ctg att gga aac gag aat cat ggc ctg tac ctg gca 1104
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
gat cag tat gtg aag ggc att gcc aag tcc aga aaa agt ggg tca tga 1152
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
tga 1155
<210> 362
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 362
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 363
<211> 1155
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 363
tcatcatgac ccactttttc tggacttggc aatgcccttc acatactgat ctgccaggta 60
caggccatga ttctcgtttc caatcagttc gattttatcc aggatgtcct taaacaggac 120
ctcctcctcg tgctgctcgg ccacgtacca ctgcagaaag ttgaaggtag catgatcttt 180
gctcttaatg gcgtggtcca caatattgtt gatagattcg gaaatatgct gctcgtgttc 240
gtaagctttc tgaaagatct gggtcaggcc ctcgaactta tgttcagggg cgctgattga 300
agtcagctgg acgggcacat tgttctcatt caggaaaatg atcagtttct ttgcatgttc 360
gtattcctcg gctgcgtgat caaacaggaa cagcccagcg ccgtccagtg agtgtgtata 420
acaccaacta gacatactca tgtacaggtt ggagctctgc atctccttgt tcacctgttc 480
gttcagcagc ttgatgatgt cgcccccact gtcaattttc tctcgattca gcttactctc 540
ttcagaatat ttgggatagt cgtaagtgcc gttcttcaca gactccatac attcattgtt 600
gcacttatgg taaaactcga agcatccatt cccgatttct ttggcattgt tcttcagctg 660
ggatttgacc ttctcataca gattcttcac gttgctatcg tggaacagca gagtccactg 720
gttcagcagc agcaccagca gctcagccag gtcagctcca gtgccatttc cgcccatttt 780
ttcgatgaca gaattcacca tgttagtaat gccattgatt gcgttctgtg tagacttctg 840
atcagcggcg tagccgctgc cctgctcatt ctgatggtgg tagccgtacc acccgtccac 900
cattcctgtc cacccgccct caataaaccc tgcgatagcg ccgaacagtc ctcttgtttc 960
ccgctgtggg atgttgcgca gtccggtgac catcctcagt ccgctgccca gattcactga 1020
gtgggtgaca gtcacgttct tctccaggac ggtatccact gtgtcggtgg agttgtttgc 1080
gtgatagccg atgcagatag tgtcagcgta ggttgcggta aaagtacaca gcaggaccag 1140
cagtttggcc ttcat 1155
<210> 364
<211> 5579
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 364
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960
tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020
cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080
ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140
ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200
accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260
gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320
ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380
atatcgccac catgaaggcc aaactgctgg tcctgctgtg tacttttacc gcaacctacg 1440
ctgacactat ctgcatcggc tatcacgcaa acaactccac cgacacagtg gataccgtcc 1500
tggagaagaa cgtgactgtc acccactcag tgaatctggg cagcggactg aggatggtca 1560
ccggactgcg caacatccca cagcgggaaa caagaggact gttcggcgct atcgcagggt 1620
ttattgaggg cgggtggaca ggaatggtgg acgggtggta cggctaccac catcagaatg 1680
agcagggcag cggctacgcc gctgatcaga agtctacaca gaacgcaatc aatggcatta 1740
ctaacatggt gaattctgtc atcgaaaaaa tgggcggaaa tggcactgga gctgacctgg 1800
ctgagctgct ggtgctgctg ctgaaccagt ggactctgct gttccacgat agcaacgtga 1860
agaatctgta tgagaaggtc aaatcccagc tgaagaacaa tgccaaagaa atcgggaatg 1920
gatgcttcga gttttaccat aagtgcaaca atgaatgtat ggagtctgtg aagaacggca 1980
cttacgacta tcccaaatat tctgaagaga gtaagctgaa tcgagagaaa attgacagtg 2040
ggggcgacat catcaagctg ctgaacgaac aggtgaacaa ggagatgcag agctccaacc 2100
tgtacatgag tatgtctagt tggtgttata cacactcact ggacggcgct gggctgttcc 2160
tgtttgatca cgcagccgag gaatacgaac atgcaaagaa actgatcatt ttcctgaatg 2220
agaacaatgt gcccgtccag ctgacttcaa tcagcgcccc tgaacataag ttcgagggcc 2280
tgacccagat ctttcagaaa gcttacgaac acgagcagca tatttccgaa tctatcaaca 2340
atattgtgga ccacgccatt aagagcaaag atcatgctac cttcaacttt ctgcagtggt 2400
acgtggccga gcagcacgag gaggaggtcc tgtttaagga catcctggat aaaatcgaac 2460
tgattggaaa cgagaatcat ggcctgtacc tggcagatca gtatgtgaag ggcattgcca 2520
agtccagaaa aagtgggtca tgatgaacac gtgggatcca gatctgctgt gccttctagt 2580
tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 2640
cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 2700
tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 2760
aggcatgctg gggatgcggt gggctctatg ggtacccagg tgctgaagaa ttgacccggt 2820
tcctcctggg ccagaaagaa gcaggcacat ccccttctct gtgacacacc ctgtccacgc 2880
ccctggttct tagttccagc cccactcata ggacactcat agctcaggag ggctccgcct 2940
tcaatcccac ccgctaaagt acttggagcg gtctctccct ccctcatcag cccaccaaac 3000
caaacctagc ctccaagagt gggaagaaat taaagcaaga taggctatta agtgcagagg 3060
gagagaaaat gcctccaaca tgtgaggaag taatgagaga aatcatagaa ttttaaggcc 3120
atgatttaag gccatcatgg ccttaatctt ccgcttcctc gctcactgac tcgctgcgct 3180
cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 3240
cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 3300
accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 3360
acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 3420
cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 3480
acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt 3540
atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 3600
agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 3660
acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 3720
gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg 3780
gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 3840
gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 3900
gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga 3960
acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga 4020
tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 4080
ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt 4140
catccatagt tgcctgactc gggggggggg ggcgctgagg tctgcctcgt gaagaaggtg 4200
ttgctgactc ataccaggcc tgaatcgccc catcatccag ccagaaagtg agggagccac 4260
ggttgatgag agctttgttg taggtggacc agttggtgat tttgaacttt tgctttgcca 4320
cggaacggtc tgcgttgtcg ggaagatgcg tgatctgatc cttcaactca gcaaaagttc 4380
gatttattca acaaagccgc cgtcccgtca agtcagcgta atgctctgcc agtgttacaa 4440
ccaattaacc aattctgatt agaaaaactc atcgagcatc aaatgaaact gcaatttatt 4500
catatcagga ttatcaatac catatttttg aaaaagccgt ttctgtaatg aaggagaaaa 4560
ctcaccgagg cagttccata ggatggcaag atcctggtat cggtctgcga ttccgactcg 4620
tccaacatca atacaaccta ttaatttccc ctcgtcaaaa ataaggttat caagtgagaa 4680
atcaccatga gtgacgactg aatccggtga gaatggcaaa agcttatgca tttctttcca 4740
gacttgttca acaggccagc cattacgctc gtcatcaaaa tcactcgcat caaccaaacc 4800
gttattcatt cgtgattgcg cctgagcgag acgaaatacg cgatcgctgt taaaaggaca 4860
attacaaaca ggaatcgaat gcaaccggcg caggaacact gccagcgcat caacaatatt 4920
ttcacctgaa tcaggatatt cttctaatac ctggaatgct gttttcccgg ggatcgcagt 4980
ggtgagtaac catgcatcat caggagtacg gataaaatgc ttgatggtcg gaagaggcat 5040
aaattccgtc agccagttta gtctgaccat ctcatctgta acatcattgg caacgctacc 5100
tttgccatgt ttcagaaaca actctggcgc atcgggcttc ccatacaatc gatagattgt 5160
cgcacctgat tgcccgacat tatcgcgagc ccatttatac ccatataaat cagcatccat 5220
gttggaattt aatcgcggcc tcgagcaaga cgtttcccgt tgaatatggc tcataacacc 5280
ccttgtatta ctgtttatgt aagcagacag ttttattgtt catgatgata tatttttatc 5340
ttgtgcaatg taacatcaga gattttgaga cacaacgtgg ctttcccccc ccccccatta 5400
ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 5460
aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga 5520
aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgtc 5579
<210> 365
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(645)
<400> 365
atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
agc gga ggc aac gga aca gac ctg gct gag ctg ctg gtg ctg ctg ctg 432
Ser Gly Gly Asn Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
aac cag tgg act ctg ctg ttc cac gat agc aac gtg aag aat ctg tat 480
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
ctg aat cga gag aaa att gac 645
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 366
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 366
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Asn Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 367
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 367
gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60
gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120
cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180
gttgctatcg tggaacagca gagtccactg gttcagcagc agcaccagca gctcagccag 240
gtctgttccg ttgcctccgc tgcccatttt ttcgatgaca gaattcacca tgttagtaat 300
gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360
ctgatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420
tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480
catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540
ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600
ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645
<210> 368
<211> 1155
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(1155)
<400> 368
atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat 288
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
gag cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
agc gga ggc aac gga aca gac ctg gct gag ctg ctg gtg ctg ctg ctg 432
Ser Gly Gly Asn Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
aac cag tgg act ctg ctg ttc cac gat agc aac gtg aag aat ctg tat 480
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
ctg aat cga gag aaa att gac agt ggg ggc gac atc atc aag ctg ctg 672
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
aac gaa cag gtg aac aag gag atg cag agc tcc aac ctg tac atg agt 720
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
atg tct agt tgg tgt tat aca cac tca ctg gac ggc gct ggg ctg ttc 768
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
ctg ttt gat cac gca gcc gag gaa tac gaa cat gca aag aaa ctg atc 816
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
att ttc ctg aat gag aac aat gtg ccc gtc cag ctg act tca atc agc 864
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
gcc cct gaa cat aag ttc gag ggc ctg acc cag atc ttt cag aaa gct 912
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
tac gaa cac gag cag cat att tcc gaa tct atc aac aat att gtg gac 960
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
cac gcc att aag agc aaa gat cat gct acc ttc aac ttt ctg cag tgg 1008
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
tac gtg gcc gag cag cac gag gag gag gtc ctg ttt aag gac atc ctg 1056
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
gat aaa atc gaa ctg att gga aac gag aat cat ggc ctg tac ctg gca 1104
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
gat cag tat gtg aag ggc att gcc aag tcc aga aaa agt ggg tca tga 1152
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
tga 1155
<210> 369
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 369
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Asn Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 370
<211> 1155
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 370
tcatcatgac ccactttttc tggacttggc aatgcccttc acatactgat ctgccaggta 60
caggccatga ttctcgtttc caatcagttc gattttatcc aggatgtcct taaacaggac 120
ctcctcctcg tgctgctcgg ccacgtacca ctgcagaaag ttgaaggtag catgatcttt 180
gctcttaatg gcgtggtcca caatattgtt gatagattcg gaaatatgct gctcgtgttc 240
gtaagctttc tgaaagatct gggtcaggcc ctcgaactta tgttcagggg cgctgattga 300
agtcagctgg acgggcacat tgttctcatt caggaaaatg atcagtttct ttgcatgttc 360
gtattcctcg gctgcgtgat caaacaggaa cagcccagcg ccgtccagtg agtgtgtata 420
acaccaacta gacatactca tgtacaggtt ggagctctgc atctccttgt tcacctgttc 480
gttcagcagc ttgatgatgt cgcccccact gtcaattttc tctcgattca gcttactctc 540
ttcagaatat ttgggatagt cgtaagtgcc gttcttcaca gactccatac attcattgtt 600
gcacttatgg taaaactcga agcatccatt cccgatttct ttggcattgt tcttcagctg 660
ggatttgacc ttctcataca gattcttcac gttgctatcg tggaacagca gagtccactg 720
gttcagcagc agcaccagca gctcagccag gtctgttccg ttgcctccgc tgcccatttt 780
ttcgatgaca gaattcacca tgttagtaat gccattgatt gcgttctgtg tagacttctg 840
atcagcggcg tagccgctgc cctgctcatt ctgatggtgg tagccgtacc acccgtccac 900
cattcctgtc cacccgccct caataaaccc tgcgatagcg ccgaacagtc ctcttgtttc 960
ccgctgtggg atgttgcgca gtccggtgac catcctcagt ccgctgccca gattcactga 1020
gtgggtgaca gtcacgttct tctccaggac ggtatccact gtgtcggtgg agttgtttgc 1080
gtgatagccg atgcagatag tgtcagcgta ggttgcggta aaagtacaca gcaggaccag 1140
cagtttggcc ttcat 1155
<210> 371
<211> 5579
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 371
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960
tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020
cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080
ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140
ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200
accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260
gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320
ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380
atatcgccac catgaaggcc aaactgctgg tcctgctgtg tacttttacc gcaacctacg 1440
ctgacactat ctgcatcggc tatcacgcaa acaactccac cgacacagtg gataccgtcc 1500
tggagaagaa cgtgactgtc acccactcag tgaatctggg cagcggactg aggatggtca 1560
ccggactgcg caacatccca cagcgggaaa caagaggact gttcggcgct atcgcagggt 1620
ttattgaggg cgggtggaca ggaatggtgg acgggtggta cggctaccac catcagaatg 1680
agcagggcag cggctacgcc gctgatcaga agtctacaca gaacgcaatc aatggcatta 1740
ctaacatggt gaattctgtc atcgaaaaaa tgggcagcgg aggcaacgga acagacctgg 1800
ctgagctgct ggtgctgctg ctgaaccagt ggactctgct gttccacgat agcaacgtga 1860
agaatctgta tgagaaggtc aaatcccagc tgaagaacaa tgccaaagaa atcgggaatg 1920
gatgcttcga gttttaccat aagtgcaaca atgaatgtat ggagtctgtg aagaacggca 1980
cttacgacta tcccaaatat tctgaagaga gtaagctgaa tcgagagaaa attgacagtg 2040
ggggcgacat catcaagctg ctgaacgaac aggtgaacaa ggagatgcag agctccaacc 2100
tgtacatgag tatgtctagt tggtgttata cacactcact ggacggcgct gggctgttcc 2160
tgtttgatca cgcagccgag gaatacgaac atgcaaagaa actgatcatt ttcctgaatg 2220
agaacaatgt gcccgtccag ctgacttcaa tcagcgcccc tgaacataag ttcgagggcc 2280
tgacccagat ctttcagaaa gcttacgaac acgagcagca tatttccgaa tctatcaaca 2340
atattgtgga ccacgccatt aagagcaaag atcatgctac cttcaacttt ctgcagtggt 2400
acgtggccga gcagcacgag gaggaggtcc tgtttaagga catcctggat aaaatcgaac 2460
tgattggaaa cgagaatcat ggcctgtacc tggcagatca gtatgtgaag ggcattgcca 2520
agtccagaaa aagtgggtca tgatgaacac gtgggatcca gatctgctgt gccttctagt 2580
tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 2640
cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 2700
tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 2760
aggcatgctg gggatgcggt gggctctatg ggtacccagg tgctgaagaa ttgacccggt 2820
tcctcctggg ccagaaagaa gcaggcacat ccccttctct gtgacacacc ctgtccacgc 2880
ccctggttct tagttccagc cccactcata ggacactcat agctcaggag ggctccgcct 2940
tcaatcccac ccgctaaagt acttggagcg gtctctccct ccctcatcag cccaccaaac 3000
caaacctagc ctccaagagt gggaagaaat taaagcaaga taggctatta agtgcagagg 3060
gagagaaaat gcctccaaca tgtgaggaag taatgagaga aatcatagaa ttttaaggcc 3120
atgatttaag gccatcatgg ccttaatctt ccgcttcctc gctcactgac tcgctgcgct 3180
cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 3240
cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 3300
accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 3360
acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 3420
cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 3480
acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt 3540
atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 3600
agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 3660
acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 3720
gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg 3780
gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 3840
gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 3900
gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga 3960
acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga 4020
tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 4080
ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt 4140
catccatagt tgcctgactc gggggggggg ggcgctgagg tctgcctcgt gaagaaggtg 4200
ttgctgactc ataccaggcc tgaatcgccc catcatccag ccagaaagtg agggagccac 4260
ggttgatgag agctttgttg taggtggacc agttggtgat tttgaacttt tgctttgcca 4320
cggaacggtc tgcgttgtcg ggaagatgcg tgatctgatc cttcaactca gcaaaagttc 4380
gatttattca acaaagccgc cgtcccgtca agtcagcgta atgctctgcc agtgttacaa 4440
ccaattaacc aattctgatt agaaaaactc atcgagcatc aaatgaaact gcaatttatt 4500
catatcagga ttatcaatac catatttttg aaaaagccgt ttctgtaatg aaggagaaaa 4560
ctcaccgagg cagttccata ggatggcaag atcctggtat cggtctgcga ttccgactcg 4620
tccaacatca atacaaccta ttaatttccc ctcgtcaaaa ataaggttat caagtgagaa 4680
atcaccatga gtgacgactg aatccggtga gaatggcaaa agcttatgca tttctttcca 4740
gacttgttca acaggccagc cattacgctc gtcatcaaaa tcactcgcat caaccaaacc 4800
gttattcatt cgtgattgcg cctgagcgag acgaaatacg cgatcgctgt taaaaggaca 4860
attacaaaca ggaatcgaat gcaaccggcg caggaacact gccagcgcat caacaatatt 4920
ttcacctgaa tcaggatatt cttctaatac ctggaatgct gttttcccgg ggatcgcagt 4980
ggtgagtaac catgcatcat caggagtacg gataaaatgc ttgatggtcg gaagaggcat 5040
aaattccgtc agccagttta gtctgaccat ctcatctgta acatcattgg caacgctacc 5100
tttgccatgt ttcagaaaca actctggcgc atcgggcttc ccatacaatc gatagattgt 5160
cgcacctgat tgcccgacat tatcgcgagc ccatttatac ccatataaat cagcatccat 5220
gttggaattt aatcgcggcc tcgagcaaga cgtttcccgt tgaatatggc tcataacacc 5280
ccttgtatta ctgtttatgt aagcagacag ttttattgtt catgatgata tatttttatc 5340
ttgtgcaatg taacatcaga gattttgaga cacaacgtgg ctttcccccc ccccccatta 5400
ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 5460
aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga 5520
aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgtc 5579
<210> 372
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(645)
<400> 372
atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat aac aat 288
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Asn Asn
85 90 95
acc cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336
Thr Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
gga aat ggc act gga gct gac ctg gct gag ctg ctg gtg ctg ctg ctg 432
Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
aac cag tgg act ctg ctg ttc cac gat agc aac gtg aag aat ctg tat 480
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
ctg aat cga gag aaa att gac 645
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 373
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 373
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Asn Asn
85 90 95
Thr Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 374
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 374
gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60
gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120
cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180
gttgctatcg tggaacagca gagtccactg gttcagcagc agcaccagca gctcagccag 240
gtcagctcca gtgccatttc cgcccatttt ttcgatgaca gaattcacca tgttagtaat 300
gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgggtatt 360
gttatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420
tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480
catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540
ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600
ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645
<210> 375
<211> 1155
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(1155)
<400> 375
atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat aac aat 288
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Asn Asn
85 90 95
acc cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336
Thr Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
gga aat ggc act gga gct gac ctg gct gag ctg ctg gtg ctg ctg ctg 432
Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
aac cag tgg act ctg ctg ttc cac gat agc aac gtg aag aat ctg tat 480
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
ctg aat cga gag aaa att gac agt ggg ggc gac atc atc aag ctg ctg 672
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
aac gaa cag gtg aac aag gag atg aac agc acc aac ctg tac atg agt 720
Asn Glu Gln Val Asn Lys Glu Met Asn Ser Thr Asn Leu Tyr Met Ser
225 230 235 240
atg tct agt tgg tgt tat aca cac tca ctg gac ggc gct ggg ctg ttc 768
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
ctg ttt gat cac gca gcc gag gaa tac gaa cat gca aag aaa ctg atc 816
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
att ttc ctg aat gag aac aat gtg ccc gtc cag ctg act tca atc agc 864
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
gcc cct gaa cat aag ttc gag ggc ctg acc cag atc ttt cag aaa gct 912
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
tac gaa cac gag cag cat att tcc gaa tct atc aac aat att gtg gac 960
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
cac gcc att aag agc aaa gat cat gct acc ttc aac ttt ctg cag tgg 1008
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
tac gtg gcc gag cag cac gag gag gag gtc ctg ttt aag gac atc ctg 1056
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
gat aaa atc gaa ctg att gga aac gag aat cat ggc ctg tac ctg gca 1104
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
gat cag tat gtg aag ggc att gcc aag tcc aga aaa agt ggg tca tga 1152
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
tga 1155
<210> 376
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 376
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Asn Asn
85 90 95
Thr Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Asn Ser Thr Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 377
<211> 1155
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 377
tcatcatgac ccactttttc tggacttggc aatgcccttc acatactgat ctgccaggta 60
caggccatga ttctcgtttc caatcagttc gattttatcc aggatgtcct taaacaggac 120
ctcctcctcg tgctgctcgg ccacgtacca ctgcagaaag ttgaaggtag catgatcttt 180
gctcttaatg gcgtggtcca caatattgtt gatagattcg gaaatatgct gctcgtgttc 240
gtaagctttc tgaaagatct gggtcaggcc ctcgaactta tgttcagggg cgctgattga 300
agtcagctgg acgggcacat tgttctcatt caggaaaatg atcagtttct ttgcatgttc 360
gtattcctcg gctgcgtgat caaacaggaa cagcccagcg ccgtccagtg agtgtgtata 420
acaccaacta gacatactca tgtacaggtt ggtgctgttc atctccttgt tcacctgttc 480
gttcagcagc ttgatgatgt cgcccccact gtcaattttc tctcgattca gcttactctc 540
ttcagaatat ttgggatagt cgtaagtgcc gttcttcaca gactccatac attcattgtt 600
gcacttatgg taaaactcga agcatccatt cccgatttct ttggcattgt tcttcagctg 660
ggatttgacc ttctcataca gattcttcac gttgctatcg tggaacagca gagtccactg 720
gttcagcagc agcaccagca gctcagccag gtcagctcca gtgccatttc cgcccatttt 780
ttcgatgaca gaattcacca tgttagtaat gccattgatt gcgttctgtg tagacttctg 840
atcagcggcg tagccgctgc cctgggtatt gttatggtgg tagccgtacc acccgtccac 900
cattcctgtc cacccgccct caataaaccc tgcgatagcg ccgaacagtc ctcttgtttc 960
ccgctgtggg atgttgcgca gtccggtgac catcctcagt ccgctgccca gattcactga 1020
gtgggtgaca gtcacgttct tctccaggac ggtatccact gtgtcggtgg agttgtttgc 1080
gtgatagccg atgcagatag tgtcagcgta ggttgcggta aaagtacaca gcaggaccag 1140
cagtttggcc ttcat 1155
<210> 378
<211> 5579
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 378
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960
tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020
cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080
ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140
ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200
accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260
gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320
ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380
atatcgccac catgaaggcc aaactgctgg tcctgctgtg tacttttacc gcaacctacg 1440
ctgacactat ctgcatcggc tatcacgcaa acaactccac cgacacagtg gataccgtcc 1500
tggagaagaa cgtgactgtc acccactcag tgaatctggg cagcggactg aggatggtca 1560
ccggactgcg caacatccca cagcgggaaa caagaggact gttcggcgct atcgcagggt 1620
ttattgaggg cgggtggaca ggaatggtgg acgggtggta cggctaccac cataacaata 1680
cccagggcag cggctacgcc gctgatcaga agtctacaca gaacgcaatc aatggcatta 1740
ctaacatggt gaattctgtc atcgaaaaaa tgggcggaaa tggcactgga gctgacctgg 1800
ctgagctgct ggtgctgctg ctgaaccagt ggactctgct gttccacgat agcaacgtga 1860
agaatctgta tgagaaggtc aaatcccagc tgaagaacaa tgccaaagaa atcgggaatg 1920
gatgcttcga gttttaccat aagtgcaaca atgaatgtat ggagtctgtg aagaacggca 1980
cttacgacta tcccaaatat tctgaagaga gtaagctgaa tcgagagaaa attgacagtg 2040
ggggcgacat catcaagctg ctgaacgaac aggtgaacaa ggagatgaac agcaccaacc 2100
tgtacatgag tatgtctagt tggtgttata cacactcact ggacggcgct gggctgttcc 2160
tgtttgatca cgcagccgag gaatacgaac atgcaaagaa actgatcatt ttcctgaatg 2220
agaacaatgt gcccgtccag ctgacttcaa tcagcgcccc tgaacataag ttcgagggcc 2280
tgacccagat ctttcagaaa gcttacgaac acgagcagca tatttccgaa tctatcaaca 2340
atattgtgga ccacgccatt aagagcaaag atcatgctac cttcaacttt ctgcagtggt 2400
acgtggccga gcagcacgag gaggaggtcc tgtttaagga catcctggat aaaatcgaac 2460
tgattggaaa cgagaatcat ggcctgtacc tggcagatca gtatgtgaag ggcattgcca 2520
agtccagaaa aagtgggtca tgatgaacac gtgggatcca gatctgctgt gccttctagt 2580
tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 2640
cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 2700
tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 2760
aggcatgctg gggatgcggt gggctctatg ggtacccagg tgctgaagaa ttgacccggt 2820
tcctcctggg ccagaaagaa gcaggcacat ccccttctct gtgacacacc ctgtccacgc 2880
ccctggttct tagttccagc cccactcata ggacactcat agctcaggag ggctccgcct 2940
tcaatcccac ccgctaaagt acttggagcg gtctctccct ccctcatcag cccaccaaac 3000
caaacctagc ctccaagagt gggaagaaat taaagcaaga taggctatta agtgcagagg 3060
gagagaaaat gcctccaaca tgtgaggaag taatgagaga aatcatagaa ttttaaggcc 3120
atgatttaag gccatcatgg ccttaatctt ccgcttcctc gctcactgac tcgctgcgct 3180
cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 3240
cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 3300
accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 3360
acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 3420
cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 3480
acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt 3540
atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 3600
agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 3660
acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 3720
gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg 3780
gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 3840
gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 3900
gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga 3960
acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga 4020
tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 4080
ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt 4140
catccatagt tgcctgactc gggggggggg ggcgctgagg tctgcctcgt gaagaaggtg 4200
ttgctgactc ataccaggcc tgaatcgccc catcatccag ccagaaagtg agggagccac 4260
ggttgatgag agctttgttg taggtggacc agttggtgat tttgaacttt tgctttgcca 4320
cggaacggtc tgcgttgtcg ggaagatgcg tgatctgatc cttcaactca gcaaaagttc 4380
gatttattca acaaagccgc cgtcccgtca agtcagcgta atgctctgcc agtgttacaa 4440
ccaattaacc aattctgatt agaaaaactc atcgagcatc aaatgaaact gcaatttatt 4500
catatcagga ttatcaatac catatttttg aaaaagccgt ttctgtaatg aaggagaaaa 4560
ctcaccgagg cagttccata ggatggcaag atcctggtat cggtctgcga ttccgactcg 4620
tccaacatca atacaaccta ttaatttccc ctcgtcaaaa ataaggttat caagtgagaa 4680
atcaccatga gtgacgactg aatccggtga gaatggcaaa agcttatgca tttctttcca 4740
gacttgttca acaggccagc cattacgctc gtcatcaaaa tcactcgcat caaccaaacc 4800
gttattcatt cgtgattgcg cctgagcgag acgaaatacg cgatcgctgt taaaaggaca 4860
attacaaaca ggaatcgaat gcaaccggcg caggaacact gccagcgcat caacaatatt 4920
ttcacctgaa tcaggatatt cttctaatac ctggaatgct gttttcccgg ggatcgcagt 4980
ggtgagtaac catgcatcat caggagtacg gataaaatgc ttgatggtcg gaagaggcat 5040
aaattccgtc agccagttta gtctgaccat ctcatctgta acatcattgg caacgctacc 5100
tttgccatgt ttcagaaaca actctggcgc atcgggcttc ccatacaatc gatagattgt 5160
cgcacctgat tgcccgacat tatcgcgagc ccatttatac ccatataaat cagcatccat 5220
gttggaattt aatcgcggcc tcgagcaaga cgtttcccgt tgaatatggc tcataacacc 5280
ccttgtatta ctgtttatgt aagcagacag ttttattgtt catgatgata tatttttatc 5340
ttgtgcaatg taacatcaga gattttgaga cacaacgtgg ctttcccccc ccccccatta 5400
ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 5460
aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga 5520
aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgtc 5579
<210> 379
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(645)
<400> 379
atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat aac aat 288
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Asn Asn
85 90 95
acc cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336
Thr Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
gga aat ggc act gga gct gac ctg gct gag ctg ctg gtg ctg ctg ctg 432
Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
aac cag tgg act ctg ctg ttc cac gat agc aac gtg aag aat ctg tat 480
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
ctg aat cga gag aaa att gac 645
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 380
<211> 215
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 380
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Asn Asn
85 90 95
Thr Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp
210 215
<210> 381
<211> 645
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 381
gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60
gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120
cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180
gttgctatcg tggaacagca gagtccactg gttcagcagc agcaccagca gctcagccag 240
gtcagctcca gtgccatttc cgcccatttt ttcgatgaca gaattcacca tgttagtaat 300
gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgggtatt 360
gttatggtgg tagccgtacc acccgtccac cattcctgtc cacccgccct caataaaccc 420
tgcgatagcg ccgaacagtc ctcttgtttc ccgctgtggg atgttgcgca gtccggtgac 480
catcctcagt ccgctgccca gattcactga gtgggtgaca gtcacgttct tctccaggac 540
ggtatccact gtgtcggtgg agttgtttgc gtgatagccg atgcagatag tgtcagcgta 600
ggttgcggta aaagtacaca gcaggaccag cagtttggcc ttcat 645
<210> 382
<211> 1155
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(1155)
<400> 382
atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac atc cca cag 192
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
cgg gaa aca aga gga ctg ttc ggc gct atc gca ggg ttt att gag ggc 240
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
ggg tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat aac aat 288
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Asn Asn
85 90 95
acc cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca 336
Thr Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
atc aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc 384
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
gga aat ggc act gga gct gac ctg gct gag ctg ctg gtg ctg ctg ctg 432
Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
aac cag tgg act ctg ctg ttc cac gat agc aac gtg aag aat ctg tat 480
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
gag aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat 528
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
gga tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct 576
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
gtg aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag 624
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
ctg aat cga gag aaa att gac agt ggg ggc gac atc atc aag ctg ctg 672
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
aac gaa cag gtg aac aag gag atg aac agc acc aac ctg tac atg agt 720
Asn Glu Gln Val Asn Lys Glu Met Asn Ser Thr Asn Leu Tyr Met Ser
225 230 235 240
atg tct agt tgg tgt tat aca cac tca ctg gac ggc gct ggg ctg ttc 768
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
ctg ttt gat cac gca gcc gag gaa tac gaa cat gca aag aaa ctg atc 816
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
att ttc ctg aat gag aac aat gtg ccc gtc aac ctg act tca atc agc 864
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Asn Leu Thr Ser Ile Ser
275 280 285
gcc cct gaa cat aag ttc gag ggc ctg acc cag atc ttt cag aaa gct 912
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
tac gaa cac gag cag cat att tcc gaa tct atc aac aat att gtg gac 960
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
cac gcc att aag agc aaa gat cat gct acc ttc aac ttt ctg cag tgg 1008
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
tac gtg gcc gag cag cac gag gag gag gtc ctg ttt aag gac atc ctg 1056
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
gat aaa atc gaa ctg att gga aac gag aat cat ggc ctg tac ctg gca 1104
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
gat cag tat gtg aag ggc att gcc aag tcc aga aaa agt ggg tca tga 1152
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
tga 1155
<210> 383
<211> 383
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 383
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Asn Asn
85 90 95
Thr Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Gly Asn Gly Thr Gly Ala Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu
210 215 220
Asn Glu Gln Val Asn Lys Glu Met Asn Ser Thr Asn Leu Tyr Met Ser
225 230 235 240
Met Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe
245 250 255
Leu Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile
260 265 270
Ile Phe Leu Asn Glu Asn Asn Val Pro Val Asn Leu Thr Ser Ile Ser
275 280 285
Ala Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala
290 295 300
Tyr Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp
305 310 315 320
His Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp
325 330 335
Tyr Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu
340 345 350
Asp Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala
355 360 365
Asp Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
370 375 380
<210> 384
<211> 1155
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 384
tcatcatgac ccactttttc tggacttggc aatgcccttc acatactgat ctgccaggta 60
caggccatga ttctcgtttc caatcagttc gattttatcc aggatgtcct taaacaggac 120
ctcctcctcg tgctgctcgg ccacgtacca ctgcagaaag ttgaaggtag catgatcttt 180
gctcttaatg gcgtggtcca caatattgtt gatagattcg gaaatatgct gctcgtgttc 240
gtaagctttc tgaaagatct gggtcaggcc ctcgaactta tgttcagggg cgctgattga 300
agtcaggttg acgggcacat tgttctcatt caggaaaatg atcagtttct ttgcatgttc 360
gtattcctcg gctgcgtgat caaacaggaa cagcccagcg ccgtccagtg agtgtgtata 420
acaccaacta gacatactca tgtacaggtt ggtgctgttc atctccttgt tcacctgttc 480
gttcagcagc ttgatgatgt cgcccccact gtcaattttc tctcgattca gcttactctc 540
ttcagaatat ttgggatagt cgtaagtgcc gttcttcaca gactccatac attcattgtt 600
gcacttatgg taaaactcga agcatccatt cccgatttct ttggcattgt tcttcagctg 660
ggatttgacc ttctcataca gattcttcac gttgctatcg tggaacagca gagtccactg 720
gttcagcagc agcaccagca gctcagccag gtcagctcca gtgccatttc cgcccatttt 780
ttcgatgaca gaattcacca tgttagtaat gccattgatt gcgttctgtg tagacttctg 840
atcagcggcg tagccgctgc cctgggtatt gttatggtgg tagccgtacc acccgtccac 900
cattcctgtc cacccgccct caataaaccc tgcgatagcg ccgaacagtc ctcttgtttc 960
ccgctgtggg atgttgcgca gtccggtgac catcctcagt ccgctgccca gattcactga 1020
gtgggtgaca gtcacgttct tctccaggac ggtatccact gtgtcggtgg agttgtttgc 1080
gtgatagccg atgcagatag tgtcagcgta ggttgcggta aaagtacaca gcaggaccag 1140
cagtttggcc ttcat 1155
<210> 385
<211> 5579
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 385
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960
tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020
cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080
ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140
ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200
accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260
gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320
ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380
atatcgccac catgaaggcc aaactgctgg tcctgctgtg tacttttacc gcaacctacg 1440
ctgacactat ctgcatcggc tatcacgcaa acaactccac cgacacagtg gataccgtcc 1500
tggagaagaa cgtgactgtc acccactcag tgaatctggg cagcggactg aggatggtca 1560
ccggactgcg caacatccca cagcgggaaa caagaggact gttcggcgct atcgcagggt 1620
ttattgaggg cgggtggaca ggaatggtgg acgggtggta cggctaccac cataacaata 1680
cccagggcag cggctacgcc gctgatcaga agtctacaca gaacgcaatc aatggcatta 1740
ctaacatggt gaattctgtc atcgaaaaaa tgggcggaaa tggcactgga gctgacctgg 1800
ctgagctgct ggtgctgctg ctgaaccagt ggactctgct gttccacgat agcaacgtga 1860
agaatctgta tgagaaggtc aaatcccagc tgaagaacaa tgccaaagaa atcgggaatg 1920
gatgcttcga gttttaccat aagtgcaaca atgaatgtat ggagtctgtg aagaacggca 1980
cttacgacta tcccaaatat tctgaagaga gtaagctgaa tcgagagaaa attgacagtg 2040
ggggcgacat catcaagctg ctgaacgaac aggtgaacaa ggagatgaac agcaccaacc 2100
tgtacatgag tatgtctagt tggtgttata cacactcact ggacggcgct gggctgttcc 2160
tgtttgatca cgcagccgag gaatacgaac atgcaaagaa actgatcatt ttcctgaatg 2220
agaacaatgt gcccgtcaac ctgacttcaa tcagcgcccc tgaacataag ttcgagggcc 2280
tgacccagat ctttcagaaa gcttacgaac acgagcagca tatttccgaa tctatcaaca 2340
atattgtgga ccacgccatt aagagcaaag atcatgctac cttcaacttt ctgcagtggt 2400
acgtggccga gcagcacgag gaggaggtcc tgtttaagga catcctggat aaaatcgaac 2460
tgattggaaa cgagaatcat ggcctgtacc tggcagatca gtatgtgaag ggcattgcca 2520
agtccagaaa aagtgggtca tgatgaacac gtgggatcca gatctgctgt gccttctagt 2580
tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 2640
cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 2700
tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 2760
aggcatgctg gggatgcggt gggctctatg ggtacccagg tgctgaagaa ttgacccggt 2820
tcctcctggg ccagaaagaa gcaggcacat ccccttctct gtgacacacc ctgtccacgc 2880
ccctggttct tagttccagc cccactcata ggacactcat agctcaggag ggctccgcct 2940
tcaatcccac ccgctaaagt acttggagcg gtctctccct ccctcatcag cccaccaaac 3000
caaacctagc ctccaagagt gggaagaaat taaagcaaga taggctatta agtgcagagg 3060
gagagaaaat gcctccaaca tgtgaggaag taatgagaga aatcatagaa ttttaaggcc 3120
atgatttaag gccatcatgg ccttaatctt ccgcttcctc gctcactgac tcgctgcgct 3180
cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 3240
cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 3300
accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 3360
acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 3420
cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 3480
acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt 3540
atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 3600
agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 3660
acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 3720
gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg 3780
gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 3840
gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 3900
gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga 3960
acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga 4020
tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 4080
ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt 4140
catccatagt tgcctgactc gggggggggg ggcgctgagg tctgcctcgt gaagaaggtg 4200
ttgctgactc ataccaggcc tgaatcgccc catcatccag ccagaaagtg agggagccac 4260
ggttgatgag agctttgttg taggtggacc agttggtgat tttgaacttt tgctttgcca 4320
cggaacggtc tgcgttgtcg ggaagatgcg tgatctgatc cttcaactca gcaaaagttc 4380
gatttattca acaaagccgc cgtcccgtca agtcagcgta atgctctgcc agtgttacaa 4440
ccaattaacc aattctgatt agaaaaactc atcgagcatc aaatgaaact gcaatttatt 4500
catatcagga ttatcaatac catatttttg aaaaagccgt ttctgtaatg aaggagaaaa 4560
ctcaccgagg cagttccata ggatggcaag atcctggtat cggtctgcga ttccgactcg 4620
tccaacatca atacaaccta ttaatttccc ctcgtcaaaa ataaggttat caagtgagaa 4680
atcaccatga gtgacgactg aatccggtga gaatggcaaa agcttatgca tttctttcca 4740
gacttgttca acaggccagc cattacgctc gtcatcaaaa tcactcgcat caaccaaacc 4800
gttattcatt cgtgattgcg cctgagcgag acgaaatacg cgatcgctgt taaaaggaca 4860
attacaaaca ggaatcgaat gcaaccggcg caggaacact gccagcgcat caacaatatt 4920
ttcacctgaa tcaggatatt cttctaatac ctggaatgct gttttcccgg ggatcgcagt 4980
ggtgagtaac catgcatcat caggagtacg gataaaatgc ttgatggtcg gaagaggcat 5040
aaattccgtc agccagttta gtctgaccat ctcatctgta acatcattgg caacgctacc 5100
tttgccatgt ttcagaaaca actctggcgc atcgggcttc ccatacaatc gatagattgt 5160
cgcacctgat tgcccgacat tatcgcgagc ccatttatac ccatataaat cagcatccat 5220
gttggaattt aatcgcggcc tcgagcaaga cgtttcccgt tgaatatggc tcataacacc 5280
ccttgtatta ctgtttatgt aagcagacag ttttattgtt catgatgata tatttttatc 5340
ttgtgcaatg taacatcaga gattttgaga cacaacgtgg ctttcccccc ccccccatta 5400
ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 5460
aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga 5520
aaccattatt atcatgacat taacctataa aaataggcgt atcacgaggc cctttcgtc 5579
<210> 386
<211> 384
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(384)
<400> 386
atg aag gcc aag ctg ctg gtg ctc ctg tgc acc ttc acc gcc acc tac 48
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
gcc gac acc atc tgc atc ggc tac cac gcc aac aac agc acc gac acc 96
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
gtg gat acc gtg ctg gaa aag aac gtg acc gtg acc cac agc gtg aac 144
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
ctg ggc agc ggc ctg cgg atg gtg aca ggc ctg cgg aac atc ccc cag 192
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
aga gag aca cgg ggc ctg ttc ggc gcc att gcc ggc ttt atc gag ggc 240
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
ggc tgg acc ggc atg gtg gac ggg tgg tac ggc tac cac cac cag aac 288
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
gag cag ggc agc ggc tac gcc gcc gac cag aag tcc acc cag aac gcc 336
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
atc aac ggc atc acc aac atg gtg aac agc gtg atc gag aag atg ggc 384
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
<210> 387
<211> 128
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 387
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
<210> 388
<211> 384
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 388
gcccatcttc tcgatcacgc tgttcaccat gttggtgatg ccgttgatgg cgttctgggt 60
ggacttctgg tcggcggcgt agccgctgcc ctgctcgttc tggtggtggt agccgtacca 120
cccgtccacc atgccggtcc agccgccctc gataaagccg gcaatggcgc cgaacaggcc 180
ccgtgtctct ctctggggga tgttccgcag gcctgtcacc atccgcaggc cgctgcccag 240
gttcacgctg tgggtcacgg tcacgttctt ttccagcacg gtatccacgg tgtcggtgct 300
gttgttggcg tggtagccga tgcagatggt gtcggcgtag gtggcggtga aggtgcacag 360
gagcaccagc agcttggcct tcat 384
<210> 389
<211> 1110
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(1110)
<400> 389
atg aag gcc aag ctg ctg gtg ctc ctg tgc acc ttc acc gcc acc tac 48
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
gcc gac acc atc tgc atc ggc tac cac gcc aac aac agc acc gac acc 96
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
gtg gat acc gtg ctg gaa aag aac gtg acc gtg acc cac agc gtg aac 144
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
ctg ggc agc ggc ctg cgg atg gtg aca ggc ctg cgg aac atc ccc cag 192
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
aga gag aca cgg ggc ctg ttc ggc gcc att gcc ggc ttt atc gag ggc 240
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
ggc tgg acc ggc atg gtg gac ggg tgg tac ggc tac cac cac cag aac 288
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
gag cag ggc agc ggc tac gcc gcc gac cag aag tcc acc cag aac gcc 336
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
atc aac ggc atc acc aac atg gtg aac agc gtg atc gag aag atg ggc 384
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
tcc ggc ggc agc ggc acc gat ctg gct gaa ctg ctg gtc ctg ctg ctg 432
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
aac gag cgg acc ctg gac ttc cac gac agc aac gtg aag aac ctg tac 480
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
gag aaa gtg aag tcc cag ctg aag aac aac gcc aaa gag atc ggc aac 528
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
ggc tgc ttc gag ttc tac cac aag tgc aac aac gag tgc atg gaa agc 576
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
gtg aag aac ggc acc tac gac tac ccc aag tac agc gag gaa agc aag 624
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
ctg aac cgc gag gga ggc atg caa atc tac gag ggc aag ctg aca gcc 672
Leu Asn Arg Glu Gly Gly Met Gln Ile Tyr Glu Gly Lys Leu Thr Ala
210 215 220
gag ggc ctg aga ttc ggc atc gtg gcc agc cgg ttc aac cac gcc ctg 720
Glu Gly Leu Arg Phe Gly Ile Val Ala Ser Arg Phe Asn His Ala Leu
225 230 235 240
gtg gac aga ctg gtg gaa ggc gcc atc gac tgc atc gtg cgg cac ggc 768
Val Asp Arg Leu Val Glu Gly Ala Ile Asp Cys Ile Val Arg His Gly
245 250 255
ggc aga gaa gag gac atc acc ctg gtc cgc gtg ccc ggc agc tgg gaa 816
Gly Arg Glu Glu Asp Ile Thr Leu Val Arg Val Pro Gly Ser Trp Glu
260 265 270
att cct gtg gct gcc ggc gag ctg gcc cgg aaa gag gat atc gac gcc 864
Ile Pro Val Ala Ala Gly Glu Leu Ala Arg Lys Glu Asp Ile Asp Ala
275 280 285
gtc atc gcc atc ggc gtg ctg atc aga ggc gcc acc ccc cac ttc gac 912
Val Ile Ala Ile Gly Val Leu Ile Arg Gly Ala Thr Pro His Phe Asp
290 295 300
tat atc gcc agc gag gtg tcc aag ggc ctg gcc aac ctg agc ctg gaa 960
Tyr Ile Ala Ser Glu Val Ser Lys Gly Leu Ala Asn Leu Ser Leu Glu
305 310 315 320
ctg cgg aag ccc atc acc ttc gga gtg atc acc gcc gac acc ctg gaa 1008
Leu Arg Lys Pro Ile Thr Phe Gly Val Ile Thr Ala Asp Thr Leu Glu
325 330 335
cag gcc atc gag aga gcc ggc acc aag cac ggc aac aag gga tgg gaa 1056
Gln Ala Ile Glu Arg Ala Gly Thr Lys His Gly Asn Lys Gly Trp Glu
340 345 350
gcc gcc ctg agc gcc atc gag atg gcc aat ctg ttc aag agc ctg cgc 1104
Ala Ala Leu Ser Ala Ile Glu Met Ala Asn Leu Phe Lys Ser Leu Arg
355 360 365
tga tga 1110
<210> 390
<211> 368
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 390
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Ile Pro Gln
50 55 60
Arg Glu Thr Arg Gly Leu Phe Gly Ala Ile Ala Gly Phe Ile Glu Gly
65 70 75 80
Gly Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn
85 90 95
Glu Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala
100 105 110
Ile Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly
115 120 125
Ser Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu
130 135 140
Asn Glu Arg Thr Leu Asp Phe His Asp Ser Asn Val Lys Asn Leu Tyr
145 150 155 160
Glu Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn
165 170 175
Gly Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser
180 185 190
Val Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys
195 200 205
Leu Asn Arg Glu Gly Gly Met Gln Ile Tyr Glu Gly Lys Leu Thr Ala
210 215 220
Glu Gly Leu Arg Phe Gly Ile Val Ala Ser Arg Phe Asn His Ala Leu
225 230 235 240
Val Asp Arg Leu Val Glu Gly Ala Ile Asp Cys Ile Val Arg His Gly
245 250 255
Gly Arg Glu Glu Asp Ile Thr Leu Val Arg Val Pro Gly Ser Trp Glu
260 265 270
Ile Pro Val Ala Ala Gly Glu Leu Ala Arg Lys Glu Asp Ile Asp Ala
275 280 285
Val Ile Ala Ile Gly Val Leu Ile Arg Gly Ala Thr Pro His Phe Asp
290 295 300
Tyr Ile Ala Ser Glu Val Ser Lys Gly Leu Ala Asn Leu Ser Leu Glu
305 310 315 320
Leu Arg Lys Pro Ile Thr Phe Gly Val Ile Thr Ala Asp Thr Leu Glu
325 330 335
Gln Ala Ile Glu Arg Ala Gly Thr Lys His Gly Asn Lys Gly Trp Glu
340 345 350
Ala Ala Leu Ser Ala Ile Glu Met Ala Asn Leu Phe Lys Ser Leu Arg
355 360 365
<210> 391
<211> 1110
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 391
tcatcagcgc aggctcttga acagattggc catctcgatg gcgctcaggg cggcttccca 60
tcccttgttg ccgtgcttgg tgccggctct ctcgatggcc tgttccaggg tgtcggcggt 120
gatcactccg aaggtgatgg gcttccgcag ttccaggctc aggttggcca ggcccttgga 180
cacctcgctg gcgatatagt cgaagtgggg ggtggcgcct ctgatcagca cgccgatggc 240
gatgacggcg tcgatatcct ctttccgggc cagctcgccg gcagccacag gaatttccca 300
gctgccgggc acgcggacca gggtgatgtc ctcttctctg ccgccgtgcc gcacgatgca 360
gtcgatggcg ccttccacca gtctgtccac cagggcgtgg ttgaaccggc tggccacgat 420
gccgaatctc aggccctcgg ctgtcagctt gccctcgtag atttgcatgc ctccctcgcg 480
gttcagcttg ctttcctcgc tgtacttggg gtagtcgtag gtgccgttct tcacgctttc 540
catgcactcg ttgttgcact tgtggtagaa ctcgaagcag ccgttgccga tctctttggc 600
gttgttcttc agctgggact tcactttctc gtacaggttc ttcacgttgc tgtcgtggaa 660
gtccagggtc cgctcgttca gcagcaggac cagcagttca gccagatcgg tgccgctgcc 720
gccggagccc atcttctcga tcacgctgtt caccatgttg gtgatgccgt tgatggcgtt 780
ctgggtggac ttctggtcgg cggcgtagcc gctgccctgc tcgttctggt ggtggtagcc 840
gtaccacccg tccaccatgc cggtccagcc gccctcgata aagccggcaa tggcgccgaa 900
caggccccgt gtctctctct gggggatgtt ccgcaggcct gtcaccatcc gcaggccgct 960
gcccaggttc acgctgtggg tcacggtcac gttcttttcc agcacggtat ccacggtgtc 1020
ggtgctgttg ttggcgtggt agccgatgca gatggtgtcg gcgtaggtgg cggtgaaggt 1080
gcacaggagc accagcagct tggccttcat 1110
<210> 392
<211> 5528
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 392
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960
tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020
cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080
ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140
ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200
accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260
gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320
ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380
ccaccatgaa ggccaagctg ctggtgctcc tgtgcacctt caccgccacc tacgccgaca 1440
ccatctgcat cggctaccac gccaacaaca gcaccgacac cgtggatacc gtgctggaaa 1500
agaacgtgac cgtgacccac agcgtgaacc tgggcagcgg cctgcggatg gtgacaggcc 1560
tgcggaacat cccccagaga gagacacggg gcctgttcgg cgccattgcc ggctttatcg 1620
agggcggctg gaccggcatg gtggacgggt ggtacggcta ccaccaccag aacgagcagg 1680
gcagcggcta cgccgccgac cagaagtcca cccagaacgc catcaacggc atcaccaaca 1740
tggtgaacag cgtgatcgag aagatgggct ccggcggcag cggcaccgat ctggctgaac 1800
tgctggtcct gctgctgaac gagcggaccc tggacttcca cgacagcaac gtgaagaacc 1860
tgtacgagaa agtgaagtcc cagctgaaga acaacgccaa agagatcggc aacggctgct 1920
tcgagttcta ccacaagtgc aacaacgagt gcatggaaag cgtgaagaac ggcacctacg 1980
actaccccaa gtacagcgag gaaagcaagc tgaaccgcga gggaggcatg caaatctacg 2040
agggcaagct gacagccgag ggcctgagat tcggcatcgt ggccagccgg ttcaaccacg 2100
ccctggtgga cagactggtg gaaggcgcca tcgactgcat cgtgcggcac ggcggcagag 2160
aagaggacat caccctggtc cgcgtgcccg gcagctggga aattcctgtg gctgccggcg 2220
agctggcccg gaaagaggat atcgacgccg tcatcgccat cggcgtgctg atcagaggcg 2280
ccacccccca cttcgactat atcgccagcg aggtgtccaa gggcctggcc aacctgagcc 2340
tggaactgcg gaagcccatc accttcggag tgatcaccgc cgacaccctg gaacaggcca 2400
tcgagagagc cggcaccaag cacggcaaca agggatggga agccgccctg agcgccatcg 2460
agatggccaa tctgttcaag agcctgcgct gatgaacacg tgggatccag atctgctgtg 2520
ccttctagtt gccagccatc tgttgtttgc ccctcccccg tgccttcctt gaccctggaa 2580
ggtgccactc ccactgtcct ttcctaataa aatgaggaaa ttgcatcgca ttgtctgagt 2640
aggtgtcatt ctattctggg gggtggggtg gggcaggaca gcaaggggga ggattgggaa 2700
gacaatagca ggcatgctgg ggatgcggtg ggctctatgg gtacccaggt gctgaagaat 2760
tgacccggtt cctcctgggc cagaaagaag caggcacatc cccttctctg tgacacaccc 2820
tgtccacgcc cctggttctt agttccagcc ccactcatag gacactcata gctcaggagg 2880
gctccgcctt caatcccacc cgctaaagta cttggagcgg tctctccctc cctcatcagc 2940
ccaccaaacc aaacctagcc tccaagagtg ggaagaaatt aaagcaagat aggctattaa 3000
gtgcagaggg agagaaaatg cctccaacat gtgaggaagt aatgagagaa atcatagaat 3060
tttaaggcca tgatttaagg ccatcatggc cttaatcttc cgcttcctcg ctcactgact 3120
cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag gcggtaatac 3180
ggttatccac agaatcaggg gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa 3240
aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc cgcccccctg 3300
acgagcatca caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa 3360
gataccaggc gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc 3420
ttaccggata cctgtccgcc tttctccctt cgggaagcgt ggcgctttct catagctcac 3480
gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac 3540
cccccgttca gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag tccaacccgg 3600
taagacacga cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggt 3660
atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa ctacggctac actagaagaa 3720
cagtatttgg tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct 3780
cttgatccgg caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga 3840
ttacgcgcag aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg 3900
ctcagtggaa cgaaaactca cgttaaggga ttttggtcat gagattatca aaaaggatct 3960
tcacctagat ccttttaaat taaaaatgaa gttttaaatc aatctaaagt atatatgagt 4020
aaacttggtc tgacagttac caatgcttaa tcagtgaggc acctatctca gcgatctgtc 4080
tatttcgttc atccatagtt gcctgactcg gggggggggg gcgctgaggt ctgcctcgtg 4140
aagaaggtgt tgctgactca taccaggcct gaatcgcccc atcatccagc cagaaagtga 4200
gggagccacg gttgatgaga gctttgttgt aggtggacca gttggtgatt ttgaactttt 4260
gctttgccac ggaacggtct gcgttgtcgg gaagatgcgt gatctgatcc ttcaactcag 4320
caaaagttcg atttattcaa caaagccgcc gtcccgtcaa gtcagcgtaa tgctctgcca 4380
gtgttacaac caattaacca attctgatta gaaaaactca tcgagcatca aatgaaactg 4440
caatttattc atatcaggat tatcaatacc atatttttga aaaagccgtt tctgtaatga 4500
aggagaaaac tcaccgaggc agttccatag gatggcaaga tcctggtatc ggtctgcgat 4560
tccgactcgt ccaacatcaa tacaacctat taatttcccc tcgtcaaaaa taaggttatc 4620
aagtgagaaa tcaccatgag tgacgactga atccggtgag aatggcaaaa gcttatgcat 4680
ttctttccag acttgttcaa caggccagcc attacgctcg tcatcaaaat cactcgcatc 4740
aaccaaaccg ttattcattc gtgattgcgc ctgagcgaga cgaaatacgc gatcgctgtt 4800
aaaaggacaa ttacaaacag gaatcgaatg caaccggcgc aggaacactg ccagcgcatc 4860
aacaatattt tcacctgaat caggatattc ttctaatacc tggaatgctg ttttcccggg 4920
gatcgcagtg gtgagtaacc atgcatcatc aggagtacgg ataaaatgct tgatggtcgg 4980
aagaggcata aattccgtca gccagtttag tctgaccatc tcatctgtaa catcattggc 5040
aacgctacct ttgccatgtt tcagaaacaa ctctggcgca tcgggcttcc catacaatcg 5100
atagattgtc gcacctgatt gcccgacatt atcgcgagcc catttatacc catataaatc 5160
agcatccatg ttggaattta atcgcggcct cgagcaagac gtttcccgtt gaatatggct 5220
cataacaccc cttgtattac tgtttatgta agcagacagt tttattgttc atgatgatat 5280
atttttatct tgtgcaatgt aacatcagag attttgagac acaacgtggc tttccccccc 5340
cccccattat tgaagcattt atcagggtta ttgtctcatg agcggataca tatttgaatg 5400
tatttagaaa aataaacaaa taggggttcc gcgcacattt ccccgaaaag tgccacctga 5460
cgtctaagaa accattatta tcatgacatt aacctataaa aataggcgta tcacgaggcc 5520
ctttcgtc 5528
<210> 393
<211> 594
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(594)
<400> 393
atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac ggg tca ggc 192
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Gly Ser Gly
50 55 60
tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat gag 240
Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn Glu
65 70 75 80
cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca atc 288
Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala Ile
85 90 95
aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc agc 336
Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly Ser
100 105 110
gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg ctg ctg aac 384
Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu Asn
115 120 125
cag tgg act ctg ctg ttc cac gat agc aac gtg aag aat ctg tat gag 432
Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr Glu
130 135 140
aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat gga 480
Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly
145 150 155 160
tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct gtg 528
Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser Val
165 170 175
aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag ctg 576
Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys Leu
180 185 190
aat cga gag aaa att gac 594
Asn Arg Glu Lys Ile Asp
195
<210> 394
<211> 198
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 394
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Gly Ser Gly
50 55 60
Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn Glu
65 70 75 80
Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala Ile
85 90 95
Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly Ser
100 105 110
Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu Asn
115 120 125
Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr Glu
130 135 140
Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly
145 150 155 160
Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser Val
165 170 175
Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys Leu
180 185 190
Asn Arg Glu Lys Ile Asp
195
<210> 395
<211> 594
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 395
gtcaattttc tctcgattca gcttactctc ttcagaatat ttgggatagt cgtaagtgcc 60
gttcttcaca gactccatac attcattgtt gcacttatgg taaaactcga agcatccatt 120
cccgatttct ttggcattgt tcttcagctg ggatttgacc ttctcataca gattcttcac 180
gttgctatcg tggaacagca gagtccactg gttcagcagc agcaccagca gctcagccag 240
gtctgttccg gagcctccgc tgcccatttt ttcgatgaca gaattcacca tgttagtaat 300
gccattgatt gcgttctgtg tagacttctg atcagcggcg tagccgctgc cctgctcatt 360
ctgatggtgg tagccgtacc acccgtccac cattcctgtc cagcctgacc cgttgcgcag 420
tccggtgacc atcctcagtc cgctgcccag attcactgag tgggtgacag tcacgttctt 480
ctccaggacg gtatccactg tgtcggtgga gttgtttgcg tgatagccga tgcagatagt 540
gtcagcgtag gttgcggtaa aagtacacag caggaccagc agtttggcct tcat 594
<210> 396
<211> 1104
<212> DNA
<213> 人工序列
<220>
<223> 合成
<220>
<221> CDS
<222> (1)..(1104)
<400> 396
atg aag gcc aaa ctg ctg gtc ctg ctg tgt act ttt acc gca acc tac 48
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
gct gac act atc tgc atc ggc tat cac gca aac aac tcc acc gac aca 96
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
gtg gat acc gtc ctg gag aag aac gtg act gtc acc cac tca gtg aat 144
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
ctg ggc agc gga ctg agg atg gtc acc gga ctg cgc aac ggg tca ggc 192
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Gly Ser Gly
50 55 60
tgg aca gga atg gtg gac ggg tgg tac ggc tac cac cat cag aat gag 240
Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn Glu
65 70 75 80
cag ggc agc ggc tac gcc gct gat cag aag tct aca cag aac gca atc 288
Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala Ile
85 90 95
aat ggc att act aac atg gtg aat tct gtc atc gaa aaa atg ggc agc 336
Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly Ser
100 105 110
gga ggc tcc gga aca gac ctg gct gag ctg ctg gtg ctg ctg ctg aac 384
Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu Asn
115 120 125
cag tgg act ctg ctg ttc cac gat agc aac gtg aag aat ctg tat gag 432
Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr Glu
130 135 140
aag gtc aaa tcc cag ctg aag aac aat gcc aaa gaa atc ggg aat gga 480
Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly
145 150 155 160
tgc ttc gag ttt tac cat aag tgc aac aat gaa tgt atg gag tct gtg 528
Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser Val
165 170 175
aag aac ggc act tac gac tat ccc aaa tat tct gaa gag agt aag ctg 576
Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys Leu
180 185 190
aat cga gag aaa att gac agt ggg ggc gac atc atc aag ctg ctg aac 624
Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu Asn
195 200 205
gaa cag gtg aac aag gag atg cag agc tcc aac ctg tac atg agt atg 672
Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser Met
210 215 220
tct agt tgg tgt tat aca cac tca ctg gac ggc gct ggg ctg ttc ctg 720
Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe Leu
225 230 235 240
ttt gat cac gca gcc gag gaa tac gaa cat gca aag aaa ctg atc att 768
Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile Ile
245 250 255
ttc ctg aat gag aac aat gtg ccc gtc cag ctg act tca atc agc gcc 816
Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser Ala
260 265 270
cct gaa cat aag ttc gag ggc ctg acc cag atc ttt cag aaa gct tac 864
Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala Tyr
275 280 285
gaa cac gag cag cat att tcc gaa tct atc aac aat att gtg gac cac 912
Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp His
290 295 300
gcc att aag agc aaa gat cat gct acc ttc aac ttt ctg cag tgg tac 960
Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp Tyr
305 310 315 320
gtg gcc gag cag cac gag gag gag gtc ctg ttt aag gac atc ctg gat 1008
Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu Asp
325 330 335
aaa atc gaa ctg att gga aac gag aat cat ggc ctg tac ctg gca gat 1056
Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala Asp
340 345 350
cag tat gtg aag ggc att gcc aag tcc aga aaa agt ggg tca tga tga 1104
Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
355 360 365
<210> 397
<211> 366
<212> PRT
<213> 人工序列
<220>
<223> 合成構(gòu)建體
<400> 397
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Gly Ser Gly
50 55 60
Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn Glu
65 70 75 80
Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala Ile
85 90 95
Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly Ser
100 105 110
Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu Asn
115 120 125
Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr Glu
130 135 140
Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly
145 150 155 160
Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser Val
165 170 175
Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys Leu
180 185 190
Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu Asn
195 200 205
Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser Met
210 215 220
Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe Leu
225 230 235 240
Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile Ile
245 250 255
Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser Ala
260 265 270
Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala Tyr
275 280 285
Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp His
290 295 300
Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp Tyr
305 310 315 320
Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu Asp
325 330 335
Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala Asp
340 345 350
Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
355 360 365
<210> 398
<211> 1104
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 398
tcatcatgac ccactttttc tggacttggc aatgcccttc acatactgat ctgccaggta 60
caggccatga ttctcgtttc caatcagttc gattttatcc aggatgtcct taaacaggac 120
ctcctcctcg tgctgctcgg ccacgtacca ctgcagaaag ttgaaggtag catgatcttt 180
gctcttaatg gcgtggtcca caatattgtt gatagattcg gaaatatgct gctcgtgttc 240
gtaagctttc tgaaagatct gggtcaggcc ctcgaactta tgttcagggg cgctgattga 300
agtcagctgg acgggcacat tgttctcatt caggaaaatg atcagtttct ttgcatgttc 360
gtattcctcg gctgcgtgat caaacaggaa cagcccagcg ccgtccagtg agtgtgtata 420
acaccaacta gacatactca tgtacaggtt ggagctctgc atctccttgt tcacctgttc 480
gttcagcagc ttgatgatgt cgcccccact gtcaattttc tctcgattca gcttactctc 540
ttcagaatat ttgggatagt cgtaagtgcc gttcttcaca gactccatac attcattgtt 600
gcacttatgg taaaactcga agcatccatt cccgatttct ttggcattgt tcttcagctg 660
ggatttgacc ttctcataca gattcttcac gttgctatcg tggaacagca gagtccactg 720
gttcagcagc agcaccagca gctcagccag gtctgttccg gagcctccgc tgcccatttt 780
ttcgatgaca gaattcacca tgttagtaat gccattgatt gcgttctgtg tagacttctg 840
atcagcggcg tagccgctgc cctgctcatt ctgatggtgg tagccgtacc acccgtccac 900
cattcctgtc cagcctgacc cgttgcgcag tccggtgacc atcctcagtc cgctgcccag 960
attcactgag tgggtgacag tcacgttctt ctccaggacg gtatccactg tgtcggtgga 1020
gttgtttgcg tgatagccga tgcagatagt gtcagcgtag gttgcggtaa aagtacacag 1080
caggaccagc agtttggcct tcat 1104
<210> 399
<211> 5528
<212> DNA
<213> 人工序列
<220>
<223> 合成
<400> 399
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcagattgg 240
ctattggcca ttgcatacgt tgtatccata tcataatatg tacatttata ttggctcatg 300
tccaacatta ccgccatgtt gacattgatt attgactagt tattaatagt aatcaattac 360
ggggtcatta gttcatagcc catatatgga gttccgcgtt acataactta cggtaaatgg 420
cccgcctggc tgaccgccca acgacccccg cccattgacg tcaataatga cgtatgttcc 480
catagtaacg ccaataggga ctttccattg acgtcaatgg gtggagtatt tacggtaaac 540
tgcccacttg gcagtacatc aagtgtatca tatgccaagt acgcccccta ttgacgtcaa 600
tgacggtaaa tggcccgcct ggcattatgc ccagtacatg accttatggg actttcctac 660
ttggcagtac atctacgtat tagtcatcgc tattaccatg gtgatgcggt tttggcagta 720
catcaatggg cgtggatagc ggtttgactc acggggattt ccaagtctcc accccattga 780
cgtcaatggg agtttgtttt ggcaccaaaa tcaacgggac tttccaaaat gtcgtaacaa 840
ctccgcccca ttgacgcaaa tgggcggtag gcgtgtacgg tgggaggtct atataagcag 900
agctcgttta gtgaaccgtc agatcgcctg gagacgccat ccacgctgtt ttgacctcca 960
tagaagacac cgggaccgat ccagcctcca tcggctcgca tctctccttc acgcgcccgc 1020
cgccctacct gaggccgcca tccacgccgg ttgagtcgcg ttctgccgcc tcccgcctgt 1080
ggtgcctcct gaactgcgtc cgccgtctag gtaagtttaa agctcaggtc gagaccgggc 1140
ctttgtccgg cgctcccttg gagcctacct agactcagcc ggctctccac gctttgcctg 1200
accctgcttg ctcaactcta gttaacggtg gagggcagtg tagtctgagc agtactcgtt 1260
gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 1320
ggtcttttct gcagtcaccg tcgtcgacac gtgtgatcag atatcgcggc cgctctagag 1380
atatcgccac catgaaggcc aaactgctgg tcctgctgtg tacttttacc gcaacctacg 1440
ctgacactat ctgcatcggc tatcacgcaa acaactccac cgacacagtg gataccgtcc 1500
tggagaagaa cgtgactgtc acccactcag tgaatctggg cagcggactg aggatggtca 1560
ccggactgcg caacgggtca ggctggacag gaatggtgga cgggtggtac ggctaccacc 1620
atcagaatga gcagggcagc ggctacgccg ctgatcagaa gtctacacag aacgcaatca 1680
atggcattac taacatggtg aattctgtca tcgaaaaaat gggcagcgga ggctccggaa 1740
cagacctggc tgagctgctg gtgctgctgc tgaaccagtg gactctgctg ttccacgata 1800
gcaacgtgaa gaatctgtat gagaaggtca aatcccagct gaagaacaat gccaaagaaa 1860
tcgggaatgg atgcttcgag ttttaccata agtgcaacaa tgaatgtatg gagtctgtga 1920
agaacggcac ttacgactat cccaaatatt ctgaagagag taagctgaat cgagagaaaa 1980
ttgacagtgg gggcgacatc atcaagctgc tgaacgaaca ggtgaacaag gagatgcaga 2040
gctccaacct gtacatgagt atgtctagtt ggtgttatac acactcactg gacggcgctg 2100
ggctgttcct gtttgatcac gcagccgagg aatacgaaca tgcaaagaaa ctgatcattt 2160
tcctgaatga gaacaatgtg cccgtccagc tgacttcaat cagcgcccct gaacataagt 2220
tcgagggcct gacccagatc tttcagaaag cttacgaaca cgagcagcat atttccgaat 2280
ctatcaacaa tattgtggac cacgccatta agagcaaaga tcatgctacc ttcaactttc 2340
tgcagtggta cgtggccgag cagcacgagg aggaggtcct gtttaaggac atcctggata 2400
aaatcgaact gattggaaac gagaatcatg gcctgtacct ggcagatcag tatgtgaagg 2460
gcattgccaa gtccagaaaa agtgggtcat gatgaacacg tgggatccag atctgctgtg 2520
ccttctagtt gccagccatc tgttgtttgc ccctcccccg tgccttcctt gaccctggaa 2580
ggtgccactc ccactgtcct ttcctaataa aatgaggaaa ttgcatcgca ttgtctgagt 2640
aggtgtcatt ctattctggg gggtggggtg gggcaggaca gcaaggggga ggattgggaa 2700
gacaatagca ggcatgctgg ggatgcggtg ggctctatgg gtacccaggt gctgaagaat 2760
tgacccggtt cctcctgggc cagaaagaag caggcacatc cccttctctg tgacacaccc 2820
tgtccacgcc cctggttctt agttccagcc ccactcatag gacactcata gctcaggagg 2880
gctccgcctt caatcccacc cgctaaagta cttggagcgg tctctccctc cctcatcagc 2940
ccaccaaacc aaacctagcc tccaagagtg ggaagaaatt aaagcaagat aggctattaa 3000
gtgcagaggg agagaaaatg cctccaacat gtgaggaagt aatgagagaa atcatagaat 3060
tttaaggcca tgatttaagg ccatcatggc cttaatcttc cgcttcctcg ctcactgact 3120
cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag gcggtaatac 3180
ggttatccac agaatcaggg gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa 3240
aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc cgcccccctg 3300
acgagcatca caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa 3360
gataccaggc gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc 3420
ttaccggata cctgtccgcc tttctccctt cgggaagcgt ggcgctttct catagctcac 3480
gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac 3540
cccccgttca gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag tccaacccgg 3600
taagacacga cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggt 3660
atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa ctacggctac actagaagaa 3720
cagtatttgg tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct 3780
cttgatccgg caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga 3840
ttacgcgcag aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg 3900
ctcagtggaa cgaaaactca cgttaaggga ttttggtcat gagattatca aaaaggatct 3960
tcacctagat ccttttaaat taaaaatgaa gttttaaatc aatctaaagt atatatgagt 4020
aaacttggtc tgacagttac caatgcttaa tcagtgaggc acctatctca gcgatctgtc 4080
tatttcgttc atccatagtt gcctgactcg gggggggggg gcgctgaggt ctgcctcgtg 4140
aagaaggtgt tgctgactca taccaggcct gaatcgcccc atcatccagc cagaaagtga 4200
gggagccacg gttgatgaga gctttgttgt aggtggacca gttggtgatt ttgaactttt 4260
gctttgccac ggaacggtct gcgttgtcgg gaagatgcgt gatctgatcc ttcaactcag 4320
caaaagttcg atttattcaa caaagccgcc gtcccgtcaa gtcagcgtaa tgctctgcca 4380
gtgttacaac caattaacca attctgatta gaaaaactca tcgagcatca aatgaaactg 4440
caatttattc atatcaggat tatcaatacc atatttttga aaaagccgtt tctgtaatga 4500
aggagaaaac tcaccgaggc agttccatag gatggcaaga tcctggtatc ggtctgcgat 4560
tccgactcgt ccaacatcaa tacaacctat taatttcccc tcgtcaaaaa taaggttatc 4620
aagtgagaaa tcaccatgag tgacgactga atccggtgag aatggcaaaa gcttatgcat 4680
ttctttccag acttgttcaa caggccagcc attacgctcg tcatcaaaat cactcgcatc 4740
aaccaaaccg ttattcattc gtgattgcgc ctgagcgaga cgaaatacgc gatcgctgtt 4800
aaaaggacaa ttacaaacag gaatcgaatg caaccggcgc aggaacactg ccagcgcatc 4860
aacaatattt tcacctgaat caggatattc ttctaatacc tggaatgctg ttttcccggg 4920
gatcgcagtg gtgagtaacc atgcatcatc aggagtacgg ataaaatgct tgatggtcgg 4980
aagaggcata aattccgtca gccagtttag tctgaccatc tcatctgtaa catcattggc 5040
aacgctacct ttgccatgtt tcagaaacaa ctctggcgca tcgggcttcc catacaatcg 5100
atagattgtc gcacctgatt gcccgacatt atcgcgagcc catttatacc catataaatc 5160
agcatccatg ttggaattta atcgcggcct cgagcaagac gtttcccgtt gaatatggct 5220
cataacaccc cttgtattac tgtttatgta agcagacagt tttattgttc atgatgatat 5280
atttttatct tgtgcaatgt aacatcagag attttgagac acaacgtggc tttccccccc 5340
cccccattat tgaagcattt atcagggtta ttgtctcatg agcggataca tatttgaatg 5400
tatttagaaa aataaacaaa taggggttcc gcgcacattt ccccgaaaag tgccacctga 5460
cgtctaagaa accattatta tcatgacatt aacctataaa aataggcgta tcacgaggcc 5520
ctttcgtc 5528
<210> 400
<211> 198
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 400
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Gly Ser Gly
50 55 60
Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn Glu
65 70 75 80
Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala Ile
85 90 95
Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly Ser
100 105 110
Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu Asn
115 120 125
Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr Glu
130 135 140
Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly
145 150 155 160
Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser Val
165 170 175
Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys Leu
180 185 190
Asn Arg Glu Lys Ile Asp
195
<210> 401
<211> 366
<212> PRT
<213> 人工序列
<220>
<223> 合成
<400> 401
Met Lys Ala Lys Leu Leu Val Leu Leu Cys Thr Phe Thr Ala Thr Tyr
1 5 10 15
Ala Asp Thr Ile Cys Ile Gly Tyr His Ala Asn Asn Ser Thr Asp Thr
20 25 30
Val Asp Thr Val Leu Glu Lys Asn Val Thr Val Thr His Ser Val Asn
35 40 45
Leu Gly Ser Gly Leu Arg Met Val Thr Gly Leu Arg Asn Gly Ser Gly
50 55 60
Trp Thr Gly Met Val Asp Gly Trp Tyr Gly Tyr His His Gln Asn Glu
65 70 75 80
Gln Gly Ser Gly Tyr Ala Ala Asp Gln Lys Ser Thr Gln Asn Ala Ile
85 90 95
Asn Gly Ile Thr Asn Met Val Asn Ser Val Ile Glu Lys Met Gly Ser
100 105 110
Gly Gly Ser Gly Thr Asp Leu Ala Glu Leu Leu Val Leu Leu Leu Asn
115 120 125
Gln Trp Thr Leu Leu Phe His Asp Ser Asn Val Lys Asn Leu Tyr Glu
130 135 140
Lys Val Lys Ser Gln Leu Lys Asn Asn Ala Lys Glu Ile Gly Asn Gly
145 150 155 160
Cys Phe Glu Phe Tyr His Lys Cys Asn Asn Glu Cys Met Glu Ser Val
165 170 175
Lys Asn Gly Thr Tyr Asp Tyr Pro Lys Tyr Ser Glu Glu Ser Lys Leu
180 185 190
Asn Arg Glu Lys Ile Asp Ser Gly Gly Asp Ile Ile Lys Leu Leu Asn
195 200 205
Glu Gln Val Asn Lys Glu Met Gln Ser Ser Asn Leu Tyr Met Ser Met
210 215 220
Ser Ser Trp Cys Tyr Thr His Ser Leu Asp Gly Ala Gly Leu Phe Leu
225 230 235 240
Phe Asp His Ala Ala Glu Glu Tyr Glu His Ala Lys Lys Leu Ile Ile
245 250 255
Phe Leu Asn Glu Asn Asn Val Pro Val Gln Leu Thr Ser Ile Ser Ala
260 265 270
Pro Glu His Lys Phe Glu Gly Leu Thr Gln Ile Phe Gln Lys Ala Tyr
275 280 285
Glu His Glu Gln His Ile Ser Glu Ser Ile Asn Asn Ile Val Asp His
290 295 300
Ala Ile Lys Ser Lys Asp His Ala Thr Phe Asn Phe Leu Gln Trp Tyr
305 310 315 320
Val Ala Glu Gln His Glu Glu Glu Val Leu Phe Lys Asp Ile Leu Asp
325 330 335
Lys Ile Glu Leu Ile Gly Asn Glu Asn His Gly Leu Tyr Leu Ala Asp
340 345 350
Gln Tyr Val Lys Gly Ile Ala Lys Ser Arg Lys Ser Gly Ser
355 360 365