本發(fā)明屬于生物技術(shù)領(lǐng)域,具體涉及一種可視化標(biāo)記基因組位點(diǎn)的方法。
背景技術(shù):
在人類細(xì)胞的細(xì)胞核中,大約32億對(duì)堿基組合成龐大的人類基因組,并且進(jìn)一步凝縮形成不同大小的23對(duì)染色體。盡管人類基因組計(jì)劃的完成幫助我們獲得了人類基因組的全部序列信息,但是這僅僅是我們認(rèn)識(shí)人類基因組的結(jié)構(gòu),以及了解由基因組組成的染色質(zhì)三維結(jié)構(gòu)與人類發(fā)育、衰老和疾病等重要生物學(xué)過程之間關(guān)系的第一步。事實(shí)上,已有多項(xiàng)研究證明基因組結(jié)構(gòu)失序是導(dǎo)致衰老和若干嚴(yán)重疾病的重要因素(Zhang,W.,etal.,Aging stem cells.A Werner syndrome stem cell model unveils heterochromatin alterations as a driver of human aging.Science,2015.348(6239):p.1160-3.2.Liu,G.H.,et al.,Progressive degeneration of human neural stem cells caused by pathogenic LRRK2.Nature,2012.491(7425):p.603-7.3.Misteli,T.,Higher-order genome organization in human disease.Cold Spring Harb Perspect Biol,2010.2(8):p.a000794.)。因此,明確人類基因組和由其構(gòu)成的染色質(zhì)的空間結(jié)構(gòu),及其與蛋白質(zhì)和RNA調(diào)節(jié)子直接的相互作用,將非常有助于我們理解導(dǎo)致衰老和人類疾病的細(xì)胞生物學(xué)進(jìn)程(Lopez-Otin,C.,etal.,The hallmarks of aging.Cell,2013.153(6):p.1194-217.2.Misteli,T.,Beyond the sequence:cellular organization of genome function.Cell,2007.128(4):p.787-800.3.Misteli,T.,The cell biology of genomes:bringing the double helix to life.Cell,2013.152(6):p.1209-12.)。
可視化的基因組標(biāo)記技術(shù)可以大大提升對(duì)于基因組和染色質(zhì)結(jié)構(gòu)和功能研究的效率。盡管目前已有多種基因組可視化標(biāo)記技術(shù)應(yīng)用于科研,但是熒光標(biāo)記Lac或Tet系統(tǒng)需要將約10kb的外源大片段整合進(jìn)入基因組目的位點(diǎn),因此存在低標(biāo)記效率和潛在的基因組損傷等問題(Robinett,C.C.,etal.,In vivo localization of DNA sequences and visualization of large-scale chromatin organization using lac operator/repressor recognition.J Cell Biol,1996.135(6Pt 2):p.1685-700.2.Heun,P.,etal.,Chromosome dynamics in the yeast interphase nucleus.Science,2001.294(5549):p.2181-6.);熒光原位雜交(FISH)是目前研究特定序列在基因組中定位的金標(biāo)準(zhǔn),但是該方法需要在化學(xué)固定的細(xì)胞中進(jìn)行操作,因此無法實(shí)現(xiàn)活細(xì)胞中的基因組可視化標(biāo)記(Levsky,J.M.and R.H.Singer,Fluorescence in situ hybridization:past,present and future.J Cell Sci,2003.116(Pt 14):p.2833-8.);盡管近年來興起的CRISPR/Cas9技術(shù)可以實(shí)現(xiàn)端粒等基因組特殊位點(diǎn)的活細(xì)胞精確標(biāo)記,但是細(xì)胞核內(nèi)的高背景導(dǎo)致的低信噪比以及復(fù)雜的系統(tǒng)等缺點(diǎn)導(dǎo)致其無法在腫瘤細(xì)胞系之外的其它人類細(xì)胞(如多能干細(xì)胞和終末分化細(xì)胞等)中實(shí)現(xiàn)基因組位點(diǎn)的精確標(biāo)記(Levsky,J.M.and R.H.Singer,Fluorescence in situ hybridization:past,present and future.J Cell Sci,2003.116(Pt 14):p.2833-8.);另一種被用做基因編輯技術(shù)的轉(zhuǎn)錄激活因子樣效應(yīng)元件(TALE)同樣可以被用于基因組位點(diǎn)的可視化標(biāo)記,然而現(xiàn)有的報(bào)道中既沒有用FISH金標(biāo)準(zhǔn)驗(yàn)證TALE介導(dǎo)的基因組標(biāo)記的正確性,同時(shí)顯示其在不同細(xì)胞類型中標(biāo)記的強(qiáng)異質(zhì)性(Ma,H.,P.Reyes-Gutierrez,and T.Pederson,Visualization of repetitive DNA sequences in human chromosomes with transcription activator-like effectors.Proc Natl Acad Sci USA,2013.110(52):p.21048-53.)。
綜上所述,現(xiàn)有的基因組可視化標(biāo)記技術(shù)都存在缺陷,無法滿足精確標(biāo)記人類細(xì)胞基因組元件和特殊位點(diǎn)的需要。因此,目前無論是科研還是臨床市場(chǎng),都急需具備精確性、簡(jiǎn)便性和長(zhǎng)效性的活細(xì)胞基因組可視化標(biāo)記技術(shù)來彌補(bǔ)空白,從而推動(dòng)人類衰老和重要疾病的基礎(chǔ)研究和臨床診療技術(shù)的發(fā)展。
技術(shù)實(shí)現(xiàn)要素:
本發(fā)明的一個(gè)目的是提供一種用于可視化標(biāo)記基因組位點(diǎn)的試劑盒。
本發(fā)明提供的用于可視化標(biāo)記基因組位點(diǎn)的試劑盒為如下1)-8)中任一種:
1)包括融合蛋白,所述融合蛋白包括用于識(shí)別或結(jié)合基因組靶序列的轉(zhuǎn)錄激活樣效應(yīng)元件蛋白TALE和硫氧還蛋白TRX;
2)包括融合蛋白,所述融合蛋白包括用于識(shí)別或結(jié)合基因組靶序列的轉(zhuǎn)錄激活樣效應(yīng)元件蛋白TALE、熒光蛋白和硫氧還蛋白TRX;
3)包括蛋白組合物,所述蛋白組合物包括用于識(shí)別或結(jié)合基因組靶序列的轉(zhuǎn)錄激活樣效應(yīng)元件蛋白TALE和硫氧還蛋白TRX;
4)包括蛋白組合物,所述蛋白組合物包括用于識(shí)別或結(jié)合基因組靶序列的轉(zhuǎn)錄激活樣效應(yīng)元件蛋白TALE、熒光蛋白和硫氧還蛋白TRX;
5)編碼所述融合蛋白的DNA分子;
6)編碼所述蛋白組合物的DNA分子;
7)包括表達(dá)1)或2)所述融合蛋白的載體;
8)包括表達(dá)3)或4)所述蛋白組合物的載體。
上述試劑盒中,所述靶序列為端粒DNA、著絲粒DNA、核仁組織區(qū)核糖體RNA編碼基因或MUC4蛋白編碼基因。
上述試劑盒中,所述靶序列可以根據(jù)本領(lǐng)域公知常識(shí),選擇它們的部分序列作為靶序列。優(yōu)選的,本發(fā)明選擇了如下序列作為靶序列:
所述端粒DNA的靶序列為序列14;
所述著絲粒DNA的靶序列為序列15;
所述核仁組織區(qū)核糖體RNA編碼基因的靶序列為序列16;
所述MUC4蛋白編碼基因的靶序列為序列17。
上述試劑盒中,
用于識(shí)別或結(jié)合基因組中所述端粒DNA的靶序列的融合蛋白為a1)或a2);
a1)由核定位序列、用于識(shí)別或結(jié)合基因組中端粒DNA的靶序列的轉(zhuǎn)錄激活樣效應(yīng)元件蛋白TALE和硫氧還蛋白TRX融合得到的蛋白質(zhì);所述融合蛋白自N端至C端依次包括核定位序列、用于識(shí)別或結(jié)合基因組中端粒DNA的靶序列的轉(zhuǎn)錄激活樣效應(yīng)元件蛋白TALE和硫氧還蛋白TRX;
a2)由核定位序列、用于識(shí)別或結(jié)合基因組中端粒DNA的靶序列的轉(zhuǎn)錄激活樣效應(yīng)元件蛋白TALE、熒光蛋白和硫氧還蛋白TRX融合得到的蛋白質(zhì);所述融合蛋白自N端至C端依次包括核定位序列、用于識(shí)別或結(jié)合基因組中端粒DNA的靶序列的轉(zhuǎn)錄激活樣效應(yīng)元件蛋白TALE、熒光蛋白和硫氧還蛋白TRX;
或用于識(shí)別或結(jié)合基因組中所述著絲粒DNA的靶序列的融合蛋白為a3)或a4);
a3)由核定位序列、用于識(shí)別或結(jié)合基因組中著絲粒DNA的靶序列的轉(zhuǎn)錄激活樣效應(yīng)元件蛋白TALE和硫氧還蛋白TRX融合得到的蛋白質(zhì);所述融合蛋白自N端至C端依次包括核定位序列、用于識(shí)別或結(jié)合基因組中著絲粒DNA的靶序列的轉(zhuǎn)錄激活樣效應(yīng)元件蛋白TALE和硫氧還蛋白TRX;
a4)由核定位序列、用于識(shí)別或結(jié)合基因組中著絲粒DNA的靶序列的轉(zhuǎn)錄激活樣效應(yīng)元件蛋白TALE、熒光蛋白和硫氧還蛋白TRX融合得到的蛋白質(zhì);所述融合蛋白自N端至C端依次包括核定位序列、用于識(shí)別或結(jié)合基因組中著絲粒DNA的靶序列的轉(zhuǎn)錄激活樣效應(yīng)元件蛋白TALE、熒光蛋白和硫氧還蛋白TRX;
或用于識(shí)別或結(jié)合基因組中所述核仁組織區(qū)核糖體RNA編碼基因的靶序列的融合蛋白由核定位序列、用于識(shí)別或結(jié)合基因組中核仁組織區(qū)核糖體RNA編碼基因的靶序列的轉(zhuǎn)錄激活樣效應(yīng)元件蛋白TALE、熒光蛋白和硫氧還蛋白TRX融合得到的蛋白質(zhì);所述融合蛋白自N端至C端依次包括核定位序列、用于識(shí)別或結(jié)合基因組中核仁組織區(qū)核糖體RNA編碼基因的靶序列的轉(zhuǎn)錄激活樣效應(yīng)元件蛋白TALE、熒光蛋白和硫氧還蛋白TRX;
或用于識(shí)別或結(jié)合基因組中MUC4蛋白編碼基因的靶序列的融合蛋白由核定位序列、用于識(shí)別或結(jié)合基因組中MUC4蛋白編碼基因的靶序列的轉(zhuǎn)錄激活樣效應(yīng)元件蛋白TALE、熒光蛋白和硫氧還蛋白TRX融合得到的蛋白質(zhì);所述融合蛋白自N端至C端依次包括核定位序列、用于識(shí)別或結(jié)合基因組中MUC4蛋白編碼基因的靶序列的轉(zhuǎn)錄激活樣效應(yīng)元件蛋白TALE、熒光蛋白和硫氧還蛋白TRX。
上述試劑盒中,
所述熒光蛋白為EGFP蛋白或mCherry蛋白;
a1)中所述融合蛋白的氨基酸序列為序列4;
a2)中所述融合蛋白的氨基酸序列為序列7;
a3)中所述融合蛋白的氨基酸序列為序列5;
a4)中所述融合蛋白的氨基酸序列為序列8;
用于識(shí)別或結(jié)合基因組中所述核仁組織區(qū)核糖體RNA編碼基因的靶序列的融合蛋白的氨基酸序列為序列10;
用于識(shí)別或結(jié)合基因組中所述MUC4蛋白編碼基因的靶序列的融合蛋白的氨基酸序列為序列12。
上述試劑盒中,
所述融合蛋白的N端還帶有Flag標(biāo)簽序列;所述FLAG標(biāo)簽序列具體為3xFlag標(biāo)簽序列。在實(shí)際應(yīng)用中,本領(lǐng)域技術(shù)人員還可以利用N端的Flag標(biāo)簽序列通過免疫熒光染色實(shí)驗(yàn)的方法實(shí)現(xiàn)細(xì)胞基因組位點(diǎn)的可視化檢測(cè)。
上述試劑盒中,
所述EGFP蛋白的編碼基因?yàn)樾蛄?3第7382-8098位所示的DNA分子;
所述mCherry蛋白的編碼基因?yàn)樾蛄?所示的DNA分子;
所述硫氧還蛋白TRX的編碼基因?yàn)樾蛄?所示的DNA分子;
所述3xFlag標(biāo)簽序列的編碼基因?yàn)樾蛄?第2113-2181位所示的DNA分子;
所述核定位序列的編碼基因?yàn)樾蛄?第2182-2232位所示的DNA分子;
所述用于識(shí)別或結(jié)合基因組中端粒DNA的靶序列的轉(zhuǎn)錄激活樣效應(yīng)元件蛋白TALE的編碼基因?yàn)樾蛄?第2233-4725位所示的DNA分子;
所述用于識(shí)別或結(jié)合基因組中著絲粒DNA的靶序列的轉(zhuǎn)錄激活樣效應(yīng)元件蛋白TALE的編碼基因?yàn)樾蛄?第121-2613位所示的DNA分子;
所述用于識(shí)別或結(jié)合基因組中核仁組織區(qū)核糖體RNA編碼基因的靶序列的轉(zhuǎn)錄激活樣效應(yīng)元件蛋白TALE的編碼基因?yàn)樾蛄?第121-2613位所示的DNA分子;
所述用于識(shí)別或結(jié)合基因組中MUC4蛋白編碼基因的靶序列的轉(zhuǎn)錄激活樣效應(yīng)元件蛋白TALE的編碼基因?yàn)樾蛄?1第121-2613位所示的DNA分子。
上述試劑盒中,
表達(dá)a1)中所述融合蛋白的載體是以用于識(shí)別或結(jié)合端粒DNA的靶序列的TALE載體為骨架載體,將序列3所示的人硫氧還蛋白TRX的編碼基因序列插入骨架載體的KpnI和XhoI位點(diǎn)間得到的載體;其中,所述用于識(shí)別或結(jié)合端粒DNA的靶序列的TALE載體的核苷酸序列為序列1;
表達(dá)a2)中所述融合蛋白的載體是以用于識(shí)別或結(jié)合端粒DNA的靶序列的TTALE載體為骨架載體,將序列6所示的mCherry編碼基因序列插入骨架載體的HpaI和KpnI位點(diǎn)間得到的載體;其中,所述用于識(shí)別或結(jié)合端粒DNA的靶序列的TTALE載體是以所述用于識(shí)別或結(jié)合端粒DNA的靶序列的TALE載體為骨架載體,將序列3所示的人硫氧還蛋白TRX的編碼基因序列插入骨架載體的KpnI和XhoI位點(diǎn)間得到的載體;
表達(dá)a3)中所述融合蛋白的載體是以用于識(shí)別或結(jié)合著絲粒DNA的靶序列的TALE載體為骨架載體,將序列3所示的人硫氧還蛋白的編碼基因序列插入骨架載體的KpnI和XhoI位點(diǎn)間得到載體;其中,所述用于識(shí)別或結(jié)合著絲粒DNA的靶序列的TALE載體的核苷酸序列為將序列1中第2113-4725位所示的DNA片段替換為序列2所示的DNA片段后得到的序列;
表達(dá)a4)中所述融合蛋白的載體是以用于識(shí)別或結(jié)合著絲粒DNA的靶序列的TTALE載體為骨架載體,將序列13第7382-8098位所示的EGFP編碼基因序列插入骨架載體的HpaI和KpnI位點(diǎn)間得到的載體;其中,所述用于識(shí)別或結(jié)合著絲粒DNA的靶序列的TTALE載體是以所述用于識(shí)別或結(jié)合著絲粒DNA的靶序列的TALE載體為骨架載體,將序列3所示的人硫氧還蛋白TRX的編碼基因序列插入骨架載體的KpnI和XhoI位點(diǎn)間得到的載體;
表達(dá)用于識(shí)別或結(jié)合基因組中所述核仁組織區(qū)核糖體RNA編碼基因的靶序列的融合蛋白的載體是以用于識(shí)別或結(jié)合核仁組織區(qū)核糖體RNA編碼基因的靶序列的TALE載體為骨架載體,將序列3所示的人硫氧還蛋白TRX的編碼基因序列插入骨架載體的AscI和XhoI位點(diǎn)間得到的載體;其中,所述用于識(shí)別或結(jié)合核仁組織區(qū)核糖體RNA編碼基因的靶序列的TALE載體的核苷酸序列為將序列1第2113-4725位所示的DNA片段替換為序列9所示的DNA片段,且將序列13第7382-8098位所示的EGFP編碼基因序列插入序列1的HpaI和KpnI位點(diǎn)間后得到的序列;
表達(dá)用于識(shí)別或結(jié)合基因組中MUC4蛋白編碼基因的靶序列的融合蛋白的載體是以用于識(shí)別或結(jié)合MUC4蛋白編碼基因的靶序列的TALE載體為骨架載體,將序列3所示的人硫氧還蛋白TRX的編碼基因序列插入骨架載體的AscI和XhoI位點(diǎn)間得到的載體;其中,所述用于識(shí)別或結(jié)合MUC4蛋白編碼基因的靶序列的TALE載體的核苷酸序列為將序列1第2113-4725位所示的DNA片段替換為序列11所示的DNA片段,且將序列13第7382-8098位所示的EGFP編碼基因序列插入序列1的HpaI和KpnI位點(diǎn)間后得到的序列。
本發(fā)明的第二個(gè)目的是提供上述試劑盒或上述試劑盒中的融合蛋白或上述試劑盒中的蛋白組合物的新用途。
本發(fā)明提供了上述試劑盒或上述試劑盒中的融合蛋白或上述試劑盒中的蛋白組合物在基因組可視化中的應(yīng)用。
本發(fā)明還提供了上述試劑盒或上述試劑盒中的融合蛋白或上述試劑盒中的蛋白組合物在制備基因組可視化的產(chǎn)品中的應(yīng)用。
本發(fā)明還提供了上述試劑盒或上述試劑盒中的融合蛋白或上述試劑盒中的蛋白組合物在如下b1)-b14)中任一種中的應(yīng)用:
b1)可視化標(biāo)記基因組位點(diǎn);
b2)制備可視化標(biāo)記基因組位點(diǎn)的產(chǎn)品;
b3)可視化標(biāo)記細(xì)胞基因組中的端粒;
b4)制備可視化標(biāo)記細(xì)胞基因組中的端粒的產(chǎn)品;
b5)可視化標(biāo)記細(xì)胞基因組中的著絲粒;
b6)制備可視化標(biāo)記細(xì)胞基因組中的著絲粒的產(chǎn)品;
b7)可視化標(biāo)記細(xì)胞基因組中的核仁組織區(qū)核糖體RNA;
b8)制備可視化標(biāo)記細(xì)胞基因組中的核仁組織區(qū)核糖體RNA的產(chǎn)品;
b9)可視化標(biāo)記細(xì)胞基因組中的MUC4編碼基因位點(diǎn);
b10)制備可視化標(biāo)記細(xì)胞基因組中的MUC4編碼基因位點(diǎn)的產(chǎn)品;
b11)可視化檢測(cè)端?;蛑z粒在細(xì)胞分裂不同時(shí)期的動(dòng)態(tài)變化;
b12)制備可視化檢測(cè)端?;蛑z粒在細(xì)胞分裂不同時(shí)期的動(dòng)態(tài)變化的產(chǎn)品;
b13)可視化檢測(cè)端粒在不同細(xì)胞衰老模型的動(dòng)態(tài)變化;
b14)制備可視化檢測(cè)端粒在不同細(xì)胞衰老模型的動(dòng)態(tài)變化的產(chǎn)品。
本發(fā)明還有一個(gè)目的是提供上述試劑盒中的融合蛋白或上述試劑盒中的蛋白組合物的新用途。
本發(fā)明提供了上述試劑盒中的融合蛋白或上述試劑盒中的蛋白組合物在作為基因組可視化工具中的應(yīng)用。
編碼上述試劑盒中的融合蛋白或上述試劑盒中的蛋白組合物的DNA分子在制備基因組可視化工具中的應(yīng)用也屬于本發(fā)明的保護(hù)范圍。
本發(fā)明的最后一個(gè)目的是提供一種可視化標(biāo)記基因組位點(diǎn)的方法。
本發(fā)明提供的可視化標(biāo)記基因組位點(diǎn)的方法包括將上述試劑盒中的融合蛋白或表達(dá)所述融合蛋白的載體導(dǎo)入細(xì)胞中,實(shí)現(xiàn)細(xì)胞基因組位點(diǎn)的可視化。
上述應(yīng)用或上述方法中,
所述細(xì)胞為人或動(dòng)物細(xì)胞;所述人或動(dòng)物細(xì)胞為人腫瘤細(xì)胞、人胚胎腎細(xì)胞、人多能干細(xì)胞、人成體干細(xì)胞、人終末分化細(xì)胞或小鼠OP9細(xì)胞;所述人或動(dòng)物細(xì)胞具體為人腫瘤細(xì)胞系(U2OS、HeLa、MCF7和HepG2)、人胚胎腎細(xì)胞系(HEK293)、人多能干細(xì)胞(胚胎干細(xì)胞hESC)、人多能干細(xì)胞(誘導(dǎo)多能干細(xì)胞iPSC)、成體干細(xì)胞(間充質(zhì)干細(xì)胞)、成體干細(xì)胞(神經(jīng)干細(xì)胞hNSC)、終末分化細(xì)胞(血管平滑肌細(xì)胞hVSMC)或小鼠OP9細(xì)胞。
本發(fā)明通過在傳統(tǒng)的轉(zhuǎn)錄激活樣效應(yīng)元件(TALE)蛋白的C端融合硫氧還蛋白TRX,創(chuàng)建了新型的基因組可視化標(biāo)記工具TTALE。通過實(shí)驗(yàn)證明:TTALE可用于在腫瘤細(xì)胞系、胚胎干細(xì)胞、成體干細(xì)胞、終末分化細(xì)胞等不同類型的人類細(xì)胞中精確標(biāo)記端粒、著絲粒和核糖體RNA編碼序列(Ribosomal DNA,rDNA)等基因組重復(fù)序列,以及編碼基因座(MUC4)。TTALE技術(shù)能夠彌補(bǔ)目前科研和臨床市場(chǎng)缺乏精確性、簡(jiǎn)便性和長(zhǎng)效性的活細(xì)胞基因組可視化標(biāo)記技術(shù)的空白,從而推動(dòng)人類衰老和重要疾病的基礎(chǔ)研究和臨床診療技術(shù)的發(fā)展。
附圖說明
圖1為本發(fā)明通過將硫氧還蛋白TRX融合在TALE的C端建立了能夠精確標(biāo)記細(xì)胞基因組中的端粒(Telomere)和著絲粒(Centromere)的基因組可視化標(biāo)記技術(shù)(TTALE)。其中,A為TTALE標(biāo)記基因組位點(diǎn)的模式圖;B為識(shí)別端粒(Telomere)的TTALE在細(xì)胞核內(nèi)的標(biāo)記結(jié)果,從左到右依次為:FLAG標(biāo)簽標(biāo)記TTALE標(biāo)記端粒的結(jié)果;FISH探針標(biāo)記端粒的結(jié)果;前兩張圖片疊加的結(jié)果,說明TTALE與FISH信號(hào)很好的共定位效果;疊加圖片的局部放大結(jié)果;C為識(shí)別端粒(Telomere)的TTALE在細(xì)胞核內(nèi)標(biāo)記出的熒光點(diǎn)的分布圖;D為特異性標(biāo)記端粒(Telomere)的FISH探針在細(xì)胞核內(nèi)標(biāo)記出的熒光點(diǎn)的分布圖;E為識(shí)別著絲粒(Centromere)的TTALE在細(xì)胞核內(nèi)的標(biāo)記結(jié)果;F為識(shí)別著絲粒(Centromere)的TTALE在細(xì)胞核內(nèi)標(biāo)記出的熒光點(diǎn)的分布圖;G為特異性標(biāo)記著絲粒(Centromere)的FISH探針在細(xì)胞核內(nèi)標(biāo)記出的熒光點(diǎn)的分布圖。
圖2為利用TTALE精確標(biāo)記人腫瘤細(xì)胞系中的端粒(Telomere)和著絲粒(Centromere)。其中,A為識(shí)別端粒(Telomere)的TTALE在處于細(xì)胞間期的人腫瘤細(xì)胞系中的標(biāo)記結(jié)果;B為識(shí)別著絲粒(Centromere)的TTALE在處于細(xì)胞間期的人腫瘤細(xì)胞系中的標(biāo)記結(jié)果;C為用熒光蛋白(EGFP或mCherry)融合表達(dá)的識(shí)別端粒(Telomere)或著絲粒(Centromere)的TTALE同時(shí)標(biāo)記處于細(xì)胞分裂周期不同時(shí)期的HeLa細(xì)胞基因組的結(jié)果。
圖3為利用TTALE精確標(biāo)記人多能干細(xì)胞(胚胎干細(xì)胞hESC或誘導(dǎo)多能干細(xì)胞iPSC)及其衍生的成體干細(xì)胞(間充質(zhì)干細(xì)胞hMSC和神經(jīng)干細(xì)胞hNSC)和終末分化細(xì)胞(神經(jīng)細(xì)胞hNeuron和血管平滑肌細(xì)胞hVSMC)中的端粒(Telomere)和著絲粒(Centromere)。其中,A為識(shí)別著絲粒(Centromere)的TTALE的標(biāo)記結(jié)果;B為識(shí)別端粒(Telomere)的TTALE的標(biāo)記結(jié)果。
圖4為利用TTALE精確標(biāo)記人類細(xì)胞中的核仁組織區(qū)核糖體RNA編碼序列(NOR-rDNA)。其中,A為識(shí)別核仁組織區(qū)核糖體RNA編碼序列的TALE載體的標(biāo)記結(jié)果;B為在hMSC中證明識(shí)別NOR-rDNA的TTALE的標(biāo)記結(jié)果能夠與NOR-rDNA的FISH雜交信號(hào)共定位,并且其標(biāo)記信號(hào)圍繞核仁組織區(qū)的分子標(biāo)志物Nucleolin和Fibrillarin;C為利用TTALE精確標(biāo)記人腫瘤細(xì)胞(HeLa和U2OS)以及人多能干細(xì)胞(hESC,hMSC和hNSC)中的NOR-rDNA。
圖5為利用TTALE精確標(biāo)記人類細(xì)胞中的基因編碼序列(MUC4)。A為與識(shí)別編碼基因位點(diǎn)(MUC4)的TALE載體的標(biāo)記結(jié)果;B為利用TTALE精確標(biāo)記人腫瘤細(xì)胞(HeLa)以及人多能干細(xì)胞(hMSC)中的MUC4基因位點(diǎn);C為用熒光蛋白(mCherry)融合表達(dá)的識(shí)別MUC4基因位點(diǎn)的TTALE標(biāo)記處于細(xì)胞分裂周期不同時(shí)期的HeLa細(xì)胞基因組的結(jié)果。
圖6為利用識(shí)別端粒(Telomere)的TTALE在不同的細(xì)胞衰老模型中可視化標(biāo)記基因組中端粒(Telomere)隨細(xì)胞衰老進(jìn)程的動(dòng)態(tài)變化。A為不同細(xì)胞衰老模型的模式圖;B為利用識(shí)別端粒(Telomere)的TTALE在不同的細(xì)胞衰老模型中可視化標(biāo)記基因組中端粒(Telomere)的結(jié)果;C為WRN基因缺失的間充質(zhì)干細(xì)胞的驗(yàn)證結(jié)果(WRN基因缺失驗(yàn)證實(shí)驗(yàn));D為HGPS患者來源的間充質(zhì)干細(xì)胞的驗(yàn)證結(jié)果(Progerin表達(dá)驗(yàn)證實(shí)驗(yàn));E為三種細(xì)胞衰老模型中端??s短的qPCR驗(yàn)證;F為利用識(shí)別端粒(Telomere)的TTALE在不同的細(xì)胞衰老模型中可視化標(biāo)記基因組中端粒(Telomere)的熒光強(qiáng)度統(tǒng)計(jì)結(jié)果。
圖7為利用識(shí)別端粒(Telomere)的TTALE在動(dòng)物體內(nèi)可視化標(biāo)記基因組中端粒(Telomere)的結(jié)果。A為構(gòu)建表達(dá)EGFP融合的TTALE的慢病毒載體(Lentivirus)的模式圖;B為利用表達(dá)EGFP融合的TTALE的慢病毒載體感染人腫瘤細(xì)胞U2OS的結(jié)果;C為利用表達(dá)EGFP融合的TTALE的慢病毒載體感染小鼠細(xì)胞OP9的結(jié)果;D為利用表達(dá)EGFP融合的TTALE的慢病毒載體(Lentivirus)在小鼠體內(nèi)可視化標(biāo)記基因組中端粒(Telomere)的模式圖;E為利用表達(dá)EGFP融合的TTALE的慢病毒載體(Lentivirus)在小鼠肌肉、肝臟和腦組織中可視化標(biāo)記基因組中端粒(Telomere)的結(jié)果。
具體實(shí)施方式
下述實(shí)施例中所使用的實(shí)驗(yàn)方法如無特殊說明,均為常規(guī)方法。
下述實(shí)施例中所用的材料、試劑等,如無特殊說明,均可從商業(yè)途徑得到。
下述實(shí)施例中的定量試驗(yàn),均設(shè)置三次重復(fù)實(shí)驗(yàn),結(jié)果取平均值。
下述實(shí)施例中的HGPS病人來源的iPSC(簡(jiǎn)稱HGPS-iPSCs):記載于“Liu,G.H.,Barkho,B.Z.,Ruiz,S.,Diep,D.,Qu,J.,Yang,S.L.,Panopoulos,A.D.,Suzuki,K.,Kurian,L.,Walsh,C.,et al.(2011a).Recapitulation of premature ageing with iPSCs from Hutchinson-Gilford progeria syndrome.Nature 472,221-225.”一文,公眾可從申請(qǐng)人處獲得,僅用于重復(fù)本發(fā)明實(shí)驗(yàn)使用。HGPS病人來源的iPSC(HGPS-iPSCs)中的LMNA基因發(fā)生了突變(LMNA基因組序列為GenBank:NG_008692.2;cDNA序列為GenBank:NM_170707.3),突變類型為C1824T(GGCGG突變?yōu)镚GTGG),該位點(diǎn)參比序列為L(zhǎng)MNAcDNA序列GenBank:NM_170707.3中的CDS區(qū)域。
下述實(shí)施例中的HGPS-GC-iPSCs為與HGPS-iPSC具有相同遺傳背景的經(jīng)過基因矯正的對(duì)照細(xì)胞系,記載于“Liu,G.H.,Suzuki,K.,Qu,J.,Sancho-Martinez,I.,Yi,F.,Li,M.,Kumar,S.,Nivet,E.,Kim,J.,Soligalla,R.D.,et al.(2011).Targeted gene correction of laminopathy-associated LMNA mutations in patient-specific iPSCs.Cell Stem Cell 8,688-694.”一文,公眾可從申請(qǐng)人處獲得,僅用于重復(fù)本發(fā)明實(shí)驗(yàn)使用。
下述實(shí)施例中的WRN基因缺失的人胚胎干細(xì)胞系(WS-ESC)由第一發(fā)明人創(chuàng)建,記載于“Zhang,W.et al.Aging stem cells.A Werner syndrome stem cell model unveils heterochromatin alterations as a driver of human aging.Science 348,1160-1163”一文,公眾可從申請(qǐng)人處獲得,僅用于重復(fù)本發(fā)明實(shí)驗(yàn)使用。WRN蛋白對(duì)于細(xì)胞核內(nèi)組成型異染色質(zhì)的結(jié)構(gòu)維持至關(guān)重要,與細(xì)胞衰老密切相關(guān),其功能缺失是成年早衰癥發(fā)生的直接原因,可引起多組織器官衰老和衰老相關(guān)疾病。
下述實(shí)施例中的pLE4載體記載于“Huize Pan,Di Guan,Xiaomeng Liu,Jingyi Li,Lixia Wang,Jun Wu,Junzhi Zhou,Weizhou Zhang,Ruotong Ren,Weiqi Zhang,Ying Li,Jiping Yang,Ying Hao,Tingting Yuan,Guohong Yuan,Hu Wang,Zhenyu Ju,Zhiyong Mao,Jian Li,Jing Qu,Fuchou Tang,Guang-Hui Liu(2016).SIRT6 safeguards human mesenchymal stem cells from oxidative stress by coactivating NRF2.Cell Research 26,190-205.”一文,公眾可從申請(qǐng)人處獲得,僅用于重復(fù)本發(fā)明實(shí)驗(yàn)使用。
下述實(shí)施例中的人胚胎干細(xì)胞H9細(xì)胞系(WT-ESC)是WiCell公司的產(chǎn)品,貨號(hào):WA09(H9)-DL-7。
實(shí)施例1、精確標(biāo)記細(xì)胞基因組中的端粒(Telomere)和著絲粒(Centromere)的基因組可視化標(biāo)記技術(shù)(TTALE)的建立
一、融合表達(dá)TALE和TRX的表達(dá)載體TTALE的構(gòu)建
1、識(shí)別端粒和著絲粒的TALE載體
參照文獻(xiàn)“Zhang,F.,et al.,Efficient construction of sequence-specific TAL effector for modulating mammalian transcription.Nat Biotechnology,2011.29(2):p.149-53,通過Golden Gate Assembly”中的方法利用TALE Toolbox Kit(美國(guó)Addgene,貨號(hào)為1000000019)分別構(gòu)建識(shí)別端粒的TALE載體和識(shí)別著絲粒的TALE載體。
(1)識(shí)別端粒的TALE載體
識(shí)別端粒的TALE載體的核苷酸序列如序列1所示。識(shí)別端粒的TALE載體表達(dá)融合蛋白TALEtelo,融合蛋白TALEtel識(shí)別的端粒區(qū)DNA序列的靶序列如下:5’-AACCCTAACCCTAACCCT-3’(序列14)。
(2)識(shí)別著絲粒的TALE載體
識(shí)別著絲粒的TALE載體的核苷酸序列為將序列1中第2113-4725位所示的DNA片段替換為序列2所示的DNA片段后得到的序列。識(shí)別著絲粒的TALE載體表達(dá)融合蛋白TALEcentro,該融合蛋白N端帶有FLAG標(biāo)簽序列和核定位序列NLS。融合蛋白TALEcentro識(shí)別的著絲粒區(qū)DNA序列的靶序列如下:5’-CCATTCCATTCCATTCCA-3’(序列15)。
2、識(shí)別端粒和著絲粒的TTALE載體
(1)識(shí)別端粒的TTALE載體
以步驟1中的識(shí)別端粒的TALE為骨架載體,將序列3所示的人硫氧還蛋白TRX的編碼基因序列插入骨架載體的KpnI和XhoI位點(diǎn)間,得到重組載體,將其記為識(shí)別端粒的TTALE載體(圖1A)。識(shí)別端粒的TTALE載體表達(dá)融合蛋白TALEtelo-TRX,該融合蛋白N端帶有FLAG標(biāo)簽序列和核定位序列NLS。融合蛋白TALEtelo-TRX的氨基酸序列如序列4所示。
(2)識(shí)別著絲粒的TTALE載體
以步驟1中的識(shí)別著絲粒的TALE為骨架載體,將序列3所示的人硫氧還蛋白的編碼基因序列插入骨架載體的KpnI和XhoI位點(diǎn)間,得到重組載體,將其記為識(shí)別著絲粒的TTALE載體(圖1A)。識(shí)別著絲粒的TTALE載體表達(dá)融合蛋白TALEcentro-TRX,該融合蛋白N端帶有FLAG標(biāo)簽序列和核定位序列NLS。融合蛋白TALEcentro-TRX的氨基酸序列如序列5所示。
二、細(xì)胞基因組中的端粒和著絲粒分布位點(diǎn)檢測(cè)
1、轉(zhuǎn)染
將步驟一中制備的識(shí)別端粒的TTALE載體和識(shí)別著絲粒的TTALE載體分別轉(zhuǎn)染U2OS細(xì)胞(美國(guó)ATCC,貨號(hào):HTB-96),同時(shí)以識(shí)別端粒的TALE載體和識(shí)別著絲粒的TALE載體為對(duì)照載體(NoTRX),轉(zhuǎn)染24-48小時(shí)后,分別得到轉(zhuǎn)染后細(xì)胞。
2、免疫熒光染色實(shí)驗(yàn)檢測(cè)細(xì)胞基因組中的端粒和著絲粒分布位點(diǎn)
利用識(shí)別端粒的TTALE載體、識(shí)別著絲粒的TTALE載體和對(duì)照載體N端的FLAG標(biāo)簽序列進(jìn)行免疫熒光染色實(shí)驗(yàn),檢測(cè)轉(zhuǎn)染后細(xì)胞基因組中的端粒和著絲粒分布位點(diǎn)。具體步驟如下:用4%多聚甲醛(北京鼎國(guó)昌盛生物技術(shù)有限責(zé)任公司,貨號(hào):AR-0211)固定轉(zhuǎn)染后細(xì)胞,然后用含0.4%TritonX100(美國(guó)Sigma公司,貨號(hào)為T9284)的PBS室通透15分鐘,用一抗稀釋液(含有10%驢血清的PBS)室溫封閉30分鐘,用一抗稀釋液配制的小鼠抗FLAG抗體4度孵育過夜,用PBS室溫清洗3次,每次10分鐘,用ALEXA-488標(biāo)記的驢抗小鼠IgG室溫孵育1小時(shí),用PBS室溫清洗3次,每次10分鐘,用1:2000的Hoechst標(biāo)記細(xì)胞核,最后利用熒光顯微鏡進(jìn)行觀察,并利用ImageJ軟件進(jìn)行熒光強(qiáng)度計(jì)算和統(tǒng)計(jì)分析。如需與FISH探針進(jìn)行共定位分析,則在孵育小鼠抗FLAG抗體之后,進(jìn)行FISH探針孵育,再用Biotin標(biāo)記的抗小鼠IgG室溫孵育1小時(shí),最后用ALEXA-488標(biāo)記的Streptavidin(Vectorlabs,貨號(hào)為SA-5488)孵育1小時(shí)。
3、熒光原位雜交實(shí)驗(yàn)檢測(cè)細(xì)胞基因組中的端粒和著絲粒分布位點(diǎn)
利用特異性識(shí)別端粒和著絲粒的FISH探針完成熒光原位雜交實(shí)驗(yàn)。具體步驟為:用4%多聚甲醛固定轉(zhuǎn)染后細(xì)胞,然后用含0.4%TritonX100(美國(guó)Sigma公司,貨號(hào)為T9284)的PBS室通透15分鐘,再用100微克/毫升的RNAase A(美國(guó)Sigma公司,貨號(hào)為83831)于37度孵育30分鐘,用90度5分鐘變性過的濃度為50nM FISH探針(韓國(guó)PANAGENE公司,端粒FISH探針貨號(hào)為F1002;著絲粒FISH探針貨號(hào)為F3003)85度孵育10分鐘,之后于室溫避光孵育過夜,用PBS室溫清洗3次,每次10分鐘,用1:2000的Hoechst標(biāo)記細(xì)胞核,最后利用熒光顯微鏡進(jìn)行觀察,并利用ImageJ軟件進(jìn)行熒光強(qiáng)度計(jì)算和統(tǒng)計(jì)分析。
結(jié)果如圖1所示。從圖1中可以看出:與識(shí)別端?;蛑z粒的TALE(圖1B中的No TRX,圖1E中的No TRX)相比,識(shí)別端?;蛑z粒的TTALE在細(xì)胞核中標(biāo)記的熒光點(diǎn)與FISH探針標(biāo)記的熒光點(diǎn)幾乎完全重合(圖1B和圖1E),并且其熒光點(diǎn)分布與FISH探針標(biāo)記的熒光點(diǎn)的分布非常相似(圖1C,1D,1F,1G),說明本發(fā)明的TTALE能夠精確標(biāo)記人類細(xì)胞基因組中的端?;蛑z粒位點(diǎn)。
實(shí)施例2、TTALE在精確標(biāo)記不同類型人類細(xì)胞中的端粒、著絲粒、核仁組織區(qū)核糖體RNA編碼序列(NOR-rDNA)以及編碼基因位點(diǎn)(MUC4)中的應(yīng)用
一、TTALE在精確標(biāo)記不同類型人類細(xì)胞中的端粒和著絲粒中的應(yīng)用
將實(shí)施例1中的U2OS細(xì)胞分別替換為如下細(xì)胞:人腫瘤細(xì)胞系(MCF7和HepG2,美國(guó)ATCC公司,貨號(hào)分別為HTB-22和HB-8065)、人胚胎腎細(xì)胞系(HEK293,美國(guó)ATCC公司,貨號(hào)為CRL-1573)、人多能干細(xì)胞(胚胎干細(xì)胞hESC,Wicell公司,貨號(hào)為WA-09)、人多能干細(xì)胞(誘導(dǎo)多能干細(xì)胞iPSC,Wicell公司,貨號(hào)為IISH6i-CML17)、成體干細(xì)胞(間充質(zhì)干細(xì)胞,Lonza公司,貨號(hào)為PT-2501)、成體干細(xì)胞(神經(jīng)干細(xì)胞hNSC,Wicell公司,貨號(hào)為NSC-H9)、終末分化細(xì)胞(血管平滑肌細(xì)胞hVSMC,Lonza公司,貨號(hào)為CC-2571),且保持其他步驟不變,然后利用熒光顯微鏡進(jìn)行觀察,并利用ImageJ軟件進(jìn)行熒光強(qiáng)度計(jì)算和統(tǒng)計(jì)分析。
結(jié)果表明:TTALE可精確標(biāo)記不同類型人類細(xì)胞基因組中的端粒和著絲粒,實(shí)現(xiàn)上述各個(gè)細(xì)胞中端粒和著絲粒的可視化(圖2A-B,圖3)。
二、熒光蛋白融合表達(dá)的識(shí)別端粒或著絲粒的TTALE在精確標(biāo)記人類細(xì)胞中的端粒和著絲粒中的應(yīng)用
1、mCherry熒光蛋白融合表達(dá)的識(shí)別端粒的TTALE的制備
以實(shí)施例1中的識(shí)別端粒的TTALE為骨架載體,將序列6所示的mCherry編碼基因序列插入骨架載體的HpaI和KpnI位點(diǎn)間,得到重組載體,將其記作mCherry融合表達(dá)的識(shí)別端粒的TTALE載體。mCherry融合表達(dá)的識(shí)別端粒的TTALE載體(mCherry-TTALEtelo)表達(dá)融合蛋白TTALEtelo-mCherry-TRX,該融合蛋白N端帶有FLAG標(biāo)簽序列和核定位序列NLS。融合蛋白TTALEtelo-mCherry-TRX的氨基酸序列如序列7所示。
2、EGFP熒光蛋白融合表達(dá)的識(shí)別著絲粒的TTALE的制備
以識(shí)別著絲粒的TTALE為骨架載體,將序列13第7382-8098位所示的EGFP編碼基因序列插入骨架載體的HpaI和KpnI位點(diǎn)間,得到EGFP融合表達(dá)的識(shí)別著絲粒的TTALE。EGFP融合表達(dá)的識(shí)別著絲粒的TTALE(EGFP-TTALEcentro)表達(dá)融合蛋白TTALEcentro-EGFP-TRX,該融合蛋白N端帶有FLAG標(biāo)簽序列和核定位序列NLS。融合蛋白TTALEcentro-EGFP-TRX的氨基酸序列如序列8所示。
3、熒光蛋白融合表達(dá)的識(shí)別端?;蛑z粒的TTALE標(biāo)記HeLa細(xì)胞
分別將mCherry融合表達(dá)的識(shí)別端粒的TTALE(mCherry-TTALEtelo)和EGFP融合表達(dá)的識(shí)別著絲粒的TTALE(EGFP-TTALEcentro)轉(zhuǎn)染處于細(xì)胞分裂周期不同時(shí)期的HeLa細(xì)胞中,轉(zhuǎn)染24-48小時(shí)后,用4%多聚甲醛固定細(xì)胞,然后利用熒光顯微鏡進(jìn)行觀察,并利用ImageJ軟件進(jìn)行熒光強(qiáng)度計(jì)算和統(tǒng)計(jì)分析。
結(jié)果表明:熒光蛋白融合表達(dá)的識(shí)別端粒或著絲粒的TTALE可精確標(biāo)記人類細(xì)胞中的端粒和著絲粒,實(shí)現(xiàn)處于細(xì)胞分裂周期不同時(shí)期的細(xì)胞中端粒和著絲粒的可視化(圖2C)。
三、TTALE在精確標(biāo)記不同類型人類細(xì)胞中的核仁組織區(qū)核糖體RNA編碼序列(NOR-rDNA)中的應(yīng)用
1、EGFP融合表達(dá)的識(shí)別核仁組織區(qū)核糖體RNA編碼基因的TALE的制備
參照文獻(xiàn)“Zhang,F.,et al.,Efficient construction of sequence-specific TAL effector for modulating mammalian transcription.Nat Biotechnology,2011.29(2):p.149-53,通過Golden Gate Assembly”中的方法利用TALE Toolbox Kit(美國(guó)Addgene,貨號(hào)為1000000019)構(gòu)建識(shí)別核仁組織區(qū)核糖體RNA編碼序列的TALE載體。
識(shí)別核仁組織區(qū)核糖體RNA編碼基因的TALE載體的核苷酸序列為將序列1第2113-4725位所示的DNA片段替換為序列9所示的DNA片段,且將序列13第7382-8098位所示的EGFP編碼基因序列插入序列1的HpaI和KpnI位點(diǎn)間后得到的序列。識(shí)別核仁組織區(qū)核糖體RNA編碼基因的TALE載體表達(dá)融合蛋白TALErDNA-EGFP,該融合蛋白N端帶有FLAG標(biāo)簽序列和核定位序列NLS。融合蛋白TALErDNA-EGFP識(shí)別的核仁組織區(qū)核糖體RNA編碼基因的靶序列如下:5’-ACCCTACTGATGATGTGT-3’(序列16)。
2、熒光蛋白融合表達(dá)的識(shí)別核仁組織區(qū)核糖體RNA編碼基因的TTALE的制備
以步驟1中的識(shí)別核仁組織區(qū)核糖體RNA編碼基因的TALE為骨架載體,將序列3所示的人硫氧還蛋白TRX的編碼基因序列插入骨架載體的AscI和XhoI位點(diǎn)間,得到重組載體,將其記為識(shí)別核仁組織區(qū)核糖體RNA編碼基因的TTALE載體(EGFP-TTALErDNA)。識(shí)別核仁組織區(qū)核糖體RNA編碼基因的TTALE載體(EGFP-TTALErDNA)表達(dá)融合蛋白TALErDNA-EGFP-TRX。融合蛋白TALErDNA-EGFP-TRX的氨基酸序列如序列10所示。
將識(shí)別核仁組織區(qū)核糖體RNA編碼序列的TTALE載體(EGFP-TTALErDNA)中的EGFP編碼基因替換為序列6所示的mCherry編碼基因,得到識(shí)別核仁組織區(qū)核糖體RNA編碼基因的TTALE載體(mCherry-TTALErDNA)。識(shí)別核仁組織區(qū)核糖體RNA編碼基因的TTALE載體(mCherry-TTALErDNA)表達(dá)融合蛋白TALErDNA-mCherry-TRX。
3、識(shí)別核仁組織區(qū)核糖體RNA編碼基因的TTALE標(biāo)記細(xì)胞
分別將識(shí)別核仁組織區(qū)核糖體RNA編碼基因的TALE載體和識(shí)別核仁組織區(qū)核糖體RNA編碼基因的TTALE轉(zhuǎn)染人腫瘤細(xì)胞(HeLa和U2OS)以及人多能干細(xì)胞(hESC,hMSC和hNSC)中,轉(zhuǎn)染24-48小時(shí)后,用4%多聚甲醛固定細(xì)胞,然后利用熒光顯微鏡進(jìn)行觀察,并利用ImageJ軟件進(jìn)行熒光強(qiáng)度計(jì)算和統(tǒng)計(jì)分析,分別檢測(cè)不同細(xì)胞中的核仁組織區(qū)核糖體RNA編碼基因的熒光分布情況。
結(jié)果表明:與識(shí)別核仁組織區(qū)核糖體RNA編碼基因的TALE載體相比(圖4A),識(shí)別核仁組織區(qū)核糖體RNA編碼基因的TTALE可精確標(biāo)記人腫瘤細(xì)胞(HeLa和U2OS)以及人多能干細(xì)胞(hESC,hMSC和hNSC)中的核糖體RNA,實(shí)現(xiàn)核仁組織區(qū)核糖體RNA編碼基因的可視化(圖4B,4C)。
四、TTALE在精確標(biāo)記不同類型人類細(xì)胞中的編碼基因位點(diǎn)(MUC4)中的應(yīng)用
1、EGFP融合表達(dá)的識(shí)別編碼基因位點(diǎn)(MUC4)的TALE的制備
參照文獻(xiàn)“Zhang,F.,et al.,Efficient construction of sequence-specific TAL effector for modulating mammalian transcription.Nat Biotechnology,2011.29(2):p.149-53,通過Golden Gate Assembly”中的方法利用TALE Toolbox Kit(美國(guó)Addgene,貨號(hào)為1000000019)構(gòu)建識(shí)別編碼基因位點(diǎn)(MUC4)的TALE載體。
識(shí)別編碼基因位點(diǎn)(MUC4)的TALE載體的核苷酸序列為將序列1第2113-4725位所示的DNA片段替換為序列11所示的DNA片段,且將序列13第7382-8098位所示的EGFP編碼基因序列插入序列1的HpaI和KpnI位點(diǎn)間后得到的序列。識(shí)別編碼基因位點(diǎn)(MUC4)TALE載體表達(dá)融合蛋白TALEMUC4-EGFP,該融合蛋白N端帶有FLAG標(biāo)簽序列和核定位序列NLS。融合蛋白TALEMUC4-EGFP識(shí)別的編碼基因位點(diǎn)(MUC4)的靶序列如下:5’-CCTGTCACCGACACTTCC-3’(序列17)。
2、熒光蛋白融合表達(dá)的識(shí)別編碼基因位點(diǎn)(MUC4)的TTALE的制備
以步驟1中的識(shí)別編碼基因位點(diǎn)(MUC4)的TALE為骨架載體,將序列3所示的人硫氧還蛋白TRX的編碼基因序列插入骨架載體的AscI和XhoI位點(diǎn)間,得到重組載體,將其記為識(shí)別編碼基因位點(diǎn)(MUC4)的TTALE載體(EGFP-TTALEMUC4)。識(shí)別編碼基因位點(diǎn)(MUC4)的TTALE載體表達(dá)融合蛋白TTALEMUC4-EGFP-TRX。融合蛋白TTALEMUC4-EGFP-TRX的氨基酸序列如序列12所示。
將識(shí)別編碼基因位點(diǎn)(MUC4)的TTALE載體(EGFP-TTALEMUC4)中的EGFP編碼基因替換為序列6所示的mCherry編碼基因,得到識(shí)別核仁組織區(qū)核糖體RNA編碼序列的TTALE載體(mCherry-TTALEMUC4)。識(shí)別核仁組織區(qū)核糖體RNA編碼序列的TTALE載體(mCherry-TTALE MUC4)表達(dá)融合蛋白TALEMUC4-mCherry-TRX。
3、識(shí)別編碼基因位點(diǎn)(MUC4)的TTALE標(biāo)記細(xì)胞
分別將識(shí)別編碼基因位點(diǎn)(MUC4)的TALE和識(shí)別編碼基因位點(diǎn)(MUC4)的TTALE分別轉(zhuǎn)染處于細(xì)胞間期和細(xì)胞分裂周期的hMSC和HeLa細(xì)胞中,轉(zhuǎn)染24-48小時(shí)后,用4%多聚甲醛固定細(xì)胞,然后利用熒光顯微鏡進(jìn)行觀察,并利用ImageJ軟件進(jìn)行熒光強(qiáng)度計(jì)算和統(tǒng)計(jì)分析。
結(jié)果表明:與識(shí)別編碼基因位點(diǎn)(MUC4)的TALE載體相比(圖5A),識(shí)別編碼基因位點(diǎn)(MUC4)的TTALE可精確標(biāo)記細(xì)胞間期和細(xì)胞分裂周期的hMSC和HeLa細(xì)胞中的編碼基因位點(diǎn)(MUC4),實(shí)現(xiàn)編碼基因位點(diǎn)(MUC4)的可視化(圖5B-C)。
實(shí)施例3、TTALE標(biāo)記不同細(xì)胞衰老模型中端粒在可視化細(xì)胞衰老進(jìn)程的動(dòng)態(tài)變化中的應(yīng)用
1、EGFP融合表達(dá)的識(shí)別端粒的TTALE載體的制備
將序列13第7382-8098位所示的EGFP編碼基因序列替換實(shí)施例2步驟二的1中mCherry熒光蛋白融合表達(dá)的識(shí)別端粒的TTALE載體(mCherry-TTALEtelo)中的mCherry編碼基因序列,得到EGFP融合表達(dá)的識(shí)別端粒的TTALE載體(EGFP-TTALEtelo),該融合蛋白N端帶有FLAG標(biāo)簽序列和核定位序列NLS。
2、EGFP融合表達(dá)的識(shí)別端粒的TTALE載體標(biāo)記不同衰老模型的人類間充質(zhì)干細(xì)胞
(1)不同衰老模型的人類間充質(zhì)干細(xì)胞的建立
1)野生型人間充質(zhì)干細(xì)胞(WT-MSC)和WRN基因缺失的人間充質(zhì)干細(xì)胞(WS-MSC)的制備
本發(fā)明將野生型人胚胎干細(xì)胞H9細(xì)胞系(WT-ESC)和WRN基因缺失的人胚胎干細(xì)胞系(WS-ESC),進(jìn)一步體外定向分化為間充質(zhì)干細(xì)胞(WT-MSC)和WRN基因缺失的人間充質(zhì)干細(xì)胞(WS-MSC),具體方法如下:
A、將野生型人胚胎干細(xì)胞H9細(xì)胞系(WT-ESC)和WRN基因缺失的人胚胎干細(xì)胞系(WS-ESC)分別進(jìn)行擬胚體(EB)分化,獲得擬胚體(EB)。擬胚體(EB)分化具體步驟如下:準(zhǔn)備含有300-500個(gè)細(xì)胞、大小均一的ESC克隆,用室溫PBS(Gibco,10010023)清洗一次,用Dispase(Invitrogen公司,貨號(hào)為17105041)37℃消化20-30min。待ESC克隆形成球體后,用CDF12培養(yǎng)基重懸后,加到低粘附培養(yǎng)板(Corning公司,貨號(hào)3471)中,37℃,5%CO2條件培養(yǎng)1-3天后即形成擬胚體。
B、將步驟A獲得的擬胚體(EB)接種于基質(zhì)膠(matrigel)包被的6孔板中進(jìn)行培養(yǎng),繼續(xù)培養(yǎng)2周至纖維狀細(xì)胞出現(xiàn)。再經(jīng)過一次傳代后,利用流式細(xì)胞術(shù)分選其中的CD73、CD90和CD105均為陽(yáng)性的細(xì)胞類群(圖1),即為野生型間充質(zhì)干細(xì)胞(記為WT-MSC)和WRN基因缺失的人間充質(zhì)干細(xì)胞系(記為WS-MSC)。
2)HGPS病人來源的(HGPS-MSC)和經(jīng)過基因矯正的人間充質(zhì)干細(xì)胞(HGPS-GC-MSC)的制備
本實(shí)施例將HGPS-iPSCs和HGPS-GC-iPSCs進(jìn)一步體外定向分化為間充質(zhì)干細(xì)胞HGPS-MSC和HGPS-GC-MSC。具體方法如下:
分別將HGPS-iPSCs和HGPS-GC-iPSCs進(jìn)行擬胚體(EB)分化,分化14天,將EB接種于基質(zhì)膠(matrigel)包被的6孔板中進(jìn)行培養(yǎng),繼續(xù)培養(yǎng)2周至纖維狀細(xì)胞出現(xiàn)。再經(jīng)過一次傳代后,利用流式細(xì)胞術(shù)分選其中CD73、CD90和CD105均為陽(yáng)性的細(xì)胞類群,即為HGPS病人來源的間充質(zhì)干細(xì)胞(記為HGPS-MSC)和經(jīng)過基因矯正的人間充質(zhì)干細(xì)胞(記為HGPS-GC-MSC)。
3)野生型間充質(zhì)干細(xì)胞的早代細(xì)胞(EP-WT-MSC)與晚代細(xì)胞(LP-WT-MSC)的制備
將步驟(1)中的野生型間充質(zhì)干細(xì)胞(WT-MSC)連續(xù)傳代培養(yǎng)至12代(記為P12代),將P1-P6代細(xì)胞記為早代間充質(zhì)干細(xì)胞(記為EP-WT-MSC),將P10-P12代細(xì)胞記晚代間充質(zhì)干細(xì)胞(記為L(zhǎng)P-WT-MSC)。選取P6和P12代WT-MSC細(xì)胞作為EP-WT-MSC和LP-WT-MSC的代表進(jìn)行下述相關(guān)實(shí)驗(yàn)。
(2)EGFP融合表達(dá)的識(shí)別端粒的TTALE載體標(biāo)記不同衰老模型的人類間充質(zhì)干細(xì)胞
將EGFP融合表達(dá)的識(shí)別端粒的TTALE載體利用化學(xué)轉(zhuǎn)染方法導(dǎo)入不同衰老模型的人類間充質(zhì)干細(xì)胞中(圖6A,6C和6D),轉(zhuǎn)染24-48小時(shí)后,用4%多聚甲醛固定細(xì)胞,然后利用熒光顯微鏡進(jìn)行觀察,并利用ImageJ軟件進(jìn)行熒光強(qiáng)度計(jì)算和統(tǒng)計(jì)分析。
3、qPCR方法檢測(cè)不同衰老模型的人類間充質(zhì)干細(xì)胞中端粒長(zhǎng)度
以步驟2中不同衰老模型的人類間充質(zhì)干細(xì)胞為供試細(xì)胞。分別從供試細(xì)胞中提取基因組DNA,通過實(shí)時(shí)定量PCR的方法檢測(cè)供試細(xì)胞中端粒長(zhǎng)度。其中,以36B4為端粒長(zhǎng)度檢測(cè)的對(duì)照基因。引物序列如下:
Tel-F:5’-GGTTTTTGAGGGTGAGGGTGAGGGTGAGGGTGAGGGT-3’;
Tel-R:5’-TCCCGACTATCCCTATCCCTATCCCTATCCCTATCCCTA-3’;
36B4u:5’-CAGCAAGTGGGAAGGTGTAATCC-3’;
36B4d:5’-CCCATTCTATCATCAACGGGTACAA-3’。
結(jié)果表明,利用識(shí)別端粒的TTALE能夠在不同的細(xì)胞衰老模型中可視化標(biāo)記基因組中端粒,并能夠反映出衰老細(xì)胞中標(biāo)記的端粒的熒光強(qiáng)度顯著降低(圖6B,6F),該結(jié)果與利用qPCR方法檢測(cè)的衰老細(xì)胞中端粒長(zhǎng)度縮短的結(jié)果一致(圖6E),說明TTALE技術(shù)能夠精確地可視化標(biāo)記基因組中端粒隨細(xì)胞衰老進(jìn)程的動(dòng)態(tài)變化。
實(shí)施例4、識(shí)別端粒的TTALE在體外和動(dòng)物體內(nèi)可視化標(biāo)記基因組中端粒中的應(yīng)用
一、識(shí)別端粒的TTALE在體外可視化標(biāo)記基因組中端粒中的應(yīng)用
1、EGFP融合表達(dá)的識(shí)別端粒的TTALE慢病毒載體質(zhì)粒的制備及包裝
將EGFP融合表達(dá)的識(shí)別端粒的TTALE的編碼基因序列插入pLE4載體的限制性內(nèi)切酶酶切位點(diǎn)MluI和SalI之間,且保持pLE4載體的其他序列不變,得到慢病毒載體質(zhì)粒pLE4-EGFP-TTALE(其核苷酸序列如序列13所示)。然后將慢病毒載體質(zhì)粒pLE4-EGFP-TTALE在HEK293T(美國(guó)ATCC,貨號(hào):CRL-3216)細(xì)胞中進(jìn)行慢病毒的病毒包裝,慢病毒包裝質(zhì)粒購(gòu)自Addgene,貨號(hào)如下:psPAX(12260),pMD2.G(12259)。
將慢病毒載體質(zhì)粒pLE4-EGFP-TTALE與包裝質(zhì)粒psPAX和pMD.2G共轉(zhuǎn)染至HEK293T細(xì)胞中,與轉(zhuǎn)染后48小時(shí)收集培養(yǎng)上清,并通過超高速離心純化慢病毒顆粒。
2、EGFP融合表達(dá)的識(shí)別端粒的慢病毒載體質(zhì)粒標(biāo)記人或小鼠細(xì)胞
利用產(chǎn)生表達(dá)EGFP融合的識(shí)別端粒的TTALE的慢病毒感染人U2OS細(xì)胞(美國(guó)ATCC,貨號(hào):HTB-96)和小鼠OP9細(xì)胞(美國(guó)ATCC,貨號(hào):CRL-2749),感染24-72小時(shí)后,用4%多聚甲醛固定細(xì)胞,然后利用熒光顯微鏡進(jìn)行觀察。
結(jié)果表明,利用表達(dá)EGFP融合的識(shí)別端粒的TTALE的慢病毒載體能夠在體外可視化標(biāo)記人和小鼠細(xì)胞基因組中端粒(圖7B,7C)。
二、EGFP融合表達(dá)的識(shí)別端粒的TTALE在動(dòng)物體內(nèi)可視化標(biāo)記基因組中端粒中的應(yīng)用
用Opti-MEM(ThermoFisher公司,貨號(hào):51985042)將步驟一中制備的表達(dá)EGFP融合的識(shí)別端粒的TTALE的慢病毒載體稀釋為108病毒/微升劑量,分別在小鼠的脛骨前肌、肝臟和大腦海馬區(qū)注射5微升病毒液(圖7D),注射7-10天后,分離上述小鼠組織,用4%多聚甲醛固定后進(jìn)行冰凍切片,分別用WGA抗體(肌肉)(ThermoFisher公司,貨號(hào):W32464)、ALB抗體(肝臟)(Abcam公司,貨號(hào):ab8940)和NeuN抗體(大腦海馬區(qū))(Abcam公司,貨號(hào):ab177487)進(jìn)行免疫熒光染色,最后用熒光顯微鏡觀察。免疫熒光染色的具體步驟如下:用4%多聚甲醛(北京鼎國(guó)昌盛生物技術(shù)有限責(zé)任公司,貨號(hào):AR-0211)室溫固定切片10分鐘,然后用含0.4%TritonX100(美國(guó)Sigma公司,貨號(hào)為T9284)的PBS室通透30分鐘,用一抗稀釋液(含有10%驢血清的PBS)室溫封閉45分鐘,用一抗稀釋液配制的WGA抗體(肌肉)(ThermoFisher公司,貨號(hào):W32464)、ALB抗體(肝臟)(Abcam公司,貨號(hào):ab8940)和NeuN抗體(大腦海馬區(qū))(Abcam公司,貨號(hào):ab177487)分別4度孵育過夜,用PBS室溫清洗3次,每次10分鐘,分別用ALEXA-488標(biāo)記的IgG室溫孵育1小時(shí),用PBS室溫清洗3次,每次10分鐘,用1:2000的Hoechst標(biāo)記細(xì)胞核,最后利用熒光顯微鏡進(jìn)行觀察,并利用ImageJ軟件進(jìn)行熒光強(qiáng)度計(jì)算和統(tǒng)計(jì)分析。
結(jié)果表明:利用表達(dá)EGFP融合的識(shí)別端粒的TTALE的慢病毒載體能夠在小鼠不同組織內(nèi)實(shí)現(xiàn)可視化標(biāo)記基因組中端粒(圖7E)。
序列表
<110>中國(guó)科學(xué)院生物物理研究所
<120>一種可視化標(biāo)記基因組位點(diǎn)的方法
<160>17
<210>1
<211>7746bp
<212>DNA
<213>人工序列
<220>
<223>
<400>1
ctcatgacca aaatccctta acgtgagtta cgcgcgcgtc gttccactga gcgtcagacc 60
ccgtagaaaa gatcaaagga tcttcttgag atcctttttt tctgcgcgta atctgctgct 120
tgcaaacaaa aaaaccaccg ctaccagcgg tggtttgttt gccggatcaa gagctaccaa 180
ctctttttcc gaaggtaact ggcttcagca gagcgcagat accaaatact gttcttctag 240
tgtagccgta gttagcccac cacttcaaga actctgtagc accgcctaca tacctcgctc 300
tgctaatcct gttaccagtg gctgctgcca gtggcgataa gtcgtgtctt accgggttgg 360
actcaagacg atagttaccg gataaggcgc agcggtcggg ctgaacgggg ggttcgtgca 420
cacagcccag cttggagcga acgacctaca ccgaactgag atacctacag cgtgagctat 480
gagaaagcgc cacgcttccc gaagggagaa aggcggacag gtatccggta agcggcaggg 540
tcggaacagg agagcgcacg agggagcttc cagggggaaa cgcctggtat ctttatagtc 600
ctgtcgggtt tcgccacctc tgacttgagc gtcgattttt gtgatgctcg tcaggggggc 660
ggagcctatg gaaaaacgcc agcaacgcgg cctttttacg gttcctggcc ttttgctggc 720
cttttgctca catgttcttt cctgcgttat cccctgattc tgtggataac cgtattaccg 780
cctttgagtg agctgatacc gctcgccgca gccgaacgac cgagcgcagc gagtcagtga 840
gcgaggaagc ggaaggcgag agtagggaac tgccaggcat caaactaagc agaaggcccc 900
tgacggatgg cctttttgcg tttctacaaa ctctttctgt gttgtaaaac gacggccagt 960
cttaagctcg ggccccctgg gcggttctga taacgagtaa tcgttaatcc gcaaataacg 1020
taaaaacccg cttcggcggg tttttttatg gggggagttt agggaaagag catttgtcag 1080
aatatttaag ggcgcctgtc actttgcttg atatatgaga attatttaac cttataaatg 1140
agaaaaaagc aacgcacttt aaataagata cgttgctttt tcgattgatg aacacctata 1200
attaaactat tcatctatta tttatgattt tttgtatata caatatttct agtttgttaa 1260
agagaattaa gaaaataaat ctcgaaaata ataaagggaa aatcagtttt tgatatcaaa 1320
attatacatg tcaacgataa tacaaaatat aatacaaact ataagatgtt atcagtattt 1380
attatcattt agaataaatt ttgtgtcgcc cttaattgtg agcggataac aattacgagc 1440
ttcatgcaca gtggcgttga cattgattat tgactagtta ttaatagtaa tcaattacgg 1500
ggtcattagt tcatagccca tatatggagt tccgcgttac ataacttacg gtaaatggcc 1560
cgcctggctg accgcccaac gacccccgcc cattgacgtc aataatgacg tatgttccca 1620
tagtaacgcc aatagggact ttccattgac gtcaatgggt ggagtattta cggtaaactg 1680
cccacttggc agtacatcaa gtgtatcata tgccaagtac gccccctatt gacgtcaatg 1740
acggtaaatg gcccgcctgg cattatgccc agtacatgac cttatgggac tttcctactt 1800
ggcagtacat ctacgtatta gtcatcgcta ttaccatggt gatgcggttt tggcagtaca 1860
tcaatgggcg tggatagcgg tttgactcac ggggatttcc aagtctccac cccattgacg 1920
tcaatgggag tttgttttgg caccaaaatc aacgggactt tccaaaatgt cgtaacaact 1980
ccgccccatt gacgcaaatg ggcggtaggc gtgtacggtg ggaggtctat ataagcagag 2040
ctctctggct aactagagaa cccactgctt actggcttat cgaaattaat acgactcact 2100
ataggggcca ccatggacta taaggaccac gacggagact acaaggatca tgatattgat 2160
tacaaagacg atgacgataa gatggcccca aagaagaagc ggaaggtcgg tatccacgga 2220
gtcccagcag ccgtagattt gagaactttg ggatattcac agcagcagca ggaaaagatc 2280
aagcccaaag tgaggtcgac agtcgcgcag catcacgaag cgctggtggg tcatgggttt 2340
acacatgccc acatcgtagc cttgtcgcag caccctgcag cccttggcac ggtcgccgtc 2400
aagtaccagg acatgattgc ggcgttgccg gaagccacac atgaggcgat cgtcggtgtg 2460
gggaaacagt ggagcggagc ccgagcgctt gaggccctgt tgacggtcgc gggagagctg 2520
agagggcctc cccttcagct ggacacgggc cagttgctga agatcgcgaa gcggggagga 2580
gtcacggcgg tcgaggcggt gcacgcgtgg cgcaatgcgc tcacgggagc acccctcaac 2640
ctgaccccag agcaggtcgt ggcaattgcg agcaacatcg ggggaaagca ggcactcgaa 2700
accgtccaga ggttgctgcc tgtgctgtgc caagcgcacg gacttacgcc agagcaggtc 2760
gtggcaattg cgagcaacat cgggggaaag caggcactcg aaaccgtcca gaggttgctg 2820
cctgtgctgt gccaagcgca cggactaacc ccagagcagg tcgtggcaat tgcgagccat 2880
gacgggggaa agcaggcact cgaaaccgtc cagaggttgc tgcctgtgct gtgccaagcg 2940
cacgggttga ccccagagca ggtcgtggca attgcgagcc atgacggggg aaagcaggca 3000
ctcgaaaccg tccagaggtt gctgcctgtg ctgtgccaag cgcacggcct gaccccagag 3060
caggtcgtgg caattgcgag ccatgacggg ggaaagcagg cactcgaaac cgtccagagg 3120
ttgctgcctg tgctgtgcca agcgcacgga ctgacaccag agcaggtcgt ggcaattgcg 3180
agcaacggag ggggaaagca ggcactcgaa accgtccaga ggttgctgcc tgtgctgtgc 3240
caagcgcacg gacttacacc cgaacaagtc gtggcaattg cgagcaacat cgggggaaag 3300
caggcactcg aaaccgtcca gaggttgctg cctgtgctgt gccaagcgca cggacttacg 3360
ccagagcagg tcgtggcaat tgcgagcaac atcgggggaa agcaggcact cgaaaccgtc 3420
cagaggttgc tgcctgtgct gtgccaagcg cacggactaa ccccagagca ggtcgtggca 3480
attgcgagcc atgacggggg aaagcaggca ctcgaaaccg tccagaggtt gctgcctgtg 3540
ctgtgccaag cgcacgggtt gaccccagag caggtcgtgg caattgcgag ccatgacggg 3600
ggaaagcagg cactcgaaac cgtccagagg ttgctgcctg tgctgtgcca agcgcacggc 3660
ctgaccccag agcaggtcgt ggcaattgcg agccatgacg ggggaaagca ggcactcgaa 3720
accgtccaga ggttgctgcc tgtgctgtgc caagcgcacg gactgacacc agagcaggtc 3780
gtggcaattg cgagcaacgg agggggaaag caggcactcg aaaccgtcca gaggttgctg 3840
cctgtgctgt gccaagcgca cggcctcacc ccagagcagg tcgtggcaat tgcgagcaac 3900
atcgggggaa agcaggcact cgaaaccgtc cagaggttgc tgcctgtgct gtgccaagcg 3960
cacggactta cgccagagca ggtcgtggca attgcgagca acatcggggg aaagcaggca 4020
ctcgaaaccg tccagaggtt gctgcctgtg ctgtgccaag cgcacggact aaccccagag 4080
caggtcgtgg caattgcgag ccatgacggg ggaaagcagg cactcgaaac cgtccagagg 4140
ttgctgcctg tgctgtgcca agcgcacggg ttgaccccag agcaggtcgt ggcaattgcg 4200
agccatgacg ggggaaagca ggcactcgaa accgtccaga ggttgctgcc tgtgctgtgc 4260
caagcgcacg gcctgacccc agagcaggtc gtggcaattg cgagccatga cgggggaaag 4320
caggcactcg aaaccgtcca gaggttgctg cctgtgctgt gccaagcgca cggactgaca 4380
ccagagcagg tcgtggcaat tgcgagcaac ggagggggaa agcaggcact cgaaaccgtc 4440
cagaggttgc tgcctgtgct gtgccaagcg cacggactca cgcctgagca ggtagtggct 4500
attgcatcca acatcggggg cagacccgca ctggagtcaa tcgtggccca gctttcgagg 4560
ccggaccccg cgctggccgc actcactaat gatcatcttg tagcgctggc ctgcctcggc 4620
ggacgacccg ccttggatgc ggtgaagaag gggctcccgc acgcgcctgc attgattaag 4680
cggaccaaca gaaggattcc cgagaggaca tcacatcgag tggcaagtta acggcgcgcc 4740
ggtaccctcg agaaaatcag cctcgactgt gccttctagt tgccagccat ctgttgtttg 4800
cccctccccc gtgccttcct tgaccctgga aggtgccact cccactgtcc tttcctaata 4860
aaatgaggaa attgcatcac aacactcaac cctatctcgg tctattcttt tgatttataa 4920
gggattttgc cgatttcggc ctattggtta aaaaatgagc tgatttaaca aaaatttaac 4980
gcgaattaat tctgtggaat gtgtgtcagt tagggtgtgg aaagtcccca ggctccccag 5040
caggcagaag tatgcaaagc atgcatctca attagtcagc aaccaggtgt ggaaagtccc 5100
caggctcccc agcaggcaga agtatgcaaa gcatgcatct caattagtca gcaaccatag 5160
tcccgcccct aactccgccc atcccgcccc taactccgcc cagttccgcc cattctccgc 5220
cccatggctg actaattttt tttatttatg cagaggccga ggccgcctct gcctctgagc 5280
tattccagaa gtagtgagga ggcttttttg gaggcctagg cttttgcaaa aagctcccgg 5340
gagcttgtat atccattttc ggatctgatc agcacgtgat gaaaaagcct gaactcaccg 5400
cgacgtctgt cgagaagttt ctgatcgaaa agttcgacag cgtctccgac ctgatgcagc 5460
tctcggaggg cgaagaatct cgtgctttca gcttcgatgt aggagggcgt ggatatgtcc 5520
tgcgggtaaa tagctgcgcc gatggtttct acaaagatcg ttatgtttat cggcactttg 5580
catcggccgc gctcccgatt ccggaagtgc ttgacattgg ggaattcagc gagagcctga 5640
cctattgcat ctcccgccgt gcacagggtg tcacgttgca agacctgcct gaaaccgaac 5700
tgcccgctgt tctgcagccg gtcgcggagg ccatggatgc gatcgctgcg gccgatctta 5760
gccagacgag cgggttcggc ccattcggac cgcaaggaat cggtcaatac actacatggc 5820
gtgatttcat atgcgcgatt gctgatcccc atgtgtatca ctggcaaact gtgatggacg 5880
acaccgtcag tgcgtccgtc gcgcaggctc tcgatgagct gatgctttgg gccgaggact 5940
gccccgaagt ccggcacctc gtgcacgcgg atttcggctc caacaatgtc ctgacggaca 6000
atggccgcat aacagcggtc attgactgga gcgaggcgat gttcggggat tcccaatacg 6060
aggtcgccaa catcttcttc tggaggccgt ggttggcttg tatggagcag cagacgcgct 6120
acttcgagcg gaggcatccg gagcttgcag gatcgccgcg gctccgggcg tatatgctcc 6180
gcattggtct tgaccaactc tatcagagct tggttgacgg caatttcgat gatgcagctt 6240
gggcgcaggg tcgatgcgac gcaatcgtcc gatccggagc cgggactgtc gggcgtacac 6300
aaatcgcccg cagaagcgcg gccgtctgga ccgatggctg tgtagaagta ctcgccgata 6360
gtggaaaccg acgccccagc actcgtccga gggcaaagga atagcacgtg ctacgagatt 6420
tcgattccac cgccgccttc tatgaaaggt tgggcttcgg aatcgttttc cgggacgccg 6480
gctggatgat cctccagcgc ggggatctca tgctggagtt cttcgcccac cccaacttgt 6540
ttattgcagc ttataatggt tacaaataaa gcaatagcat cacaaatttc acaaataaag 6600
catttttttc actgcattct agttgtggtt tgtccaaact catcaatgta tcttatcatg 6660
tctgtatacc gtcgacctct agctagagct tggcgtaatc atggtcatta ccaatgctta 6720
atcagtgagg cacctatctc agcgatctgt ctatttcgtt catccatagt tgcctgactc 6780
cccgtcgtgt agataactac gatacgggag ggcttaccat ctggccccag cgctgcgatg 6840
ataccgcgag aaccacgctc accggctccg gatttatcag caataaacca gccagccgga 6900
agggccgagc gcagaagtgg tcctgcaact ttatccgcct ccatccagtc tattaattgt 6960
tgccgggaag ctagagtaag tagttcgcca gttaatagtt tgcgcaacgt tgttgccatc 7020
gctacaggca tcgtggtgtc acgctcgtcg tttggtatgg cttcattcag ctccggttcc 7080
caacgatcaa ggcgagttac atgatccccc atgttgtgca aaaaagcggt tagctccttc 7140
ggtcctccga tcgttgtcag aagtaagttg gccgcagtgt tatcactcat ggttatggca 7200
gcactgcata attctcttac tgtcatgcca tccgtaagat gcttttctgt gactggtgag 7260
tactcaacca agtcattctg agaatagtgt atgcggcgac cgagttgctc ttgcccggcg 7320
tcaatacggg ataataccgc gccacatagc agaactttaa aagtgctcat cattggaaaa 7380
cgttcttcgg ggcgaaaact ctcaaggatc ttaccgctgt tgagatccag ttcgatgtaa 7440
cccactcgtg cacccaactg atcttcagca tcttttactt tcaccagcgt ttctgggtga 7500
gcaaaaacag gaaggcaaaa tgccgcaaaa aagggaataa gggcgacacg gaaatgttga 7560
atactcatat tcttcctttt tcaatattat tgaagcattt atcagggtta ttgtctcatg 7620
agcggataca tatttgaatg tatttagaaa aataaacaaa taggggtcag tgttacaacc 7680
aattaaccaa ttctgaacat tatcgcgagc ccatttatac ctgaatatgg ctcataacac 7740
cccttg 7746
<210>2
<211>2613bp
<212>DNA
<213>人工序列
<220>
<223>
<400>2
atggactata aggaccacga cggagactac aaggatcatg atattgatta caaagacgat 60
gacgataaga tggccccaaa gaagaagcgg aaggtcggta tccacggagt cccagcagcc 120
gtagatttga gaactttggg atattcacag cagcagcagg aaaagatcaa gcccaaagtg 180
aggtcgacag tcgcgcagca tcacgaagcg ctggtgggtc atgggtttac acatgcccac 240
atcgtagcct tgtcgcagca ccctgcagcc cttggcacgg tcgccgtcaa gtaccaggac 300
atgattgcgg cgttgccgga agccacacat gaggcgatcg tcggtgtggg gaaacagtgg 360
agcggagccc gagcgcttga ggccctgttg acggtcgcgg gagagctgag agggcctccc 420
cttcagctgg acacgggcca gttgctgaag atcgcgaagc ggggaggagt cacggcggtc 480
gaggcggtgc acgcgtggcg caatgcgctc acgggagcac ccctcaacct gaccccagag 540
caggtcgtgg caattgcgag ccatgacggg ggaaagcagg cactcgaaac cgtccagagg 600
ttgctgcctg tgctgtgcca agcgcacgga cttacgccag agcaggtcgt ggcaattgcg 660
agccatgacg ggggaaagca ggcactcgaa accgtccaga ggttgctgcc tgtgctgtgc 720
caagcgcacg gactaacccc agagcaggtc gtggcaattg cgagcaacat cgggggaaag 780
caggcactcg aaaccgtcca gaggttgctg cctgtgctgt gccaagcgca cgggttgacc 840
ccagagcagg tcgtggcaat tgcgagcaac ggagggggaa agcaggcact cgaaaccgtc 900
cagaggttgc tgcctgtgct gtgccaagcg cacggcctga ccccagagca ggtcgtggca 960
attgcgagca acggaggggg aaagcaggca ctcgaaaccg tccagaggtt gctgcctgtg 1020
ctgtgccaag cgcacggact gacaccagag caggtcgtgg caattgcgag ccatgacggg 1080
ggaaagcagg cactcgaaac cgtccagagg ttgctgcctg tgctgtgcca agcgcacgga 1140
cttacacccg aacaagtcgt ggcaattgcg agccatgacg ggggaaagca ggcactcgaa 1200
accgtccaga ggttgctgcc tgtgctgtgc caagcgcacg gacttacgcc agagcaggtc 1260
gtggcaattg cgagcaacat cgggggaaag caggcactcg aaaccgtcca gaggttgctg 1320
cctgtgctgt gccaagcgca cggactaacc ccagagcagg tcgtggcaat tgcgagcaac 1380
ggagggggaa agcaggcact cgaaaccgtc cagaggttgc tgcctgtgct gtgccaagcg 1440
cacgggttga ccccagagca ggtcgtggca attgcgagca acggaggggg aaagcaggca 1500
ctcgaaaccg tccagaggtt gctgcctgtg ctgtgccaag cgcacggcct gaccccagag 1560
caggtcgtgg caattgcgag ccatgacggg ggaaagcagg cactcgaaac cgtccagagg 1620
ttgctgcctg tgctgtgcca agcgcacgga ctgacaccag agcaggtcgt ggcaattgcg 1680
agccatgacg ggggaaagca ggcactcgaa accgtccaga ggttgctgcc tgtgctgtgc 1740
caagcgcacg gcctcacccc agagcaggtc gtggcaattg cgagcaacat cgggggaaag 1800
caggcactcg aaaccgtcca gaggttgctg cctgtgctgt gccaagcgca cggacttacg 1860
ccagagcagg tcgtggcaat tgcgagcaac ggagggggaa agcaggcact cgaaaccgtc 1920
cagaggttgc tgcctgtgct gtgccaagcg cacggactaa ccccagagca ggtcgtggca 1980
attgcgagca acggaggggg aaagcaggca ctcgaaaccg tccagaggtt gctgcctgtg 2040
ctgtgccaag cgcacgggtt gaccccagag caggtcgtgg caattgcgag ccatggaggg 2100
ggaaagcagg cactcgaaac cgtccagagg ttgctgcctg tgctgtgcca agcgcacggc 2160
ctgaccccag agcaggtcgt ggcaattgcg agccatggag ggggaaagca ggcactcgaa 2220
accgtccaga ggttgctgcc tgtgctgtgc caagcgcacg gactgacacc agagcaggtc 2280
gtggcaattg cgagcaacat cgggggaaag caggcactcg aaaccgtcca gaggttgctg 2340
cctgtgctgt gccaagcgca cggactcacg cctgagcagg tagtggctat tgcatccaac 2400
ggagggggca gacccgcact ggagtcaatc gtggcccagc tttcgaggcc ggaccccgcg 2460
ctggccgcac tcactaatga tcatcttgta gcgctggcct gcctcggcgg acgacccgcc 2520
ttggatgcgg tgaagaaggg gctcccgcac gcgcctgcat tgattaagcg gaccaacaga 2580
aggattcccg agaggacatc acatcgagtg gca 2613
<210>3
<211>318bp
<212>DNA
<213>人工序列
<220>
<223>
<400>3
atggtgaagc agatcgagag caagactgct tttcaggaag ccttggacgc tgcaggtgat 60
aaacttgtag tagttgactt ctcagccacg tggtgtgggc cttgcaaaat gatcaagcct 120
ttctttcatt ccctctctga aaagtattcc aacgtgatat tccttgaagt agatgtggat 180
gactgtcagg atgttgcttc agagtgtgaa gtcaaatgca tgccaacatt ccagtttttt 240
aagaagggac aaaaggtggg tgaattttct ggagccaata aggaaaagct tgaagccacc 300
attaatgaat tagtctaa 318
<210>4
<211>979
<212>PRT
<213>人工序列
<220>
<223>
<400>4
Met Asp Tyr Lys Asp His Asp Gly Asp Tyr Lys Asp His Asp Ile Asp
1 5 10 15
Tyr Lys Asp Asp Asp Asp Lys Met Ala Pro Lys Lys Lys Arg Lys Val
20 25 30
Gly Ile His Gly Val Pro Ala Ala Val Asp Leu Arg Thr Leu Gly Tyr
35 40 45
Ser Gln Gln Gln Gln Glu Lys Ile Lys Pro Lys Val Arg Ser Thr Val
50 55 60
Ala Gln His His Glu Ala Leu Val Gly His Gly Phe Thr His Ala His
65 70 75 80
Ile Val Ala Leu Ser Gln His Pro Ala Ala Leu Gly Thr Val Ala Val
85 90 95
Lys Tyr Gln Asp Met Ile Ala Ala Leu Pro Glu Ala Thr His Glu Ala
100 105 110
Ile Val Gly Val Gly Lys Gln Trp Ser Gly Ala Arg Ala Leu Glu Ala
115 120 125
Leu Leu Thr Val Ala Gly Glu Leu Arg Gly Pro Pro Leu Gln Leu Asp
130 135 140
Thr Gly Gln Leu Leu Lys Ile Ala Lys Arg Gly Gly Val Thr Ala Val
145 150 155 160
Glu Ala Val His Ala Trp Arg Asn Ala Leu Thr Gly Ala Pro Leu Asn
165 170 175
Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn Ile Gly Gly Lys
180 185 190
Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala
195 200 205
His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn Ile Gly
210 215 220
Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys
225 230 235 240
Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His
245 250 255
Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val
260 265 270
Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala
275 280 285
Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu
290 295 300
Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala
305 310 315 320
Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg
325 330 335
Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val
340 345 350
Val Ala Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr Val
355 360 365
Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu
370 375 380
Gln Val Val Ala Ile Ala Ser Asn Ile Gly Gly Lys Gln Ala Leu Glu
385 390 395 400
Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr
405 410 415
Pro Glu Gln Val Val Ala Ile Ala Ser Asn Ile Gly Gly Lys Gln Ala
420 425 430
Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly
435 440 445
Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys
450 455 460
Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala
465 470 475 480
His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly
485 490 495
Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys
500 505 510
Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His
515 520 525
Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val
530 535 540
Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala
545 550 555 560
Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu
565 570 575
Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala
580 585 590
Ile Ala Ser Asn Ile Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg
595 600 605
Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val
610 615 620
Val Ala Ile Ala Ser Asn Ile Gly Gly Lys Gln Ala Leu Glu Thr Val
625 630 635 640
Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu
645 650 655
Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu
660 665 670
Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr
675 680 685
Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala
690 695 700
Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly
705 710 715 720
Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys
725 730 735
Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala
740 745 750
His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn Gly Gly
755 760 765
Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys
770 775 780
Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn
785 790 795 800
Ile Gly Gly Arg Pro Ala Leu Glu Ser Ile Val Ala Gln Leu Ser Arg
805 810 815
Pro Asp Pro Ala Leu Ala Ala Leu Thr Asn Asp His Leu Val Ala Leu
820 825 830
Ala Cys Leu Gly Gly Arg Pro Ala Leu Asp Ala Val Lys Lys Gly Leu
835 840 845
Pro His Ala Pro Ala Leu Ile Lys Arg Thr Asn Arg Arg Ile Pro Glu
850 855 860
Arg Thr Ser His Arg Val Ala Ser Tyr Gln Gly Met Val Lys Gln Ile
865 870 875 880
Glu Ser Lys Thr Ala Phe Gln Glu Ala Leu Asp Ala Ala Gly Asp Lys
885 890 895
Leu Val Val Val Asp Phe Ser Ala Thr Trp Cys Gly Pro Cys Lys Met
900 905 910
Ile Lys Pro Phe Phe His Ser Leu Ser Glu Lys Tyr Ser Asn Val Ile
915 920 925
Phe Leu Glu Val Asp Val Asp Asp Cys Gln Asp Val Ala Ser Glu Cys
930 935 940
Glu Val Lys Cys Met Pro Thr Phe Gln Phe Phe Lys Lys Gly Gln Lys
945 950 955 960
Val Gly Glu Phe Ser Gly Ala Asn Lys Glu Lys Leu Glu Ala Thr Ile
965 970 975
Asn Glu Leu
<210>5
<211>972
<212>PRT
<213>人工序列
<220>
<223>
<400>5
Met Asp Tyr Lys Asp His Asp Gly Asp Tyr Lys Asp His Asp Ile Asp
1 5 10 15
Tyr Lys Asp Asp Asp Asp Lys Met Ala Pro Lys Lys Lys Arg Lys Val
20 25 30
Gly Ile His Gly Val Pro Ala Ala Val Asp Leu Arg Thr Leu Gly Tyr
35 40 45
Ser Gln Gln Gln Gln Glu Lys Ile Lys Pro Lys Val Arg Ser Thr Val
50 55 60
Ala Gln His His Glu Ala Leu Val Gly His Gly Phe Thr His Ala His
65 70 75 80
Ile Val Ala Leu Ser Gln His Pro Ala Ala Leu Gly Thr Val Ala Val
85 90 95
Lys Tyr Gln Asp Met Ile Ala Ala Leu Pro Glu Ala Thr His Glu Ala
100 105 110
Ile Val Gly Val Gly Lys Gln Trp Ser Gly Ala Arg Ala Leu Glu Ala
115 120 125
Leu Leu Thr Val Ala Gly Glu Leu Arg Gly Pro Pro Leu Gln Leu Asp
130 135 140
Thr Gly Gln Leu Leu Lys Ile Ala Lys Arg Gly Gly Val Thr Ala Val
145 150 155 160
Glu Ala Val His Ala Trp Arg Asn Ala Leu Thr Gly Ala Pro Leu Asn
165 170 175
Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys
180 185 190
Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala
195 200 205
His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly
210 215 220
Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys
225 230 235 240
Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn
245 250 255
Ile Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val
260 265 270
Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala
275 280 285
Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu
290 295 300
Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala
305 310 315 320
Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg
325 330 335
Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val
340 345 350
Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val
355 360 365
Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu
370 375 380
Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu
385 390 395 400
Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr
405 410 415
Pro Glu Gln Val Val Ala Ile Ala Ser Asn Ile Gly Gly Lys Gln Ala
420 425 430
Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly
435 440 445
Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys
450 455 460
Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala
465 470 475 480
His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn Gly Gly
485 490 495
Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys
500 505 510
Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His
515 520 525
Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val
530 535 540
Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala
545 550 555 560
Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu
565 570 575
Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala
580 585 590
Ile Ala Ser Asn Ile Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg
595 600 605
Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val
610 615 620
Val Ala Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr Val
625 630 635 640
Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu
645 650 655
Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu
660 665 670
Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr
675 680 685
Pro Glu Gln Val Val Ala Ile Ala Ser His Gly Gly Gly Lys Gln Ala
690 695 700
Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly
705 710 715 720
Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Gly Gly Gly Lys
725 730 735
Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala
740 745 750
His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn Ile Gly
755 760 765
Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys
770 775 780
Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn
785 790 795 800
Gly Gly Gly Arg Pro Ala Leu Glu Ser Ile Val Ala Gln Leu Ser Arg
805 810 815
Pro Asp Pro Ala Leu Ala Ala Leu Thr Asn Asp His Leu Val Ala Leu
820 825 830
Ala Cys Leu Gly Gly Arg Pro Ala Leu Asp Ala Val Lys Lys Gly Leu
835 840 845
Pro His Ala Pro Ala Leu Ile Lys Arg Thr Asn Arg Arg Ile Pro Glu
850 855 860
Arg Thr Ser His Arg Val Ala Val Ile Arg Glu Trp Ser Arg Ser Arg
865 870 875 880
Ala Arg Leu Leu Phe Arg Lys Pro Trp Thr Leu Gln Val Ile Asn Leu
885 890 895
Leu Thr Ser Gln Pro Arg Gly Val Gly Leu Ala Lys Ser Ser Leu Ser
900 905 910
Phe Ile Pro Ser Leu Lys Ser Ile Pro Thr Tyr Ser Leu Lys Met Trp
915 920 925
Met Thr Val Arg Met Leu Leu Gln Ser Val Lys Ser Asn Ala Cys Gln
930 935 940
His Ser Ser Phe Leu Arg Arg Asp Lys Arg Trp Val Asn Phe Leu Glu
945 950 955 960
Pro Ile Arg Lys Ser Leu Lys Pro Pro Leu Met Asn
965 970
<210>6
<211>711bp
<212>DNA
<213>人工序列
<220>
<223>
<400>6
atggtgagca agggcgagga ggataacatg gccatcatca aggagttcat gcgcttcaag 60
gtgcacatgg agggctccgt gaacggccac gagttcgaga tcgagggcga gggcgagggc 120
cgcccctacg agggcaccca gaccgccaag ctgaaggtga ccaagggtgg ccccctgccc 180
ttcgcctggg acatcctgtc ccctcagttc atgtacggct ccaaggccta cgtgaagcac 240
cccgccgaca tccccgacta cttgaagctg tccttccccg agggcttcaa gtgggagcgc 300
gtgatgaact tcgaggacgg cggcgtggtg accgtgaccc aggactcctc cctgcaggac 360
ggcgagttca tctacaaggt gaagctgcgc ggcaccaact tcccctccga cggccccgta 420
atgcagaaga agaccatggg ctgggaggcc tcctccgagc ggatgtaccc cgaggacggc 480
gccctgaagg gcgagatcaa gcagaggctg aagctgaagg acggcggcca ctacgacgct 540
gaggtcaaga ccacctacaa ggccaagaag cccgtgcagc tgcccggcgc ctacaacgtc 600
aacatcaagt tggacatcac ctcccacaac gaggactaca ccatcgtgga acagtacgaa 660
cgcgccgagg gccgccactc caccggcggc atggacgagc tgtacaagta a 711
<210>7
<211>1229
<212>PRT
<213>人工序列
<220>
<223>
<400>7
Met Asp Tyr Lys Asp His Asp Gly Asp Tyr Lys Asp His Asp Ile Asp
1 5 10 15
Tyr Lys Asp Asp Asp Asp Lys Met Ala Pro Lys Lys Lys Arg Lys Val
20 25 30
Gly Ile His Gly Val Pro Ala Ala Val Asp Leu Arg Thr Leu Gly Tyr
35 40 45
Ser Gln Gln Gln Gln Glu Lys Ile Lys Pro Lys Val Arg Ser Thr Val
50 55 60
Ala Gln His His Glu Ala Leu Val Gly His Gly Phe Thr His Ala His
65 70 75 80
Ile Val Ala Leu Ser Gln His Pro Ala Ala Leu Gly Thr Val Ala Val
85 90 95
Lys Tyr Gln Asp Met Ile Ala Ala Leu Pro Glu Ala Thr His Glu Ala
100 105 110
Ile Val Gly Val Gly Lys Gln Trp Ser Gly Ala Arg Ala Leu Glu Ala
115 120 125
Leu Leu Thr Val Ala Gly Glu Leu Arg Gly Pro Pro Leu Gln Leu Asp
130 135 140
Thr Gly Gln Leu Leu Lys Ile Ala Lys Arg Gly Gly Val Thr Ala Val
145 150 155 160
Glu Ala Val His Ala Trp Arg Asn Ala Leu Thr Gly Ala Pro Leu Asn
165 170 175
Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn Ile Gly Gly Lys
180 185 190
Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala
195 200 205
His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn Ile Gly
210 215 220
Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys
225 230 235 240
Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His
245 250 255
Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val
260 265 270
Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala
275 280 285
Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu
290 295 300
Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala
305 310 315 320
Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg
325 330 335
Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val
340 345 350
Val Ala Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr Val
355 360 365
Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu
370 375 380
Gln Val Val Ala Ile Ala Ser Asn Ile Gly Gly Lys Gln Ala Leu Glu
385 390 395 400
Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr
405 410 415
Pro Glu Gln Val Val Ala Ile Ala Ser Asn Ile Gly Gly Lys Gln Ala
420 425 430
Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly
435 440 445
Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys
450 455 460
Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala
465 470 475 480
His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly
485 490 495
Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys
500 505 510
Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His
515 520 525
Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val
530 535 540
Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala
545 550 555 560
Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu
565 570 575
Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala
580 585 590
Ile Ala Ser Asn Ile Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg
595 600 605
Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val
610 615 620
Val Ala Ile Ala Ser Asn Ile Gly Gly Lys Gln Ala Leu Glu Thr Val
625 630 635 640
Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu
645 650 655
Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu
660 665 670
Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr
675 680 685
Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala
690 695 700
Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly
705 710 715 720
Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys
725 730 735
Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala
740 745 750
His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn Gly Gly
755 760 765
Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys
770 775 780
Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn
785 790 795 800
Ile Gly Gly Arg Pro Ala Leu Glu Ser Ile Val Ala Gln Leu Ser Arg
805 810 815
Pro Asp Pro Ala Leu Ala Ala Leu Thr Asn Asp His Leu Val Ala Leu
820 825 830
Ala Cys Leu Gly Gly Arg Pro Ala Leu Asp Ala Val Lys Lys Gly Leu
835 840 845
Pro His Ala Pro Ala Leu Ile Lys Arg Thr Asn Arg Arg Ile Pro Glu
850 855 860
Arg Thr Ser His Arg Val Ala Ser Tyr Gln Gly Met Val Ser Lys Gly
865 870 875 880
Glu Glu Asp Asn Met Ala Ile Ile Lys Glu Phe Met Arg Phe Lys Val
885 890 895
His Met Glu Gly Ser Val Asn Gly His Glu Phe Glu Ile Glu Gly Glu
900 905 910
Gly Glu Gly Arg Pro Tyr Glu Gly Thr Gln Thr Ala Lys Leu Lys Val
915 920 925
Thr Lys Gly Gly Pro Leu Pro Phe Ala Trp Asp Ile Leu Ser Pro Gln
930 935 940
Phe Met Tyr Gly Ser Lys Ala Tyr Val Lys His Pro Ala Asp Ile Pro
945 950 955 960
Asp Tyr Leu Lys Leu Ser Phe Pro Glu Gly Phe Lys Trp Glu Arg Val
965 970 975
Met Asn Phe Glu Asp Gly Gly Val Val Thr Val Thr Gln Asp Ser Ser
980 985 990
Leu Gln Asp Gly Glu Phe Ile Tyr Lys Val Lys Leu Arg Gly Thr Asn
995 1000 1005
Phe Pro Ser Asp Gly Pro Val Met Gln Lys Lys Thr Met Gly Trp
1010 1015 1020
Glu Ala Ser Ser Glu Arg Met Tyr Pro Glu Asp Gly Ala Leu Lys
1025 1030 1035
Gly Glu Ile Lys Gln Arg Leu Lys Leu Lys Asp Gly Gly His Tyr
1040 1045 1050
Asp Ala Glu Val Lys Thr Thr Tyr Lys Ala Lys Lys Pro Val Gln
1055 1060 1065
Leu Pro Gly Ala Tyr Asn Val Asn Ile Lys Leu Asp Ile Thr Ser
1070 1075 1080
His Asn Glu Asp Tyr Thr Ile Val Glu Gln Tyr Glu Arg Ala Glu
1085 1090 1095
Gly Arg His Ser Thr Gly Gly Met Asp Glu Leu Tyr Lys Gly Thr
1100 1105 1110
Ser Gly Leu Arg Ser Arg Ala Gln Ala Ser Asn Ser Met Val Lys
1115 1120 1125
Gln Ile Glu Ser Lys Thr Ala Phe Gln Glu Ala Leu Asp Ala Ala
1130 1135 1140
Gly Asp Lys Leu Val Val Val Asp Phe Ser Ala Thr Trp Cys Gly
1145 1150 1155
Pro Cys Lys Met Ile Lys Pro Phe Phe His Ser Leu Ser Glu Lys
1160 1165 1170
Tyr Ser Asn Val Ile Phe Leu Glu Val Asp Val Asp Asp Cys Gln
1175 1180 1185
Asp Val Ala Ser Glu Cys Glu Val Lys Cys Met Pro Thr Phe Gln
1190 1195 1200
Phe Phe Lys Lys Gly Gln Lys Val Gly Glu Phe Ser Gly Ala Asn
1205 1210 1215
Lys Glu Lys Leu Glu Ala Thr Ile Asn Glu Leu
1220 1225
<210>8
<211>1212
<212>PRT
<213>人工序列
<220>
<223>
<400>8
Met Asp Tyr Lys Asp His Asp Gly Asp Tyr Lys Asp His Asp Ile Asp
1 5 10 15
Tyr Lys Asp Asp Asp Asp Lys Met Ala Pro Lys Lys Lys Arg Lys Val
20 25 30
Gly Ile His Gly Val Pro Ala Ala Val Asp Leu Arg Thr Leu Gly Tyr
35 40 45
Ser Gln Gln Gln Gln Glu Lys Ile Lys Pro Lys Val Arg Ser Thr Val
50 55 60
Ala Gln His His Glu Ala Leu Val Gly His Gly Phe Thr His Ala His
65 70 75 80
Ile Val Ala Leu Ser Gln His Pro Ala Ala Leu Gly Thr Val Ala Val
85 90 95
Lys Tyr Gln Asp Met Ile Ala Ala Leu Pro Glu Ala Thr His Glu Ala
100 105 110
Ile Val Gly Val Gly Lys Gln Trp Ser Gly Ala Arg Ala Leu Glu Ala
115 120 125
Leu Leu Thr Val Ala Gly Glu Leu Arg Gly Pro Pro Leu Gln Leu Asp
130 135 140
Thr Gly Gln Leu Leu Lys Ile Ala Lys Arg Gly Gly Val Thr Ala Val
145 150 155 160
Glu Ala Val His Ala Trp Arg Asn Ala Leu Thr Gly Ala Pro Leu Asn
165 170 175
Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys
180 185 190
Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala
195 200 205
His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly
210 215 220
Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys
225 230 235 240
Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn
245 250 255
Ile Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val
260 265 270
Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala
275 280 285
Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu
290 295 300
Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala
305 310 315 320
Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg
325 330 335
Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val
340 345 350
Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val
355 360 365
Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu
370 375 380
Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu
385 390 395 400
Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr
405 410 415
Pro Glu Gln Val Val Ala Ile Ala Ser Asn Ile Gly Gly Lys Gln Ala
420 425 430
Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly
435 440 445
Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys
450 455 460
Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala
465 470 475 480
His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn Gly Gly
485 490 495
Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys
500 505 510
Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His
515 520 525
Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val
530 535 540
Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala
545 550 555 560
Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu
565 570 575
Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala
580 585 590
Ile Ala Ser Asn Ile Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg
595 600 605
Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val
610 615 620
Val Ala Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr Val
625 630 635 640
Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu
645 650 655
Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu
660 665 670
Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr
675 680 685
Pro Glu Gln Val Val Ala Ile Ala Ser His Gly Gly Gly Lys Gln Ala
690 695 700
Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly
705 710 715 720
Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Gly Gly Gly Lys
725 730 735
Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala
740 745 750
His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn Ile Gly
755 760 765
Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys
770 775 780
Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn
785 790 795 800
Gly Gly Gly Arg Pro Ala Leu Glu Ser Ile Val Ala Gln Leu Ser Arg
805 810 815
Pro Asp Pro Ala Leu Ala Ala Leu Thr Asn Asp His Leu Val Ala Leu
820 825 830
Ala Cys Leu Gly Gly Arg Pro Ala Leu Asp Ala Val Lys Lys Gly Leu
835 840 845
Pro His Ala Pro Ala Leu Ile Lys Arg Thr Asn Arg Arg Ile Pro Glu
850 855 860
Arg Thr Ser His Arg Val Ala Val Ile Arg Glu Trp Ala Arg Ala Arg
865 870 875 880
Ser Cys Ser Pro Gly Trp Cys Pro Ser Trp Ser Ser Trp Thr Ala Thr
885 890 895
Thr Ala Thr Ser Ser Ala Cys Pro Ala Arg Ala Arg Ala Met Pro Pro
900 905 910
Thr Ala Ser Pro Ser Ser Ser Ala Pro Pro Ala Ser Cys Pro Cys Pro
915 920 925
Gly Pro Pro Ser Pro Pro Pro Thr Ala Cys Ser Ala Ser Ala Ala Thr
930 935 940
Pro Thr Thr Ser Ser Thr Thr Ser Ser Ser Pro Pro Cys Pro Lys Ala
945 950 955 960
Thr Ser Arg Ser Ala Pro Ser Ser Ser Arg Thr Thr Ala Thr Thr Arg
965 970 975
Pro Ala Pro Arg Ser Ser Arg Ala Thr Pro Trp Thr Ala Ser Ser Arg
980 985 990
Ala Ser Thr Ser Arg Arg Thr Ala Thr Ser Trp Gly Thr Ser Trp Ser
995 1000 1005
Thr Thr Thr Thr Ala Thr Thr Ser Ile Ser Trp Pro Thr Ser Arg
1010 1015 1020
Arg Thr Ala Ser Arg Thr Ser Arg Ser Ala Thr Thr Ser Arg Thr
1025 1030 1035
Ala Ala Cys Ser Ser Pro Thr Thr Thr Ser Arg Thr Pro Pro Ser
1040 1045 1050
Ala Thr Ala Pro Cys Cys Cys Pro Thr Thr Thr Thr Ala Pro Ser
1055 1060 1065
Pro Pro Ala Lys Thr Pro Thr Arg Ser Ala Ile Thr Trp Ser Cys
1070 1075 1080
Trp Ser Ser Pro Pro Pro Gly Ser Leu Ser Ala Trp Thr Ser Cys
1085 1090 1095
Thr Arg Val Pro Pro Asp Ser Asp Leu Glu Leu Lys Leu Arg Ile
1100 1105 1110
Pro Trp Ser Arg Ser Arg Ala Arg Leu Leu Phe Arg Lys Pro Trp
1115 1120 1125
Thr Leu Gln Val Ile Asn Leu Leu Thr Ser Gln Pro Arg Gly Val
1130 1135 1140
Gly Leu Ala Lys Ser Ser Leu Ser Phe Ile Pro Ser Leu Lys Ser
1145 1150 1155
Ile Pro Thr Tyr Ser Leu Lys Met Trp Met Thr Val Arg Met Leu
1160 1165 1170
Leu Gln Ser Val Lys Ser Asn Ala Cys Gln His Ser Ser Phe Leu
1175 1180 1185
Arg Arg Asp Lys Arg Trp Val Asn Phe Leu Glu Pro Ile Arg Lys
1190 1195 1200
Ser Leu Lys Pro Pro Leu Met Asn Ser
1205 1210
<210>9
<211>2613bp
<212>DNA
<213>人工序列
<220>
<223>
<400>9
atggactata aggaccacga cggagactac aaggatcatg atattgatta caaagacgat 60
gacgataaga tggccccaaa gaagaagcgg aaggtcggta tccacggagt cccagcagcc 120
gtagatttga gaactttggg atattcacag cagcagcagg aaaagatcaa gcccaaagtg 180
aggtcgacag tcgcgcagca tcacgaagcg ctggtgggtc atgggtttac acatgcccac 240
atcgtagcct tgtcgcagca ccctgcagcc cttggcacgg tcgccgtcaa gtaccaggac 300
atgattgcgg cgttgccgga agccacacat gaggcgatcg tcggtgtggg gaaacagtgg 360
agcggagccc gagcgcttga ggccctgttg acggtcgcgg gagagctgag agggcctccc 420
cttcagctgg acacgggcca gttgctgaag atcgcgaagc ggggaggagt cacggcggtc 480
gaggcggtgc acgcgtggcg caatgcgctc acgggagcac ccctcaacct gaccccagag 540
caggtcgtgg caattgcgag caacatcggg ggaaagcagg cactcgaaac cgtccagagg 600
ttgctgcctg tgctgtgcca agcgcacgga cttacgccag agcaggtcgt ggcaattgcg 660
agccatgacg ggggaaagca ggcactcgaa accgtccaga ggttgctgcc tgtgctgtgc 720
caagcgcacg gactaacccc agagcaggtc gtggcaattg cgagccatga cgggggaaag 780
caggcactcg aaaccgtcca gaggttgctg cctgtgctgt gccaagcgca cgggttgacc 840
ccagagcagg tcgtggcaat tgcgagccat gacgggggaa agcaggcact cgaaaccgtc 900
cagaggttgc tgcctgtgct gtgccaagcg cacggcctga ccccagagca ggtcgtggca 960
attgcgagca acggaggggg aaagcaggca ctcgaaaccg tccagaggtt gctgcctgtg 1020
ctgtgccaag cgcacggact gacaccagag caggtcgtgg caattgcgag caacatcggg 1080
ggaaagcagg cactcgaaac cgtccagagg ttgctgcctg tgctgtgcca agcgcacgga 1140
cttacacccg aacaagtcgt ggcaattgcg agccatgacg ggggaaagca ggcactcgaa 1200
accgtccaga ggttgctgcc tgtgctgtgc caagcgcacg gacttacgcc agagcaggtc 1260
gtggcaattg cgagcaacgg agggggaaag caggcactcg aaaccgtcca gaggttgctg 1320
cctgtgctgt gccaagcgca cggactaacc ccagagcagg tcgtggcaat tgcgagcaac 1380
aacgggggaa agcaggcact cgaaaccgtc cagaggttgc tgcctgtgct gtgccaagcg 1440
cacgggttga ccccagagca ggtcgtggca attgcgagca acatcggggg aaagcaggca 1500
ctcgaaaccg tccagaggtt gctgcctgtg ctgtgccaag cgcacggcct gaccccagag 1560
caggtcgtgg caattgcgag caacggaggg ggaaagcagg cactcgaaac cgtccagagg 1620
ttgctgcctg tgctgtgcca agcgcacgga ctgacaccag agcaggtcgt ggcaattgcg 1680
agcaacaacg ggggaaagca ggcactcgaa accgtccaga ggttgctgcc tgtgctgtgc 1740
caagcgcacg gcctcacccc agagcaggtc gtggcaattg cgagcaacat cgggggaaag 1800
caggcactcg aaaccgtcca gaggttgctg cctgtgctgt gccaagcgca cggacttacg 1860
ccagagcagg tcgtggcaat tgcgagcaac ggagggggaa agcaggcact cgaaaccgtc 1920
cagaggttgc tgcctgtgct gtgccaagcg cacggactaa ccccagagca ggtcgtggca 1980
attgcgagca acaacggggg aaagcaggca ctcgaaaccg tccagaggtt gctgcctgtg 2040
ctgtgccaag cgcacgggtt gaccccagag caggtcgtgg caattgcgag caacggaggg 2100
ggaaagcagg cactcgaaac cgtccagagg ttgctgcctg tgctgtgcca agcgcacggc 2160
ctgaccccag agcaggtcgt ggcaattgcg agcaacaacg ggggaaagca ggcactcgaa 2220
accgtccaga ggttgctgcc tgtgctgtgc caagcgcacg gactgacacc agagcaggtc 2280
gtggcaattg cgagcaacgg agggggaaag caggcactcg aaaccgtcca gaggttgctg 2340
cctgtgctgt gccaagcgca cggactcacg cctgagcagg tagtggctat tgcatccaac 2400
ggagggggca gacccgcact ggagtcaatc gtggcccagc tttcgaggcc ggaccccgcg 2460
ctggccgcac tcactaatga tcatcttgta gcgctggcct gcctcggcgg acgacccgcc 2520
ttggatgcgg tgaagaaggg gctcccgcac gcgcctgcat tgattaagcg gaccaacaga 2580
aggattcccg agaggacatc acatcgagtg gca 2613
<210>10
<211>871bp
<212>DNA
<213>人工序列
<220>
<223>
<400>10
Met Asp Tyr Lys Asp His Asp Gly Asp Tyr Lys Asp His Asp Ile Asp
1 5 10 15
Tyr Lys Asp Asp Asp Asp Lys Met Ala Pro Lys Lys Lys Arg Lys Val
20 25 30
Gly Ile His Gly Val Pro Ala Ala Val Asp Leu Arg Thr Leu Gly Tyr
35 40 45
Ser Gln Gln Gln Gln Glu Lys Ile Lys Pro Lys Val Arg Ser Thr Val
50 55 60
Ala Gln His His Glu Ala Leu Val Gly His Gly Phe Thr His Ala His
65 70 75 80
Ile Val Ala Leu Ser Gln His Pro Ala Ala Leu Gly Thr Val Ala Val
85 90 95
Lys Tyr Gln Asp Met Ile Ala Ala Leu Pro Glu Ala Thr His Glu Ala
100 105 110
Ile Val Gly Val Gly Lys Gln Trp Ser Gly Ala Arg Ala Leu Glu Ala
115 120 125
Leu Leu Thr Val Ala Gly Glu Leu Arg Gly Pro Pro Leu Gln Leu Asp
130 135 140
Thr Gly Gln Leu Leu Lys Ile Ala Lys Arg Gly Gly Val Thr Ala Val
145 150 155 160
Glu Ala Val His Ala Trp Arg Asn Ala Leu Thr Gly Ala Pro Leu Asn
165 170 175
Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn Ile Gly Gly Lys
180 185 190
Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala
195 200 205
His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly
210 215 220
Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys
225 230 235 240
Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His
245 250 255
Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val
260 265 270
Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala
275 280 285
Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu
290 295 300
Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala
305 310 315 320
Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg
325 330 335
Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val
340 345 350
Val Ala Ile Ala Ser Asn Ile Gly Gly Lys Gln Ala Leu Glu Thr Val
355 360 365
Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu
370 375 380
Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu
385 390 395 400
Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr
405 410 415
Pro Glu Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala
420 425 430
Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly
435 440 445
Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn Asn Gly Gly Lys
450 455 460
Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala
465 470 475 480
His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn Ile Gly
485 490 495
Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys
500 505 510
Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn
515 520 525
Gly Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val
530 535 540
Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala
545 550 555 560
Ser Asn Asn Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu
565 570 575
Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala
580 585 590
Ile Ala Ser Asn Ile Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg
595 600 605
Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val
610 615 620
Val Ala Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr Val
625 630 635 640
Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu
645 650 655
Gln Val Val Ala Ile Ala Ser Asn Asn Gly Gly Lys Gln Ala Leu Glu
660 665 670
Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr
675 680 685
Pro Glu Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala
690 695 700
Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly
705 710 715 720
Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn Asn Gly Gly Lys
725 730 735
Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala
740 745 750
His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn Gly Gly
755 760 765
Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys
770 775 780
Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn
785 790 795 800
Gly Gly Gly Arg Pro Ala Leu Glu Ser Ile Val Ala Gln Leu Ser Arg
805 810 815
Pro Asp Pro Ala Leu Ala Ala Leu Thr Asn Asp His Leu Val Ala Leu
820 825 830
Ala Cys Leu Gly Gly Arg Pro Ala Leu Asp Ala Val Lys Lys Gly Leu
835 840 845
Pro His Ala Pro Ala Leu Ile Lys Arg Thr Asn Arg Arg Ile Pro Glu
850 855 860
Arg Thr Ser His Arg Val Ala
865 870
<210>11
<211>2613bp
<212>DNA
<213>人工序列
<220>
<223>
<400>11
atggactata aggaccacga cggagactac aaggatcatg atattgatta caaagacgat 60
gacgataaga tggccccaaa gaagaagcgg aaggtcggta tccacggagt cccagcagcc 120
gtagatttga gaactttggg atattcacag cagcagcagg aaaagatcaa gcccaaagtg 180
aggtcgacag tcgcgcagca tcacgaagcg ctggtgggtc atgggtttac acatgcccac 240
atcgtagcct tgtcgcagca ccctgcagcc cttggcacgg tcgccgtcaa gtaccaggac 300
atgattgcgg cgttgccgga agccacacat gaggcgatcg tcggtgtggg gaaacagtgg 360
agcggagccc gagcgcttga ggccctgttg acggtcgcgg gagagctgag agggcctccc 420
cttcagctgg acacgggcca gttgctgaag atcgcgaagc ggggaggagt cacggcggtc 480
gaggcggtgc acgcgtggcg caatgcgctc acgggagcac ccctcaacct gaccccagag 540
caggtcgtgg caattgcgag ccatgacggg ggaaagcagg cactcgaaac cgtccagagg 600
ttgctgcctg tgctgtgcca agcgcacgga cttacgccag agcaggtcgt ggcaattgcg 660
agccatgacg ggggaaagca ggcactcgaa accgtccaga ggttgctgcc tgtgctgtgc 720
caagcgcacg gactaacccc agagcaggtc gtggcaattg cgagcaacgg agggggaaag 780
caggcactcg aaaccgtcca gaggttgctg cctgtgctgt gccaagcgca cgggttgacc 840
ccagagcagg tcgtggcaat tgcgagcaac aacgggggaa agcaggcact cgaaaccgtc 900
cagaggttgc tgcctgtgct gtgccaagcg cacggcctga ccccagagca ggtcgtggca 960
attgcgagca acggaggggg aaagcaggca ctcgaaaccg tccagaggtt gctgcctgtg 1020
ctgtgccaag cgcacggact gacaccagag caggtcgtgg caattgcgag ccatgacggg 1080
ggaaagcagg cactcgaaac cgtccagagg ttgctgcctg tgctgtgcca agcgcacgga 1140
cttacacccg aacaagtcgt ggcaattgcg agcaacatcg ggggaaagca ggcactcgaa 1200
accgtccaga ggttgctgcc tgtgctgtgc caagcgcacg gacttacgcc agagcaggtc 1260
gtggcaattg cgagccatga cgggggaaag caggcactcg aaaccgtcca gaggttgctg 1320
cctgtgctgt gccaagcgca cggactaacc ccagagcagg tcgtggcaat tgcgagccat 1380
gacgggggaa agcaggcact cgaaaccgtc cagaggttgc tgcctgtgct gtgccaagcg 1440
cacgggttga ccccagagca ggtcgtggca attgcgagca acaacggggg aaagcaggca 1500
ctcgaaaccg tccagaggtt gctgcctgtg ctgtgccaag cgcacggcct gaccccagag 1560
caggtcgtgg caattgcgag caacatcggg ggaaagcagg cactcgaaac cgtccagagg 1620
ttgctgcctg tgctgtgcca agcgcacgga ctgacaccag agcaggtcgt ggcaattgcg 1680
agccatgacg ggggaaagca ggcactcgaa accgtccaga ggttgctgcc tgtgctgtgc 1740
caagcgcacg gcctcacccc agagcaggtc gtggcaattg cgagcaacat cgggggaaag 1800
caggcactcg aaaccgtcca gaggttgctg cctgtgctgt gccaagcgca cggacttacg 1860
ccagagcagg tcgtggcaat tgcgagccat gacgggggaa agcaggcact cgaaaccgtc 1920
cagaggttgc tgcctgtgct gtgccaagcg cacggactaa ccccagagca ggtcgtggca 1980
attgcgagca acggaggggg aaagcaggca ctcgaaaccg tccagaggtt gctgcctgtg 2040
ctgtgccaag cgcacgggtt gaccccagag caggtcgtgg caattgcgag caacggaggg 2100
ggaaagcagg cactcgaaac cgtccagagg ttgctgcctg tgctgtgcca agcgcacggc 2160
ctgaccccag agcaggtcgt ggcaattgcg agccatgacg ggggaaagca ggcactcgaa 2220
accgtccaga ggttgctgcc tgtgctgtgc caagcgcacg gactgacacc agagcaggtc 2280
gtggcaattg cgagccatga cgggggaaag caggcactcg aaaccgtcca gaggttgctg 2340
cctgtgctgt gccaagcgca cggactcacg cctgagcagg tagtggctat tgcatccaac 2400
ggagggggca gacccgcact ggagtcaatc gtggcccagc tttcgaggcc ggaccccgcg 2460
ctggccgcac tcactaatga tcatcttgta gcgctggcct gcctcggcgg acgacccgcc 2520
ttggatgcgg tgaagaaggg gctcccgcac gcgcctgcat tgattaagcg gaccaacaga 2580
aggattcccg agaggacatc acatcgagtg gca 2613
<210>12
<211>871
<212>DNA
<213>人工序列
<220>
<223>
<400>12
Met Asp Tyr Lys Asp His Asp Gly Asp Tyr Lys Asp His Asp Ile Asp
1 5 10 15
Tyr Lys Asp Asp Asp Asp Lys Met Ala Pro Lys Lys Lys Arg Lys Val
20 25 30
Gly Ile His Gly Val Pro Ala Ala Val Asp Leu Arg Thr Leu Gly Tyr
35 40 45
Ser Gln Gln Gln Gln Glu Lys Ile Lys Pro Lys Val Arg Ser Thr Val
50 55 60
Ala Gln His His Glu Ala Leu Val Gly His Gly Phe Thr His Ala His
65 70 75 80
Ile Val Ala Leu Ser Gln His Pro Ala Ala Leu Gly Thr Val Ala Val
85 90 95
Lys Tyr Gln Asp Met Ile Ala Ala Leu Pro Glu Ala Thr His Glu Ala
100 105 110
Ile Val Gly Val Gly Lys Gln Trp Ser Gly Ala Arg Ala Leu Glu Ala
115 120 125
Leu Leu Thr Val Ala Gly Glu Leu Arg Gly Pro Pro Leu Gln Leu Asp
130 135 140
Thr Gly Gln Leu Leu Lys Ile Ala Lys Arg Gly Gly Val Thr Ala Val
145 150 155 160
Glu Ala Val His Ala Trp Arg Asn Ala Leu Thr Gly Ala Pro Leu Asn
165 170 175
Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys
180 185 190
Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala
195 200 205
His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly
210 215 220
Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys
225 230 235 240
Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn
245 250 255
Gly Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val
260 265 270
Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala
275 280 285
Ser Asn Asn Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu
290 295 300
Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala
305 310 315 320
Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg
325 330 335
Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val
340 345 350
Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val
355 360 365
Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu
370 375 380
Gln Val Val Ala Ile Ala Ser Asn Ile Gly Gly Lys Gln Ala Leu Glu
385 390 395 400
Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr
405 410 415
Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala
420 425 430
Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly
435 440 445
Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys
450 455 460
Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala
465 470 475 480
His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn Asn Gly
485 490 495
Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys
500 505 510
Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn
515 520 525
Ile Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val
530 535 540
Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala
545 550 555 560
Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu
565 570 575
Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala
580 585 590
Ile Ala Ser Asn Ile Gly Gly Lys Gln Ala Leu Glu Thr Val Gln Arg
595 600 605
Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu Gln Val
610 615 620
Val Ala Ile Ala Ser His Asp Gly Gly Lys Gln Ala Leu Glu Thr Val
625 630 635 640
Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr Pro Glu
645 650 655
Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala Leu Glu
660 665 670
Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly Leu Thr
675 680 685
Pro Glu Gln Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys Gln Ala
690 695 700
Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala His Gly
705 710 715 720
Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly Gly Lys
725 730 735
Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys Gln Ala
740 745 750
His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser His Asp Gly
755 760 765
Gly Lys Gln Ala Leu Glu Thr Val Gln Arg Leu Leu Pro Val Leu Cys
770 775 780
Gln Ala His Gly Leu Thr Pro Glu Gln Val Val Ala Ile Ala Ser Asn
785 790 795 800
Gly Gly Gly Arg Pro Ala Leu Glu Ser Ile Val Ala Gln Leu Ser Arg
805 810 815
Pro Asp Pro Ala Leu Ala Ala Leu Thr Asn Asp His Leu Val Ala Leu
820 825 830
Ala Cys Leu Gly Gly Arg Pro Ala Leu Asp Ala Val Lys Lys Gly Leu
835 840 845
Pro His Ala Pro Ala Leu Ile Lys Arg Thr Asn Arg Arg Ile Pro Glu
850 855 860
Arg Thr Ser His Arg Val Ala
865 870
<210>13
<211>10415bp
<212>DNA
<213>人工序列
<220>
<223>
<400>13
caggtggcac ttttcgggga aatgtgcgcg gaacccctat ttgtttattt ttctaaatac 60
attcaaatat gtatccgctc atgagacaat aaccctgata aatgcttcaa taatattgaa 120
aaaggaagag tatgagtatt caacatttcc gtgtcgccct tattcccttt tttgcggcat 180
tttgccttcc tgtttttgct cacccagaaa cgctggtgaa agtaaaagat gctgaagatc 240
agttgggtgc acgagtgggt tacatcgaac tggatctcaa cagcggtaag atccttgaga 300
gttttcgccc cgaagaacgt tttccaatga tgagcacttt taaagttctg ctatgtggcg 360
cggtattatc ccgtattgac gccgggcaag agcaactcgg tcgccgcata cactattctc 420
agaatgactt ggttgagtac tcaccagtca cagaaaagca tcttacggat ggcatgacag 480
taagagaatt atgcagtgct gccataacca tgagtgataa cactgcggcc aacttacttc 540
tgacaacgat cggaggaccg aaggagctaa ccgctttttt gcacaacatg ggggatcatg 600
taactcgcct tgatcgttgg gaaccggagc tgaatgaagc cataccaaac gacgagcgtg 660
acaccacgat gcctgtagca atggcaacaa cgttgcgcaa actattaact ggcgaactac 720
ttactctagc ttcccggcaa caattaatag actggatgga ggcggataaa gttgcaggac 780
cacttctgcg ctcggccctt ccggctggct ggtttattgc tgataaatct ggagccggtg 840
agcgtgggtc tcgcggtatc attgcagcac tggggccaga tggtaagccc tcccgtatcg 900
tagttatcta cacgacgggg agtcaggcaa ctatggatga acgaaataga cagatcgctg 960
agataggtgc ctcactgatt aagcattggt aactgtcaga ccaagtttac tcatatatac 1020
tttagattga tttaaaactt catttttaat ttaaaaggat ctaggtgaag atcctttttg 1080
ataatctcat gaccaaaatc ccttaacgtg agttttcgtt ccactgagcg tcagaccccg 1140
tagaaaagat caaaggatct tcttgagatc ctttttttct gcgcgtaatc tgctgcttgc 1200
aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc ggatcaagag ctaccaactc 1260
tttttccgaa ggtaactggc ttcagcagag cgcagatacc aaatactgtc cttctagtgt 1320
agccgtagtt aggccaccac ttcaagaact ctgtagcacc gcctacatac ctcgctctgc 1380
taatcctgtt accagtggct gctgccagtg gcgataagtc gtgtcttacc gggttggact 1440
caagacgata gttaccggat aaggcgcagc ggtcgggctg aacggggggt tcgtgcacac 1500
agcccagctt ggagcgaacg acctacaccg aactgagata cctacagcgt gagctatgag 1560
aaagcgccac gcttcccgaa gggagaaagg cggacaggta tccggtaagc ggcagggtcg 1620
gaacaggaga gcgcacgagg gagcttccag ggggaaacgc ctggtatctt tatagtcctg 1680
tcgggtttcg ccacctctga cttgagcgtc gatttttgtg atgctcgtca ggggggcgga 1740
gcctatggaa aaacgccagc aacgcggcct ttttacggtt cctggccttt tgctggcctt 1800
ttgctcacat gttctttcct gcgttatccc ctgattctgt ggataaccgt attaccgcct 1860
ttgagtgagc tgataccgct cgccgcagcc gaacgaccga gcgcagcgag tcagtgagcg 1920
aggaagcgga agagcgccca atacgcaaac cgcctctccc cgcgcgttgg ccgattcatt 1980
aatgcagctg gcacgacagg tttcccgact ggaaagcggg cagtgagcgc aacgcaatta 2040
atgtgagtta gctcactcat taggcacccc aggctttaca ctttatgctt ccggctcgta 2100
tgttgtgtgg aattgtgagc ggataacaat ttcacacagg aaacagctat gaccatgatt 2160
acgccaagcg cgcaattaac cctcactaaa gggaacaaaa gctggagctg caagcttaat 2220
gtagtcttat gcaatactct tgtagtcttg caacatggta acgatgagtt agcaacatgc 2280
cttacaagga gagaaaaagc accgtgcatg ccgattggtg gaagtaaggt ggtacgatcg 2340
tgccttatta ggaaggcaac agacgggtct gacatggatt ggacgaacca ctgaattgcc 2400
gcattgcaga gatattgtat ttaagtgcct agctcgatac aataaacggg tctctctggt 2460
tagaccagat ctgagcctgg gagctctctg gctaactagg gaacccactg cttaagcctc 2520
aataaagctt gccttgagtg cttcaagtag tgtgtgcccg tctgttgtgt gactctggta 2580
actagagatc cctcagaccc ttttagtcag tgtggaaaat ctctagcagt ggcgcccgaa 2640
cagggacctg aaagcgaaag ggaaaccaga gctctctcga cgcaggactc ggcttgctga 2700
agcgcgcacg gcaagaggcg aggggcggcg actggtgagt acgccaaaaa ttttgactag 2760
cggaggctag aaggagagag atgggtgcga gagcgtcagt attaagcggg ggagaattag 2820
atcgcgatgg gaaaaaattc ggttaaggcc agggggaaag aaaaaatata aattaaaaca 2880
tatagtatgg gcaagcaggg agctagaacg attcgcagtt aatcctggcc tgttagaaac 2940
atcagaaggc tgtagacaaa tactgggaca gctacaacca tcccttcaga caggatcaga 3000
agaacttaga tcattatata atacagtagc aaccctctat tgtgtgcatc aaaggataga 3060
gataaaagac accaaggaag ctttagacaa gatagaggaa gagcaaaaca aaagtaagac 3120
caccgcacag caagcggccg ctgatcttca gacctggagg aggagatatg agggacaatt 3180
ggagaagtga attatataaa tataaagtag taaaaattga accattagga gtagcaccca 3240
ccaaggcaaa gagaagagtg gtgcagagag aaaaaagagc agtgggaata ggagctttgt 3300
tccttgggtt cttgggagca gcaggaagca ctatgggcgc agcctcaatg acgctgacgg 3360
tacaggccag acaattattg tctggtatag tgcagcagca gaacaatttg ctgagggcta 3420
ttgaggcgca acagcatctg ttgcaactca cagtctgggg catcaagcag ctccaggcaa 3480
gaatcctggc tgtggaaaga tacctaaagg atcaacagct cctggggatt tggggttgct 3540
ctggaaaact catttgcacc actgctgtgc cttggaatgc tagttggagt aataaatctc 3600
tggaacagat tggaatcaca cgacctggat ggagtgggac agagaaatta acaattacac 3660
aagcttaata cactccttaa ttgaagaatc gcaaaaccag caagaaaaga atgaacaaga 3720
attattggaa ttagataaat gggcaagttt gtggaattgg tttaacataa caaattggct 3780
gtggtatata aaattattca taatgatagt aggaggcttg gtaggtttaa gaatagtttt 3840
tgctgtactt tctatagtga atagagttag gcagggatat tcaccattat cgtttcagac 3900
ccacctccca accccgaggg gacccgacag gcccgaagga atagaagaag aaggtggaga 3960
gagagacaga gacagatcca ttcgattagt gaacggatct cgacggttaa cttttaaaag 4020
aaaagggggg attggggggt acagtgcagg ggaaagaata gtagacataa tagcaacaga 4080
catacaaact aaagaattac aaaaacaaat tacaaaaatt caaaatttta tcgatggtac 4140
ctaccgggta ggggaggcgc ttttcccaag gcagtctgga gcatgcgctt tagcagcccc 4200
gctgggcact tggcgctaca caagtggcct ctggctcgca cacattccac atccaccggt 4260
aggcgccaac cggctccgtt ctttggtggc cccttcgcgc caccttctac tcctccccta 4320
gtcaggaagt tcccccccgc cccgcagctc gcgtcgtgca ggacgtgaca aatggaagta 4380
gcacgtctca ctagtctcgt gcagatggac agcaccgctg agcaatggaa gcdggtaggc 4440
ctttggggca gcggccaata gcagctttgc tccttcgctt tctgggctca gaggctggga 4500
aggggtgggt ccgggggcgg gctcaggggc gggctcaggg gcggggcggg cgcccgaagt 4560
cctccggagg cccggcattc tgcacgcttc aaaagcgcac gtctgccgcg ctgttctcct 4620
cttcctcatc tccgggcctt tcgacctgca ctctagagga tccctcgaga ccggtcgcca 4680
ccgagctctc tggctaacta gagaacccac tgcttactgg cttatcgaaa ttaatacgac 4740
tcactatagg ggccaccatg gactataagg accacgacgg agactacaag gatcatgata 4800
ttgattacaa agacgatgac gataagatgg ccccaaagaa gaagcggaag gtcggtatcc 4860
acggagtccc agcagccgta gatttgagaa ctttgggata ttcacagcag cagcaggaaa 4920
agatcaagcc caaagtgagg tcgacagtcg cgcagcatca cgaagcgctg gtgggtcatg 4980
ggtttacaca tgcccacatc gtagccttgt cgcagcaccc tgcagccctt ggcacggtcg 5040
ccgtcaagta ccaggacatg attgcggcgt tgccggaagc cacacatgag gcgatcgtcg 5100
gtgtggggaa acagtggagc ggagcccgag cgcttgaggc cctgttgacg gtcgcgggag 5160
agctgagagg gcctcccctt cagctggaca cgggccagtt gctgaagatc gcgaagcggg 5220
gaggagtcac ggcggtcgag gcggtgcacg cgtggcgcaa tgcgctcacg ggagcacccc 5280
tcaacctgac cccagagcag gtcgtggcaa ttgcgagcaa catcggggga aagcaggcac 5340
tcgaaaccgt ccagaggttg ctgcctgtgc tgtgccaagc gcacggactt acgccagagc 5400
aggtcgtggc aattgcgagc aacatcgggg gaaagcaggc actcgaaacc gtccagaggt 5460
tgctgcctgt gctgtgccaa gcgcacggac taaccccaga gcaggtcgtg gcaattgcga 5520
gccatgacgg gggaaagcag gcactcgaaa ccgtccagag gttgctgcct gtgctgtgcc 5580
aagcgcacgg gttgacccca gagcaggtcg tggcaattgc gagccatgac gggggaaagc 5640
aggcactcga aaccgtccag aggttgctgc ctgtgctgtg ccaagcgcac ggcctgaccc 5700
cagagcaggt cgtggcaatt gcgagccatg acgggggaaa gcaggcactc gaaaccgtcc 5760
agaggttgct gcctgtgctg tgccaagcgc acggactgac accagagcag gtcgtggcaa 5820
ttgcgagcaa cggaggggga aagcaggcac tcgaaaccgt ccagaggttg ctgcctgtgc 5880
tgtgccaagc gcacggactt acacccgaac aagtcgtggc aattgcgagc aacatcgggg 5940
gaaagcaggc actcgaaacc gtccagaggt tgctgcctgt gctgtgccaa gcgcacggac 6000
ttacgccaga gcaggtcgtg gcaattgcga gcaacatcgg gggaaagcag gcactcgaaa 6060
ccgtccagag gttgctgcct gtgctgtgcc aagcgcacgg actaacccca gagcaggtcg 6120
tggcaattgc gagccatgac gggggaaagc aggcactcga aaccgtccag aggttgctgc 6180
ctgtgctgtg ccaagcgcac gggttgaccc cagagcaggt cgtggcaatt gcgagccatg 6240
acgggggaaa gcaggcactc gaaaccgtcc agaggttgct gcctgtgctg tgccaagcgc 6300
acggcctgac cccagagcag gtcgtggcaa ttgcgagcca tgacggggga aagcaggcac 6360
tcgaaccgtc cagaggttgc tgcctgtgct gtgccaagcg cacggactga caccagagca 6420
ggtcgtggca attgcgagca acggaggggg aaagcaggca ctcgaaaccg tccagaggtt 6480
gctgcctgtg ctgtgccaag cgcacggcct caccccagag caggtcgtgg caattgcgag 6540
caacatcggg ggaaagcagg cactcgaaac cgtccagagg ttgctgcctg tgctgtgcca 6600
agcgcacgga cttacgccag agcaggtcgt ggcaattgcg agcaacatcg ggggaaagca 6660
ggcactcgaa accgtccaga ggttgctgcc tgtgctgtgc caagcgcacg gactaacccc 6720
agagcaggtc gtggcaattg cgagccatga cgggggaaag caggcactcg aaaccgtcca 6780
gaggttgctg cctgtgctgt gccaagcgca cgggttgacc ccagagcagg tcgtggcaat 6840
tgcgagccat gacgggggaa agcaggcact cgaaaccgtc cagaggttgc tgcctgtgct 6900
gtgccaagcg cacggcctga ccccagagca ggtcgtggca attgcgagcc atgacggggg 6960
aaagcaggca ctcgaaaccg tccagaggtt gctgcctgtg ctgtgccaag cgcacggact 7020
gacaccagag caggtcgtgg caattgcgag caacggaggg ggaaagcagg cactcgaaac 7080
cgtccagagg ttgctgcctg tgctgtgcca agcgcacgga ctcacgcctg agcaggtagt 7140
ggctattgca tccaacatcg ggggcagacc cgcactggag tcaatcgtgg cccagctttc 7200
gaggccggac cccgcgctgg ccgcactcac taatgatcat cttgtagcgc tggcctgcct 7260
cggcggacga cccgccttgg atgcggtgaa gaaggggctc ccgcacgcgc ctgcattgat 7320
taagcggacc aacagaagga ttcccgagag gacatcacat cgagtggcaa gttatcaggg 7380
aatggtgagc aagggcgagg agctgttcac cggggtggtg cccatcctgg tcgagctgga 7440
cggcgacgta aacggccaca agttcagcgt gtccggcgag ggcgagggcg atgccaccta 7500
cggcaagctg accctgaagt tcatctgcac caccggcaag ctgcccgtgc cctggcccac 7560
cctcgtgacc accctgacct acggcgtgca gtgcttcagc cgctaccccg accacatgaa 7620
gcagcacgac ttcttcaagt ccgccatgcc cgaaggctac gtccaggagc gcaccatctt 7680
cttcaaggac gacggcaact acaagacccg cgccgaggtg aagttcgagg gcgacaccct 7740
ggtgaaccgc atcgagctga agggcatcga cttcaaggag gacggcaaca tcctggggca 7800
caagctggag tacaactaca acagccacaa cgtctatatc atggccgaca agcagaagaa 7860
cggcatcaag gtgaacttca agatccgcca caacatcgag gacggcagcg tgcagctcgc 7920
cgaccactac cagcagaaca cccccatcgg cgacggcccc gtgctgctgc ccgacaacca 7980
ctacctgagc acccagtccg ccctgagcaa agaccccaac gagaagcgcg atcacatggt 8040
cctgctggag ttcgtgaccg ccgccgggat cactctcggc atggacgagc tgtacaagtc 8100
cggactcaga tctcgagctc aagcttcgaa ttccatggtg aagcagatcg agagcaagac 8160
tgcttttcag gaagccttgg acgctgcagg tgataaactt gtagtagttg acttctcagc 8220
cacgtggtgt gggccttgca aaatgatcaa gcctttcttt cattccctct ctgaaaagta 8280
ttccaacgtg atattccttg aagtagatgt ggatgactgt caggatgttg cttcagagtg 8340
tgaagtcaaa tgcatgccaa cattccagtt ttttaagaag ggacaaaagg tgggtgaatt 8400
ttctggagcc aataaggaaa agcttgaagc caccattaat gaattagtct aactcgagag 8460
cggccgcgac gcgtgtcgac aatcaacctc tggattacaa aatttgtgaa agattgactg 8520
gtattcttaa ctatgttgct ccttttacgc tatgtggata cgctgcttta atgcctttgt 8580
atcatgctat tgcttcccgt atggctttca ttttctcctc cttgtataaa tcctggttgc 8640
tgtctcttta tgaggagttg tggcccgttg tcaggcaacg tggcgtggtg tgcactgtgt 8700
ttgctgacgc aacccccact ggttggggca ttgccaccac ctgtcagctc ctttccggga 8760
ctttcgcttt ccccctccct attgccacgg cggaactcat cgccgcctgc cttgcccgct 8820
gctggacagg ggctcggctg ttgggcactg acaattccgt ggtgttgtcg gggaagctga 8880
cgtcctttcc atggctgctc gcctgtgttg ccacctggat tctgcgcggg acgtccttct 8940
gctacgtccc ttcggccctc aatccagcgg accttccttc ccgcggcctg ctgccggctc 9000
tgcggcctct tccgcgtctt cgccttcgcc ctcagacgag tcggatctcc ctttgggccg 9060
cctccccgcc tggaattcga gctcggtacc tttaagacca atgacttaca aggcagctgt 9120
agatcttagc cactttttaa aagaaaaggg gggactggaa gggctaattc actcccaacg 9180
aagacaagat ctgctttttg cttgtactgg gtctctctgg ttagaccaga tctgagcctg 9240
ggagctctct ggctaactag ggaacccact gcttaagcct caataaagct tgccttgagt 9300
gcttcaagta gtgtgtgccc gtctgttgtg tgactctggt aactagagat ccctcagacc 9360
cttttagtca gtgtggaaaa tctctagcag tagtagttca tgtcatctta ttattcagta 9420
tttataactt gcaaagaaat gaatatcaga gagtgagagg aacttgttta ttgcagctta 9480
taatggttac aaataaagca atagcatcac aaatttcaca aataaagcat ttttttcact 9540
gcattctagt tgtggtttgt ccaaactcat caatgtatct tatcatgtct ggctctagct 9600
atcccgcccc taactccgcc cagttccgcc cattctccgc cccatggctg actaattttt 9660
tttatttatg cagaggccga ggccgcctcg gcctctgagc tattccagaa gtagtgagga 9720
ggcttttttg gaggcctagg cttttgcgtc gagacgtacc caattcgccc tatagtgagt 9780
cgtattacgc gcgctcactg gccgtcgttt tacaacgtcg tgactgggaa aaccctggcg 9840
ttacccaact taatcgcctt gcagcacatc cccctttcgc cagctggcgt aatagcgaag 9900
aggcccgcac cgatcgccct tcccaacagt tgcgcagcct gaatggcgaa tggcgcgacg 9960
cgccctgtag cggcgcatta agcgcggcgg gtgtggtggt tacgcgcagc gtgaccgcta 10020
cacttgccag cgccctagcg cccgctcctt tcgctttctt cccttccttt ctcgccacgt 10080
tcgccggctt tccccgtcaa gctctaaatc gggggctccc tttagggttc cgatttagtg 10140
ctttacggca cctcgacccc aaaaaacttg attagggtga tggttcacgt agtgggccat 10200
cgccctgata gacggttttt cgccctttga cgttggagtc cacgttcttt aatagtggac 10260
tcttgttcca aactggaaca acactcaacc ctatctcggt ctattctttt gatttataag 10320
ggattttgcc gatttcggcc tattggttaa aaaatgagct gatttaacaa aaatttaacg 10380
cgaattttaa caaaatatta acgtttacaa tttcc 10415
<210>14
<211>18bp
<212>DNA
<213>人工序列
<220>
<223>
<400>14
aaccctaacc ctaaccct 18
<210>15
<211>18bp
<212>DNA
<213>人工序列
<220>
<223>
<400>15
aaccctaacc ctaaccct 18
<210>16
<211>18bp
<212>DNA
<213>人工序列
<220>
<223>
<400>16
accctactga tgatgtgt 18
<210>17
<211>18bp
<212>DNA
<213>人工序列
<220>
<223>
<400>17
cctgtcaccg acacttcc 18