專利名稱:非內(nèi)源的被組成型活化的人g蛋白偶聯(lián)的受體的制作方法
特此聲明,本發(fā)明要求下列在先申請的優(yōu)先權美國專利申請?zhí)?9/170,496(1998年10月13日提出申請)、美國專利申請?zhí)?8/839,449(1997年4月14日提出申請,現(xiàn)在已放棄)、美國專利申請?zhí)?9/060,188(1998年4月14日提出申請)、美國臨時申請?zhí)?0/090,783(1998年6月26日提出申請)和美國臨時申請?zhí)?0/095,677(1998年8月7日提出申請)。前述的每個申請都以全文引入本申請作參考。本發(fā)明的領域本專利申請文件所公開的發(fā)明涉及跨膜受體,具體地講,涉及被改造的人G蛋白偶聯(lián)受體(GPCR),它可被組成型活化。在最優(yōu)選情況下,被改造的人GPCR可用于篩選藥用化合物。本發(fā)明的背景盡管在人體內(nèi)有很多種類的受體,但到目前為止最豐富和最與治療有關的是G蛋白偶聯(lián)受體(GPCR)。據(jù)估計,在人類基因組內(nèi)有大約100,000個基因,它們中的大約2%即2,000個基因被估計用來編碼GPCR。已識別出與其中的約100個GPCR結(jié)合的內(nèi)源配體。由于在發(fā)現(xiàn)內(nèi)源GPCR和發(fā)現(xiàn)它的內(nèi)源配體之間有顯著的時間延遲,因此可預測,其余的1900種GPCR將在它們的內(nèi)源配體被識別之前的很長時間內(nèi)被識別和鑒定。其實,人類基因組計劃正在快速測序人類的100,000個基因,這表明在今后幾年內(nèi),其余的人GPCR將被完全測序。然而,盡管付出了對人類基因組測序的努力,但仍不清楚科學家如何能夠快速、有力和有效率地利用這樣的信息來提高和增強人類的健康狀態(tài)。本發(fā)明正是指向這個重要目的。
包括GPCR在內(nèi),其內(nèi)源配體已被認識的受體被稱為“已知”受體,內(nèi)源配體尚不知曉的受體被稱為“孤兒”受體。這種區(qū)別并不僅是語義上的,尤其是對GPCR來說。GPCR代表著藥物產(chǎn)品開發(fā)的一個重要領域60%的處方藥物開發(fā)自100個已知GPCR中的大約20個。因此,孤兒GPCR將是推進藥物工業(yè)增長、擴張、增強和發(fā)展的機會,就像是金子對于19世紀晚期的加利福尼亞一樣。然而,孤兒受體在涉及新藥物發(fā)現(xiàn)時有一個嚴重缺陷。這是因為,發(fā)現(xiàn)和開發(fā)藥物的傳統(tǒng)途徑既需要利用受體又需要利用它的內(nèi)源配體。因此,迄今為止,孤兒GPCR帶給本領域的只是用于發(fā)現(xiàn)新藥的具有誘惑力且未開發(fā)的資源。
在探索潛在治療藥物的傳統(tǒng)途徑下,一般是受體先被識別。在探索藥物的努力開始之前,一般會啟動精細、費時和昂貴的程序以識別、分離和產(chǎn)生受體的內(nèi)源配體---對于每個受體,此過程可花費3到10年,成本大約為每個受體500萬美元。在探索藥物的傳統(tǒng)工作可開始之前,必須先消耗這些時間和資金。這是因為,傳統(tǒng)的藥物探索技術依賴于所謂的“競爭性結(jié)合檢測”,其中,假定的治療劑是通過受體被“篩選”的,其目的是找到這樣的化合物,它或者能阻止內(nèi)源配體與受體結(jié)合(拮抗劑)、或者能促進或模仿與受體結(jié)合的配體的作用(激活劑)。其總體目標是要識別出這樣的化合物,它們在配體與受體結(jié)合時能阻止細胞活化(拮抗劑),或者當配體與受體適當?shù)亟Y(jié)合時能促進或增強細胞活性(激活劑)。由定義可知,孤兒GPCR的內(nèi)源配體尚未被識別,因此不可能應用傳統(tǒng)藥物發(fā)現(xiàn)技術去發(fā)現(xiàn)針對這些受體的獨特的新治療藥物。正如下面將揭示的,本發(fā)明能夠克服傳統(tǒng)藥物發(fā)現(xiàn)技術所導致的這些和其他嚴格限制。
GPCR都具有一個相同的基元(motif)。所有這些受體具有七個由22到24個疏水氨基酸組成的序列,它們組成七個α螺旋,每個α螺旋都跨過膜(每個跨度都以數(shù)字表示,例如,跨膜-1(TM-1)、跨膜-2(TM-2)等)??缒ぢ菪ㄟ^氨基酸鏈連接,在細胞膜的外部即“細胞外”一邊的氨基酸鏈分別在跨膜-2和跨膜-3、跨膜-4和跨膜-5、跨膜育一小時。接著加入麥胚凝集素小珠(25μl,Amersham),組合物在室溫下再溫育30分鐘,然后試管在1500×g、室溫下離心5分鐘,并在閃爍計數(shù)器上記數(shù)。
另一個花費更少但同樣可適用的方法已被識別,它也可滿足大規(guī)模篩選的需要。Flash platesTM和WallacTM閃爍帶可被用來格式化高處理量的[35S]GTPγS結(jié)合測定。進一步,利用此技術,本方法可用于已知的GPCR,它在通過[35S]GTPγS的結(jié)合監(jiān)測化合物效應的同時,同時監(jiān)測與受體結(jié)合的由氚標記的配體。這之所以可能的是因為Wallacβ計數(shù)器可以把能量窗口切換成監(jiān)測氚和35S標記的探針。本方法也可用于偵查導致受體活化的其他類型的膜活化事件。例如,本方法可用于監(jiān)測許多受體的32P磷酸化(針對G蛋白偶聯(lián)受體和酪氨酸激酶受體)。當膜被離心到孔的底部時,結(jié)合的[35S]GTPγS或32P磷酸化的受體將要活化包被在孔上的閃爍劑。Scinti_帶(Wallac)已被用來展示這個原理。另外,本方法也可通過應用放射標記的配體用來測量與受體結(jié)合的配體。以相似的方式,當放射標記的結(jié)合的配體被離心到孔底時,閃爍帶標記在位于標記的配體附近,這導致活化并被檢測到。
基于前述的程序,比較空白對照(pCMV)、內(nèi)源APJ和非內(nèi)源APJ的代表性結(jié)果被圖示于圖6。2.腺苷酸環(huán)化酶設計用來進行基于細胞的測定的Flash PlateTM腺苷酸環(huán)化酶試劑盒(New England Nuclear;目錄號SMP004A)被改進以應用于未加工的胞漿膜。閃爍板的孔含有閃爍劑包被層,其中含有識別cAMP的特異抗體。在孔中產(chǎn)生的cAMP通過直接和放射性cAMP示蹤物競爭與cAMP抗體結(jié)合而被定量。下面是對測量在表達受體的膜上cAMP水平變化程序的簡短描述。
在轉(zhuǎn)染后大約3天收獲轉(zhuǎn)染細胞。通過在含有20mM pH 7.4的育一小時。接著加入麥胚凝集素小珠(25μl,Amersham),組合物在室溫下再溫育30分鐘,然后試管在1500×g、室溫下離心5分鐘,并在閃爍計數(shù)器上記數(shù)。
另一個花費更少但同樣可適用的方法已被識別,它也可滿足大規(guī)模篩選的需要。Flash platesTM和WallacTM閃爍帶可被用來格式化高處理量的[35S]GTPγS結(jié)合測定。進一步,利用此技術,本方法可用于已知的GPCR,它在通過[35S]GTPγS的結(jié)合監(jiān)測化合物效應的同時,同時監(jiān)測與受體結(jié)合的由氚標記的配體。這之所以可能的是因為Wallacβ計數(shù)器可以把能量窗口切換成監(jiān)測氚和35S標記的探針。本方法也可用于偵查導致受體活化的其他類型的膜活化事件。例如,本方法可用于監(jiān)測許多受體的32P磷酸化(針對G蛋白偶聯(lián)受體和酪氨酸激酶受體)。當膜被離心到孔的底部時,結(jié)合的[35S]GTPγS或32P磷酸化的受體將要活化包被在孔上的閃爍劑。Scinti_帶(Wallac)已被用來展示這個原理。另外,本方法也可通過應用放射標記的配體用來測量與受體結(jié)合的配體。以相似的方式,當放射標記的結(jié)合的配體被離心到孔底時,閃爍帶標記在位于標記的配體附近,這導致活化并被檢測到。
基于前述的程序,比較空白對照(pCMV)、內(nèi)源APJ和非內(nèi)源APJ的代表性結(jié)果被圖示于圖6。2.腺苷酸環(huán)化酶設計用來進行基于細胞的測定的Flash PlateTM腺苷酸環(huán)化酶試劑盒(New England Nuclear;目錄號SMP004A)被改進以應用于未加工的胞漿膜。閃爍板的孔含有閃爍劑包被層,其中含有識別cAMP的特異抗體。在孔中產(chǎn)生的cAMP通過直接和放射性cAMP示蹤物競爭與cAMP抗體結(jié)合而被定量。下面是對測量在表達受體的膜上cAMP水平變化程序的簡短描述。
在轉(zhuǎn)染后大約3天收獲轉(zhuǎn)染細胞。通過在含有20mM pH 7.4的篩選化合物的傳統(tǒng)“信條”需要受體的配體已知。由定義可知,這種方法對于孤兒受體來說是沒有用的。因此,在堅持用這個教條的方法去發(fā)現(xiàn)藥物時實質(zhì)上在教導本領域去廢棄孤兒受體的應用,除非和直到受體的內(nèi)源配體被發(fā)現(xiàn)??紤]到大約有2000個G蛋白偶聯(lián)受體且其中的大多數(shù)是孤兒受體,如此信條就會與具有創(chuàng)造性的唯一且獨特的藥物探索方法相對立。
關于不同種類GPCR的核酸和/或氨基酸序列的信息總結(jié)于下表A。因為本發(fā)明在此公開的一個重要焦點直接指向孤兒GPCR,所以很多下面所引述的參考都是涉及孤兒GPCR的。然而,本列表并非要在法律或在其他意義上進行以下暗示或解釋,即在此公開的本發(fā)明僅可應用于GPCR或者下面所特別列舉的GPCR。此外,一些被分離的受體本身并非是本申請的主題;例如,參考國際互聯(lián)網(wǎng)上列舉GPCR的G蛋白偶聯(lián)受體數(shù)據(jù)庫(署名的發(fā)明人和受讓人都與這個站點沒有任何關系)。其他的GPCR是由本發(fā)明受讓人所有的發(fā)明申請的主題,它們并沒有在下面列出(包括GPR3、GPR6和GPR12;參見美國臨時申請?zhí)?0/094879)表A
正如下面詳細公開的那樣,應用突變盒修飾人GPCR內(nèi)源序列會導致人GPCR的組成型活化。這些非內(nèi)源性的可組成型活化的人GPCR尤其可用于篩選候選化合物,以直接識別例如與藥物相關的化合物。本發(fā)明的概述在此公開的是非內(nèi)源的人G蛋白偶聯(lián)受體,它包含作為(a)最優(yōu)選的氨基酸序列區(qū)域(從C-末端到N-末端走向)和/或(b)最優(yōu)選核酸序列區(qū)域(3’到5’走向)的橫跨GPCR的跨膜-6(TM6)和細胞內(nèi)環(huán)-3(IC3)區(qū)域(a)P1AA15X其中(1)P1是位于GPCR的TM6區(qū)域內(nèi)的一個氨基酸殘基,其中P1是從(i)內(nèi)源的GPCR的脯氨酸殘基和(ii)除脯氨酸之外的非內(nèi)源的氨基酸殘基中選擇而來;(2)AA15是15個氨基酸,它們是從(a)內(nèi)源的GPCR的氨基酸、(b)非內(nèi)源的氨基酸殘基和(c)內(nèi)源的GPCR氨基酸和非內(nèi)源氨基酸的組合中選擇而來,除非位于GPCR的TM6區(qū)域內(nèi)的15個氨基酸殘基都不是脯氨酸;和(3)X是位于所述GPCR的IC3區(qū)域內(nèi)的非內(nèi)源氨基酸殘基,優(yōu)選是從賴氨酸、組氨酸和精氨酸中選擇而來,最優(yōu)選地是賴氨酸,但是如果在X位置的內(nèi)源氨基酸是賴氨酸,那么此時X是非賴氨酸的氨基酸,優(yōu)選是丙氨酸;和/或(b)P密碼子(AA-密碼子)15X密碼子其中(1)P密碼子是位于GPCR的TM6區(qū)域內(nèi)的一個核苷酸序列,其中P密碼子編碼從(i)內(nèi)源的GPCR的脯氨酸殘基和(ii)除脯氨酸之外的非內(nèi)源的氨基酸殘基中選擇出來的氨基酸;(2)(AA-密碼子)15是編碼15個氨基酸的15個密碼子,這些氨基酸是從(a)內(nèi)源的GPCR的氨基酸、(b)非內(nèi)源的氨基酸殘基和(c)內(nèi)源GPCR的氨基酸和非內(nèi)源氨基酸的組合中選擇而來,除非在位于GPCR的TM6區(qū)域內(nèi)的15個內(nèi)源密碼子都不編碼脯氨酸;(3)X密碼子是編碼位于所述GPCR的IC3區(qū)域內(nèi)的氨基酸殘基的核苷酸,其中X密碼子編碼非內(nèi)源氨基酸,它們優(yōu)選是從由賴氨酸、組氨酸和精氨酸中選擇而來,最優(yōu)選地是賴氨酸,但是當在X密碼子位置的內(nèi)源編碼區(qū)域編碼賴氨酸,那么此時X密碼子編碼除賴氨酸以外的氨基酸,優(yōu)選是丙氨酸。
針對這些序列盒所使用的內(nèi)源和非內(nèi)源術語是相對于內(nèi)源GPCR而言。例如,一旦內(nèi)源脯氨酸殘基位于特定GPCR的TM6區(qū)域之內(nèi)且從它算起的第16個氨基酸被識別以用于發(fā)生突變來組成型活化該受體,那么也有可能突變內(nèi)源脯氨酸殘基(即,一旦標記物被定位且將發(fā)生突變的第16個氨基酸被確定,就可能突變標記物本身),盡管最優(yōu)選脯氨酸殘基不被突變。相似地是,盡管最優(yōu)選AA15保持它們的內(nèi)源形式,但這些氨基酸也可被突變。在人GPCR的非內(nèi)源形式中唯一必須被突變的氨基酸是X,即,由從P1開始的第16個殘基構成的內(nèi)源氨基酸不能保持其內(nèi)源形式而必須被突變,這正如在此進一步公開的那樣。重述一遍,盡管優(yōu)選在人GPCR的非內(nèi)源形式中,P1和AA15保持它們的內(nèi)源形式(即,與它們的野生型形式一樣),但一旦X被識別和被突變,任何和/或所有的P1和AA15都可以被突變。這對核苷酸序列也同樣適用。那么,當在位置X的內(nèi)源氨基酸是賴氨酸時,在此GPCR的非內(nèi)源形式中的X是非賴氨酸的氨基酸,優(yōu)選是內(nèi)氨酸。
因此,作為假設的情形,如果內(nèi)源GPCR在上述位置具有如下的內(nèi)源氨基酸序列P-AACCTTGGRRRDDDE-Q那么下面的任何一個情形和假設中的序列盒都將落在公開的范圍之內(nèi)(非內(nèi)源氨基酸以粗體字表示)P-AACCTTGGRRRDDDE-KP-AACCTTHIGPRDDDE-K
P-ADEETTGGPRRDDDE-AP-LLKFMSTWZLVAAPQ-KA-LLKFMSTWZLVAAPQ-K也可能在AA15內(nèi)加入氨基酸殘基,但此方法并沒有特別的進展。其實,在最優(yōu)選的實施方案中,在非內(nèi)源的人GPCR與內(nèi)源的GPCR中唯一的氨基酸差別就是在X位置的氨基酸;此氨基酸自身的突變導致該受體的組成型活化。
因此,在特別優(yōu)選的實施方案中,P1和P密碼子分別是內(nèi)源脯氨酸和編碼脯氨酸的內(nèi)源核苷酸編碼區(qū);X和X密碼子分別是非內(nèi)源賴氨酸或丙氨酸和編碼賴氨酸或丙氨酸的非內(nèi)源核苷酸編碼區(qū),其中最優(yōu)選是賴氨酸。因為最優(yōu)選帶有這些突變的非內(nèi)源人GPCR包含在哺乳動物細胞內(nèi)并用于篩選候選化合物,因此,盡管分離和純化的非內(nèi)源人GPCR是在本發(fā)明公開的范圍之內(nèi),但帶有突變的非內(nèi)源的人GPCR本身并不需要純化和分離(即它們被包含在哺乳動物細胞的細胞膜內(nèi))。涉及非內(nèi)源人GPCR的基因-靶向和轉(zhuǎn)基因非人哺乳動物(優(yōu)選是大鼠和小鼠)也在本發(fā)明范圍之內(nèi);特別是,基因靶向的哺乳動物是最優(yōu)選的,這是因為可把人GPCR的非內(nèi)源形式導入這些動物,以代替非人哺乳動物中的內(nèi)源GPCR編碼區(qū)(產(chǎn)生這樣的非人哺乳動物的技術是周知的,利用人編碼區(qū)替代這些非人哺乳動物的蛋白質(zhì)編碼區(qū);例如,參見美國專利號5,777,194)。
已經(jīng)發(fā)現(xiàn)內(nèi)源人GPCR的這些變化可使GPCR組成型活化,以致于非內(nèi)源的被組成型活化的人GPCR尤其可被用來直接篩選候選化合物,而不需要內(nèi)源配體,這正如在此將要進一步揭示的那樣。因此,應用這些材料的方法和用這些方法識別的產(chǎn)品也在如下的公開范圍之內(nèi)。圖示的簡短說明
圖1表示與G蛋白偶聯(lián)的受體的大體結(jié)構,標出的數(shù)字表示跨膜的螺旋、細胞內(nèi)環(huán)和細胞外環(huán)。
圖2示意典型的G蛋白偶聯(lián)受體的活化和非活化兩種狀態(tài),和活性狀態(tài)與第二信使傳導途徑的偶聯(lián)。
圖3是優(yōu)選的載體pCMV的序列圖示,其中包括限制酶切位點的位置。
圖4是進行如下比較所測得的信號圖示pCMV、組成型活化的非內(nèi)源GPR30對于GPR6介導的CRE-Luc報告基因活化的抑制作用、內(nèi)源GPR30對于GPR6介導的CRE-Luc報告基因活化的抑制作用。
圖5是進行如下比較所測得的信號圖示pCMV、組成型活化的非內(nèi)源GPR17對于GPR3介導的CRE-Luc報告基因活化抑制作用,內(nèi)源GPR17對于GPR3介導的CRE-Luc報告基因活化的抑制作用。
圖6是比較pCMV對照、內(nèi)源APJ和非內(nèi)源APJ所測得的信號圖示。
圖7提供了與其內(nèi)源型進行比較的非內(nèi)源人5-HT2A受體產(chǎn)生IP3的圖解說明。
圖8是GPR1(8A)、GPR30(8B)和APJ(8C)的點印跡結(jié)果。詳細描述本科學文獻涉及受體并采用一些術語來描述對受體具有不同作用的配體。為了清楚和前后一致,在本發(fā)明文獻中將由始至終使用下列定義。在這些定義與這些詞語的其他定義沖突時,選擇下列定義激活劑 意味著激活細胞內(nèi)反應的化合物,此時它們結(jié)合受體或促進GTP與膜結(jié)合。
在此應用的氨基酸縮寫列于下表
部分激活劑 意味著這樣的化合物,它們與受體結(jié)合時,激活細胞內(nèi)反應或者促進GTP與膜結(jié)合的程度低于激活劑。
拮抗劑 意味著這樣的化合物,它和激活劑在同一位點與受體競爭性地結(jié)合,但不激活由受體的活性形式引起的細胞內(nèi)反應,并可因此抑制由激活劑或部分激活劑促進的細胞內(nèi)反應。拮抗劑在沒有激活劑或部分激活劑的情形下并不削弱基本細胞內(nèi)反應。
候選化合物 意味著一個將經(jīng)受篩選技術檢驗的分子(例如但不限于化學化合物)。優(yōu)選的“候選化合物”并不包括對公眾來說已知選自受體的反激活劑、激活劑或拮抗劑的化合物,它們以前已通過非直接的識別方法被確定(“非直接識別的化合物”);更優(yōu)選不包括先前已經(jīng)確定至少在一種哺乳動物中具有治療效果的已被非直接識別的化合物;并且,最優(yōu)選不包括先前已經(jīng)確定的在人體中具有治療用途的已被非直接識別的化合物。
密碼子 意味著一組三個核苷酸(或者核苷酸的等價物),它們一般包括一個與磷酸基團偶聯(lián)的核苷(腺苷(A)、鳥苷(G)、胞苷(C)、尿苷(U)和胸苷(T)),當被翻譯時,它們編碼一個氨基酸。
化合物效應 意味著一個化合物抑制或者刺激受體功能的能力的量度,它與受體結(jié)合親和力相對。一個優(yōu)選的測定化合物效應的方法是通過測量[35S]GTPγS的結(jié)合,這一點將在本發(fā)明的實施例部分中進一步公開。
被組成型活化的受體 意味著易受組成型受體活化的受體。與本發(fā)明在此公開的相一致,一個非內(nèi)源的被組成型活化的人G蛋白偶聯(lián)受體是突變后包括氨基酸盒P1AA15X的受體,這正如在下面詳細描述的那樣。
組成型受體活化 意味著不利用它的內(nèi)源配體或其化學等價物與受體結(jié)合的方法而使在活性狀態(tài)下的受體穩(wěn)定。優(yōu)選地是,被本發(fā)明的組成型受體活化的G蛋白偶聯(lián)受體與GPCR的內(nèi)源形式相比,對組成型活化所測得的信號的反應有至少10%的差異(增高或者降低,如具體情況可能的那樣),更優(yōu)選在如此比較的反應中有大約25%的差異,最優(yōu)選在如此比較的反應中有大約50%的差異。當用于直接識別候選化合物的目的時,最優(yōu)選信號差異至少為50%,以使在內(nèi)源信號和非內(nèi)源信號之間有足夠的差異,從而在被選擇的候選化合物之間產(chǎn)生區(qū)別。在最多的情形下,“差異”將是信號的增加;然而,正如下面詳細將要敘述的那樣,對Gs-偶聯(lián)的GPCR來說,測量的“差異”優(yōu)選是降低。
接觸 意味著把至少兩部分放在一起,無論是在體外系統(tǒng)還是在體內(nèi)系統(tǒng)中。
直接識別或被直接識別,與術語“候選化合物”相聯(lián)系,意味著篩選針對組成型活化的G蛋白偶聯(lián)受體的候選化合物。本術語在任何情形下都不應被解釋或被理解為被包括或包括術語“非直接地識別”或“非直接地被識別”。
內(nèi)源 意味著由物種的基因組天然產(chǎn)生的物質(zhì)。關于內(nèi)源的GPCR,意味著由人體、昆蟲、植物、細菌或病毒天然產(chǎn)生的物質(zhì),這些只作為例證但卻不是限制。與之相對比,術語“非內(nèi)源”在本文中 意味著不是由物種的基因組天然產(chǎn)生的物質(zhì)。例如在其內(nèi)源形式下并非組成型活化的受體,當應用在此公開的盒使之突變并因而使之組成型活化時,此受體被最優(yōu)選地指稱為“非內(nèi)源的被組成型活化的受體”,這只作為例證而不是限制。兩個用語都可被用來描述“體內(nèi)”和“體外”系統(tǒng)。在篩選過程中,內(nèi)源的或非內(nèi)源的受體可被用于體外篩選系統(tǒng),其中受體在哺乳動物的細胞表面表達,這也只作為例證而不是限制。作為進一步的例子而不是限制,當操作哺乳動物的基因組以包括非內(nèi)源組成型活化受體時,可以通過體內(nèi)系統(tǒng)篩選候選化合物。
宿主細胞 意味著能在其中插入質(zhì)粒和/或載體的細胞。在原核宿主細胞情形下,當宿主細胞復制時質(zhì)粒典型地以自主分子方式復制(在一般情況下,質(zhì)粒在復制后被分離出來以被引入真核宿主細胞中);在真核宿主細胞情形下,質(zhì)粒被整合進宿主細胞的細胞DNA中,因而,當真核細胞復制時,質(zhì)粒復制。為在此公開的本發(fā)明的目的,宿主細胞優(yōu)選是真核細胞,更優(yōu)選是哺乳動物細胞,最優(yōu)選地是從293、293T和COS-7細胞中選擇出來的細胞。
非直接地識別或非直接地被識別 意味著發(fā)現(xiàn)藥物的傳統(tǒng)方法,該方法涉及對內(nèi)源受體特異的內(nèi)源配體的識別、篩選針對受體的候選化合物、確定那些干擾或競爭配體-受體相互反應的化合物、測量化合物對至少一個與活化受體相關的第二信使途徑影響的效率。
抑制,與用語“反應”相聯(lián)系,意味著在一個化合物存在時一個反應被降低或阻止,這正好與該化合物不存在時相反。
反激活劑 意味著這樣的化合物,它們與內(nèi)源受體或受體的組成型活化形式結(jié)合,并且將由受體的活性形式引發(fā)的基本細胞內(nèi)反應抑制到正?;A水平以下,該活性水平是在沒有激活劑或部分激活劑的情況下觀察的,或者它們降低GTP與膜的結(jié)合。與在沒有反激活劑情況下的基本反應相比,基本細胞內(nèi)反應在反激活劑的存在下優(yōu)選被抑制至少30%、更優(yōu)選至少50%、最優(yōu)選至少75%。。
已知受體 意味著其特異的內(nèi)源配體已被識別的內(nèi)源受體。
配體 意味著對內(nèi)源的天然產(chǎn)生的受體特異的內(nèi)源的天然產(chǎn)生的分子。
關于內(nèi)源受體的核苷酸和/或氨基酸序列的突變 意味著這些內(nèi)源序列的特定改造,從而使內(nèi)源的非組成型活化受體的突變型能造成受體的組成型活化。對于特定序列的等價物,人受體的后續(xù)突變型被認為是人受體的首次突變的等價物,如果(a)后續(xù)突變型受體的組成型活化水平與受體的首次突變所表明的在本質(zhì)上一樣;和(b)在后續(xù)突變型受體和受體的首次突變之間的序列同源性的百分數(shù)是至少80%,更優(yōu)選地是至少90%,最優(yōu)選地是至少95%。在理想的情況下,考慮到在此公開的用于進行組成型活化的最優(yōu)選的盒包括在內(nèi)源和非內(nèi)源型GPCR之間發(fā)生變化的單一氨基酸和/或密碼子(即X或X密碼子),序列同源性的百分數(shù)應是至少98%。
孤兒受體 意味著這樣的內(nèi)源受體,其特異的內(nèi)源配體尚未被識別或尚未知。
藥物組合物 意味著包括至少一種活性成分的組合物,借助此活性成分可以研究該組合物可在哺乳動物(例如但不限于人體)中特定的效果。本領域的那些普通技術人員將能夠理解和正確評價那些適于確定活性成分是否具有基于技術人員需要的預期效果的技術。
質(zhì)粒 意味著載體和cDNA的結(jié)合體。一般,為cDNA復制和/或表達蛋白質(zhì)的目的將質(zhì)粒引進宿主細胞。
刺激,與術語“反應”相聯(lián)系,意味著當一種化合物存在時比當它不存在時反應增強。
橫跨,涉及被定義的核苷酸序列或者被定義的氨基酸序列,意味著該序列位于至少兩個不同且明確限定的區(qū)域之內(nèi)。例如,在一個長度為10個氨基酸的氨基酸序列中,其中10個中的3個是在GPCR的TM6區(qū),其余的7個是在GPCR的IC3區(qū),這10個氨基酸就可被描述為橫跨GPCR的TM6和IC3區(qū)。
針對cDNA的載體 意味著能夠?qū)⒅辽僖粋€cDNA摻入其中且能導入到宿主細胞中的環(huán)形DNA。
下面部分的順序安排是為了表達效果,而不能被解釋為對下面的公開或權利要求的限制。A.引言受體的傳統(tǒng)研究一直是基于這樣的前置假定(基于歷史),即內(nèi)源配體必須首先被識別,然后才能發(fā)現(xiàn)可以作用于受體的拮抗劑和其他分子。甚至在拮抗劑被首先發(fā)現(xiàn)的情況下,搜索的目光也立即延伸到查找內(nèi)源配體上去。即使在發(fā)現(xiàn)組成型活化受體之后,這種思維模式也一直在受體研究中持續(xù)。在此之前沒有被認識到的是,是受體的活性狀態(tài)對發(fā)現(xiàn)受體的激活劑、部分激活劑和反激活劑是最有用的。對于那些因為受體的過度活化和不夠活化而導致的疾病來說,希望得到的治療藥物是能分別用來減少受體的活性狀態(tài)或增強受體活性的化合物,而并不需要是對抗內(nèi)源配體的拮抗劑。這是因為,一個降低或增強活化態(tài)受體活性的化合物并不需要結(jié)合在和內(nèi)源配體一樣的位點上。因而,正如本發(fā)明的一個方法所說的那樣,對治療性化合物的任何搜索可通過篩選針對配體非依賴性活性態(tài)的化合物而開始。
篩選針對非內(nèi)源的組成型活化的GPCR的候選化合物,這可直接識別與這些細胞表面受體作用的候選化合物,而根本無需先了解或使用此受體的內(nèi)源配體。通過確定表達和/或過度表達這些GPCR的內(nèi)源形式的身體內(nèi)部區(qū)域,有可能確定與這些受體的表達和/或過度表達相關的疾病/紊亂狀態(tài);這種方法在本發(fā)明中得到公開。B.疾病/紊亂識別和/或選擇最優(yōu)選用本發(fā)明的材料識別針對非內(nèi)源的組成型活化的GPCR的反激活劑。如此的反激活劑是治療與這些受體有關的疾病的藥物探索中先導化合物的理想候選者。因為可直接識別針對這些受體的反激活劑、部分激活劑或激活劑,因此有可能開發(fā)和搜索針對與這些受體有關的疾病和紊亂的藥物組合物。例如,檢查患病和正常組織樣品中這些受體的存在,現(xiàn)在不僅僅是學術研究的問題,也是在孤兒受體的情形下通過識別來尋找內(nèi)源配體的研究道路上所致力解決的問題。可在健康和患病組織的寬廣范圍內(nèi)進行組織檢查。如此的組織檢查提供了把特異受體與疾病/紊亂相聯(lián)系的優(yōu)選第一步驟。
優(yōu)選內(nèi)源GPCR的DNA序列被用來制作探針,用于在組織樣品中GPCR表達的放射標記cDNA或RT-PCR識別。在疾病組織中受體的存在,或者與正常組織相比在疾病組織中受體的濃度提高或降低,可被優(yōu)選地用來識別與那種疾病的關聯(lián)。用這種方法也可很好地把受體定位于器官的區(qū)域?;谑荏w被定位于其中的特定組織的已知功能,受體假想的功能性角色可被推導出來。C.“人GPCR脯氨酸標記物”算法規(guī)則和非內(nèi)源的組成型活化的人GPCR的形成在生物技術領域所面臨的許多挑戰(zhàn)中,包括從一個物種搜集遺傳信息并把該信息和其他物種的信息相聯(lián)系的不可預測性---在本領域中沒有比編碼核酸和蛋白質(zhì)的遺傳序列這個問題更能困擾人的。因此,為了一致性并考慮到本領域的高不可預測性,下述發(fā)明用的哺乳動物術語局限于人GPCR---把本發(fā)明應用于其他哺乳動物物種,盡管具有潛在可能,但并不僅僅只是生搬硬套式的應用。
一般來說,當企圖把從一種相關的蛋白質(zhì)序列或物種中得到的普遍“規(guī)則”應用于其他序列或物種時,本領域一般是求助于序列的對齊比較,即把序列線性化并希望在兩個或更多的序列之間發(fā)現(xiàn)相同的區(qū)域。盡管很有用,但此方法并不經(jīng)常能產(chǎn)生有意義的信息。在GPCR情形下,盡管所有GPCR的一般結(jié)構基元是相同的,但TM、EC和IC在長度上的不同導致這樣的對齊方法在從一種GPCR到另一種時變得很困難。因此,盡管可以期望應用一種普遍方法,例如從一種GPCR到另一種的組成型活化方法,但由于從一種到另一種GPCR在序列長度、一致性等上存在很大的不同,一種可普遍適用的和實際上成功的突變對齊方法在本質(zhì)上是不可能的。作個類比,如此的一種方法與這樣一種景況相似讓一位旅行者從A點開始旅程,給他很多指向B點的不同的地圖,但在任何一個地圖上卻沒有任何比例尺或距離的標記物,然后讓旅行者僅利用這些地圖找出通向目的地B的最短和最有效的路徑。在這樣的情況下,通過以下手段被簡化任務擁有(a)在每個地圖上都有一個共同的“地方標記物”,和(b)測量從每一個地方標記物到目的地B的距離的能力,然后,這將容許旅行者選擇從開始點A通向目的地B的最有效的路徑。
在本質(zhì)上,本發(fā)明的一個特點是提供在人GPCR中的這樣一個坐標,它可以容許形成組成型活化的人GPCR。
正如此技術領域中所評價的那樣,細胞的跨膜區(qū)域是高度疏水的;因此,應用通常的疏水測繪技術,本領域的技術人員就可確定GPCR的TM區(qū),特別是TM6(同樣的方法也可用來確定GPCR的EC區(qū)和IC區(qū))。已經(jīng)發(fā)現(xiàn),在人GPCR的TM6區(qū),一個共同的脯氨酸殘基(一般靠近TM6的中間)是組成型活化的“標記物”。通過從脯氨酸標記物數(shù)過15個氨基酸,第16個氨基酸(定位在IC3環(huán)上)在從其內(nèi)源形式突變?yōu)榉莾?nèi)源形式時導致受體的組成型活化。為方便起見,我們把這稱為“人GPCR脯氨酸標記物”算法規(guī)則。盡管在此位置的非內(nèi)源氨基酸可以是任何一種氨基酸,但最優(yōu)選的非內(nèi)源氨基酸還是賴氨酸。盡管并不希望被任何理論所束縛,我們還是相信該位置的本身是獨特的,并且,本位置上的突變會影響受體發(fā)生細成性的活化。
我們注意到,例如,當在第16位置上的內(nèi)源氨基酸已經(jīng)是賴氨酸時(如在GPR4和GPR32中的那樣),那么為了讓X是一個非內(nèi)源氨基酸,它必不是賴氨酸;因此,在內(nèi)源GPCR的第16位位置上是內(nèi)源賴氨酸殘基的情形下,非內(nèi)源GPCR在該位置上優(yōu)選不是賴氨酸的氨基酸,優(yōu)選是丙氨酸、組氨酸和精氨酸。進一步注意到,確定了GPR4看起來被與Gs相聯(lián)并在其內(nèi)源形式下活化(數(shù)據(jù)沒有列出)。
因為僅有20種天然氨基酸(盡管可以利用非天然形成的氨基酸),為此第16位置的替代而選擇特別的非內(nèi)源氨基酸是可行的,并且容許有效選擇適合研究者需要的非內(nèi)源氨基酸。然而,正如提示,在第16位更優(yōu)選的非內(nèi)源氨基酸是賴氨酸、組氨酸、精氨酸和丙氨酸,其中賴氨酸是最優(yōu)選的??烧J為本領域的普通技術人員有能力用熟練的方法改造密碼子的序列以產(chǎn)生所期望的突變。
也發(fā)現(xiàn),在偶然而并非是經(jīng)常的情形下,脯氨酸殘基標記物在TM6中位于W2之后(即,W2P1AA15X),其中W是色氨酸,2是任何氨基酸殘基。
我們的發(fā)現(xiàn)否定了對于本領域常常應用不可預測且復雜的序列對齊方法的需求。其實,盡管在本質(zhì)上是一個規(guī)則,但我們的發(fā)現(xiàn)的重要性就在于它可以容易的方法應用于人GPCR上,其可被本領域技術人員靈巧地簡化,并得到獨特和高度有用的終產(chǎn)品,即被組成型活化的人GPCR。因為需要很多年和很多資金來確定人GPCR的內(nèi)源配體(正由人類基因組計劃揭示),本發(fā)明不僅降低積極探索這種序列信息所必要的時間,也能顯著地節(jié)約成本。本方法能夠真正證實人類基因組計劃的重要性,因為它不僅準許應用基因信息來理解GPCR在疾病等中的角色,也能提供提高人類健康狀況的可能。D.候選化合物的篩選1.GPCR的篩選測定技術當一種G蛋白受體變?yōu)榻M成型活化時,它與G蛋白(例如,Gq、Gs、Gi、Go)偶聯(lián)并刺激釋放GTP,其后GTP與G蛋白結(jié)合。接著,借助受體在正常情況下失活,G蛋白作為GTP酶慢慢地把GTP水解為GDP,然而,包括本發(fā)明的非內(nèi)源的組成型活化的人GPCR在內(nèi),組成型活化的受體繼續(xù)把GDP轉(zhuǎn)化為GTP。GTP非可水解的類似物[35S]GTPγS,可被用來監(jiān)測與表達組成型活化受體的膜上的G蛋白加強了的結(jié)合。據(jù)報道,[35S]GTPγS可被用來監(jiān)測在配體存在或不存在的情形下G蛋白與膜的偶聯(lián)。在本領域中著名和可行的其他例證中有此種監(jiān)測的一個例證,它由Traynor和Nahorski在1995年所報道。本測定系統(tǒng)的一個優(yōu)選的應用是為了初步篩選候選化合物,因為本系統(tǒng)對所有蛋白-偶聯(lián)受體一般可行,而不考慮與受體的細胞內(nèi)結(jié)構域相互作用的那一種特別的G蛋白。B2.特定的GPCR篩選測定技術C 一旦應用“一般”G蛋白偶聯(lián)的受體測定方法(即篩選是激活劑、部分激活劑或反激活劑的化合物的方法)識別出候選化合物,優(yōu)選進一步篩選以確認作用在受體位點的化合物。例如,應用“一般”測定方法識別的化合物可以不與受體結(jié)合,但也可以僅僅從細胞內(nèi)結(jié)構域與G蛋白“解偶聯(lián)”。a.Gs和GiGs刺激腺苷酸環(huán)化酶。另一方面,Gi(和Go)抑制該酶。腺苷酸環(huán)化酶催化ATP向cAMP的轉(zhuǎn)化;因此,與Gs蛋白偶聯(lián)的組成型活化的GPCR與升高的細胞內(nèi)cAMP水平相關聯(lián)。在另一方面,與Gi(或Go)蛋白偶聯(lián)的組成型活化的GPCR與降低的細胞內(nèi)cAMP水平相關聯(lián)。一般情況參見“突觸傳導的非直接機制(Indirect Mechanisms ofSynaptic Transmission)”,第8章,叢神經(jīng)到大腦(From Neuron ToBrain)(第三版),Nichols,J.G.等編,Sinauer Associates,Inc.(1992)。因此,檢測cAMP的方法可被用來確定一個競爭性的化合物是否是受體的反激活劑(即這樣的一個化合物將能降低cAMP的水平)等。本領域已知的測定cAMP的不同方法可以被利用;最優(yōu)選的方法依賴于在基于ELISA的方法中應用抗-cAMP的抗體??杀粦玫牧硪活悳y定方法是一種全細胞第二信使報告基因系統(tǒng)測定法?;蛏系膯幼域?qū)動由一個特別的基因所編碼的蛋白質(zhì)的表達。環(huán)AMP通過以下步驟促進基因的表達,即它響應促進cAMP的DNA結(jié)合蛋白或轉(zhuǎn)錄因子(CREB)的結(jié)合,轉(zhuǎn)錄因子接著在被稱為cAMP效應元件的特別位點與肩動子結(jié)合并驅(qū)動基因表達。報告基因系統(tǒng)可被構建為具有一個啟動子,該啟動子在報告基因的前面含有多個cAMP效應兀件,例如β-半乳糖苷酶或熒光素酶。因而,一個被組成型活化的連接Gs的受體引起cAMP的積累,cAMP接著激活報告蛋白質(zhì)的基因和表達。β-半乳糖苷酶或熒光素酶等報告蛋白質(zhì)可用標準生化方法檢測到(Chen等,1995)。對于偶聯(lián)Gi(或Go)的GPCR來說,它能降低cAMP水平,一種篩選反激活劑等的方法是基于應用與Gs相連接的受體(并因而降低cAMP水平),該篩選方法在實施例部分被公開,其針對GPR17和GPR30。b.Go和GqGo和Gq與磷脂酶C的活化相聯(lián)系,磷脂酶隨后水解磷酸酯PIP2,并釋放兩種細胞內(nèi)信使二酰甘油(DAG)和肌醇-1,4,5-三磷酸(IP3)。積累增加的IP3與Gq-和Go-關聯(lián)的受體相關聯(lián)。一般情況參見“突觸傳導的非直接機制(Indirect Mechanisms of Synaptic Transmission)”,第8章,從神經(jīng)到大腦(From Neuron To Brain)(第三版),Nichols,J.G.等編,Sinauer Associates,Inc.(1992)。測定IP3積累的方法可被用來確定一個候選化合物是否是例如針對Gq-或Go-關聯(lián)受體等的反激活劑(即如此的化合物能降低IP3的水平)。Gq關聯(lián)受體也可用AP1報告基因測定方法來檢測,因為Gq依賴的磷脂酶C引起含有AP1元件的基因活化;因而,活化的Gq關聯(lián)受體將導致如此基因的表達增高,而其反激活劑將導致如此表達的降低,激活劑將導致如此表達的升高。進行如此測定的商業(yè)可得的方法是可得的。E.藥物化學在一般但并非經(jīng)常的情況下對候選化合物直接識別與通過組合化學技術產(chǎn)生的化合物聯(lián)合使用,其中隨機制備幾千種化合物用于此分析。如此篩選的結(jié)果一般將是具有獨特中心結(jié)構的化合物;其后,這些化合物圍繞著一個優(yōu)選的中心結(jié)構而被優(yōu)選進行額外的化學修飾,以進一步加強其藥用性質(zhì)。這樣的技術在該領域中是已知的,并不需要在本專利文件中詳細描述。F.藥物細合物為進一步開發(fā)而選擇出的候選化合物可應用本領域周知的技術制劑成藥物組合物。適宜的藥物可接受的載體在本領域中是可得的;例如,參見Remington’s Pharmaceuctical Sciences,第16版,1980,MackPublishing Co.(Oslo等編)。G.其他應用盡管公開的非內(nèi)源人GPCR的一個優(yōu)選的應用是為了直接識別作為反激活劑、激活劑或部分激活劑(優(yōu)選地作為藥物使用)的候選化合物,這些受體也可被用于研究之用。例如,導入這些受體的體外或體內(nèi)系統(tǒng)可被用來闡釋和理解受體在正常和患病的人體狀況中的作用,也可理解當它應用于理解信號級聯(lián)反應時組成型活化的角色。這些非內(nèi)源受體的一個價值由于其獨特的特點是它們作為研究工具的用途被強化,公開的受體可被用來理解特殊受體在人體中的作用,即使在其內(nèi)源配體被識別之前。公開的受體的其他應用對于本領域的技術人員將是明顯的,特別是當他們閱讀了本申請文件之后。實施例下面的實施例是為了說明而非限制本發(fā)明的目的。根據(jù)本發(fā)明文件的敘述,可以在基于與TM6中的脯氨酸殘基有關的位置的人GPCR的IC3環(huán)中應用突變盒以組成型活化受體,并且,盡管在此公開了特定的核苷酸和氨基酸序列,但可認為本領域中的那些普通技術人員擁有這樣的能力,即通過對這些序列的簡單的修飾就可得到與下面所報道的相同或基本相似的結(jié)果。基于技術人員特殊需要的特殊的序列突變方法是在技術人員的了解范圍之內(nèi)。實施例1制備人內(nèi)源GPCR不同的GPCR應用在如下的實施例中。一些人內(nèi)源GPCR在表達載體中提供(如下面致謝),其他的人內(nèi)源GPCR是用公眾可得的序列信息從頭合成的。1.GPR1(GenBank入藏登記號U13666)GPR1的人cDNA序列由Brian O’Dowd(多倫多大學)在pRcCMV中提供。GPR1 cDNA(1.4kb片段)作為一個NdeI-XbaI片段被從pRcCMV載體中切下,并被亞克隆進pCMV載體的NdeI-XbaI位點(參見圖3)。人GPR1的核苷酸(SEQ ID NO1)和氨基酸(SEQ ID NO2)序列其后被確定和證實。2.GPR4(GenBank入藏登記號L36148,U35399,U21051)GPR4的人cDNA序列由Brian O’Dowd(多倫多大學)在pRcCMV中提供。GPR1 cDNA(1.4kb片段)作為一個ApaI(平端)-XbaI片段被從pRcCMV載體中切下,并被亞克隆進(把大部分5’未翻譯區(qū)除去)pCMV載體的HindIII(平端)-XbaI位點。人GPR4的核苷酸序列(SEQ ID NO3)和氨基酸(SEQ ID NO4)序列其后被確定和證實。3.GPR5(GenBank入藏登記號L36149)按如下步驟產(chǎn)生人GPR5的cDNA并將其克隆進pCMV表達載體以基因組DNA為模板,用rTth聚合酶(Perkin Elmer)和制造商提供的緩沖系統(tǒng)進行PCR,其中用每個引物0.25μM、4種核苷酸中每種0.2mM。循環(huán)條件是進行30個循環(huán)94℃1分鐘;64℃ 1分鐘;72℃ 1.5分鐘。5’PCR引物包括一個EcoR I位點,其序列為5’-TATGAATTCAGATGCTCTAAACGTCCCTGC-3’(SEQ IDNO5)3’引物包括BamH I位點,其序列為5’-TCCGGATCCACCTGCACCTGCGCCTGCACC-3’(SEQ IDNO6)1.1kb PCR片段被用EcoR I和BamH I消化并被克隆進pCMV表達載體的EcoR I-BamH I位點。人GPR5的核酸(SEQ ID NO7)和氨基酸(SEQ ID NO8)序列其后被確定和證實。4.GPR7(GenBank入藏登記號U22491)按如下步驟產(chǎn)生人GPR7的cDNA并把其克隆進pCMV表達載體PCR條件-以基因組DNA為模板,用rTth聚合酶(Perkin Elmer)和制造商提供的緩沖系統(tǒng)進行PCR,其中使用每個引物0.25μM、4種核苷酸中每種0.2mM。循環(huán)條件是進行30個循環(huán)94℃ 1分鐘;62℃ 1分鐘;72℃ 1分鐘20秒。5’PCR引物包括一個Hind III位點,且其序列為5’-GCAAGCTTGGGGGACGCCAGGTCGCCGGCT-3’(SEQ IDNO9)3’引物包括BamH I位點,其序列為5’-GCGGATCCGGACGCTGGGGGAGTCAGGCTGC-3’(SEQ IDNO10)1.1kb PCR片段被用Hind III和BamH I消化并被克隆進pCMV表達載體的Hind III-BamH I位點。人GPR7的核酸(SEQ ID NO11)和氨基酸(SEQ ID NO12)序列其后被確定和證實。5.GPR8(GenBank入藏登記號U22492)按如下步驟產(chǎn)生人GPR8的cDNA并把其克隆進pCMV表達載體以基因組DNA為模板,用rTth聚合酶(Perkin Elmer)和制造商提供的緩沖系統(tǒng)進行PCR,其中使用每個引物0.25μM、4種核苷酸中每種0.2mM。循環(huán)條件是進行30個循環(huán)94℃ 1分鐘;62℃ 1分鐘;72℃ 1分鐘20秒。5’PCR引物包括一個EcoR I位點,其序列為5’-CGGAATTCGTCAACGGTCCCAGCTACAATG-3’(SEQ IDNO13)3’引物包括BamH I位點,其序列為5’-ATGGATCCCAGGCCCTTCAGCACCGCAATAT-3’(SEQ IDNO14)1.1kb PCR片段被用EcoR I和BamH I消化并被克隆進pCMV表達載體的EcoR I-BamH I位點。所有測序的4個cDNA克隆包含可能的多態(tài)現(xiàn)象,它涉及第206個氨基酸從Arg轉(zhuǎn)變?yōu)镚ln。暫且不論這個差別,人GPR8的核酸(SEQ ID NO15)和氨基酸(SEQ ID NO16)序列其后被確定和證實。6.GPR9(GenBank入藏登記號X95876)按如下步驟產(chǎn)生人GPR9的cDNA并把其克隆進pCMV表達載體以克隆為模板(由Brian O’Dowd提供),用pfu聚合酶(Stratagene)和制造商提供的加有10%DMSO的緩沖系統(tǒng)進行PCR,其中使用每種引物0.25μM、4種核苷酸每種0.5mM。循環(huán)條件是進行25個循環(huán)94℃ 1分鐘;56℃ 1分鐘;72℃ 2.5分鐘。5’PCR引物包括一個EcoRI位點,其序列為5’-ACGAATTCAGCCATGGTCCTTGAGGTGAGTGACCACCAAGTGCTAAAT-3’(SEQ ID NO17)3’引物包括BamH I位點,其序列為5’-GAGGATCCTGGAATGCGGGGAAGTCAG-3’(SEQ ID NO18)1.2kb PCR片段被用EcoR I消化并被克隆進pCMV表達載體的EcoR I-Sam I位點。人GPR9的核酸(SEQ ID NO19)和氨基酸(SEQ IDNO20)序列其后被確定和證實。7.GPR9-6(GenBank入藏登記號U45982)按如下步驟產(chǎn)生人GPR9-6的cDNA并把其克隆進pCMV表達載體以基因組DNA為模板,用rTth聚合酶(Perkin Elmer)和制造商提供的緩沖系統(tǒng)進行PCR,其中使用每個引物0.25μM、4種核苷酸中每種0.2mM。循環(huán)條件是進行30個循環(huán)94℃ 1分鐘;62℃ 1分鐘;72℃ 1分鐘20秒。5’PCR引物被用如下序列激酶化5’-TTAAGCTTGACCTAATGCCATCTTGTGTCC-3’(SEQ IDNO21)3’引物包括BamH I位點,其序列為
5’-TTGGATCCAAAAGAACCATGCACCTCAGAG-3’(SEQ IDNO22)1.2kb PCR片段被用BamH I消化并被克隆進pCMV表達載體的EcoRV-BamH I位點。人GPR9-6的核酸(SEQ ID NO23)和氨基酸(SEQID NO24)序列其后被確定和證實。8.GPR10(GenBank入藏登記號U32672)GPR10的人cDNA序列由Brian O’Dowd(多倫多大學)在pRcCMV中提供。GPR10 cDNA(1.3kb片段)作為一個EcoRI-XbaI片段被從pRcCMV載體中切下,并被亞克隆進pCMV載體的EcoRI-XbaI位點。人GPR10的核酸(SEQ ID NO25)和氨基酸(SEQ ID NO26)序列其后被確定和證實。9.GPR15(GenBank入藏登記號U34806)GPR15的人cDNA序列由Brian O’Dowd(多倫多大學)在pCDNA3中提供。GPR15 cDNA(1.5kb片段)作為一個HindIII-Bam片段被從pCDNA3載體中切下,并被亞克隆進pCMV載體的HindIII-Bam位點。人GPR15的核酸(SEQ ID NO27)和氨基酸(SEQ ID NO28)序列其后被確定和證實。10.GPR17(GenBank入藏登記號Z94154)按如下步驟產(chǎn)生人GPR17的cDNA并把其克隆進pCMV表達載體以基因組DNA為模板,用rTth聚合酶(Perkin Elmer)和制造商提供的緩沖系統(tǒng)進行PCR,其中使用每個引物0.25μM、4種核苷酸每種0.2mM。循環(huán)條件是進行30個循環(huán)94℃ 1分鐘;56℃ 1分鐘72℃ 1分鐘20秒。5’PCR引物包括一個EcoR I位點,其序列為5’-CTAGAATTCTGACTCCAGCCAAAGCATGAAT-3’(SEQ IDNO29)3’引物包括BamH I位點,其序列為
5’-GCTGGATCCTAAACAGTCTGCGCTCGGCCT-3’(SEQ IDNO30)1.1kb PCR片段被用EcoR I和BamH I消化并被克隆進pCMV表達載體的EcoR I-BamH I位點。人GPR17的核酸(SEQ ID NO31)和氨基酸(SEQ ID NO32)序列其后被確定和證實。11.GPR18(GenBank入藏登記號L42324)按如下步驟產(chǎn)生人GPR18的cDNA并把其克隆進pCMV表達載體以基因組DNA為模板,用rTth聚合酶(Perkin Elmer)和制造商提供的緩沖系統(tǒng)進行PCR,其中使用每個引物0.25μM、4種核苷酸中每種0.2mM。循環(huán)條件是進行30個循環(huán)94℃ 1分鐘;54℃ 1分鐘;72℃ 1分鐘20秒。5’PCR引物被用如下序列激酶化5’-ATAAGATGATCACCCTGAACAATCAAGAT-3’(SEQ IDNO33)3’引物包括EcoR I位點,其序列為5’-TCCGAATTCATAACATTTCACTGTTTATATTGC-3’(SEQ IDNO34)1.0kb PCR片段被用EcoR I消化并被克隆進pCMV表達載體的平端-EcoR I位點。所有8個被測序的cDNA克隆含有4種可能的多態(tài),其中涉及第12位的氨基酸從Thr變?yōu)镻ro,第86位的氨基酸從Ala變?yōu)镚lu,第97位的氨基酸從Ile變?yōu)長eu,第310位的氨基酸從Leu變?yōu)镸et。暫且不論這些改變,人GPR18的核酸(SEQ ID NO35)和氨基酸(SEQ ID NO36)序列其后被確定和證實。12.GPR20(GenBank入藏登記號U66579)按如下步驟產(chǎn)生人GPR20的cDNA并把其克隆進pCMV表達載體以基因組DNA為模板,用rTth聚合酶(Perkin Elmer)和制造商提供的緩沖系統(tǒng)進行PCR,其中使用每個引物0.25μM、4種核苷酸中每個0.2mM。循環(huán)條件是30個循環(huán)94℃ 1分鐘;62℃ 1分鐘;72℃ 1分鐘20秒。5’PCR引物被用如下序列激酶化5’-CCAAGCTTCCAGGCCTGGGGTGTGCTGG-3’(SEQ IDNO37)3’引物包括BamH I位點,其序列為5’-ATGGATCCTGACCTTCGGCCCCTGGCAGA-3’(SEQ IDNO38)1.2kb PCR片段被用BamH I消化并被克隆進pCMV表達載體的EcoRV-BamH I位點。人GPR20的核酸(SEQ ID NO39)和氨基酸(SEQID NO40)序列其后被確定和證實。13.GPR21(GenBank入藏登記號U66580)按如下步驟產(chǎn)生人GPR21的cDNA并把其克隆進pCMV表達載體以基因組DNA為模板,用rTth聚合酶(Perkin Elmer)和制造商提供的緩沖系統(tǒng)進行PCR,其中每個引物0.25μM、4種核苷酸中每種0.2mM。循環(huán)條件是進行30個循環(huán)94℃ 1分鐘;62℃ 1分鐘;72℃ 1分鐘20秒。5’PCR引物被用如下序列激酶化5’-GAGAATTCACTCCTGAGCTCAAGATGAACT-3’(SEQ IDNO41)3’引物包括BamH I位點,其序列為5’-CGGGATCCCCGTAACTGAGCCACTTCAGAT-3’(SEQ IDNO42)1.1kb PCR片段被用BamH I消化并被克隆進pCMV表達載體的EcoRV-BamH I位點。人GPR21的核酸(SEQ ID NO43)和氨基酸(SEQID NO44)序列其后被確定和證實。14.GPR22(GenBank入藏登記號U66581)按如下步驟產(chǎn)生人GPR22的cDNA并把其克隆進pCMV表達載體以基因組DNA為模板,用rTth聚合酶(Perkin Elmer)和制造商提供的緩沖系統(tǒng)進行PCR,其中使用每個引物0.25μM、4種核苷酸中每種0.2mM。循環(huán)條件是進行30個循環(huán)94℃ 1分鐘;50℃ 1分鐘;72℃ 1.5分鐘。5’PCR引物被用如下序列激酶化5’-TCCCCCGGGAAAAAAACCAACTGCTCCAAA-3’(SEQ IDNO45)3’引物包括BamH I位點,其序列為5’-TAGGATCCATTTGAATGTGGATTTGGTGAAA-3’(SEQ IDNO46)1.38kb PCR片段被用BamH I消化并被克隆進pCMV表達載體的EcoRV-BamH I位點。人GPR22的核酸(SEQ ID NO47)和氨基酸(SEQID NO48)序列其后被確定和證實。15.GPR24(GenBank入藏登記號U71092)按如下步驟產(chǎn)生人GPR24的cDNA并把其克隆進pCMV表達載體以基因組DNA為模板,用rTth聚合酶(Perkin Elmer)和制造商提供的緩沖系統(tǒng)進行PCR,其中使用每個引物0.25μM、4種核苷酸中每種0.2mM。循環(huán)條件是進行30個循環(huán)94℃ 1分鐘;56℃ 1分鐘;72℃ 1分鐘20秒。5’PCR引物含有具有如下序列的Hind III位點5’-GTGAAGCTTGCCTCTGGTGCCTGCAGGAGG-3’(SEQ IDNO49)3’引物包括EcoR I位點,其序列為5’-GCAGAATTCCCGGTGGCGTGTTGTGGTGCCC-3’(SEQ IDNO50)1.3kb PCR片段被用Hind III和EcoR I消化并被克隆進pCMV表達載體的Hind III和EcoR I位點。人GPR24的核酸(SEQ ID NO51)和氨基酸(SEQ ID NO52)序列其后被確定和證實。16.GPR30(GenBank入藏登記號U63917)按如下步驟產(chǎn)生人GPR30的cDNA并克隆化從基因組DNA中擴增GPR30(1128bp的長度)的編碼序列,并應用下面的引物5’GGCGGATCCATGGATGTGACTTCCCAA-3’(SEQ ID NO53)和5’GGCGGATCCCTACACGGCACTGCTGAA-3’(SEQ IDNO54)。
然后用“TOPO-TA克隆試劑盒”(Invitrogen,#K4500-01)跟隨制造商的指令把擴增的產(chǎn)品克隆進商業(yè)可得的載體PCR2.1(Invitrogen)中。用BamH I消化來釋放全長GPR30插入子,用瓊脂糖凝膠電泳使之從載體中分離,用Sephaglas BandprepTM試劑盒(Pharmacia,#27-9285-01)按照制造商的指令純化。人GPR30的核酸(SEQ ID NO55)和氨基酸(SEQ ID NO56)序列其后被確定和證實。17.GPR31(GenBank入藏登記號U65402)按如下步驟產(chǎn)生人GPR31的cDNA并把其克隆進pCMV表達載體以基因組DNA為模板,用rTth聚合酶(Perkin Elmer)和制造商提供的緩沖系統(tǒng)進行PCR,其中使用每個引物0.25μM、4種核苷酸中每種0.2mM。循環(huán)條件是進行30個循環(huán)94℃ 1分鐘;58℃ 1分鐘;72℃ 2分鐘。5’PCR引物含有具有如下序列的EcoR I位點5’-AAGGAATTCACGGCCGGGTGATGCCATTCCC-3’(SEQ IDNO57)3’引物包括BamH I位點,其序列為5’-GGTGGATCCATAAACACGGGCGTTGAGGAC-3’(SEQ IDNO58)1.0kb PCR片段被用EcoR I和BamH I消化并被克隆進pCMV表達載體的EcoR I-BamH I位點。人GPR31的核酸(SEQ ID NO59)和氨基酸(SEQ ID NO60)序列其后被確定和證實。18.GPR32(GenBank入藏登記號AF045764)按如下步驟產(chǎn)生人GPR32的cDNA并把其克隆進pCMV表達載體以基因組DNA為模板,用rTth聚合酶(Perkin Elmer)和制造商提供的緩沖系統(tǒng)進行PCR,其中使用每個引物0.25μM、4種核苷酸中每種0.2mM。循環(huán)條件是進行30個循環(huán)94℃ 1分鐘;56℃ 1分鐘;72℃ 1分鐘20秒。5’PCR引物含有具有如下序列的EcoR I位點5’-TAAGAATTCCATAAAAATTATGGAATGG-3’(SEQ IDNO243)3’引物包括BamH I位點,其序列為5’-CCAGGATCCAGCTGAAGTCTTCCATCATTC-3’(SEQ IDNO244)1.1kb PCR片段被用EcoR I和BamH I消化并被克隆進pCMV表達載體的EcoR I-BamH I位點。人GPR32的核酸(SEQ ID NO245)和氨基酸(SEQ ID NO246)序列其后被確定和證實。19.GPR40(GenBank入藏登記號AF024687)按如下步驟產(chǎn)生人GPR40的cDNA并把其克隆進pCMV表達載體以基因組DNA為模板,用rTth聚合酶(Perkin Elmer)和制造商提供的緩沖系統(tǒng)進行PCR,其中使用每個引物0.25μM、4種核苷酸中每種0.2mM。循環(huán)條件是進行30個循環(huán)94℃ 1分鐘;65℃ 1分鐘;72℃ 1分鐘10秒。5’PCR引物含有具有如下序列的EcoR I位點5’-GCAGAATTCGGCGGCCCCATGGACCTGCCCCC-3’(SEQ IDNO247)3’引物包括BamH I位點,其序列為5’-GCTGGATCCCCCGAGCAGTGGCGTTACTTC-3’(SEQ IDNO248)1kb PCR片段被用EcoR I和BamH I消化并被克隆進pCMV表達載體的EcoR I-BamH I位點。人GPR40的核酸(SEQ ID NO249)和氨基酸(SEQ ID NO250)序列其后被確定和證實。20.GPR41(GenBank入藏登記號AF024688)按如下步驟產(chǎn)生人GPR41的cDNA并把其克隆進pCMV表達載體以基因組DNA為模板,用rTth聚合酶(Perkin Elmer)和制造商提供的緩沖系統(tǒng)進行PCR,其中使用每個引物0.25μM、4種核苷酸中每種0.2mM。循環(huán)條件是進行30個循環(huán)94℃ 1分鐘;65℃ 1分鐘;72℃ 1分鐘10秒。5’PCR引物含有具有如下序列的Hind III位點5’-CTCAAGCTTACTCTCTCTCACCAGTGGCCAC-3’(SEQ IDNO251)3’引物被用下列序列激酶化5’-CCCTCCTCCCCCGGAGGACCTAGC-3’(SEQ ID NO252)1kb PCR片段被用Hind III消化并被克隆進pCMV表達載體的Hind III-平端位點。人GPR41的核酸(SEQ ID NO253)和氨基酸(SEQ IDNO254)序列其后被確定和證實。21.GPR43(GenBank入藏登記號AF024690)按如下步驟產(chǎn)生人GPR43的cDNA并把其克隆進pCMV表達載體以基因組DNA為模板,用rTth聚合酶(Perkin Elmer)和制造商提供的緩沖系統(tǒng)進行PCR,其中使用每個引物0.25μM、4種核苷酸中每種0.2mM。循環(huán)條件是進行30個循環(huán)94℃ 1分鐘;65℃ 1分鐘;72℃ 1分鐘10秒。5’PCR引物含有具有如下序列的Hind III位點5’-TTTAAGCTTCCCCTCCAGGATGCTGCCGGAC-3’(SEQ IDNO255)3’引物包括EcoR I位點,其序列為5’-GGCGAATTCTGAAGGTCCAGGGAAACTGCTA-3’(SEQ IDNO256)1kb PCR片段被用Hind III和EcoR I消化并被克隆進pCMV表達載體的Hind III-EcoR I位點。人GPR43的核酸(SEQ ID NO257)和氨基酸(SEQ ID NO258)序列其后被確定和證實。22.APJ(GenBank入藏登記號U03642)人APJ的cDNA(在pRcCMV載體中)由Brian O’Dowd(多倫多大學)提供。人APJ的cDNA作為一個EcoR I-XbaI(平端)片段被從pRcCMV載體中切下,并被亞克隆進pCMV載體的EcoR I-Smal位點。人APJ的核苷酸(SEQ ID NO61)和氨基酸(SEQ ID NO62)序列其后被確定和證實。23.BLR1(GenBank入藏登記號X68149)按如下步驟產(chǎn)生人BLR1的cDNA并把其克隆進pCMV表達載體以胸腺cDNA為模板,用rTth聚合酶(Perkin Elmer)和制造商提供的緩沖系統(tǒng)進行PCR,其中使用每個引物0.25μM、4種核苷酸每種0.2mM。循環(huán)條件是進行30個循環(huán)94℃ 1分鐘;62℃ 1分鐘;72℃ 1分鐘20秒。5’PCR引物含有具有如下序列的EcoR I位點5’-TGAGAATTCTGGTGACTCACAGCCGGCACAG-3’(SEQ IDNO63)3’引物包括BamH I位點,其序列為5’-GCCGGATCCAAGGAAAAGCAGCAATAAAAGG-3’(SEQ IDNO64)1.2kb PCR片段被用EcoR I和BamH I消化并被克隆進pCMV表達載體的EcoR I-BamH I位點。人BLR1的核酸(SEQ ID NO65)和氨基酸(SEQ ID NO66)序列其后被確定和證實。24.CEPR(GenBank入藏登記號U77827)按如下步驟產(chǎn)生人CEPR的cDNA并把其克隆進pCMV表達載體以基因組cDNA為模板,用rTth聚合酶(Perkin Elmer)和制造商提供的緩沖系統(tǒng)進行PCR,其中使用每個引物0.25μM、4種核苷酸中每種0.2mM。循環(huán)條件是進行30個循環(huán)94℃ 1分鐘;65℃ 1分鐘;72℃ 1分鐘20秒。5’PCR引物被如下序列激酶化5’-CAAAGCTTGAAAGCTGCACGGTGCAGAGAC-3’(SEQ IDNO67)3’引物包括BamH I位點,其序列為5’-GCGGATCCCGAGTCACACCCTGGCTGGGCC-3’(SEQ IDNO68)1.2kb PCR片段被用BamH I消化并被克隆進pCMV表達載體的EcoRV-BamH I位點。人CEPR的核酸(SEQ ID NO69)和氨基酸(SEQ IDNO70)序列其后被確定和證實。25.EBIl(GenBank入藏登記號L31581)按如下步驟產(chǎn)生人EBI1的cDNA并把其克隆進pCMV表達載體以胸腺cDNA為模板,用rTth聚合酶(Perkin Elmer)和制造商提供的緩沖系統(tǒng)進行PCR,其中使用每個引物0.25μM、4種核苷酸中每種0.2mM。循環(huán)條件是進行30個循環(huán)94℃ 1分鐘;62℃ 1分鐘;72℃ 1分鐘20秒。5’PCR引物包括EcoR I位點,其序列為5’-ACAGAATTCCTGTGTGGTTTTACCGCCCAG-3’(SEQ IDNO71)3’引物包括BamH I位點,其序列為5’-CTCGGATCCAGGCAGAAGAGTCGCCTATGG-3’(SEQ IDNO72)
1.2kb PCR片段被用EcoR I和BamH I消化并被克隆進pCMV表達載體的EcoR I-BamH I位點。人EBI1的核酸(SEQ ID NO73)和氨基酸(SEQ ID NO74)序列其后被確定和證實。26.EBI2(GenBank入藏登記號L08177)按如下步驟產(chǎn)生人EBI2的cDNA并把其克隆進pCMV表達載體以cDNA克隆為模板(由Kevin Lynch提供,University of VirginiaHealth Sciences Center;應用的載體未被本資料識別),用pfu聚合酶(Stratagene)和制造商提供的緩沖系統(tǒng)并輔以10%DMSO進行PCR,其中使用每個引物0.25μM、4種核苷酸中每種0.5mM。循環(huán)條件是進行30個循環(huán)94℃ 1分鐘;60℃ 1分鐘;72℃ 1分鐘20秒。5’PCR引物包括EcoR I位點,其序列為5’-CTGGAATTCACCTGGACCACCACCAATGGATA-3’(SEQ IDNO75)3’引物包括BamH I位點,其序列為5’-CTCGGATCCTGCAAAGTTTGTCATACAGTT-3’(SEQ IDNO76)1.2kb PCR片段被用EcoR I和BamH I消化并被克隆進pCMV表達載體的EcoR I-BamH I位點。人EBI2的核酸(SEQ ID NO77)和氨基酸(SEQ ID NO78)序列其后被確定和證實。27.ETBR-LP2(GenBank入藏登記號D38449)按如下步驟產(chǎn)生人ETBR-LP2的cDNA并把其克隆進pCMV表達載體以腦cDNA為模板,用rTth聚合酶(Perkin Elmer)和制造商提供的緩沖系統(tǒng)進行PCR,其中使用每個引物0.25μM、4種核苷酸中每種0.2mM。循環(huán)條件是進行30個循環(huán)94℃ 1分鐘;65℃ 1分鐘;72℃ 1.5分鐘。5’PCR引物包括EcoR I位點,其序列為5’-CTGGAATTCTCCTGCTCATCCAGCCATGCGG-3’(SEQ IDNO79)3’引物包括BamH I位點,其序列為5’-CCTGGATCCCCACCCCTACTGGGGCCTCAG-3’(SEQ IDNO80)1.5kb PCR片段被用EcoR I和BamH I消化并被克隆進pCMV表達載體的EcoR I-BamH I位點。人ETBR-LP2的核酸(SEQ ID NO81)和氨基酸(SEQ ID NO82)序列其后被確定和證實。28.GHSR(GenBank入藏登記號U60179)按如下步驟產(chǎn)生人GHSR的cDNA并把其克隆進pCMV表達載體以海馬cDNA為模板,用TaqPlus Precision聚合酶(Stratagene)和制造商提供的緩沖系統(tǒng)進行PCR,其中使用每個引物0.25μM、4種核苷酸中每種0.2mM。循環(huán)條件是進行30個循環(huán)94℃ 1分鐘;68℃ 1分鐘;72℃ 1分鐘10秒。對于第一輪PCR,5’PCR引物序列為5’-ATGTGGAACGCGACGCCCAGCG-3’(SEQ ID NO83)3’引物序列為5’-TCATGTATTAATACTAGATTCT-3’(SEQ ID NO84)2毫升第一輪PCR產(chǎn)物被用作第二輪PCR模板,其中5’引物被如下序列所激酶化5’-TACCATGTGGAACGCGACGCCCAGCGAAGAGCCGGGGT-3’(SEQ ID NO85)3’PCR引物包括EcoR I位點,其序列為5’-CGGAATTCATGTATTAATACTAGATTCTGTCCAGGCCCG-3’(SEQ ID NO86)1.1kb PCR片段被用EcoR I消化并被克隆進pCMV表達載體的平端-EcoR I位點。人GHSR的核酸(SEQ ID NO87)和氨基酸(SEQ IDNO88)序列其后被確定和證實。29.GPCR-CNS(GenBank入藏登記號AFO17262)按如下步驟產(chǎn)生人GPCR-CNS的cDNA并把其克隆進pCMV表達載體以腦cDNA為模板,用rTth聚合酶(Perkin Elmer)和制造商提供的緩沖系統(tǒng)進行PCR,其中使用每個引物0.25μM、4種核苷酸中每種0.2mM。循環(huán)條件是進行30個循環(huán)94℃ 1分鐘65℃ 1分鐘;72℃ 2分鐘。5’PCR引物包括Hind III位點,其序列為5’-GCAAGCTTGTGCCCTCACCAAGCCATGCGAGCC-3’(SEQID NO89)3’引物包括EcoR I位點,其序列為5’-CGGAATTCAGCAATGAGTTCCGACAGAAGC-3’(SEQ IDNO90)1.9kb PCR片段被用Hind III和EcoR I消化并被克隆進pCMV表達載體的Hind III-EcoR I位點。所有9個被測序的克隆包含涉及一個S284C變化的潛在多態(tài)現(xiàn)象。暫且不論這個差別,人GPCR-CNS的核酸(SEQ ID NO91)和氨基酸(SEQ ID NO92)序列其后被確定和證實。30.GPR-NGA(GenBank入藏登記號U55312)按如下步驟產(chǎn)生人GPR-NGA的cDNA并把其克隆進pCMV表達載體以基因組cDNA為模板,用rTth聚合酶(Perkin Elmer)和制造商提供的緩沖系統(tǒng)進行PCR,其中使用每個引物0.25μM、4種核苷酸中每種0.2mM。循環(huán)條件是進行30個循環(huán)94℃ 1分鐘;56℃ 1分鐘;72℃ 1.5分鐘。5’PCR引物包括EcoR I位點,其序列為5’-CAGAATTCAGAGAAAAAAAGTGAATATGGTTTTT-3’(SEQID NO93)
3’引物包括BamH I位點,其序列為5’-TTGGATCCCTGGTGCATAACAATTGAAAGAAT-3’(SEQ IDNO94)1.3kb PCR片段被用EcoR I和BamH I消化并被克隆進pCMV表達載體的EcoR I-BamH I位點。人GPR-NGA的核酸(SEQ ID NO95)和氨基酸(SEQ ID NO96)序列其后被確定和證實。31.H9(GenBank入藏登記號U52219)按如下步驟產(chǎn)生人HB954的cDNA并把其克隆進pCMV表達載體以垂體cDNA為模板,用rTth聚合酶(Perkin Elmer)和制造商提供的緩沖系統(tǒng)進行PCR,其中使用每個引物0.25μM、4種核苷酸中每種0.2mM。循環(huán)條件是進行30個循環(huán)94℃ 1分鐘;62℃ 1分鐘;72℃ 2分鐘。5’PCR引物包括Hind III位點,其序列為5’-GGAAAGCTTAACGATCCCCAGGAGCAACAT-3’(SEQ IDNO97)3’引物包括BamH I位點,其序列為5’-CTGGGATCCTACGAGAGCATTTTTCACACAG-3’(SEQ IDNO98)1.9kb PCR片段被用Hind III和BamH I消化并被克隆進pCMV表達載體的Hind III-BamH I位點。與公開的序列相比較,還識別出一個不同的同種型并稱其為“H9b”,其在細胞質(zhì)尾部有一個12個bp的框插入。兩個同種型都含有涉及氨基酸P320S和G448A改變的潛在多態(tài)現(xiàn)象。同種型H9a含有另一個涉及氨基酸S493N改變的潛在多態(tài)現(xiàn)象,而同種型H9b含有另兩個額外的涉及氨基酸I502T和A532T(相當于同種型H9a的氨基酸528)改變的潛在多態(tài)現(xiàn)象。人H9的核酸(SEQ ID NO99)和氨基酸(SEQ ID NO100)序列其后被確定和證實(在下面的部分,兩個同種型都依據(jù)人GPCR脯氨酸標記物規(guī)則進行突變)。32.HB954(GenBank入藏登記號D38449)按如下步驟產(chǎn)生人HB954的cDNA并把其克隆進pCMV表達載體以腦cDNA為模板,用rTth聚合酶(Perkin Elmer)和制造商提供的緩沖系統(tǒng)進行PCR,其中使用每個引物0.25μM、4種核苷酸每種0.2mM。循環(huán)條件是進行30個循環(huán)94℃ 1分鐘;58℃ 1分鐘;72℃ 2分鐘。5’PCR引物包括Hind III位點,其序列為5’-TCCAAGCTTCGCCATGGGACATAACGGGAGCT-3’(SEQ IDNO101)3’引物包括EcoR I位點,其序列為5’-CGTGAATTCCAAGAATTTACAATCCTTGCT-3’(SEQ IDNO102)1.6kb PCR片段被用Hind III和EcoR I消化并被克隆進pCMV表達載體的Hind III-EcoR I位點。人HB954的核酸(SEQ ID NO103)和氨基酸(SEQ ID NO104)序列其后被確定和證實。33.HG38(GenBank入藏登記號AF062006)按如下步驟產(chǎn)生人HB38的cDNA并把其克隆進pCMV表達載體以腦cDNA為模板,用rTth聚合酶(Perkin Elmer)和制造商提供的緩沖系統(tǒng)進行PCR,其中使用每個引物0.25μM、4種核苷酸中每種0.2mM。循環(huán)條件是進行30個循環(huán)94℃ 1分鐘;56℃ 1分鐘;72℃ 1分鐘30秒。進行兩次PCR反應以分別獲得5’和3’片段。對于5’片段,5’PCR引物包括Hind III位點,其序列為5’-CCCAAGCTTCGGGCACCATGGACACCTCCC-3’(SEQ IDNO259)
3’引物包括BamH I位點,其序列為5’-ACAGGATCCAAATGCACAGCACTGGTAAGC-3’(SEQ IDNO260)這個5’的1.5kb PCR片段被用Hind III和BamH I消化并被克隆進pCMV表達載體的Hind III-BamH I位點。對于3’引物,5’PCR引物被如下序列激酶化5’-CTATAACTGGGTTACATGGTTTAAC-3’(SEQ ID NO261)3’引物包括EcoR I位點,其序列為5’-TTTGAATTCACATATTAATTAGAGACATGG-3’(SEQ IDNO262)1.4kb的3’PCR片段被用EcoR I消化并被亞克隆進pCMV表達載體的平端-EcoR I位點。接著5’片段和3’片段通過一個共同的EcoRV位點被連接在一起,得到全長cDNA克隆。人HG38的核酸(SEQ IDNO263)和氨基酸(SEQ ID NO264)序列其后被確定和證實。34.HM74(GenBank入藏登記號D10923)按如下步驟產(chǎn)生人HM74的cDNA并把其克隆進pCMV表達載體以基因組cDNA或者胸腺cDNA為模板,用rTth聚合酶(PerkinElmer)和制造商提供的緩沖系統(tǒng)進行PCR,其中使用每個引物0.25μM、4種核苷酸每種0.2mM。循環(huán)條件是進行30個循環(huán)94℃ 1分鐘;65℃ 1分鐘;72℃ 1分鐘20秒。5’PCR引物包括EcoR I位點,其序列為5’-GGAGAATTCACTAGGCGAGGCGCTCCATC-3’(SEQ IDNO105)3’引物被如下序列激酶化5’-GGAGGATCCAGGAAACCTTAGGCCGAGTCC-3’(SEQ IDNO106)1.3kb PCR片段被用EcoR I消化并被克隆進pCMV表達載體的EcoR I-Sma I位點。測序的克隆揭示了涉及一個N94K改造的潛在多態(tài)現(xiàn)象。暫且不論這個差別,人HM74的核酸(SEQ ID NO107)和氨基酸(SEQ ID NO108)序列其后被確定和證實。35.MIG(GenBank入藏登記號AFO44600和AFO44601)按如下步驟產(chǎn)生人MIG的cDNA并把其克隆進pCMV表達載體以基因組cDNA為模板,用TaqPlus Precision聚合酶(Stratagene)(第一輪PCR)或pfu聚合酶(Stratagene)(第二輪PCR)和制造商提供的緩沖系統(tǒng)進行PCR,其中使用每個引物0.25μM、4種核苷酸每種0.2mM(TaqPlus Precision)或0.5mM(pfu)。當用pfu時,在緩沖液中包括10%的DMSO。循環(huán)條件是進行30個循環(huán)94℃ 1分鐘;65℃ 1分鐘;在72℃(a)第一輪PCR 1分鐘和(b)第二輪PCR 2分鐘。因為在編碼區(qū)有一個內(nèi)含子,分別應用兩套引物以產(chǎn)生重疊的5’和3’片段。5’片段PCR引物是5’-ACCATGGCTTGCAATGGCAGTGCGGCCAGGGGGCACT-3’(外部有義)(SEQ ID NO109)和5’-CGACCAGGACAAACAGCATCTTGGTCACTTGTCTCCGGC-3’(內(nèi)部反義)(SEQ ID NO110)。
3’片段PCR引物為5’-GACCAAGATGCTGTTTGTCCTGGTCGTGGTGTTTGGCAT-3’(內(nèi)部有義)(SEQ ID NO111)和5’-CGGAATTCAGGATGGATCGGTCTCTTGCTGCGCCT-3’(具有EcoRI位點的外部反義)(SEQ ID NO112)。
通過下列方法把5’和3’片段連接在一起應用第一輪PCR做模板,并用激酶化的外部有義引物和外部反義引物進行第二輪PCR。1.2kbPCR片段被用EcoR I消化并被克隆進pCMV表達載體的平端-EcoR I位點。人MIG的核酸(SEQ ID NO113)和氨基酸(SEQ ID NO114)序列其后被確定和證實。36.OGR1(GenBank入藏登記號U48405)按如下步驟產(chǎn)生人OGR1的cDNA并把其克隆進pCMV表達載體以基因組cDNA為模板,用rTth聚合酶(Perkin Elmer)和制造商提供的緩沖系統(tǒng)進行PCR,其中使用每個引物0.25μM、4種核苷酸中每種0.2mM。循環(huán)條件是進行30個循環(huán)94℃ 1分鐘;65℃ 1分鐘;72℃ 1分鐘20秒。5’PCR引物被如下序列激酶化5’-GGAAGCTTCAGGCCCAAAGATGGGGAACAT-3’(SEQ IDNO115)3’引物包括BamH I位點,其序列為5’-GTGGATCCACCCGCGGAGGACCCAGGCTAG-3’(SEQ IDNO116)1.1kb PCR片段被用BamH I消化并被克隆進pCMV表達載體的EcoRV-BamH I位點。人OGR1的核酸(SEQ ID NO117)和氨基酸(SEQID NO118)序列其后被確定和證實。37. 5-羥色胺5HT2A編碼人內(nèi)源5HT2A受體的cDNA通過RT-PCR而得到應用人腦poly-A+RNA;來自5′未翻譯區(qū)的5′引物,其具有如下Xho I限制位點5′-GACCTCGAGTCCTTCTACACCTCATC-3′(SEQ ID NO119)來自3′未翻譯區(qū)具有如下Xba I位點的3′引物5′-TGCTCTAGATTCCAGATAGGTGAAAACTTG-3′(SEQ IDNO120)
用TaqPlusTMPrecision聚合酶(Stratagene)或rTthTM聚合酶(PerkinElmer)和制造商提供的緩沖系統(tǒng)進行PCR,其中使用每個引物0.25μM、4種核苷酸每種0.2mM。循環(huán)條件是進行30個循環(huán)94℃ 1分鐘;57℃ 1分鐘;72℃ 2分鐘。1.5kb的PCR片段被用Xba I消化并被亞克隆進pBluescript的EcoRV-Xba I位點。得到的cDNA克隆被完全測序,發(fā)現(xiàn)它編碼被公布序列的兩種氨基酸的變化。第一個是在N-末端細胞外結(jié)構域的一個T25N突變;第二個是一個H452Y突變。因為使用兩個不同商業(yè)來源(從Stratagene得到TaqPlusTM和從PerkinElmer得到的rTthTM)的Taq多聚酶通過兩個獨立的PCR反應得到cDNA克隆且cDNA克隆含有相同的兩個突變,所以這些突變好象是代表序列多態(tài)現(xiàn)象而不是PCR錯誤。除了這些例外,人5HT2A的核酸(SEQ IDNO121)和氨基酸(SEQ ID NO122)序列其后被確定和證實。38. 5-羥色胺5HT2C編碼人內(nèi)源5HT2C受體的cDNA通過RT-PCR而得到其中應用人腦poly-A+RNA。從5′和3’未翻譯區(qū)得到5′和3’引物,其序列為5′-GACCTCGAGGTTGCTTAAGACTGAAGC-3′(SEQ ID NO123)5′-ATTTCTAGACATATGTAGCTTGTACCG-3′(SEQ ID NO124)人5HT2C的核酸(SEQ ID NO125)和氨基酸(SEQ ID NO126)序列其后被確定和證實。39.V28(GenBank入藏登記號U20350)按如下步驟產(chǎn)生人V28的cDNA并把其克隆進pCMV表達載體以腦cDNA為模板,用rTth聚合酶(Perkin Elmer)和制造商提供的緩沖系統(tǒng)進行PCR,其中使用每個引物0.25μM、4種核苷酸每種0.2mM。循環(huán)條件是進行30個循環(huán)94℃ 1分鐘;65℃ 1分鐘;72℃ 1分鐘20秒。5’PCR引物引物包括Hind III位點,其序列為5’-GGTAAGCTTGGCAGTCCACGCCAGGCCTTC-3’(SEQ IDNO127)3’引物包括EcoR I位點,其序列為5’-TCCGAATTCTCTGTAGACACAAGGCTTTGG-3’(SEQ IDNO128)1.1kb PCR片段被用Hind III和EcoR I消化并被克隆進pCMV表達載體的Hind III-EcoR I位點。人V28的核酸(SEQ ID NO129)和氨基酸(SEQ ID NO130)序列其后被確定和證實。實施例2制備非內(nèi)源的人GPCR1.定點誘變把建立在此處公開的人GPCR脯氨酸標記物方法基礎上的誘變使用于前述的內(nèi)源人GPCR,應用Transformer Site-Directed Mutagenesis試劑盒(Clontech)并按照生產(chǎn)商的說明進行。對于此誘變方法,使用一個突變探針和一個選擇標記物探針(除非另外說明,SEQ ID NO132的探針始終是同樣的),用于特定序列的這些探針序列列于下表B(括號中的數(shù)字是SEQ ID NO)。為方便起見,引入人GPCR的密碼子突變也被以標準的形式標出表B
然后,非內(nèi)源的人GPCR被測序,得到且被證實的核酸和氨基酸序列被列于本發(fā)明所附的“序列列表”,如下的表C為其摘要表C
2.利用脯氨酸標記物算法規(guī)則的其他突變方法APJ、5-羥色胺5HT2A、5-羥色胺5HT2C和GPR30盡管上述定點誘變方法是特別優(yōu)選地,但其他方法也可以用來創(chuàng)造如此的突變;那些熟悉本領域的人員知道用所選擇的方法突變GPCR以適合技術人員的特別需要。a.APJ制備非內(nèi)源的人APJ受體通過突變L247K而完成。合成兩個含有此突變的寡核苷酸5’-GGCTTAAGAGCATCATCGTGGTGCTGGTG-3’(SEQ ID NO233)5’-GTCACCACCAGCACCACGATGATGCTCTTAAGCC-3’(SEQID NO234)兩個寡核苷酸被退火,并被用來取代人的內(nèi)源APJ的NaeI-BstEII片段,以產(chǎn)生非內(nèi)源的人APJ。b.5-羥色胺5HT2A包含點突變C322K的cDNA通過利用包括氨基酸322的限制酶位點Sph I構建。含有C322K突變的引物5’-CAAAGAAAGTACTGGGCATCGTCTTCTTCCT-3’(SEQ IDNO235)與從受體的3’未翻譯區(qū)得到的引物一起應用5’-TGCTCTAGATTCCAGATAGGTGAAAACTTG-3’(SEQ IDNO236)以進行PCR(在上述條件下)。得到的PCR片段然后被用來經(jīng)T4多聚酶補平的Sph I位點去替代內(nèi)源5HT2AcDNA的3’末端。c.5-羥色胺5HT2C包含S310K突變的cDNA通過用編碼目的突變的合成雙鏈寡核苷酸去替代包括氨基酸310的Sty I限制片段而構建。應用的意義鏈具有如下序列5’-CTAGGGGCACCATGCAGGCTATCAACAATGAAAGAAAAGCTAAGAAAGTC-3’(SEQ ID NO237)應用的反義鏈具有如下序列5’-CAAGGACTTTCTTAGCTTTTCTTTCATTGTTGATAGCCTGCATGGTGCCC-3’(SEQ ID NO238)d.GPR30在產(chǎn)生非內(nèi)源GPR30之前,幾個獨立的pCR2.1/GPR30分離物被整體測序以識別不發(fā)生經(jīng)PCR產(chǎn)生的突變的克隆。沒有突變的克隆用EcoR I消化,并通過用EcoR I消化pCI-Neo并把從pCR2.1/GPR30得到的EcoR I-釋放的GPR30片段亞克隆,將內(nèi)源GPR30 cDNA片段轉(zhuǎn)移進由CMV驅(qū)動表達的質(zhì)粒pCI-Neo(Promega),以產(chǎn)生pCI/GPR30。其后,按照制造商的說明,用Quick-ChangeTM定點誘變試劑盒(Stratagene,#200518)把位于密碼子258上的亮氨酸突變?yōu)橘嚢彼?,引物如?’-CGGCGGCAGAAGGCGAAACGCATGATCCTCGCGGT-3’(SEQ ID NO239)和5’-ACCGCGAGGATCATGCGTTTCGCCTTCTGCCGCCG-3’(SEQID NO240)實施例3(內(nèi)源和突變的)受體表達盡管在本領域中有多種細胞可用于蛋白質(zhì)的表達,但最優(yōu)選應用的是哺乳動物細胞。據(jù)預測,其基本原因是實用性,即例如表達GPCR的酵母細胞的應用,有可能把一種非哺乳動物細胞引入到程序中,此細胞可能不(其實,對于酵母來說,是不)包括偶聯(lián)受體、遺傳機制和分泌途徑,而這些是經(jīng)過進化用于哺乳動物系統(tǒng)的。因此,在非哺乳動物細胞中得到的結(jié)果,盡管是可能有用的,但并不如從哺乳動物細胞中得到的結(jié)果優(yōu)選。在哺乳動物細胞中,COS-7、293和293T細胞是特別優(yōu)選的,盡管應用的特定哺乳動物細胞可按技術人員的特別需要而被判定。
除非在此說明,應用如下的程序表達內(nèi)源和非內(nèi)源人GPCR。表D列舉用于表達GPCR的哺乳動物細胞和數(shù)量。
表D
第一天,哺乳動物細胞被接種到板上。第二天,準備兩支試管(比例是每板用于一支試管)通過混合20μg DNA(例如pCMV載體、帶有內(nèi)源受體cDNA的pCMV載體、帶有非內(nèi)源受體cDNA的pCMV載體)在1.2ml無血清的DMEM(Irvine Scientific,Irvine,CA)來制備試管A;通過混合120μl lipofectamine(Gibco BRL)在1.2ml無血清DMEM中制備試管B。把試管A和B互傾混合(幾次),然后在室溫下溫育30-45分鐘。組合物被稱為“轉(zhuǎn)染組合物”。植出的細胞用1XPBS洗滌,然后加入10ml無血清的DMEM。把2.4ml轉(zhuǎn)染組合物加入到細胞中去,然后在37℃/5% CO2下溫育4小時。接著通過抽吸移去轉(zhuǎn)染組合物,然后加入25ml的DMEM/10%胎牛血清。接著細胞在37℃/5% CO2溫育。72小時后收獲細胞并用來進行分析。1.Gi偶聯(lián)受體與Gs偶聯(lián)受體共轉(zhuǎn)染對于GPR320來說,已經(jīng)確定本受體與G蛋白Gi偶聯(lián)。已知Gi抑制腺苷酸環(huán)化酶,這對ATP向cAMP的轉(zhuǎn)化催化是必須的。因此,非內(nèi)源的、被組成型活化的GPR30將被期望與cAMP水平的下降相關。盡管可通過測定下降的cAMP水平對非內(nèi)源的、可被組成型活化的GPR30進行測定證實,但它可通過聯(lián)合應用Gs偶聯(lián)受體而優(yōu)選地被測定。例如,Gs偶聯(lián)的受體將刺激腺苷酸環(huán)化酶,并因而與cAMP的升高相關聯(lián)。本專利申請的受讓人已經(jīng)發(fā)現(xiàn)孤兒受體GPR6是一個內(nèi)源的、被組成型活化的GPCR。GPR6與Gs偶聯(lián)。因此,當被共轉(zhuǎn)染時,可以很容易證實由一個假想的GPR30突變導致其組成型活化即當與內(nèi)源的、被組成型活化的GPR6/非內(nèi)源的、被組成型活化的GPR30(這導致一個相對更低水平的cAMP)相比時,內(nèi)源的、被組成型活化的GPR6/內(nèi)源的非組成型活化的GPR30細胞將導致cAMP水平的升高。探查cAMP的測定方法可用來確定一個候選化合物是否是例如針對Gs關聯(lián)受體的反激活劑(即這樣的一個化合物將降低cAMP的水平)或是Gi關聯(lián)受體(或Go關聯(lián)受體)的反向激活劑(即此候選化合物將增加cAMP的水平)。在本領域中已知有多種測量cAMP的方法可被應用;一個優(yōu)選的方法依賴于抗-cAMP抗體的應用。另一個最優(yōu)選的方法是利用全細胞第二信使報告基因系統(tǒng)試驗?;蛏系膯幼右l(fā)特定基因編碼的蛋白質(zhì)表達。環(huán)AMP促進基因表達的過程,是它先促進響應cAMP的DNA結(jié)合蛋白質(zhì)或轉(zhuǎn)錄因子(CREB)的結(jié)合,其后它們在被稱為是cAMP效應元件的特殊位點與啟動子結(jié)合并促進基因的表達。報告基因系統(tǒng)可被構建為含有一個啟動子,該啟動子在報告基因如β-半乳糖苷酶或熒光素酶前含有多個cAMP效應元件。因而,一個被活化的受體例如GPR6就引起cAMP的積聚,這然后又活化基因和促進報告蛋白質(zhì)的表達。最優(yōu)選293細胞與GPR6(或另外一個Gs偶聯(lián)受體)和GPR30(或另外一個Gi偶聯(lián)受體)質(zhì)粒共轉(zhuǎn)染,優(yōu)選比例為1∶1,最優(yōu)選比例為1∶4。因為GPR6是一個內(nèi)源的、被組成型活化的受體并可刺激cAMP的產(chǎn)生,所以GPR6可強烈地激活報告基因和它的表達。報告蛋白質(zhì)例如β-半乳糖苷酶或熒光素酶可接著被用標準生化測定方法檢測到(Chen等,1995)。內(nèi)源的、組成型活化的GPR6與內(nèi)源的、非組成型活化的GPR30共轉(zhuǎn)染可導致熒光素酶報告蛋白的增加;相反地,內(nèi)源的、組成型活化的GPR6與非內(nèi)源的、組成型活化的GPR30共轉(zhuǎn)染可導致熒光素酶表達的急劇下降。在本領域中已知幾個報告基因質(zhì)粒并可用于測量第二信使試驗。據(jù)認為,對于熟練的技術人員來說,很容易基于技術人員的特別需要為一個特別的基因表達確定合適的報告質(zhì)粒。盡管可將多種可得的細胞用于表達,但哺乳動物細胞是最優(yōu)選的,在這些類型中,293細胞是最優(yōu)選的。293細胞被報告基因質(zhì)粒pCRE-Luc/GPR6和非內(nèi)源的、組成型活化的GPR30轉(zhuǎn)染,其中使用Mammalian TransfectionTM試劑盒(Stratagene,#200285)CaPO4沉淀程序,并依照制造商的指令(關于公開的內(nèi)源GPR6序列,參見28 Genomics 347(1995))。沉淀含有400ng報告基因、80ng CMV-表達質(zhì)粒(具有1∶4的GPR6對內(nèi)源GPR30或非內(nèi)源GPR30的比例)和20ng CMV-SEAP(編碼分泌的堿性磷酸酶的轉(zhuǎn)染控制質(zhì)粒)。把50%的沉淀分到96孔組織培養(yǎng)板(包含4×104細胞/孔)的3個孔中;其余的50%丟棄。第二天上午換培養(yǎng)液。轉(zhuǎn)染開始48小時后,細胞被裂解,并按每個制造商的指令用LucliteTM試劑盒(Packard,Cat.#6016911)、Trilux 1450 MicrobetaTM液體閃爍和發(fā)光計數(shù)器(Wallac)檢測熒光素酶的活性。用GraphPad Prism 2.0a(GraphPadSoftware Inc.)分析數(shù)據(jù)。
對于已被確定是偶聯(lián)Gi的GPR17來說,可基于在本質(zhì)上是應用另一個偶聯(lián)Gs的內(nèi)源受體GPR3,來對前述方法加以改進并使用。(參見23 Genomics 609(1994)和24 Genomics 391(1994))。最優(yōu)選應用293細胞。這些細胞被植到96孔培養(yǎng)板上,密度為每孔2×104細胞,在第二天按照制造商的指令用Lipofectamine Reagent(BRL)轉(zhuǎn)染。按如下程序為每6孔的轉(zhuǎn)染制備一個DNA/脂的組合物在100μl DMEM中的260ng質(zhì)粒DNA與在100μl DMEM中的脂輕輕混合(260ng質(zhì)粒DNA含有200ng 8×CRE-Luc報告質(zhì)粒(參見下面),50ng pCMV含有內(nèi)源受體或非內(nèi)源受體或僅有pCMV,10ng GPRS表達質(zhì)粒(GPRS在pcDNA3(Invitrogen)中)。8×CRE-Luc報告基因質(zhì)粒按如下方法制備通過在pβgal-Basic載體(Clontech)中的BglV-Hind III位點克隆大鼠生長激素釋放抑制因子啟動子(-71/+51),得到載體SRIF-β-gal。用腺病毒模板AdpCF126CCRE8(參見7 Human Gene Therapy 1883(1996))通過PCR得到cAMP效應元件的8個拷貝,把它克隆進SRIF-β-gal載體的Kpn-BglV位點,得到8×CRE-β-gal報告基因載體。8×CRE-Luc報告基因質(zhì)粒是通過在HindIII-BamH I位點用從pGL3-基本載體(Promega)中得到的熒光素酶基因置換8×CRE-β-gal報告基因載體中的β-半乳糖苷酶基因而得到。在室溫溫育30分鐘之后,DNA/脂組合物用400μlDMEM稀釋,把100μl得到的被稀釋的組合物加到每個孔中。在細胞培養(yǎng)溫育箱中溫育4小時之后,在每個孔中加入帶有10%FCS的100μlDMEM。第二天上午,被轉(zhuǎn)化的細胞被換成每孔使用帶有10%FCS的200μl DMEM。8小時之后,在一次PBS洗滌之后,每孔被換成100μl無酚紅的DMEM。下一天用LucLiteTM報告基因測定試劑盒(Packard),按照制造商的指令測定熒光素酶活性,在1450 MicrobetaTM閃爍發(fā)光計數(shù)器(Wallac)上記數(shù)。
圖4表明,組成型活化的GPR30抑制GPR6介導的CRE-Luc報告基因在293細胞中的活化。在表達載體pCMV中,測定熒光素酶為大約4.1相對光單位。內(nèi)源GPR30表達的熒光素酶是在大約8.5相對光單位,而非內(nèi)源的、組成型活化的GPR30(L258K)分別表達大約3.8和3.1相對光單位的熒光素酶。用內(nèi)源GPR30與內(nèi)源GPR6以1∶4的比例共轉(zhuǎn)染,可顯著地增加熒光素酶的表達到大約104.1相對光單位。用非內(nèi)源GPR30(L258K)與內(nèi)源GPR6以相同的比例共轉(zhuǎn)染,可顯著地降低表達,它們分別在大約18.2和29.5相對光單位。對于GPR17,當與GPR3共轉(zhuǎn)染時,也可觀察到相似的結(jié)果,如在圖5中所示的那樣。實施例3確定非內(nèi)源GPCR的組成性活性的測定方法A.細胞膜結(jié)合試驗1.[35S]GTPγS試驗當G蛋白偶聯(lián)受體在其活性狀態(tài),并作為配體結(jié)合或者作為組成型活化的結(jié)果時,受體與G蛋白偶聯(lián)并刺激GDP的釋放和其后GTP與G蛋白的結(jié)合。G蛋白-受體復合物的α亞基作為GTP酶并慢慢地水解GTP為GDP,在此點受體通常發(fā)生失活。組成型活化受體繼續(xù)把GDP轉(zhuǎn)化為GTP。不可水解的GTP類似物[35S]GTPγS,可被用來展示[35S]GTPγS與表達組成型活化受體的膜的增強的結(jié)合。應用[35S]GTPγS結(jié)合測定組成型活化的優(yōu)點是(a)它對所有G蛋白偶聯(lián)受體是普遍適用的;(b)它鄰近細胞膜表面,在此處較少可能揀到遇到影響細胞內(nèi)級聯(lián)反應的分子。
此試驗利用G蛋白偶聯(lián)受體的刺激[35S]GTPγS與表達相關受體的細胞膜結(jié)合的能力。因此本測定可用于直接識別法去篩選針對已知、孤兒和組成型活化G蛋白偶聯(lián)受體的候選化合物。本測定是普遍的并可用于針對所有G蛋白偶聯(lián)受體的藥物發(fā)現(xiàn)。
GTPγS試驗在20mM HEPES、1至大約20mM的MgCl2(盡管20mM是優(yōu)選的,但這個劑量可針對結(jié)果的最優(yōu)化進行調(diào)整)、pH7.4、含有在0.3和1.2nM之間的[35S]GTPγS(盡管1.2是優(yōu)選的,但這個劑量可針對結(jié)果的最優(yōu)進行調(diào)整)、12.5到75μg膜蛋白(例如,COS-7細胞表達受體,本劑量可為最優(yōu)化進行調(diào)整,盡管75μg是優(yōu)選的)和1μM GDP(這個劑量可針對結(jié)果的最優(yōu)化進行改造)的結(jié)合緩沖液中溫-6和跨膜-7之間(這些分別被稱為“細胞外”區(qū)1、2和3(EC-1、EC-2和EC-3))。在細胞膜內(nèi)部即“細胞內(nèi)”一邊,跨膜螺旋也通過氨基酸鏈進行連接,這些氨基酸鏈分別在跨膜-1和跨膜-2、跨膜-3和跨膜-4、跨膜-5和跨膜-6之間(這些分別被稱為“細胞內(nèi)”區(qū)1、2和3(IC-1、IC-2和IC-3))。受體的“羧基”(“C”)端是在細胞內(nèi)的區(qū)域,受體的“氨基”(“N”)端在細胞外的區(qū)域。圖1描繪了與G蛋白偶聯(lián)的受體的一般結(jié)構。
一般來說,當內(nèi)源配體與受體結(jié)合時(經(jīng)常被稱為受體的“活化”),細胞內(nèi)區(qū)域的構象發(fā)生變化,以容許細胞內(nèi)區(qū)域和細胞內(nèi)“G-蛋白”進行偶聯(lián)。盡管存在其他G蛋白,但當前已被識別的G-蛋白是Gq、Gs、Gi和Go。內(nèi)源配體活化的GPCR與G-蛋白的偶聯(lián)引發(fā)一個信號級聯(lián)過程(被稱為“信號傳導”)。在通常情形下,信號傳導最終導致細胞活化或細胞抑制。據(jù)認為,受體的IC-3環(huán)與羧基端都和G蛋白相互作用。本發(fā)明的一個重要焦點就涉及GPCR的跨膜-6(TM6)區(qū)域和細胞內(nèi)-3(IC3)區(qū)域。
在生理條件下,GPCR存在于細胞膜上,并在“非活化”狀態(tài)和“活化”狀態(tài)這兩種不同構象之間保持平衡。如在圖2中所圖示的那樣,在非活性狀態(tài)下的受體不能與細胞內(nèi)信號傳導途徑相偶聯(lián)以產(chǎn)生生物學反應。受體構象向活性狀態(tài)的轉(zhuǎn)變就使它與傳導途徑相偶聯(lián)(通過G-蛋白)并產(chǎn)生生物學反應。
受體可被內(nèi)源配體或藥物等化合物穩(wěn)定在活性狀態(tài)。近來的發(fā)現(xiàn)提供了除內(nèi)源配體或藥物之外能夠促進和穩(wěn)定受體到活性狀態(tài)構象的方法,這包括但不限于對受體的氨基酸序列的修飾。這些方法通過模仿與受體結(jié)合的內(nèi)源配體的作用來有效地穩(wěn)定活性狀態(tài)的受體。通過如此的配體非依賴性方法形成的穩(wěn)定被稱為“組成型受體活化”。
如上所述,將孤兒受體用于篩選目的是不可能的。這是因為涉及HEPES和10mM MgCl2的緩沖液中均質(zhì)化懸浮的細胞來制備細胞膜。均質(zhì)化是在冰上用Brinkman PolytronTM進行大約10秒鐘。得到的均質(zhì)化物在4℃、49,000×g離心15分鐘。得到的沉淀物接著在含有20mMpH 7.4的HEPES和0.1mM EDTA緩沖液中懸浮,均質(zhì)化10秒鐘,然后在4℃、49,000×g離心15分鐘。得到的沉淀可被貯藏在-4℃?zhèn)溆?。在測量的當天,膜沉淀物在室溫下被緩慢解凍,在含有20mM pH 7.4的HEPES和10mM MgCl2的緩沖液中重新懸浮(這些數(shù)量可被優(yōu)化,盡管在此列舉的數(shù)值是優(yōu)選的),得到最終蛋白質(zhì)濃度為0.60mg/ml(重新懸浮的膜放置在冰上備用)。
按照制造商的指令制備和維持cAMP標準品和檢測緩沖液(含有2μCi示蹤物[125I cAMP(100μl)]的11ml檢測緩沖液)。為篩選用的試驗緩沖液被新鮮制備,它含有20Mm pH 7.4的HEPES、10mM MgCl2、20mM(Sigma)、0.1單位/ml肌酸磷酸激酶(Sigma)、50μM GTP(Sigma)和0.2mM ATP(Sigma);試驗緩沖液可在冰上貯存?zhèn)溆谩J紫燃尤?0μl試驗緩沖液、接著加入50μl膜懸浮物到NEN Flash Plate,以開始試驗。得到的測定組合物在室溫下溫育60分鐘,然后加入100μl檢測緩沖液。培養(yǎng)板接著再溫育2-4小時,然后用Wallac MicroBeta液閃計數(shù)器記數(shù)。cAMP/孔的數(shù)值從標準cAMP曲線外推,該曲線包括在每個測定板之內(nèi)。應用前述的分析MIG的測定方法。B.基于報告基因的測定1.CREB報告基因測定(Gs偶聯(lián)受體)檢測Gs刺激的方法依賴于轉(zhuǎn)化因子CREB的已知性質(zhì),它是以cAMP依賴的方式被活化的。應用PathDetect CREB trans-ReportingSystem(Stratagene,Catalogue #219010)來檢測在293和293T細胞中Gs偶聯(lián)的活性。用上述系統(tǒng)的質(zhì)粒成分和編碼內(nèi)源的或突變的受體的指明的表達質(zhì)粒轉(zhuǎn)染細胞,其使用哺乳動物細胞轉(zhuǎn)染試劑盒(Stratagene,Catalogue #200285)并按照制造商的指令。簡短而言,400ng pFR-Luc(熒光素酶報告基因質(zhì)粒含有Gal4識別序列)、40ng pFA2-CREB(Gal4-CREB融合蛋白含有Gal4 DNA結(jié)合域)、80ng CMV-受體表達質(zhì)粒(包括受體)和20ng CMV-SEAP(分泌的堿性磷酸酶表達質(zhì)粒;堿性磷酸酶活性在轉(zhuǎn)染細胞的培養(yǎng)基中測量,以控制在樣品間轉(zhuǎn)染效率的變化)在磷酸鈣沉淀中按照試劑盒的指令進行混合。把沉淀的一半等量地分布在96孔培養(yǎng)板的3個孔中,保持細胞過夜,第二天上午置換新鮮培養(yǎng)基。轉(zhuǎn)染后48小時,如上述GPR30系統(tǒng)所說的方法處理細胞并測定熒光素酶活性。此測定用于GHSR。2.AP1報告基因測定(Gq偶聯(lián)的受體)測定Gq刺激依賴的方法依賴于Gq依賴的磷脂酶C已知的特性,即它可引起在其啟動子含有AP1元件的基因活化。按照上述CREB報告基因測定所說的程序,使用Pathdetect AP-1 cis-Reporting System(Stratagene,Catalogue #219073),其中只是將磷酸鈣沉淀的組分改為410ng pAPl-Luc、80ng受體表達質(zhì)粒和20ng CMV-SEAP。本測定用于ETBR-LP2。C.細胞內(nèi)IP3累積的測定在第一天,含有5-羥色胺受體(內(nèi)源的和突變的)的細胞被接種于24孔培養(yǎng)板上,一般是1×105細胞/孔。在第二天轉(zhuǎn)染細胞,首先混合在50μl/孔無血清DMEM中的0.25μg DNA和在50μl/孔無血清DMEM中的2μl lipofectamine。輕輕地混合溶液并在室溫下溫育15-30分鐘。用0.5ml PBS洗滌細胞,把400μl無血清培養(yǎng)基與轉(zhuǎn)染培養(yǎng)基混合并加到細胞中。然后在37℃/5% CO2下溫育細胞3-4小時,再移去轉(zhuǎn)染培養(yǎng)基,替換為1ml/孔常規(guī)培養(yǎng)基。在第三天,用3H-肌醇標記細胞。簡短地說,移去培養(yǎng)基,細胞用0.5ml PBS洗滌,接著加入0.5ml/孔無肌醇/無血清培養(yǎng)基(GIBCO BRL)和0.25μ Ci/孔3H-肌醇,在37℃/5% CO2下溫育細胞16-18小時。在第四天,用0.5ml PBS洗滌細胞,加入0.45ml試驗培養(yǎng)基,其中含有無肌醇/無血清培養(yǎng)基10μM巴吉林10mM氯化鋰或0.4ml試驗培養(yǎng)基和50μl 10×ketaserin(ket)以得到10μM的終濃度。然后在37℃溫育細胞30分鐘。用0.5ml PBS洗滌細胞,加入200μl/孔新鮮的/冰冷的終止液(1M KOH、18mM硼酸鈉、3.8mM EDTA)。溶液在冰上放置5-10分鐘或直到細胞被溶解,然后用200μl新鮮的/冰冷的中和液(7.5%HCl)中和。然后把裂解物轉(zhuǎn)移到1.5ml離心管中,加入1ml/管氯仿/甲醇(1∶2)。然后使溶液渦旋15秒鐘,把上層上樣至Biorad AGl-X8陰離子交換樹脂(100-200目)。首先,樹脂以1∶1.25 W/V的比例用水洗滌,向柱中加載0.9ml的上層溶液。用10ml 5mM肌醇和10ml 5mM的硼酸鈉/60mM甲酸鈉洗滌柱子。肌醇三磷酸酯被洗提入液閃管中,其中含有10ml液閃雞尾,它有2ml 0.1M甲酸/1M甲酸銨。通過用10ml 0.1M甲酸/3M甲酸銨洗滌和用ddH20洗滌兩次來再生交換柱,柱子貯存在4℃的水中。
圖7圖示了從包括C322K突變的人5-HT2A受體產(chǎn)生IP3。盡管這些結(jié)果表明脯氨酸突變算法規(guī)則可組成型活化本受體,但為了應用這樣一個受體而篩選識別可能的治療物,優(yōu)選更大的差別。然而,因為活化的受體可被用來理解和闡釋組成型活化的角色和用來識別可被進一步檢查的化合物,我們相信這個差別本身在區(qū)分人5-HT2A受體的內(nèi)源和非內(nèi)源形式時是有用的。D.結(jié)果概要檢測到的GPCR結(jié)果列于表E,其中百分數(shù)的增加表示,觀察到的非內(nèi)源GPCR結(jié)果與內(nèi)源GPCR相比較時的百分數(shù)差異;這些結(jié)果后面的括號里面標明的是應用的檢測方法。進一步,應用的測定系統(tǒng)在括號中列出(并且,在應用不同的宿主細胞時,兩者都列出)。這些結(jié)果表明,可應用多種方法確定人GPCR的非內(nèi)源形式的組成型活化。相信本領域的熟練技術人員在基于前述并參考本領域中的信息后可具有選擇和/或最佳化適于研究者的特別需要的特別測定方法的能力。
表E
實施例6內(nèi)源孤兒GPCR的組織分布應用商業(yè)可得的人組織斑點印跡系統(tǒng),探查內(nèi)源孤兒GPCR以確定這些受體被定位的區(qū)域。除非在下面指明,完整受體cDNA(放射標記的)被用作探針按照制造商的指令,應用Prime-It IITM隨機引物標記試劑盒(Stratagene,#300385),用完全受體cDNA(從載體中切下)產(chǎn)生放射標記的探針。把人RNA Master BlotTM(Clontech,#7770-1)與GPCR放射標記的探針雜交,并按照制造商的指令在嚴格的條件下洗滌。印跡曝光于Kodak BioMax放射自顯影底片,在-80℃過夜。
代表性的斑點印跡結(jié)果在表8中以GPR1(8A)、GPR30(8B)和APJ(8C)列出,針對所有受體的結(jié)果摘要列于表F。
表F
基于前述的信息,注意到可評定人GPCR在患病組織中的分布;然后,在“正?!焙突疾〗M織中的對比性評定可被用來確定在疾病狀態(tài)下一個特別受體的過度表達和不足表達的可能性。當希望利用人GPCR的非內(nèi)源形式進行篩選以直接識別可能與治療相關的候選化合物時,注意到反激活劑可用于治療由特定人GPCR過度表達引起的疾病和紊亂,而激活劑或部分激活劑對于治療特定人GPCR不足表達引起的疾病和紊亂是有用的。
正如被期望的,利用本領域技術人員周知的技術(例如,原位雜交),更詳細的受體細胞定位可被用于識別特殊細胞,其中感興趣的受體在特殊細胞所在的組織中表達。
預期本發(fā)明文件提到的每一個專利、申請和印刷出版物,都以全文作為參考引入。
正如本領域的技術人員將認知的那樣,可以對本發(fā)明的優(yōu)選實施方案做多種變化和修飾而不背離本發(fā)明的精神。預期所有這些變化都落在本發(fā)明的范圍之內(nèi)。
雖然本領域的普通技術人員可以得到許多不同的載體為內(nèi)源和非內(nèi)源GPCR的目的使用,但最好是用pCMV載體。按照國際承認用于專利程序的微生物保存布達佩斯條約,該載體于1998年10月13日保存在美國典型培養(yǎng)物保藏中心(American Type Culture Collection)(ATCC)(10801 University Blvd,Manassas,VA20110-2209 USA7)。該載體由ATCC在1998年__月__日進行了檢驗并在1998年__年__日測定了其存活性。ATCC為pCMV給出了下列保藏號---。
序列表(1)一般資料(i)申請人多米尼克·P·比漢;德里克·T·查默斯;廖王蓁(ii)發(fā)明名稱非內(nèi)源的被組成型活化的人G蛋白偶聯(lián)的受體(iii)序列數(shù)280(iv)通訊地址(A)收信人阿瑞那制藥公司(B)Nancy Ridge大道6166號(C)城市圣地亞哥(D)州加利福尼亞州(E)國家美國(F)郵編92121(v)計算機可讀形式(A)介質(zhì)類型軟盤(B)計算機IBM個人兼容機(C)操作系統(tǒng)PC-DOS/MS-DOS(D)軟件PatentIn Release#1.0,#1.30版(vi)本申請資料(A)申請?zhí)?B)申請日(C)分類號(viii)代理人信息(A)姓名Burgoon,Richard P.(B)登記號34787(ix)電訊信息(A)電話(858)453-7200(B)電傳(858)453-7210(2)SEQ ID NO1的資料(i)序列特征(A)長度1068個堿基對(B)類型核酸(C)鏈型單鏈
(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO1的序列描述ATGGAAGATT TGGAGGAAAC ATTATTTGAA GAATTTGAAA ACTATTCCTA TGACCTAGAC 60TATTACTCTC TGGAGTCTGA TTTGGAGGAG AAAGTCCAGC TGGGAGTTGT TCACTGGGTC 120TCCCTGGTGT TATATTGTTT GGCTTTTGTT CTGGGAATTC CAGGAAATGC CATCGTCATT 180TGGTTCACGG GGCTCAAGTG GAAGAAGACA GTCACCACTC TGTGGTTCCT CAATCTAGCC 240ATTGCGGATT TCATTTTTCT TCTCTTTCTG CCCCTGTACA TCTCCTATGT GGCCATGAAT 300TTCCACTGGC CCTTTGGCAT CTGGCTGTGC AAAGCCAATT CCTTCACTGC CCAGTTGAAC 360ATGTTTGCCA GTGTTTTTTT CCTGACAGTG ATCAGCCTGG ACCACTATAT CCACTTGATC 420CATCCTGTCT TATCTCATCG GCATCGAACC CTCAAGAACT CTCTGATTGT CATTATATTC 480ATCTGGCTTT TGGCTTCTCT AATTGGCGGT CCTGCCCTGT ACTTCCGGGA CACTGTGGAG 540TTCAATAATC ATACTCTTTG CTATAACAAT TTTCAGAAGC ATGATCCTGA CCTCACTTTG 600ATCAGGCACC ATGTTCTGAC TTGGGTGAAA TTTATCATTG GCTATCTCTT CCCTTTGCTA 660ACAATGAGTA TTTGCTACTT GTGTCTCATC TTCAAGGTGA AGAAGCGAAC AGTCCTGATC 720TCCAGTAGGC ATTTCTGGAC AATTCTGGTT GTGGTTGTGG CCTTTGTGGT TTGCTGGACT 780CCTTATCACC TGTTTAGCAT TTGGGAGCTC ACCATTCACC ACAATAGCTA TTCCCACCAT 840GTGATGCAGG CTGGAATCCC CCTCTCCACT GGTTTGGCAT TCCTCAATAG TTGCTTGAAC 900CCCATCCTTT ATGTCCTAAT TAGTAAGAAG TTCCAAGCTC GCTTCCGGTC CTCAGTTGCT 960GAGATACTCA AGTACACACT GTGGGAAGTC AGCTGTTCTG GCACAGTGAG TGAACAGCTC 1020AGGAACTCAG AAACCAAGAA TCTGTGTCTC CTGGAAACAG CTCAATAA 1068(3)SEQ ID NO2的資料(i)序列特征(A)長度355個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO2的序列描述Met Glu Asp Leu Glu Glu Thr Leu Phe Glu Glu Phe Glu Asn Tyr Ser1 5 10 15Tyr Asp Leu Asp Tyr Tyr Ser Leu Glu Ser Asp Leu Glu Glu Lys Val20 25 30Gln Leu Gly Val Val His Trp Val Ser Leu Val Leu Tyr Cys Leu Ala35 40 45Phe Val Leu Gly Ile Pro Gly Asn Ala Ile Val Ile Trp Phe Thr Gly50 55 60Leu Lys Trp Lys Lys Thr Val Thr Thr Leu Trp Phe Leu Asn Leu Ala65 70 75 80Ile Ala Asp Phe Ile Phe Leu Leu Phe Leu Pro Leu Tyr Ile Ser Tyr
85 90 95Val Ala Met Asn Phe His Trp Pro Phe Gly Ile Trp Leu Cys Lys Ala100 105 110Asn Ser Phe Thr Ala Gln Leu Asn Met Phe Ala Ser Val Phe Phe Leu115 120 125Thr Val Ile Ser Leu Asp His Tyr Ile His Leu Ile His Pro Val Leu130 135 140Ser His Arg His Arg Thr Leu Lys Asn Ser Leu Ile Val Ile Ile Phe145 150 155 160Ile Trp Leu Leu Ala Ser Leu Ile Gly Gly Pro Ala Leu Tyr Phe Arg165 170 175Asp Thr Val Glu Phe Asn Asn His Thr Leu Cys Tyr Asn Asn Phe Gln180 185 190Lys His Asp Pro Asp Leu Thr Leu Ile Arg His His Val Leu Thr Trp195 200 205Val Lys Phe Ile Ile Gly Tyr Leu Phe Pro Leu Leu Thr Met Ser Ile210 215 220Cys Tyr Leu Cys Leu Ile Phe Lys Val Lys Lys Arg Thr Val Leu Ile225 230 235 240Ser Ser Arg His Phe Trp Thr Ile Leu Val Val Val Val Ala Phe Val245 250 255Val Cys Trp Thr Pro Tyr His Leu Phe Ser Ile Trp Glu Leu Thr Ile260 265 270His His Asn Ser Tyr Ser His His Val Met Gln Ala Gly Ile Pro Leu275 280 285Ser Thr Gly Leu Ala Phe Leu Asn Ser Cys Leu Asn Pro Ile Leu Tyr290 295 300Val Leu Ile Ser Lys Lys Phe Gln Ala Arg Phe Arg Ser Ser Val Ala305 310 315 320Glu Ile Leu Lys Tyr Thr Leu Trp Glu Val Ser Cys Ser Gly Thr Val325 330 335Ser Glu Gln Leu Arg Asn Ser Glu Thr Lys Asn Leu Cys Leu Leu Glu340 345 350Thr Ala Gln
355(4)SEQ ID NO3的資料(i)序列特征(A)長度1089個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO3的序列描述ATGGGCAACC ACACGTGGGA GGGCTGCCAC GTGGACTCGC GCGTGGACCA CCTCTTTCCG 60CCATCCCTCT ACATCTTTGT CATCGGCGTG GGGCTGCCCA CCAACTGCCT GGCTCTGTGG 120GCGGCCTACC GCCAGGTGCA ACAGCGCAAC GAGCTGGGCG TCTACCTGAT GAACCTCAGC 180ATCGCCGACC TGCTGTACAT CTGCACGCTG CCGCTGTGGG TGGACTACTT CCTGCACCAC 240GACAACTGGA TCCACGGCCC CGGGTCCTGC AAGCTCTTTG GGTTCATCTT CTACACCAAT 300ATCTACATCA GCATCGCCTT CCTGTGCTGC ATCTCGGTGG ACCGCTACCT GGCTGTGGCC 360CACCCACTCC GCTTCGCCCG CCTGCGCCGC GTCAAGACCG CCGTGGCCGT GAGCTCCGTG 420GTCTGGGCCA CGGAGCTGGG CGCCAACTCG GCGCCCCTGT TCCATGACGA GCTCTTCCGA 480GACCGCTACA ACCACACCTT CTGCTTTGAG AAGTTCCCCA TGGAAGGCTG GGTGGCCTGG 540ATGAACCTCT ATCGGGTGTT CGTGGGCTTC CTCTTCCCGT GGGCGCTCAT GCTGCTGTCG 600TACCGGGGCA TCCTGCGGGC CGTGCGGGGC AGCGTGTCCA CCGAGCGCCA GGAGAAGGCC 660AAGATCAAGC GGCTGGCCCT CAGCCTCATC GCCATCGTGC TGGTCTGCTT TGCGCCCTAT 720CACGTGCTCT TGCTGTCCCG CAGCGCCATC TACCTGGGCC GCCCCTGGGA CTGCGGCTTC 780GAGGAGCGCG TCTTTTCTGC ATACCACAGC TCACTGGCTT TCACCAGCCT CAACTGTGTG 840GCGGACCCCA TCCTCTACTG CCTGGTCAAC GAGGGCGCCC GCAGCGATGT GGCCAAGGCC 900CTGCACAACC TGCTCCGCTT TCTGGCCAGC GACAAGCCCC AGGAGATGGC CAATGCCTCG 960CTCACCCTGG AGACCCCACT CACCTCCAAG AGGAACAGCA CAGCCAAAGC CATGACTGGC 1020AGCTGGGCGG CCACTCCGCC TTCCCAGGGG GACCAGGTGC AGCTGAAGAT GCTGCCGCCA 1080GCACAATGA 1089(5)SEQ ID NO4的資料(i)序列特征(A)長度362個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO4的序列描述Met Gly Asn His Thr Trp Glu Gly Cys His Val Asp Ser Arg Val Asp1 5 10 15His Leu Phe Pro Pro Ser Leu Tyr Ile Phe Val Ile Gly Val Gly Leu20 25 30Pro Thr Asn Cys Leu Ala Leu Trp Ala Ala Tyr Arg Gln Val Gln Gln35 40 45Arg Asn Glu Leu Gly Val Tyr Leu Met Asn Leu Ser Ile Ala Asp Leu50 55 60Leu Tyr Ile Cys Thr Leu Pro Leu Trp Val Asp Tyr Phe Leu His His65 70 75 80Asp Asn Trp Ile His Gly Pro Gly Ser Cys Lys Leu Phe Gly Phe Ile85 90 95Phe Tyr Thr Asn Ile Tyr Ile Ser Ile Ala Phe Leu Cys Cys Ile Ser100 105 110Val Asp Arg Tyr Leu Ala Val Ala His Pro Leu Arg Phe Ala Arg Leu115 120 125Arg Arg Val Lys Thr Ala Val Ala Val Ser Ser Val Val Trp Ala Thr130 135 140Glu Leu Gly Ala Asn Ser Ala Pro Leu Phe His Asp Glu Leu Phe Arg145 150 155 160Asp Arg Tyr Asn His Thr Phe Cys Phe Glu Lys Phe Pro Met Glu Gly165 170 175Trp Val Ala Trp Met Asn Leu Tyr Arg Val Phe Val Gly Phe Leu Phe180 185 190Pro Trp Ala Leu Met Leu Leu Ser Tyr Arg Gly Ile Leu Arg Ala Val195 200 205Arg Gly Ser Val Ser Thr Glu Arg Gln Glu Lys Ala Lys Ile Lys Arg210 215 220Leu Ala Leu Ser Leu Ile Ala Ile Val Leu Val Cys Phe Ala Pro Tyr225 230 235 240His Val Leu Leu Leu Ser Arg Ser Ala Ile Tyr Leu Gly Arg Pro Trp245 250 255Asp Cys Gly Phe Glu Glu Arg Val Phe Ser Ala Tyr His Ser Ser Leu260 265 270Ala Phe Thr Ser Leu Asn Cys Val Ala Asp Pro Ile Leu Tyr Cys Leu275 280 285Val Asn Glu Gly Ala Arg Ser Asp Val Ala Lys Ala Leu His Asn Leu290 295 300Leu Arg Phe Leu Ala Ser Asp Lys Pro Gln Glu Met Ala Asn Ala Ser305 310 315 320Leu Thr Leu Glu Thr Pro Leu Thr Ser Lys Arg Asn Ser Thr Ala Lys325 330 335Ala Met Thr Gly Ser Trp Ala Ala Thr Pro Pro Ser Gln Gly Asp Gln340 345 350Val Gln Leu Lys Met Leu Pro Pro Ala Gln355 360(6)SEQ ID NO5的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO5的序列描述TATGAATTCA GATGCTCTAA ACGTCCCTGC 30(7)SEQ ID NO6的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO6的序列描述TCCGGATCCA CCTGCACCTG CGCCTGCACC 30(8)SEQ ID NO7的資料(i)序列特征(A)長度1002個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO7的序列描述ATGGAGTCCT CAGGCAACCC AGAGAGCACC ACCTTTTTTT ACTATGACCT TCAGAGCCAG 60CCGTGTGAGA ACCAGGCCTG GGTCTTTGCT ACCCTCGCCA CCACTGTCCT GTACTGCCTG 120GTGTTTCTCC TCAGCCTAGT GGGCAACAGC CTGGTCCTGT GGGTCCTGGT GAAGTATGAG 180AGCCTGGAGT CCCTCACCAA CATCTTCATC CTCAACCTGT GCCTCTCAGA CCTGGTGTTC 240GCCTGCTTGT TGCCTGTGTG GATCTCCCCA TACCACTGGG GCTGGGTGCT GGGAGACTTC 300CTCTGCAAAC TCCTCAATAT GATCTTCTCC ATCAGCCTCT ACAGCAGCAT CTTCTTCCTG 360ACCATCATGA CCATCCACCG CTACCTGTCG GTAGTGAGCC CCCTCTCCAC CCTGCGCGTC 420CCCACCCTCC GCTGCCGGGT GCTGGTGACC ATGGCTGTGT GGGTAGCCAG CATCCTGTCC 480TCCATCCTCG ACACCATCTT CCACAAGGTG CTTTCTTCGG GCTGTGATTA TTCCGAACTC 540ACGTGGTACC TCACCTCCGT CTACCAGCAC AACCTCTTCT TCCTGCTGTC CCTGGGGATT 600ATCCTGTTCT GCTACGTGGA GATCCTCAGG ACCCTGTTCC GCTCACGCTC CAAGCGGCGC 660CACCGCACGG TCAAGCTCAT CTTCGCCATC GTGGTGGCCT ACTTCCTCAG CTGGGGTCCC 720TACAACTTCA CCCTGTTTCT GCAGACGCTG TTTCGGACCC AGATCATCCG GAGCTGCGAG 780GCCAAACAGC AGCTAGAATA CGCCCTGCTC ATCTGCCGCA ACCTCGCCTT CTCCCACTGC 840TGCTTTAACC CGGTGCTCTA TGTCTTCGTG GGGGTCAAGT TCCGCACACA CCTGAAACAT 900GTTCTCCGGC AGTTCTGGTT CTGCCGGCTG CAGGCACCCA GCCCAGCCTC GATCCCCCAC 960TCCCCTGGTG CCTTCGCCTA TGAGGGCGCC TCCTTCTACT GA1002(9)SEQ ID NO8的資料(i)序列特征(A)長度333個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO8的序列描述Met Glu Ser Ser Gly Asn Pro Glu Ser Thr Thr Phe Phe Tyr Tyr Asp1 5 10 15Leu Gln Ser Gln Pro Cys Glu Asn Gln Ala Trp Val Phe Ala Thr Leu20 25 30Ala Thr Thr Val Leu Tyr Cys Leu Val Phe Leu Leu Ser Leu Val Gly35 40 45Asn Ser Leu Val Leu Trp Val Leu Val Lys Tyr Glu Ser Leu Glu Ser50 55 60Leu Thr Asn Ile Phe Ile Leu Ash Leu Cys Leu Ser Asp Leu Val Phe65 70 75 80Ala Cys Leu Leu Pro Val Trp Ile Ser Pro Tyr His Trp Gly Trp Val85 90 95Leu Gly Asp Phe Leu Cys Lys Leu Leu Asn Met Ile Phe Ser Ile Ser100 105 110Leu Tyr Ser Ser Ile Phe Phe Leu Thr Ile Met Thr Ile His Arg Tyr115 120 125Leu Ser Val Val Ser Pro Leu Ser Thr Leu Arg Val Pro Thr Leu Arg130 135 140Cys Arg Val Leu Val Thr Met Ala Val Trp Val Ala Ser Ile Leu Ser145 150 155 160Ser Ile Leu Asp Thr Ile Phe His Lys Val Leu Ser Ser Gly Cys Asp165 170 175Tyr Ser Glu Leu Thr Trp Tyr Leu Thr Ser Val Tyr Gln His Asn Leu180 185 190Phe Phe Leu Leu Ser Leu Gly Ile Ile Leu Phe Cys Tyr Val Glu Ile195 200 205Leu Arg Thr Leu Phe Arg Ser Arg Ser Lys Arg Arg His Arg Thr Val210 215 220Lys Leu Ile Phe Ala Ile Val Val Ala Tyr Phe Leu Ser Trp Gly Pro225 230 235 240Tyr Asn Phe Thr Leu Phe Leu Gln Thr Leu Phe Arg Thr Gln Ile Ile245 250 255Arg Ser Cys Glu Ala Lys Gln Gln Leu Glu Tyr Ala Leu Leu Ile Cys260 265 270Arg Asn Leu Ala Phe Ser His Cys Cys Phe Asn Pro Val Leu Tyr Val275 280 285Phe Val Gly Val Lys Phe Arg Thr His Leu Lys His Val Leu Arg Gln290 295 300Phe Trp Phe Cys Arg Leu Gln Ala Pro Ser Pro Ala Ser Ile Pro His305 310 315 320Ser Pro Gly Ala Phe Ala Tyr Glu Gly Ala Ser Phe Tyr325 330(10)SEQ ID NO9的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO9的序列描述GCAAGCTTGG GGGACGCCAG GTCGCCGGCT 30(11)SEQ ID NO10的資料(i)序列特征(A)長度31個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO10的序列描述gcggatccgg acgctggggg agtcaggctg c 31(12) SEQ ID NO11的資料(i)序列特征(A)長度987個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO11的序列描述ATGGACAACG CCTCGTTCTC GGAGCCCTGG CCCGCCAACG CATCGGGCCC GGACCCGGCG 60CTGAGCTGCT CCAACGCGTC GACTCTGGCG CCGCTGCCGG CGCCGCTGGC GGTGGCTGTA 120CCAGTTGTCT ACGCGGTGAT CTGCGCCGTG GGTCTGGCGG GCAACTCCGC CGTGCTGTAC 180GTGTTGCTGC GGGCGCCCCG CATGAAGACC GTCACCAACC TGTTCATCCT CAACCTGGCC 240ATCGCCGACG AGCTCTTCAC GCTGGTGCTG CCCATCAACA TCGCCGACTT CCTGCTGCGG 300CAGTGGCCCT TCGGGGAGCT CATGTGCAAG CTCATCGTGG CTATCGACCA GTACAACACC 360TTCTCCAGCC TCTACTTCCT CACCGTCATG AGCGCCGACC GCTACCTGGT GGTGTTGGCC 420ACTGCGGAGT CGCGCCGGGT GGCCGGCCGC ACCTACAGCG CCGCGCGCGC GGTGAGCCTG 480GCCGTGTGGG GGATCGTCAC ACTCGTCGTG CTGCCCTTCG CAGTCTTCGC CCGGCTAGAC 540GACGAGCAGG GCCGGCGCCA GTGCGTGCTA GTCTTTCCGC AGCCCGAGGC CTTCTGGTGG 600CGCGCGAGCC GCCTCTACAC GCTCGTGCTG GGCTTCGCCA TCCCCGTGTC CACCATCTGT 660GTCCTCTATA CCACCCTGCT GTGCCGGCTG CATGCCATGC GGCTGGACAG CCACGCCAAG 720GCCCTGGAGC GCGCCAAGAA GCGGGTGACC TTCCTGGTGG TGGCAATCCT GGCGGTGTGC 780CTCCTCTGCT GGACGCCCTA CCACCTGAGC ACCGTGGTGG CGCTCACCAC CGACCTCCCG 840CAGACGCCGC TGGTCATCGC TATCTCCTAC TTCATCACCA GCCTGACGTA CGCCAACAGC 900TGCCTCAACC CCTTCCTCTA CGCCTTCCTG GACGCCAGCT TCCGCAGGAA CCTCCGCCAG 960CTGATAACTT GCCGCGCGGC AGCCTGA 987(13)SEQ ID NO12的資料(i)序列特征
(A)長度328個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO12的序列描述Met Asp Asn Ala Ser Phe Ser Glu Pro Trp Pro Ala Asn Ala Ser Gly1 5 10 15Pro Asp Pro Ala Leu Ser Cys Ser Asn Ala Ser Thr Leu Ala Pro Leu20 25 30Pro Ala Pro Leu Ala Val Ala Val Pro Val Val Tyr Ala Val Ile Cys35 40 45Ala Val Gly Leu Ala Gly Asn Ser Ala Val Leu Tyr Val Leu Leu Arg50 55 60Ala Pro Arg Met Lys Thr Val Thr Asn Leu Phe Ile Leu Asn Leu Ala65 70 75 80Ile Ala Asp Glu Leu Phe Thr Leu Val Leu Pro Ile Asn Ile Ala Asp85 90 95Phe Leu Leu Arg Gln Trp Pro Phe Gly Glu Leu Met Cys Lys Leu Ile100 105 110Val Ala Ile Asp Gln Tyr Asn Thr Phe Ser Ser Leu Tyr Phe Leu Thr115 120 125Val Met Ser Ala Asp Arg Tyr Leu Val Val Leu Ala Thr Ala Glu Ser130 135 140Arg Arg Val Ala Gly Arg Thr Tyr Ser Ala Ala Arg Ala Val Ser Leu145 150 155 160Ala Val Trp Gly Ile Val Thr Leu Val Val Leu Pro Phe Ala Val Phe165 170 175Ala Arg Leu Asp Asp Glu Gln Gly Arg Arg Gln Cys Val Leu Val Phe180 185 190Pro Gln Pro Glu Ala Phe Trp Trp Arg Ala Ser Arg Leu Tyr Thr Leu195 200 205Val Leu Gly Phe Ala Ile Pro Val Ser Thr Ile Cys Val Leu Tyr Thr210 215 220Thr Leu Leu Cys Arg Leu His Ala Met Arg Leu Asp Ser His Ala Lys225 230 235 240Ala Leu Glu Arg Ala Lys Lys Arg Val Thr Phe Leu Val Val Ala Ile245 250 255Leu Ala Val Cys Leu Leu Cys Trp Thr Pro Tyr His Leu Ser Thr Val260 265 270Val Ala Leu Thr Thr Asp Leu Pro Gln Thr Pro Leu Val Ile Ala Ile275 280 285Ser Tyr Phe Ile Thr Ser Leu Thr Tyr Ala Asn Ser Cys Leu Asn Pro290295 300Phe Leu Tyr Ala Phe Leu Asp Ala Ser Phe Arg Arg Asn Leu Arg Gln305 310 315 320Leu Ile Thr Cys Arg Ala Ala Ala325(14)SEQ ID NO13的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO13的序列描述CGGAATTCGT CAACGGTCCC AGCTACAATG 30(15)SEQ ID NO14的資料(i)序列特征(A)長度31個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO14的序列描述ATGGATCCCA GGCCCTTCAG CACCGCAATA T31(16)SEQ ID NO15的資料(i)序列特征
(A)長度1002個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO15的序列描述ATGCAGGCCG CTGGGCACCC AGAGCCCCTT GACAGCAGGG GCTCCTTCTC CCTCCCCACG 60ATGGGTGCCA ACGTCTCTCA GGACAATGGC ACTGGCCACA ATGCCACCTT CTCCGAGCCA 120CTGCCGTTCC TCTATGTGCT CCTGCCCGCC GTGTACTCCG GGATCTGTGC TGTGGGGCTG 180ACTGGCAACA CGGCCGTCAT CCTTGTAATC CTAAGGGCGC CCAAGATGAA GACGGTGACC 240AACGTGTTCA TCCTGAACCT GGCCGTCGCC GACGGGCTCT TCACGCTGGT ACTGCCCGTC 300AACATCGCGG AGCACCTGCT GCAGTACTGG CCCTTCGGGG AGCTGCTCTG CAAGCTGGTG 360CTGGCCGTCG ACCACTACAA CATCTTCTCC AGCATCTACT TCCTAGCCGT GATGAGCGTG 420GACCGATACC TGGTGGTGCT GGCCACCGTG AGGTCCCGCC ACATGCCCTG GCGCACCTAC 480CGGGGGGCGA AGGTCGCCAG CCTGTGTGTC TGGCTGGGCG TCACGGTCCT GGTTCTGCCC 540TTCTTCTCTT TCGCTGGCGT CTACAGCAAC GAGCTGCAGG TCCCAAGCTG TGGGCTGAGC 600TTCCCGTGGC CCGAGCGGGT CTGGTTCAAG GCCAGCCGTG TCTACACTTT GGTCCTGGGC 660TTCGTGCTGC CCGTGTGCAC CATCTGTGTG CTCTACACAG ACCTCCTGCG CAGGCTGCGG 720GCCGTGCGGC TCCGCTCTGG AGCCAAGGCT CTAGGCAAGG CCAGGCGGAA GGTGACCGTC 780CTGGTCCTCG TCGTGCTGGC CGTGTGCCTC CTCTGCTGGA CGCCCTTCCA CCTGGCCTCT 840GTCGTGGCCC TGACCACGGA CCTGCCCCAG ACCCCACTGG TCATCAGTAT GTCCTACGTC 900ATCACCAGCC TCACGTACGC CAACTCGTGC CTGAACCCCT TCCTCTACGC CTTTCTAGAT 960GACAACTTCC GGAAGAACTT CCGCAGCATA TTGCGGTGCT GA1002(17)SEQ ID NO16的資料(i)序列特征(A)長度333個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO16的序列描述Met Gln Ala Ala Gly His Pro Glu Pro Leu Asp Ser Arg Gly Ser Phe1 5 10 15Ser Leu Pro Thr Met Gly Ala Asn Val Ser Gln Asp Asn Gly Thr Gly20 25 30His Asn Ala Thr Phe Ser Glu Pro Leu Pro Phe Leu Tyr Val Leu Leu35 40 45Pro Ala Val Tyr Ser Gly Ile Cys Ala Val Gly Leu Thr Gly Asn Thr50 55 60Ala Val Ile Leu Val Ile Leu Arg Ala Pro Lys Met Lys Thr Val Thr65 70 75 80Asn Val Phe Ile Leu Asn Leu Ala Mal Ala Asp Gly Leu Phe Thr Leu85 90 95Val Leu Pro Val Asn Ile Ala Glu His Leu Leu Gln Tyr Trp Pro Phe100 105 110Gly Glu Leu Leu Cys Lys Leu Val Leu Ala Val Asp His Tyr Asn Ile115 120 125Phe Ser Ser Ile Tyr Phe Leu Ala Val Met Ser Val Asp Arg Tyr Leu130 135 140Val Val Leu Ala Thr Val Arg Ser Arg His Met Pro Trp Arg Thr Tyr145 150 155 160Arg Gly Ala Lys Val Ala Ser Leu Cys Val Trp Leu Gly Val Thr Val165 170 175Leu Val Leu Pro Phe Phe Ser Phe Ala Gly Val Tyr Ser Asn Glu Leu180 185 190Gln Val Pro Ser Cys Gly Leu Ser Phe Pro Trp Pro Glu Arg Val Trp195 200 205Phe Lys Ala Ser Arg Val Tyr Thr Leu Val Leu Gly Phe Val Leu Pro210 215 220Val Cys Thr Ile Cys Val Leu Tyr Thr Asp Leu Leu Arg Arg Leu Arg225 230 235 240Ala Val Arg Leu Arg Ser Gly Ala Lys Ala Leu Gly Lys Ala Arg Arg245 250 255Lys Val Thr Val Leu Val Leu Val Val Leu Ala Mal Cys Leu Leu Cys260 265 270Trp Thr Pro Phe His Leu Ala Ser Val Mal Ala Leu Thr Thr Asp Leu275 280 285Pro Gln Thr Pro Leu Val Ile Ser Met Ser Tyr Val Ile Thr Ser Leu290 295 300Thr Tyr Ala Asn Ser Cys Leu Asn Pro Phe Leu Tyr Ala Phe Leu Asp305 310 315 320Asp Asn Phe Arg Lys Asn Phe Arg Ser Ile Leu Arg Cys325 330(18)SEQ ID NO17的資料(i)序列特征(A)長度48個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO17的序列描述ACGAATTCAG CCATGGTCCT TGAGGTGAGT GACCACCAAG TGCTAAAT 48(19)SEQ ID NO18的資料(i)序列特征(A)長度27個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO18的序列描述GAGGATCCTG GAATGCGGGG AAGTCAG27(20)SEQ ID NO19的資料(i)序列特征(A)長度1107個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO19的序列描述ATGGTCCTTG AGGTGAGTGA CCACCAAGTG CTAAATGACG CCGAGGTTGC CGCCCTCCTG 60GAGAACTTCA GCTCTTCCTA TGACTATGGA GAAAACGAGA GTGACTCGTG CTGTACCTCC 120CCGCCCTGCC CACAGGACTT CAGCCTGAAC TTCGACCGGG CCTTCCTGCC AGCCCTCTAC 180AGCCTCCTCT TTCTGCTGGG GCTGCTGGGC AACGGCGCGG TGGCAGCCGT GCTGCTGAGC 240CGGCGGACAG CCCTGAGCAG CACCGACACC TTCCTGCTCC ACCTAGCTGT AGCAGACACG 300CTGCTGGTGC TGACACTGCC GCTCTGGGCA GTGGACGCTG CCGTCCAGTG GGTCTTTGGC 360TCTGGCCTCT GCAAAGTGGC AGGTGCCCTC TTCAACATCA ACTTCTACGC AGGAGCCCTC 420CTGCTGGCCT GCATCAGCTT TGACCGCTAC CTGAACATAG TTCATGCCAC CCAGCTCTAC 480CGCCGGGGGC CCCCGGCCCG CGTGACCCTC ACCTGCCTGG CTGTCTGGGG GCTCTGCCTG 540CTTTTCGCCC TCCCAGACTT CATCTTCCTG TCGGCCCACC ACGACGAGCG CCTCAACGCC 600ACCCACTGCC AATACAACTT CCCACAGGTG GGCCGCACGG CTCTGCGGGT GCTGCAGCTG 660GTGGCTGGCT TTCTGCTGCC CCTGCTGGTC ATGGCCTACT GCTATGCCCA CATCCTGGCC 720GTGCTGCTGG TTTCCAGGGG CCAGCGGCGC CTGCGGGCCA TGCGGCTGGT GGTGGTGGTC 780GTGGTGGCCT TTGCCCTCTG CTGGACCCCC TATCACCTGG TGGTGCTGGT GGACATCCTC 840ATGGACCTGG GCGCTTTGGC CCGCAACTGT GGCCGAGAAA GCAGGGTAGA CGTGGCCAAG 900TCGGTCACCT CAGGCCTGGG CTACATGCAC TGCTGCCTCA ACCCGCTGCT CTATGCCTTT 960GTAGGGGTCA AGTTCCGGGA GCGGATGTGG ATGCTGCTCT TGCGCCTGGG CTGCCCCAAC 1020CAGAGAGGGC TCCAGAGGCA GCCATCGTCT TCCCGCCGGG ATTCATCCTG GTCTGAGACC 1080TCAGAGGCCT CCTACTCGGG CTTGTGA 1107(21)SEQ ID NO20的資料(i)序列特征(A)長度368個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO20的序列描述Met Val Leu Glu Val Ser Asp His Gln Val Leu Asn Asp Ala Glu Val1 5 10 15Ala Ala Leu Leu Glu Asn Phe Ser Ser Ser Tyr Asp Tyr Gly Glu Asn20 25 30Glu Ser Asp Ser Cys Cys Thr Ser Pro Pro Cys Pro Gln Asp Phe Ser35 40 45Leu Asn Phe Asp Arg Ala Phe Leu Pro Ala Leu Tyr Ser Leu Leu Phe50 55 60Leu Leu Gly Leu Leu Gly Asn G1y Ala Val Ala Ala Val Leu Leu Ser65 70 75 80Arg Arg Thr Ala Leu Ser Ser Thr Asp Thr Phe Leu Leu His Leu Ala85 90 95Val Ala Asp Thr Leu Leu Val Leu Thr Leu Pro Leu Trp Ala Val Asp100 105 110Ala Ala Val Gln Trp Val Phe Gly Ser Gly Leu Cys Lys Val Ala Gly115 120 125Ala Leu Phe Asn Ile Asn Phe Tyr Ala Gly Ala Leu Leu Leu Ala Cys130 135 140Ile Ser Phe Asp Arg Tyr Leu Asn Ile Val His Ala Thr Gln Leu Tyr145 150 155 160Arg Arg Gly Pro Pro Ala Arg Val Thr Leu Thr Cys Leu Ala Val Trp165 170 175Gly Leu Cys Leu Leu Phe Ala Leu Pro Asp Phe Ile Phe Leu Ser Ala
180 185 190His His Asp Glu Arg Leu Asn Ala Thr His Cys Gln Tyr Asn Phe Prol95 200 205Gln Val Gly Arg Thr Ala Leu Arg Val Leu Gln Leu Val Ala Gly Phe210 215 220Leu Leu Pro Leu Leu Val Met Ala Tyr Cys Tyr Ala His Ile Leu Ala225 230 235 240Val Leu Leu Val Ser Arg Gly Gln Arg Arg Leu Arg Ala Met Arg Leu245 250 255Val Val Val Val Val Val Ala Phe Ala Leu Cys Trp Thr Pro Tyr His260 265 270Leu Val Val Leu Val Asp Ile Leu Met Asp Leu Gly Ala Leu Ala Arg275 280 285Asn Cys Gly Arg Glu Ser Arg Val Asp Val Ala Lys Ser Val Thr Ser290 295 300Gly Leu Gly Tyr Met His Cys Cys Leu Asn Pro Leu Leu Tyr Ala Phe305 310 315 320Val Gly Val Lys Phe Arg Glu Arg Met Trp Met Leu Leu Leu Arg Leu325 330 335Gly Cys Pro Asn Gln Arg Gly Leu Gln Arg Gln Pro Ser Ser Ser Arg340 345 350Arg Asp Ser Ser Trp Ser Glu Thr Ser Glu Ala Ser Tyr Ser Gly Leu355 360 365(22)SEQ ID NO21的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO21的序列描述TTAAGCTTGA CCTAATGCCA TCTTGTGTCC 30(23)SEQ ID NO22的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組)(xi)SEQ ID NO22的序列描述TTGGATCCAA AAGAACCATG CACCTCAGAG 30(24)SEQ ID NO23的資料(i)序列特征(A)長度1074個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO23的序列描述ATGGCTGATG ACTATGGCTC TGAATCCACA TCTTCCATGG AAGACTACGT TAACTTCAAC 60TTCACTGACT TCTACTGTGA GAAAAACAAT GTCAGGCAGT TTGCGAGCCA TTTCCTCCCA 120CCCTTGTACT GGCTCGTGTT CATCGTGGGT GCCTTGGGCA ACAGTCTTGT TATCCTTGTC 180TACTGGTACT GCACAAGAGT GAAGACCATG ACCGACATGT TCCTTTTGAA TTTGGCAATT 240GCTGACCTCC TCTTTCTTGT CACTCTTCCC TTCTGGGCCA TTGCTGCTGC TGACCAGTGG 300AAGTTCCAGA CCTTCATGTG CAAGGTGGTC AACAGCATGT ACAAGATGAA CTTCTACAGC 360TGTGTGTTGC TGATCATGTG CATCAGCGTG GACAGGTACA TTGCCATTGC CCAGGCCATG 420AGAGCACATA CTTGGAGGGA GAAAAGGCTT TTGTACAGCA AAATGGTTTG CTTTACCATC 480TGGGTATTGG CAGCTGCTCT CTGCATCCCA GAAATCTTAT ACAGCCAAAT CAAGGAGGAA 540TCCGGCATTG CTATCTGCAC CATGGTTTAC CCTAGCGATG AGAGCACCAA ACTGAAGTCA 600GCTGTCTTGA CCCTGAAGGT CATTCTGGGG TTCTTCCTTC CCTTCGTGGT CATGGCTTGC 660TGCTATACCA TCATCATTCA CACCCTGATA CAAGCCAAGA AGTCTTCCAA GCACAAAGCC 720CTAAAAGTGA CCATCACTGT CCTGACCGTC TTTGTCTTGT CTCAGTTTCC CTACAACTGC 780ATTTTGTTGG TGCAGACCAT TGACGCCTAT GCCATGTTCA TCTCCAACTG TGCCGTTTCC 840ACCAACATTG ACATCTGCTT CCAGGTCACC CAGACCATCG CCTTCTTCCA CAGTTGCCTG 900AACCCTGTTC TCTATGTTTT TGTGGGTGAG AGATTCCGCC GGGATCTCGT GAAAACCCTG 960AAGAACTTGG GTTGCATCAG CCAGGCCCAG TGGGTTTCAT TTACAAGGAG AGAGGGAAGC 1020TTGAAGCTGT CGTCTATGTT GCTGGAGACA ACCTCAGGAG CACTCTCCCT CTGA 1074(25)SEQ ID NO24的資料(i)序列特征(A)長度357個氨基酸(B)類型氨基酸(C)鏈型
(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO24的序列描述Met Ala Asp Asp Tyr Gly Ser Glu Ser Thr Ser Ser Met Glu Asp Tyr1 5 10 15Val Asn Phe Asn Phe Thr Asp Phe Tyr Cys Glu Lys Asn Asn Val Arg20 25 30Gln Phe Ala Ser His Phe Leu Pro Pro Leu Tyr Trp Leu Val Phe Ile35 40 45Val Gly Ala Leu Gly Asn Ser Leu Val Ile Leu Val Tyr Trp Tyr Cys50 55 60Thr Arg Val Lys Thr Met Thr Asp Met Phe Leu Leu Asn Leu Ala Ile65 70 75 80Ala Asp Leu Leu Phe Leu Val Thr Leu Pro Phe Trp Ala Ile Ala Ala85 90 95Ala Asp Gln Trp Lys Phe Gln Thr Phe Met Cys Lys Val Val Asn Ser100 105 110Met Tyr Lys Met Asn Phe Tyr Ser Cys Val Leu Leu Ile Met Cys Ile115 120 125Ser Val Asp Arg Tyr Ile Ala Ile Ala Gln Ala Met Arg Ala His Thr130 135 140Trp Arg Glu Lys Arg Leu Leu Tyr Ser Lys Met Val Cys Phe Thr Ile145 150 155 160Trp Val Leu Ala Ala Ala Leu Cys Ile Pro Glu Ile Leu Tyr Ser Gln165 170 175Ile Lys Glu Glu Ser Gly Ile Ala Ile Cys Thr Met Val Tyr Pro Ser180 185 190Asp Glu Ser Thr Lys Leu Lys Ser Ala Val Leu Thr Leu Lys Val Ile195 200 205Leu Gly Phe Phe Leu Pro Phe Val Val Met Ala Cys Cys Tyr Thr Ile210 215 220Ile Ile His Thr Leu Ile Gln Ala Lys Lys Ser Ser Lys His Lys Ala225 230 235 240Leu Lys Val Thr Ile Thr Val Leu Thr Val Phe Val Leu Ser Gln Phe245 250 255Pro Tyr Asn Cys Ile Leu Leu Val Gln Thr Ile Asp Ala Tyr Ala Met260 265 270Phe Ile Ser Asn Cys Ala Val Ser Thr Asn Ile Asp Ile Cys Phe Gln275 280 285Val Thr Gln Thr Ile Ala Phe Phe His Ser Cys Leu Asn Pro Val Leu290 295 300Tyr Val Phe Val Gly Glu Arg Phe Arg Arg Asp Leu Val Lys Thr Leu305 310 315 320Lys Asn Leu Gly Cys Ile Ser Gln Ala Gln Trp Val Ser Phe Thr Arg325 330 335Arg Glu Gly Ser Leu Lys Leu Ser Ser Met Leu Leu Glu Thr Thr Ser340 345 350Gly Ala Leu Ser Leu355(26)SEQ ID NO25的資料(i)序列特征(A)長度1110個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO25的序列描述A1(27)SEQ ID NO26的資料(i)序列特征(A)長度369個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO26的序列描述Met Ala Ser Ser Thr Thr Arg Gly Pro Arg Val Ser Asp Leu Phe Ser1 5 10 15Gly Leu Pro Pro Ala Val Thr Thr Pro Ala Asn Gln Ser Ala Glu Ala20 25 30Ser Ala Gly Asn Gly Ser Val Ala Gly Ala Asp Ala Pro Ala Val Thr35 40 45Pro Phe Gln Ser Leu Gln Leu Val His Gln Leu Lys Gly Leu Ile Val50 55 60Leu Leu Tyr Ser Val Val Val Val Val Gly Leu Val Gly Asn Cys Leu65 70 75 80Leu Val Leu Val Ile Ala Arg Val Pro Arg Leu His Asn Val Thr Asn85 90 95Phe Leu Ile Gly Asn Leu Ala Leu Ser Asp Val Leu Met Cys Thr Ala100 105 110Cys Val Pro Leu Thr Leu Ala Tyr Ala Phe Glu Pro Arg Gly Trp Val115 120 125Phe Gly Gly Gly Leu Cys His Leu Val Phe Phe Leu Gln Pro Val Thr130 135 140Val Tyr Val Ser Val Phe Thr Leu Thr Thr Ile Ala Val Asp Arg Tyr145 150 155 160Val Val Leu Val His Pro Leu Arg Arg Ala Ser Arg Cys Ala Ser Ala165 170 175Tyr Ala Val Leu Ala Ile Trp Ala Leu Ser Ala Val Leu Ala Leu Pro180 185 190Pro Ala Val His Thr Tyr His Val Glu Leu Lys Pro His Asp Val Arg195 200 205Leu Cys Glu Glu Phe Trp Gly Ser Gln Glu Arg Gln Arg Gln Leu Tyr210 215 220Ala Trp Gly Leu Leu Leu Val Thr Tyr Leu Leu Pro Leu Leu Val Ile225 230 235 240Leu Leu Ser Tyr Val Arg Val Ser Val Lys Leu Arg Asn Arg Val Val245 250 255Pro Gly Cys Val Thr Gln Ser Gln Ala Asp Trp Asp Arg Ala Arg Arg260 265 270Arg Arg Thr Phe Cys Leu Leu Val Val Val Val Val Val Phe Ala Val275 280 285Cys Trp Leu Pro Leu His Val Phe Asn Leu Leu Arg Asp Leu Asp Pro290 295 300His Ala Ile Asp Pro Tyr Ala Phe Gly Leu Val Gln Leu Leu Cys His305 310 315 320Trp Leu Ala Met Ser Ser Ala Cys Tyr Asn Pro Phe lle Tyr Ala Trp325 330 335Leu His Asp Ser Phe Arg Glu Glu Leu Arg Lys Leu Leu Val Ala Trp340 345 350Pro Arg Lys Ile Ala Pro His Gly Gln Asn Met Thr Val Ser Val Val355 360 365Ile(28)SEQ ID NO27的資料(i)序列特征(A)長度1083個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO27的序列描述ATGGACCCAG AAGAAACTTC AGTTTATTTG GATTATTACT ATGCTACGAG CCCAAACTCT 60GACATCAGGG AGACCCACTC CCATGTTCCT TACACCTCTG TCTTCCTTCC AGTCTTTTAC 120ACAGCTGTGT TCCTGACTGG AGTGCTGGGG AACCTTGTTC TCATGGGAGC GTTGCATTTC 180AAACCCGGCA GCCGAAGACT GATCGACATC TTTATCATCA ATCTGGCTGC CTCTGACTTC 240ATTTTTCTTG TCACATTGCC TCTCTGGGTG GATAAAGAAG CATCTCTAGG ACTGTGGAGG 300ACGGGCTCCT TCCTGTGCAA AGGGAGCTCC TACATGATCT CCGTCAATAT GCACTGCAGT 360GTCCTCCTGC TCACTTGCAT GAGTGTTGAC CGCTACCTGG CCATTGTGTG GCCAGTCGTA 420TCCAGGAAAT TCAGAAGGAC AGACTGTGCA TATGTAGTCT GTGCCAGCAT CTGGTTTATC 480TCCTGCCTGC TGGGGTTGCC TACTCTTCTG TCCAGGGAGC TCACGCTGAT TGATGATAAG 540CCATACTGTG CAGAGAAAAA GGCAACTCCA ATTAAACTCA TATGGTCCCT GGTGGCCTTA 600ATTTTCACCT TTTTTGTCCC TTTGTTGAGC ATTGTGACCT GCTACTGTTG CATTGCAAGG 660AAGCTGTGTG CCCATTACCA GCAATCAGGA AAGCACAACA AAAAGCTGAA GAAATCTATA 720AAGATCATCT TTATTGTCGT GGCAGCCTTT CTTGTCTCCT GGCTGCCCTT CAATACTTTC 780AAGTTCCTGG CCATTGTCTC TGGGTTGCGG CAAGAACACT ATTTACCCTC AGCTATTCTT 840CAGCTTGGTA TGGAGGTGAG TGGACCCTTG GCATTTGCCA ACAGCTGTGT CAACCCTTTC 900ATTTACTATA TCTTCGACAG CTACATCCGC CGGGCCATTG TCCACTGCTT GTGCCCTTGC 960CTGAAAAACT ATGACTTTGG GAGTAGCACT GAGACATCAG ATAGTCACCT CACTAAGGCT 1020CTCTCCACCT TCATTCATGC AGAAGATTTT GCCAGGAGGA GGAAGAGGTC TGTGTCACTC 1080TAA 1083(29)SEQ ID NO28的資料(i)序列特征(A)長度360個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO28的序列描述Met Asp Pro Glu Glu Thr Ser Val Tyr Leu Asp Tyr Tyr Tyr Ala Thr1 5 10 15Ser Pro Asn Ser Asp Ile Arg Glu Thr His Ser His Val Pro Tyr Thr20 25 30Ser Val Phe Leu Pro Val Phe Tyr Thr Ala Val Phe Leu Thr Gly Val35 40 45Leu Gly Asn Leu Val Leu Met Gly Ala Leu His Phe Lys Pro Gly Ser50 55 60Arg Arg Leu Ile Asp Ile Phe Ile Ile Asn Leu Ala Ala Ser Asp Phe65 70 75 80Ile Phe Leu Val Thr Leu Pro Leu Trp Val Asp Lys Glu Ala Ser Leu85 90 95Gly Leu Trp Arg Thr Gly Ser Phe Leu Cys Lys Gly Ser Ser Tyr Met100 105 110Ile Ser Val Asn Met His Cys Ser Val Leu Leu Leu Thr Cys Met Ser115 120 125Val Asp Arg Tyr Leu Ala Ile Val Trp Pro Val Val Ser Arg Lys Phe130 135 140Arg Arg Thr Asp Cys Ala Tyr Val Val Cys Ala Ser Ile Trp Phe Ile145 150 155 160Ser Cys Leu Leu Gly Leu Pro Thr Leu Leu Ser Arg Glu Leu Thr Leu165 170 175Ile Asp Asp Lys Pro Tyr Cys Ala Glu Lys Lys Ala Thr Pro Ile Lys180 185 190Leu Ile Trp Ser Leu Val Ala Leu Ile Phe Thr Phe Phe Val Pro Leu195 200 205Leu Ser Ile Val Thr Cys Tyr Cys Cys Ile Ala Arg Lys Leu Cys Ala210 215 220His Tyr Gln Gln Ser Gly Lys His Asn Lys Lys Leu Lys Lys Ser Ile225 230 235 240Lys Ile Ile Phe Ile Val Val Ala Ala Phe Leu Val Ser Trp Leu Pro245 250 255Phe Asn Thr Phe Lys Phe Leu Ala Ile Val Ser Gly Leu Arg Gln Glu260 265 270His Tyr Leu Pro Ser Ala Ile Leu Gln Leu Gly Met Glu Val Ser Gly275 280 285Pro Leu Ala Phe Ala Asn Ser Cys Val Asn Pro Phe Ile Tyr Tyr Ile290 295 300Phe Asp Ser Tyr Ile Arg Arg Ala Ile Val His Cys Leu Cys Pro Cys305 310 315 320Leu Lys Asn Tyr Asp Phe Gly Ser Ser Thr Glu Thr Ser Asp Ser His325 330 335Leu Thr Lys Ala Leu Ser Thr Phe Ile His Ala Glu Asp Phe Ala Arg340 345 350Arg Arg Lys Arg Ser Val Ser Leu355 360(30)SEQ ID NO29的資料(i)序列特征(A)長度31個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO29的序列描述CTAGAATTCT GACTCCAGCC AAAGCATGAA T 3l(31)SEQ ID NO30的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO30的序列描述GCTGGATCCT AAACAGTCTG CGCTCGGCCT 30(32)SEQ ID NO31的資料(i)序列特征(A)長度1020個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO31的序列描述ATGAATGGCC TTGAAGTGGC TCCCCCAGGT CTGATCACCA ACTTCTCCCT GGCCACGGCA 60GAGCAATGTG GCCAGGAGAC GCCACTGGAG AACATGCTGT TCGCCTCCTT CTACCTTCTG 120GATTTTATCC TGGCTTTAGT TGGCAATACC CTGGCTCTGT GGCTTTTCAT CCGAGACCAC 180AAGTCCGGGA CCCCGGCCAA CGTGTTCCTG ATGCATCTGG CCGTGGCCGA CTTGTCGTGC 240GTGCTGGTCC TGCCCACCCG CCTGGTCTAC CACTTCTCTG GGAACCACTG GCCATTTGGG 300GAAATCGCAT GCCGTCTCAC CGGCTTCCTC TTCTACCTCA ACATGTACGC CAGCATCTAC 360TTCCTCACCT GCATCAGCGC CGACCGTTTC CTGGCCATTG TGCACCCGGT CAAGTCCCTC 420AAGCTCCGCA GGCCCCTCTA CGCACACCTG GCCTGTGCCT TCCTGTGGGT GGTGGTGGCT 480GTGGCCATGG CCCCGCTGCT GGTGAGCCCA CAGACCGTGC AGACCAACCA CACGGTGGTC 540TGCCTGCAGC TGTACCGGGA GAAGGCCTCC CACCATGCCC TGGTGTCCCT GGCAGTGGCC 600TTCACCTTCC CGTTCATCAC CACGGTCACC TGCTACCTGC TGATCATCCG CAGCCTGCGG 660CAGGGCCTGC GTGTGGAGAA GCGCCTCAAG ACCAAGGCAG TGCGCATGAT CGCCATAGTG 720CTGGCCATCT TCCTGGTCTG CTTCGTGCCC TACCACGTCA ACCGCTCCGT CTACGTGCTG 780CACTACCGCA GCCATGGGGC CTCCTGCGCC ACCCAGCGCA TCCTGGCCCT GGCAAACCGC 840ATCACCTCCT GCCTCACCAG CCTCAACGGG GCACTCGACC CCATCATGTA TTTCTTCGTG 900GCTGAGAAGT TCCGCCACGC CCTGTGCAAC TTGCTCTGTG GCAAAAGGCT CAAGGGCCCG 960CCCCCCAGCT TCGAAGGGAA AACCAACGAG AGCTCGCTGA GTGCCAAGTC AGAGCTGTGA 1020(33)SEQ ID NO32的資料(i)序列特征(A)長度339個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO32的序列描述Met Asn Gly Leu Glu Val Ala Pro Pro Gly Leu Ile Thr Asn Phe Ser1 5 10 15Leu Ala Thr Ala Glu Gln Cys Gly Gln Glu Thr Pro Leu Glu Asn Met20 25 30Leu Phe Ala Ser Phe Tyr Leu Leu Asp Phe Ile Leu Ala Leu Val Gly35 40 45Asn Thr Leu Ala Leu Trp Leu Phe Ile Arg Asp His Lys Ser Gly Thr50 55 60Pro Ala Asn Val Phe Leu Met His Leu Ala Val Ala Asp Leu Ser Cys65 70 75 80Val Leu Val Leu Pro Thr Arg Leu Val Tyr His Phe Ser Gly Asn His85 90 95Trp Pro Phe Gly Glu Ile Ala Cys Arg Leu Thr Gly Phe Leu Phe Tyr100 105 110Leu Asn Met Tyr Ala Ser Ile Tyr Phe Leu Thr Cys Ile Ser Ala Asp115 120 125Arg Phe Leu Ala Ile Val His Pro Val Lys Ser Leu Lys Leu Arg Arg130 135 140Pro Leu Tyr Ala His Leu Ala Cys Ala Phe Leu Trp Val Val Val Ala145 150 155 160Val Ala Met Ala Pro Leu Leu Val Ser Pro Gln Thr Val Gln Thr Asn165 170 175His Thr Val Val Cys Leu Gln Leu Tyr Arg Glu Lys Ala Ser His His180 185 190Ala Leu Val Ser Leu Ala Val Ala Phe Thr Phe Pro Phe Ile Thr Thr195 200 205Val Thr Cys Tyr Leu Leu Ile Ile Arg Ser Leu Arg Gln Gly Leu Arg210 215 220Val Glu Lys Arg Leu Lys Thr Lys Ala Val Arg Met Ile Ala Ile Val225 230 235 240Leu Ala Ile Phe Leu Val Cys Phe Val Pro Tyr His Val Asn Arg Ser245 250 255Val Tyr Val Leu His Tyr Arg Ser His Gly Ala Ser Cys Ala Thr Gln260 265 270Arg Ile Leu Ala Leu Ala Asn Arg Ile Thr Ser Cys Leu Thr Ser Leu275 280 285Asn Gly Ala Leu Asp Pro Ile Met Tyr Phe Phe Val Ala Glu Lys Phe290 295 300Arg His Ala Leu Cys Asn Leu Leu Cys Gly Lys Arg Leu Lys Gly Pro305 310 315 320Pro Pro Ser Phe Glu Gly Lys Thr Asn Glu Ser Ser Leu Ser Ala Lys325 330 335Ser Glu Leu(34)SEQ ID NO33的資料(i)序列特征(A)長度29個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO33的序列描述ATAAGATGAT CACCCTGAAC AATCAAGAT 29(35)SEQ ID NO34的資料(i)序列特征(A)長度33個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO34的序列描述TCCGAATTCA TAACATTTCA CTGTTTATAT TGC 33(36)SEQ ID NO35的資料(i)序列特征(A)長度996個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO35的序列描述ATGATCACCC TGAACAATCA AGATCAACCT GTCACTTTTA ACAGCTCACA TCCAGATGAA 60TACAAAATTG CAGCCCTTGT CTTCTATAGC TGTATCTTCA TAATTGGATT ATTTGTTAAC 120ATCACTGCAT TATGGGTTTT CAGTTGTACC ACCAAGAAGA GAACCACGGT AACCATCTAT 180ATGATGAATG TGGCATTAGT GGACTTGATA TTTATAATGA CTTTACCCTT TCGAATGTTT 240TATTATGCAA AAGATGCATG GCCATTTGGA GAGTACTTCT GCCAGATTAT TGGAGCTCTC 300ACAGTGTTTT ACCCAAGCAT TGCTTTATGG CTTCTTGCCT TTATTAGTGC TGACAGATAC 360ATGGCCATTG TACAGCCGAA GTACGCCAAA GAACTTAAAA ACACGTGCAA AGCCGTGCTG 420GCGTGTGTGG GAGTCTGGAT AATGACCCTG ACCACGACCA CCCCTCTGCT ACTGCTCTAT 480AAAGACCCAG ATAAAGACTC CACTCCCGCC ACCTGCCTCA AGATTTCTGA CATCATCTAT 540CTAAAAGCTG TGAACGTGCT GAACCTCACT CGACTGACAT TTTTTTTCTT GATTCCTTTG 600TTCATCATGA TTGGGTGCTA CTTGGTCATT ATTCATAATC TCCTTCACGG CAGGACGTCT 660AAGCTGAAAC CCAAAGTCAA GGAGAAGTCC ATAAGGATCA TCATCACGCT GCTGGTGCAG 720GTGCTCGTCT GCTTTATGCC CTTCCACATC TGTTTCGCTT TCCTGATGCT GGGAACGGGG 780GAGAACAGTT ACAATCCCTG GGGAGCCTTT ACCACCTTCC TCATGAACCT CAGCACGTGT 840CTGGATGTGA TTCTCTACTA CATCGTTTCA AAACAATTTC AGGCTCGAGT CATTAGTGTC 900ATGCTATACC GTAATTACCT TCGAAGCCTG CGCAGAAAAA GTTTCCGATC TGGTAGTCTA 960AGGTCACTAA GCAATATAAA CAGTGAAATG TTATGA 996(37)SEQ ID NO36的資料(i)序列特征(A)長度331個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO36的序列描述Met Ile Thr Leu Asn Asn Gln Asp Gln Pro Val Thr Phe Asn Ser Ser1 5 10 15His Pro Asp Glu Tyr Lys Ile Ala Ala Leu Val Phe Tyr Ser Cys Ile20 25 30Phe Ile Ile Gly Leu Phe Val Asn Ile Thr Ala Leu Trp Val Phe Ser35 40 45Cys Thr Thr Lys Lys Arg Thr Thr Val Thr Ile Tyr Met Met Asn Val50 55 60Ala Leu Val Asp Leu Ile Phe Ile Met Thr Leu Pro Phe Arg Met Phe65 70 75 80Tyr Tyr Ala Lys Asp Ala Trp Pro Phe Gly Glu Tyr Phe Cys Gln Ile85 90 95Ile Gly Ala Leu Thr Val Phe Tyr Pro Ser Ile Ala Leu Trp Leu Leu100 105 110Ala Phe Ile Ser Ala Asp Arg Tyr Met Ala Ile Val Gln Pro Lys Tyr115 120 125Ala Lys Glu Leu Lys Asn Thr Cys Lys Ala Val Leu Ala Cys Val Gly
130 135 140Val Trp Ile Met Thr Leu Thr Thr Thr Thr Pro Leu Leu Leu Leu Tyr145 150 155 160Lys Asp Pro Asp Lys Asp Ser Thr Pro Ala Thr Cys Leu Lys Ile Ser165 170 175Asp Ile Ile Tyr Leu Lys Ala Val Asn Val Leu Asn Leu Thr Arg Leu180 185 190Thr Phe Phe Phe Leu Ile Pro Leu Phe Ile Met Ile Gly Cys Tyr Leu195 200 205Val Ile Ile His Asn Leu Leu His Gly Arg Thr Ser Lys Leu Lys Pro210 215220Lys Val Lys Glu Lys Ser Ile Arg Ile Ile Ile Thr Leu Leu Val Gln225 230 235 240Val Leu Val Cys Phe Met Pro Phe His Ile Cys Phe Ala Phe Leu Met245 250 255Leu Gly Thr Gly Glu Asn Ser Tyr Asn Pro Trp Gly Ala Phe Thr Thr260 265 270Phe Leu Met Asn Leu Ser Thr Cys Leu Asp Val Ile Leu Tyr Tyr Ile275 280 285Val Ser Lys Gln Phe Gln Ala Arg Val Ile Ser Val Met Leu Tyr Arg290 295 300Asn Tyr Leu Arg Ser Leu Arg Arg Lys Ser Phe Arg Ser Gly Ser Leu305 310 315 320Arg Ser Leu Ser Asn Ile Asn Ser Glu Met Leu330(38)SEQ ID NO37的資料(i)序列特征(A)長度28個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO37的序列描述CCAAGCTTCC AGGCCTGGGG TGTGCTGG28(39)SEQ ID NO38的資料(i)序列特征(A)長度29個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO38的序列描述ATGGATCCTG ACCTTCGGCC CCTGGCAGA 29(40)SEQ ID NO39的資料(i)序列特征(A)長度1077個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO39的序列描述ATGCCCTCTG TGTCTCCAGC GGGGCCCTCG GCCGGGGCAG TCCCCAATGC CACCGCAGTG 60ACAACAGTGC GGACCAATGC CAGCGGGCTG GAGGTGCCCC TGTTCCACCT GTTTGCCCGG 120CTGGACGAGG AGCTGCATGG CACCTTCCCA GGCCTGTGCG TGGCGCTGAT GGCGGTGCAC 180GGAGCCATCT TCCTGGCAGG GCTGGTGCTC AACGGGCTGG CGCTGTACGT CTTCTGCTGC 240CGCACCCGGG CCAAGACACC CTCAGTCATC TACACCATCA ACCTGGTGGT GACCGATCTA 300CTGGTAGGGC TGTCCCTGCC CACGCGCTTC GCTGTGTACT ACGGCGCCAG GGGCTGCCTG 360CGCTGTGCCT TCCCGCACGT CCTCGGTTAC TTCCTCAACA TGCACTGCTC CATCCTCTTC 420CTCACCTGCA TCTGCGTGGA CCGCTACCTG GCCATCGTGC GGCCCGAAGG CTCCCGCCGC 480TGCCGCCAGC CTGCCTGTGC CAGGGCCGTG TGCGCCTTCG TGTGGCTGGC CGCCGGTGCC 540GTCACCCTGT CGGTGCTGGG CGTGACAGGC AGCCGGCCCT GCTGCCGTGT CTTTGCGCTG 600ACTGTCCTGG AGTTCCTGCT GCCCCTGCTG GTCATCAGCG TGTTTACCGG CCGCATCATG 660TGTGCACTGT CGCGGCCGGG TCTGCTCCAC CAGGGTCGCC AGCGCCGCGT GCGGGCCATG 720CAGCTCCTGC TCACGGTGCT CATCATCTTT CTCGTCTGCT TCACGCCCTT CCACGCCCGC 780CAAGTGGCCG TGGCGCTGTG GCCCGACATG CCACACCACA CGAGCCTCGT GGTCTACCAC 840GTGGCCGTGA CCCTCAGCAG CCTCAACAGC TGCATGGACC CCATCGTCTA CTGCTTCGTC 900ACCAGTGGCT TCCAGGCCAC CGTCCGAGGC CTCTTCGGCC AGCACGGAGA GCGTGAGCCC 960AGCAGCGGTG ACGTGGTCAG CATGCACAGG AGCTCCAAGG GCTCAGGCCG TCATCACATC 1020CTCAGTGCCG GCCCTCACGC CCTCACCCAG GCCCTGGCTA ATGGGCCCGA GGCTTAG1077(41)SEQ ID NO40的資料(i)序列特征(A)長度358個氨基酸(B)類型氨基酸(C)鏈型
(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO40的序列描述Met Pro Ser Val Ser Pro Ala Gly Pro Ser Ala Gly Ala Val Pro Asn1 5 10 15Ala Thr Ala Val Thr Thr Val Arg Thr Asn Ala Ser Gly Leu Glu Val20 25 30Pro Leu Phe His Leu Phe Ala Arg Leu Asp Glu Glu Leu His Gly Thr35 40 45Phe Pro Gly Leu Cys Val Ala Leu Met Ala Val His Gly Ala Ile Phe50 55 60Leu Ala Gly Leu Val Leu Asn Gly Leu Ala Leu Tyr Val Phe Cys Cys65 70 75 80Arg Thr Arg Ala Lys Thr Pro Ser Val Ile Tyr Thr Ile Asn Leu Val85 90 95Val Thr Asp Leu Leu Val Gly Leu Ser Leu Pro Thr Arg Phe Ala Val100 105 110Tyr Tyr Gly Ala Arg Gly Cys Leu Arg Cys Ala Phe Pro His Val Leu115 120 125Gly Tyr Phe Leu Asn Met His Cys Ser Ile Leu Phe Leu Thr Cys Ile130 135 140Cys Val Asp Arg Tyr Leu Ala Ile Val Arg Pro Glu Ala Pro Ala Ala145 150 155 160Cys Arg Gln Pro Ala Cys Ala Arg Ala Val Cys Ala Phe Val Trp Leu165 170 175Ala Ala Gly Ala Val Thr Leu Ser Val Leu Gly Val Thr Gly Ser Arg180 185 190Pro Cys Cys Arg Val Phe Ala Leu Thr Val Leu Glu Phe Leu Leu Pro195 200 205Leu Leu Val Ile Ser Val Phe Thr Gly Arg Ile Met Cys Ala Leu Ser210 215 220Arg Pro Gly Leu Leu His Gln Gly Arg Gln Arg Arg Val Arg Ala Met225 230 235 240Gln Leu Leu Leu Thr Val Leu Ile Ile Phe Leu Val Cys Phe Thr Pro245 250 255Phe His Ala Arg Gln Val Ala Val Ala Leu Trp Pro Asp Met Pro His260 265 270His Thr Ser Leu Val Val Tyr His Val Ala Val Thr Leu Ser Ser Leu275 280 285Asn Ser Cys Met Asp Pro Ile Val Tyr Cys Phe Val Thr Ser Gly Phe290 295 300Gln Ala Thr Val Arg Gly Leu Phe Gly Gln His Gly Glu Arg Glu Pro305 310 315 320Ser Ser Gly Asp Val Val Ser Met His Arg Ser Ser Lys Gly Ser Gly325 330 335Arg His His Ile Leu Ser Ala Gly Pro His Ala Leu Thr Gln Ala Leu340 345 350Ala Asn Gly Pro Glu Ala355(42)SEQ ID NO41的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO41的序列描述GAGAATTCAC TCCTGAGCTC AAGATGAACT 30(43)SEQ ID NO42的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO42的序列描述CGGGATCCCC GTAACTGAGC CACTTCAGAT 30(44)SEQ ID NO43的資料(i)序列特征(A)長度1050個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO43的序列描述ATGAACTCCA CCTTGGATGG TAATCAGAGC AGCCACCCTT TTTGCCTCTT GGCATTTGGC 60TATTTGGAAA CTGTCAATTT TTGCCTTTTG GAAGTATTGA TTATTGTCTT TCTAACTGTA 120TTGATTATTT CTGGCAACAT CATTGTGATT TTTGTATTTC ACTGTGCACC TTTGTTGAAC 180CATCACACTA CAAGTTATTT TATCCAGACT ATGGCATATG CTGACCTTTT TGTTGGGGTG 240AGCTGCGTGG TCCCTTCTTT ATCACTCCTC CATCACCCCC TTCCAGTAGA GGAGTCCTTG 300ACTTGCCAGA TATTTGGTTT TGTAGTATCA GTTCTGAAGA GCGTCTCCAT GGCTTCTCTG 360GCCTGTATCA GCATTGATAG ATACATTGCC ATTACTAAAC CTTTAACCTA TAATACTCTG 420GTTACACCCT GGAGACTACG CCTGTGTATT TTCCTGATTT GGCTATACTC GACCCTGGTC 480TTCCTGCCTT CCTTTTTCCA CTGGGGCAAA CCTGGATATC ATGGAGATGT GTTTCAGTGG 540TGTGCGGAGT CCTGGCACAC CGACTCCTAC TTCACCCTGT TCATCGTGAT GATGTTATAT 600GCCCCAGCAG CCCTTATTGT CTGCTTCACC TATTTCAACA TCTTCCGCAT CTGCCAACAG 660CACACAAAGG ATATCAGCGA AAGGCAAGCC CGCTTCAGCA GCCAGAGTGG GGAGACTGGG 720GAAGTGCAGG CCTGTCCTGA TAAGCGCTAT GCCATGGTCC TGTTTCGAAT CACTAGTGTA 780TTTTACATCC TCTGGTTGCC ATATATCATC TACTTCTTGT TGGAAAGCTC CACTGGCCAC 840AGCAACCGCT TCGCATCCTT CTTGACCACC TGGCTTGCTA TTAGTAACAG TTTCTGCAAC 900TGTGTAATTT ATAGTCTCTC CAACAGTGTA TTCCAAAGAG GACTAAAGCG CCTCTCAGGG 960GCTATGTGTA CTTCTTGTGC AAGTCAGACT ACAGCCAACG ACCCTTACAC AGTTAGAAGC 1020AAAGGCCCTC TTAATGGATG TCATATCTGA 1050(45)SEQ ID NO44的資料(i)序列特征(A)長度349個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO44的序列描述Met Asn Ser Thr Leu Asp Gly Asn Gln Ser Ser His Pro Phe Cys Leu1 5 10 15Leu Ala Phe Gly Tyr Leu Glu Thr Val Asn Phe Cys Leu Leu Glu Val20 25 30Leu Ile Ile Val Phe Leu Thr Val Leu Ile Ile Ser Gly Asn Ile Ile35 40 45Val Ile Phe Val Phe His Cys Ala Pro Leu Leu Asn His His Thr Thr
50 55 60Ser Tyr Phe Ile Gln Thr Met Ala Tyr Ala Asp Leu Phe Val Gly Val65 70 75 80Ser Cys Val Val Pro Ser Leu Ser Leu Leu His His Pro Leu Pro Val85 90 95Glu G1u Ser Leu Thr Cys Gln Ile Phe Gly Phe Val Val Ser Val Leu100 105 110Lys Ser Val Ser Met Ala Ser Leu Ala Cys Ile Ser Ile Asp Arg Tyr115 120 125Ile Ala Ile Thr Lys Pro Leu Thr Tyr Asn Thr Leu Val Thr Pro Trp130 135 140Arg Leu Arg Leu Cys Ile Phe Leu Ile Trp Leu Tyr Ser Thr Leu Val145 150 155 160Phe Leu Pro Ser Phe Phe His Trp Gly Lys Pro Gly Tyr His Gly Asp165 170 175Val Phe Gln Trp Cys Ala Glu Ser Trp His Thr Asp Ser Tyr Phe Thr180 185 190Leu Phe Ile Val Met Met Leu Tyr Ala Pro Ala Ala Leu Ile Val Cys195 200 205Phe Thr Tyr Phe Asn Ile Phe Arg Ile Cys Gln Gln His Thr Lys Asp210 215 220Ile Ser Glu Arg Gln Ala Arg Phe Ser Ser Gln Ser Gly Glu Thr Gly225 230 235 240Glu Val Gln Ala Cys Pro Asp Lys Arg Tyr Ala Met Val Leu Phe Arg245 250 255Ile Thr Ser Val Phe Tyr Ile Leu Trp Leu Pro Tyr Ile Ile Tyr Phe260 265 270Leu Leu Glu Ser Ser Thr Gly His Ser Asn Arg Phe Ala Ser Phe Leu275 280 285Thr Thr Trp Leu Ala Ile Ser Asn Ser Phe Cys Asn Cys Val Ile Tyr290 295 300Ser Leu Ser Asn Ser Val Phe Gln Arg Gly Leu Lys Arg Leu Ser Gly305 310 315 320Ala Met Cys Thr Ser Cys Ala Ser Gln Thr Thr Ala Asn Asp Pro Tyr
325 330 335Thr Val Arg Ser Lys Gly Pro Leu Asn Gly Cys His Ile345(46)SEQ ID NO45的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO45的序列描述TCCCCCGGGA AAAAAACCAA CTGCTCCAAA 30(47)SEQ ID NO46的資料(i)序列特征(A)長度31個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO46的序列描述TAGGATCCAT TTGAATGTGG ATTTGGTGAA A 31(48)SEQ ID NO47的資料(i)序列特征(A)長度1302個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO47的序列描述WCHTRVHGNR AWCANMWRWM MWDNMAHMGS BDRVWNNYMK DDWBHHRTAN GKGBBKDWMH 60ATMWNWWNNN WMHASRTHHT MSNWRMANRG ARTMSNWRMA NRGARABACK SATGTGTTTT 120TCTCCCATTC TGGAAATCAA CATGCAGTCT GAATCTAACA TTACAGTGCG AGATGACATT 180GATGACATCA ACACCAATAT GTACCAACCA CTATCATATC CGTTAAGCTT TCAAGTGTCT 240CTCACCGGAT TTCTTATGTT AGAAATTGTG TTGGGACTTG GCAGCAACCT CACTGTATTG 300GTACTTTACT GCATGAAATC CAACTTAATC AACTCTGTCA GTAACATTAT TACAATGAAT 360CTTCATGTAC TTGATGTAAT AATTTGTGTG GGATGTATTC CTCTAACTAT AGTTATCCTT 420CTGCTTTCAC TGGAGAGTAA CACTGCTCTC ATTTGCTGTT TCCATGAGGC TTGTGTATCT 480TTTGCAAGTG TCTCAACAGC AATCAACGTT TTTGCTATCA CTTTGGACAG ATATGACATC 540TCTGTAAAAC CTGCAAACCG AATTCTGACA ATGGGCAGAG CTGTAATGTT AATGATATCC 600ATTTGGATTT TTTCTTTTTT CTCTTTCCTG ATTCCTTTTA TTGAGGTAAA TTTTTTCAGT 660CTTCAAAGTG GAAATACCTG GGAAAACAAG ACACTTTTAT GTGTCAGTAC AAATGAATAC 720TACACTGAAC TGGGAATGTA TTATCACCTG TTAGTACAGA TCCCAATATT CTTTTTCACT 780GTTGTAGTAA TGTTAATCAC ATACACCAAA ATACTTCAGG CTCTTAATAT TCGAATAGGC 840ACAAGATTTT CAACAGGGCA GAAGAAGAAA GCAAGAAAGA AAAAGACAAT TTCTCTAACC 900ACACAACATG AGGCTACAGA CATGTCACAA AGCAGTGGTG GGAGAAATGT AGTCTTTGGT 960GTAAGAACTT CAGTTTCTGT AATAATTGCC CTCCGGCGAG CTGTGAAACG ACACCGTGAA 1020CGACGAGAAA GACAAAAGAG AGTCTTCAGG ATGTCTTTAT TGATTATTTC TACATTTCTT 1080CTCTGCTGGA CACCAATTTC TGTTTTAAAT ACCACCATTT TATGTTTAGG CCCAAGTGAC 1140CTTTTAGTAA AATTAAGATT GTGTTTTTTA GTCATGGCTT ATGGAACAAC TATATTTCAC 1200CCTCTATTAT ATGCATTCAC TAGACAAAAA TTTCAAAAGG TCTTGAAAAG TAAAATGAAA 1260AAGCGAGTTG TTTCTATAGT AGAAGCTGAT CCCCTGCCTA ATAATGCTGT AATACACAAC 1320TCTTGGATAG ATCCCAAAAG AAACAAAAAA ATTACCTTTG AAGATAGTGA AATAAGAGAA 1380AAACGTTTAG TGCCTCAGGT TGTCACAGAC TAG 1413(49)SEQ ID NO48的資料(i)序列特征(A)長度433個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO48的序列描述Met Cys Phe Ser Pro Ile Leu Glu Ile Asn Met Gln Ser Glu Ser Asn1 5 10 15Ile Thr Val Arg Asp Asp Ile Asp Asp Ile Asn Thr Asn Met Tyr Gln20 25 30Pro Leu Ser Tyr Pro Leu Ser Phe Gln Val Ser Leu Thr Gly Phe Leu35 40 45Met Leu Glu Ile Val Leu Gly Leu Gly Ser Asn Leu Thr Val Leu Val50 55 60Leu Tyr Cys Met Lys Ser Asn Leu Ile Asn Ser Val Ser Asn Ile Ile65 70 75 80Thr Met Asn Leu His Val Leu Asp Val Ile Ile Cys Val Gly Cys Ile85 90 95Pro Leu Thr Ile Val Ile Leu Leu Leu Ser Leu Glu Ser Asn Thr Ala100 105 110Leu Ile Cys Cys Phe His Glu Ala Cys Val Ser Phe Ala Ser Val Ser115 120 125Thr Ala Ile Asn Val Phe Ala Ile Thr Leu Asp Arg Tyr Asp Ile Ser130 135 140Val Lys Pro Ala Asn Arg Ile Leu Thr Met Gly Arg Ala Val Met Leu145 150 155 160Met Ile Ser Ile Trp Ile Phe Ser Phe Phe Ser Phe Leu Ile Pro Phe165 170 175Ile Glu Val Asn Phe Phe Ser Leu Gln Ser Gly Asn Thr Trp Glu Asn180 185 190Lys Thr Leu Leu Cys Val Ser Thr Asn Glu Tyr Tyr Thr Glu Leu Gly195 200 205Met Tyr Tyr His Leu Leu Val Gln Ile Pro Ile Phe Phe Phe Thr Val210 215 220Val Val Met Leu Ile Thr Tyr Thr Lys Ile Leu Gln Ala Leu Asn Ile225 230 235 240Arg Ile Gly Thr Arg Phe Ser Thr Gly Gln Lys Lys Lys Ala Arg Lys245 250 255Lys Lys Thr Ile Ser Leu Thr Thr Gln His Glu Ala Thr Asp Met Ser260 265 270Gln Ser Ser Gly Gly Arg Asn Val Val Phe Gly Val Arg Thr Ser Val275 280 285Ser Val Ile Ile Ala Leu Arg Arg Ala Val Lys Arg His Arg Glu Arg290 295 300Arg Glu Arg Gln Lys Arg Val Phe Arg Met Ser Leu Leu Ile Ile Ser305 310 315 320Thr Phe Leu Leu Cys Trp Thr Pro Ile Ser Val Leu Asn Thr Thr Ile325 330 335Leu Cys Leu Gly Pro Ser Asp Leu Leu Val Lys Leu Arg Leu Cys Phe340 345 350Leu Val Met Ala Tyr Gly Thr Thr Ile Phe His Pro Leu Leu Tyr Ala355 360 365Phe Thr Arg Gln Lys Phe Gln Lys Val Leu Lys Ser Lys Met Lys Lys370 375 380Arg Val Val Ser Ile Val Glu Ala Asp Pro Leu Pro Asn Asn Ala Val385 390 395 400Ile His Asn Ser Trp Ile Asp Pro Lys Arg Asn Lys Lys Ile Thr Phe405 410 415Glu Asp Ser Glu Ile Arg Glu Lys Arg Leu Val Pro Gln Val Val Thr420 425 430Asp(50)SEQ ID NO49的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO49的序列描述GTGAAGCTTG CCTCTGGTGC CTGCAGGAGG 30(51)SEQ ID NO50的資料(i)序列特征(A)長度31個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO50的序列描述GCAGAATTCC CGGTGGCGTG TTGTGGTGCC C 31(52)SEQ ID NO51的資料(i)序列特征(A)長度1209個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO51的序列描述ATGTTGTGTC CTTCCAAGAC AGATGGCTCA GGGCACTCTG GTAGGATTCA CCAGGAAACT 60CATGGAGAAG GGAAAAGGGA CAAGATTAGC AACAGTGAAG GGAGGGAGAA TGGTGGGAGA 120GGATTCCAGA TGAACGGTGG GTCGCTGGAG GCTGAGCATG CCAGCAGGAT GTCAGTTCTC 180AGAGCAAAGC CCATGTCAAA CAGCCAACGC TTGCTCCTTC TGTCCCCAGG ATCACCTCCT 240CGCACGGGGA GCATCTCCTA CATCAACATC ATCATGCCTT CGGTGTTCGG CACCATCTGC 300CTCCTGGGCA TCATCGGGAA CTCCACGGTC ATCTTCGCGG TCGTGAAGAA GTCCAAGCTG 360CACTGGTGCA ACAACGTCCC CGACATCTTC ATCATCAACC TCTCGGTAGT AGATCTCCTC 420TTTCTCCTGG GCATGCCCTT CATGATCCAC CAGCTCATGG GCAATGGGGT GTGGCACTTT 480GGGGAGACCA TGTGCACCCT CATCACGGCC ATGGATGCCA ATAGTCAGTT CACCAGCACC 540TACATCCTGA CCGCCATGGC CATTGACCGC TACCTGGCCA CTGTCCACCC CATCTCTTCC 600ACGAAGTTCC GGAAGCCCTC TGTGGCCACC CTGGTGATCT GCCTCCTGTG GGCCCTCTCC 660TTCATCAGCA TCACCCCTGT GTGGCTGTAT GCCAGACTCA TCCCCTTCCC AGGAGGTGCA 720GTGGGCTGCG GCATACGCCT GCCCAACCCA GACACTGACC TCTACTGGTT CACCCTGTAC 780CAGTTTTTCC TGGCCTTTGC CCTGCCTTTT GTGGTCATCA CAGCCGCATA CGTGAGGATC 840CTGCAGCGCA TGACGTCCTC AGTGGCCCCC GCCTCCCAGC GCAGCATCCG GCTGCGGACA 900AAGAGGGTGA CCCGCACAGC CATCGCCATC TGTCTGGTCT TCTTTGTGTG CTGGGCACCC 960TACTATGTGC TACAGCTGAC CCAGTTGTCC ATCAGCCGCC CGACCCTCAC CTTTGTCTAC 1020TTATACAATG CGGCCATCAG CTTGGGCTAT GCCAACAGCT GCCTCAACCC CTTTGTGTAC 1080ATCGTGCTCT GTGAGACGTT CCGCAAACGC TTGGTCCTGT CGGTGAAGCC TGCAGCCCAG 1140GGGCAGCTTC GCGCTGTCAG CAACGCTCAG ACGGCTGACG AGGAGAGGAC AGAAAGCAAA 1200GGCACCTGA 1209(53)SEQ ID NO52的資料(i)序列特征(A)長度402個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO52的序列描述Met Leu Cys Pro Ser Lys Thr Asp Gly Ser Gly His Ser Gly Arg Ile1 5 10 15His Gln Glu Thr His Gly Glu Gly Lys Arg Asp Lys Ile Ser Asn Ser20 25 30Glu Gly Arg Glu Asn Gly Gly Arg Gly Phe Gln Met Asn Gly Gly Ser35 40 45Leu Glu Ala Glu His Ala Ser Arg Met Ser Val Leu Arg Ala Lys Pro50 55 60Met Ser Asn Ser Gln Arg Leu Leu Leu Leu Ser Pro Gly Ser Pro Pro65 70 75 80Arg Thr Gly Ser Ile Ser Tyr Ile Asn Ile Ile Met Pro Ser Val Phe85 90 95Gly Thr Ile Cys Leu Leu Gly Ile Ile Gly Asn Ser Thr Val Ile Phe100 105 110Ala Val Val Lys Lys Ser Lys Leu His Trp Cys Asn Asn Val Pro Asp115 120 125Ile Phe Ile Ile Asn Leu Ser Val Val Asp Leu Leu Phe Leu Leu Gly130 135 140Met Pro Phe Met Ile His Gln Leu Met Gly Asn Gly Val Trp His Phe145 150 155 160Gly Glu Thr Met Cys Thr Leu Ile Thr Ala Met Asp Ala Asn Ser Gln165 170 175Phe Thr Ser Thr Tyr Ile Leu Thr Ala Met Ala Ile Asp Arg Tyr Leu180 185 190Ala Thr Val His Pro lle Ser Ser Thr Lys Phe Arg Lys Pro Ser Val195 200 205Ala Thr Leu Val Ile Cys Leu Leu Trp Ala Leu Ser Phe Ile Ser Ile210 215 220Thr Pro Val Trp Leu Tyr Ala Arg Leu Ile Pro Phe Pro Gly Gly Ala225 230 235 240Val Gly Cys Gly Ile Arg Leu Pro Asn Pro Asp Thr Asp Leu Tyr Trp245 250 255Phe Thr Leu Tyr Gln Phe Phe Leu Ala Phe Ala Leu Pro Phe Val Val260 265 270Ile Thr Ala Ala Tyr Val Arg Ile Leu Gln Arg Met Thr Ser Ser Val275 280 285Ala Pro Ala Ser Gln Arg Ser Ile Arg Leu Arg Thr Lys Arg Val Thr290 295 300Arg Thr Ala Ile Ala Ile Cys Leu Val Phe Phe Val Cys Trp Ala Pro305 310 315 320Tyr Tyr Val Leu Gln Leu Thr Gln Leu Ser Ile Ser Arg Pro Thr Leu325 330 335Thr Phe Val Tyr Leu Tyr Asn Ala Ala Ile Ser Leu Gly Tyr Ala Asn340 345 350Ser Cys Leu Asn Pro Phe Val Tyr Ile Val Leu Cys Glu Thr Phe Arg355 360 365Lys Arg Leu Val Leu Ser Val Lys Pro Ala Ala Gln Gly Gln Leu Arg370 375 380Ala Val Ser Asn Ala Gln Thr Ala Asp Glu Glu Arg Thr Glu Ser Lys385 390 395 400Gly Thr(54)SEQ ID NO53的資料(i)序列特征(A)長度27個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO53的序列描述GGCGGATCCA TGGATGTGAC TTCCCAA27(55)SEQ ID NO54的資料(i)序列特征(A)長度27個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO54的序列描述GGCGGATCCC TACACGGCAC TGCTGAA 27(56)SEQ ID NO55的資料(i)序列特征(A)長度1128個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO55的序列描述ATGGATGTGA CTTCCCAAGC CCGGGGCGTG GGCCTGGAGA TGTACCCAGG CACCGCGCAC 60GCTGCGGCCC CCAACACCAC CTCCCCCGAG CTCAACCTGT CCCACCCGCT CCTGGGCACC 120GCCCTGGCCA ATGGGACAGG TGAGCTCTCG GAGCACCAGC AGTACGTGAT CGGCCTGTTC 180CTCTCGTGCC TCTACACCAT CTTCCTCTTC CCCATCGGCT TTGTGGGCAA CATCCTGATC 240CTGGTGGTGA ACATCAGCTT CCGCGAGAAG ATGACCATCC CCGACCTGTA CTTCATCAAC 300CTGGCGGTGG CGGACCTCAT CCTGGTGGCC GACTCCCTCA TTGAGGTGTT CAACCTGCAC 360GAGCGGTACT ACGACATCGC CGTCCTGTGC ACCTTCATGT CGCTCTTCCT GCAGGTCAAC 420ATGTACAGCA GCGTCTTCTT CCTCACCTGG ATGAGCTTCG ACCGCTACAT CGCCCTGGCC 480AGGGCCATGC GCTGCAGCCT GTTCCGCACC AAGCACCACG CCCGGCTGAG CTGTGGCCTC 540ATCTGGATGG CATCCGTGTC AGCCACGCTG GTGCCCTTCA CCGCCGTGCA CCTGCAGCAC 600ACCGACGAGG CCTGCTTCTG TTTCGCGGAT GTCCGGGAGG TGCAGTGGCT CGAGGTCACG 660CTGGGCTTCA TCGTGCCCTT CGCCATCATC GGCCTGTGCT ACTCCCTCAT TGTCCGGGTG 720CTGGTCAGGG CGCACCGGCA CCGTGGGCTG CGGCCCCGGC GGCAGAAGGC GCTCCGCATG 780ATCCTCGCGG TGGTGCTGGT CTTCTTCGTC TGCTGGCTGC CGGAGAACGT CTTCATCAGC 840GTGCACCTCC TGCAGCGGAC GCAGCCTGGG GCCGCTCCCT GCAAGCAGTC TTTCCGCCAT 900GCCCACCCCC TCACGGGCCA CATTGTCAAC CTCGCCGCCT TCTCCAACAG CTGCCTAAAC 960CCCCTCATCT ACAGCTTTCT CGGGGAGACC TTCAGGGACA AGCTGAGGCT GTACATTGAG 1020CAGAAAACAA ATTTGCCGGC CCTGAACCGC TTCTGTCACG CTGCCCTGAA GGCCGTCATT 1080CCAGACAGCA CCGAGCAGTC GGATGTGAGG TTCAGCAGTG CCGTGTGA 1128(57)SEQ ID NO56的資料(i)序列特征(A)長度375個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO56的序列描述Met Asp Val Thr Ser Gln Ala Arg Gly Val Gly Leu Glu Met Tyr Pro1 5 10 15Gly Thr Ala His Ala Ala Ala Pro Asn Thr Thr Ser Pro Glu Leu Asn20 25 30Leu Ser His Pro Leu Leu Gly Thr Ala Leu Ala Asn Gly Thr Gly Glu35 40 45Leu Ser Glu His Gln Glr Tyr Val Ile Gly Leu Phe Leu Ser Cys Leu50 55 60Tyr Thr Ile Phe Leu Phe Pro Ile Gly Phe Val Gly Asn Ile Leu Ile65 70 75 80Leu Val Val Asn Ile Ser Phe Arg Glu Lys Met Thr Ile Pro Asp Leu85 90 95Tyr Phe Ile Asn Leu Ala Val Ala Asp Leu Ile Leu Val Ala Asp Ser100 105 110Leu Ile Glu Val Phe Asn Leu His Glu Arg Tyr Tyr Asp Ile Ala Val115 120 125Leu Cys Thr Phe Met Ser Leu Phe Leu Gln Val Asn Met Tyr Ser Ser130 135 140Val Phe Phe Leu Thr Trp Met Ser Phe Asp Arg Tyr Ile Ala Leu Ala145 150 155 160Arg Ala Met Arg Cys Ser Leu Phe Arg Thr Lys His His Ala Arg Leu165 170 175Ser Cys Gly Leu Ile Trp Met Ala Ser Val Ser Ala Thr Leu Val Pro180 185 190Phe Thr Ala Val His Leu Gln His Thr Asp Glu Ala Cys Phe Cys Phe195 200 205Ala Asp Val Arg Glu Val Gln Trp Leu Glu Val Thr Leu Gly Phe Ile210 215 220Val Pro Phe Ala Ile Ile Gly Leu Cys Tyr Ser Leu Ile Val Arg Val225 230 235 240Leu Val Arg Ala His Arg His Arg Gly Leu Arg Pro Arg Arg Gln Lys245 250 255Ala Leu Arg Met Ile Leu Ala Val Val Leu Val Phe Phe Val Cys Trp260 265 270Leu Pro Glu Asn Val Phe Ile Ser Val His Leu Leu Gln Arg Thr Gln275 280 285Pro Gly Ala Ala Pro Cys Lys Gln Ser Phe Arg His Ala His Pro Leu290 295 300Thr Gly His Ile Val Asn Leu Ala Ala Phe Ser Asn Ser Cys Leu Asn305 310 315 320Pro Leu Ile Tyr Ser Phe Leu Gly Glu Thr Phe Arg Asp Lys Leu Arg325 330 335Leu Tyr Ile Glu Gln Lys Thr Asn Leu Pro Ala Leu Asn Arg Phe Cys340 345 350His Ala Ala Leu Lys Ala Val Ile Pro Asp Ser Thr Glu Gln Ser Asp355 360 365Val Arg Phe Ser Ser Ala Val370 375(58)SEQ ID NO57的資料(i)序列特征(A)長度31個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO57的序列描述AAGGAATTCA CGGCCGGGTG ATGCCATTCC C31(59)SEQ ID NO58的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO58的序列描述GGTGGATCCA TAAACACGGG CGTTGAGGAC 30(60)SEQ ID NO59的資料(i)序列特征(A)長度960個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO59的序列描述atgccattcc caaactgctc agcccccagc actgtggtgg ccacagctgt gggtgtcttg 60ctggggctgg agtgtgggct gggtctgctg ggcaacgcgg tggcgctgtg gaccttcctg 120ttccgggtca gggtgtggaa gccgtacgct gtctacctgc tcaacctggc cctggctgac 180ctgctgttgg ctgcgtgcct gcctttcctg gccgccttct acctgagcct ccaggcttgg 240catctgggcc gtgtgggctg ctgggccctg cgcttcctgc tggacctcag ccgcagcgtg 300gggatggcct tcctggccgc cgtggctttg gaccggtacc tccgtgtggt ccaccctcgg 360cttaaggtca acctgctgtc tcctcaggcg gccctggggg tctcgggcct cgtctggctc 420ctgatggtcg ccctcacctg cccgggcttg ctcatctctg aggccgccca gaactccacc 480aggtgccaca gtttctactc cagggcagac ggctccttca gcatcatctg gcaggaagca 540ctctcctgcc ttcagtttgt cctccccttt ggcctcatcg tgttctgcaa tgcaggcatc 600atcagggctc tccagaaaag actccgggag cctgagaaac agcccaagct tcagcgggcc 660caggcactgg tcaccttggt ggtggtgctg tttgctctgt gctttctgcc ctgcttcctg 720gccagagtcc tgatgcacat cttccagaat ctggggagct gcagggccct ttgtgcagtg 780gctcatacct cggatgtcac gggcagcctc acctacctgc acagtgtcgt caaccccgtg 840gtatactgct tctccagccc caccttcagg agctcctatc ggagggtctt ccacaccctc 900cgaggcaaag ggcaggcagc agagccccca gatttcaacc ccagagactc ctattcctga 960(61)SEQ ID NO60的資料(i)序列特征(A)長度319個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO60的序列描述Met Pro Phe Pro Asn Cys Ser Ala Pro Ser Thr Val Val Ala Thr Ala1 5 10 15Val Gly Val Leu Leu Gly Leu Glu Cys Gly Leu Gly Leu Leu Gly Asn20 25 30Ala Val Ala Leu Trp Thr Phe Leu Phe Arg Val Arg Val Trp Lys Pro35 40 45Tyr Ala Val Tyr Leu Leu Asn Leu Ala Leu Ala Asp Leu Leu Leu Ala50 55 60Ala Cys Leu Pro Phe Leu Ala Ala Phe Tyr Leu Ser Leu Gln Ala Trp65 70 75 80HIs Leu Gly Arg Val Gly Cys Trp Ala Leu Arg Phe Leu Leu Asp Leu85 90 95Ser Arg Ser Val Gly Met Ala Phe Leu Ala Ala Val Ala Leu Asp Arg100 105 110Tyr Leu Arg Val Val His Pro Arg Leu Lys Val Asn Leu Leu Ser Pro115 120 125Gln Ala Ala Leu Gly Val Ser Gly Leu Val Trp Leu Leu Met Val Ala130 135 140Leu Thr Cys Pro Gly Leu Leu Ile Ser Glu Ala Ala Gln Asn Ser Thr145 150 155 160Arg Cys His Ser Phe Tyr Ser Arg Ala Asp Gly Ser Phe Ser Ile Ile165 170 175Trp Gln Glu Ala Leu Ser Cys Leu Gln Phe Val Leu Pro Phe Gly Leu180 185 190Ile Val Phe Cys Asn Ala Gly Ile Ile Arg Ala Leu Gln Lys Arg Leu195 200 205Arg Glu Pro Glu Lys Gln Pro Lys Leu Gln Arg Ala Gln Ala Leu Val210 215 220Thr Leu Val Val Val Leu Phe Ala Leu Cys Phe Leu Pro Cys Phe Leu225 230 235 240Ala Arg Val Leu Met His Ile Phe Gln Asn Leu Gly Ser Cys Arg Ala245 250 255Leu Cys Ala Val Ala His Thr Ser Asp Val Thr Gly Ser Leu Thr Tyr260 265 270Leu His Ser Val Val Asn Pro Val Val Tyr Cys Phe Ser Ser Pro Thr275 280 285Phe Arg Ser Ser Tyr Arg Arg Val Phe His Thr Leu Arg Gly Lys Gly290 295 300Gln Ala Ala Glu Pro Pro Asp Phe Asn Pro Arg Asp Ser Tyr Ser305 310 315(62)SEQ ID NO61的資料(i)序列特征(A)長度1143個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO61的序列描述ATGGAGGAAG GTGGTGATTT TGACAACTAC TATGGGGCAG ACAACCAGTC TGAGTGTGAG 60TACACAGACT GGAAATCCTC GGGGGCCCTC ATCCCTGCCA TCTACATGTT GGTCTTCCTC 120CTGGGCACCA CGGGAAACGG TCTGGTGCTC TGGACCGTGT TTCGGAGCAG CCGGGAGAAG 180AGGCGCTCAG CTGATATCTT CATTGCTAGC CTGGCGGTGG CTGACCTGAC CTTCGTGGTG 240ACGCTGCCCC TGTGGGCTAC CTACACGTAC CGGGACTATG ACTGGCCCTT TGGGACCTTC 300TTCTGCAAGC TCAGCAGCTA CCTCATCTTC GTCAACATGT ACGCCAGCGT CTTCTGCCTC 360ACCGGCCTCA GCTTCGACCG CTACCTGGCC ATCGTGAGGC CAGTGGCCAA TGCTCGGCTG 420AGGCTGCGGG TCAGCGGGGC CGTGGCCACG GCAGTTCTTT GGGTGCTGGC CGCCCTCCTG 480GCCATGCCTG TCATGGTGTT ACGCACCACC GGGGACTTGG AGAACACCAC TAAGGTGCAG 540TGCTACATGG ACTACTCCAT GGTGGCCACT GTGAGCTCAG AGTGGGCCTG GGAGGTGGGC 600CTTGGGGTCT CGTCCACCAC CGTGGGCTTT GTGGTGCCCT TCACCATCAT GCTGACCTGT 660TACTTCTTCA TCGCCCAAAC CATCGCTGGC CACTTCCGCA AGGAACGCAT CGAGGGCCTG 720CGGAAGCGGC GCCGGCTGCT CAGCATCATC GTGGTGCTGG TGGTGACCTT TGCCCTGTGC 780TGGATGCCCT ACCACCTGGT GAAGACGCTG TACATGCTGG GCAGCCTGCT GCACTGGCCC 840TGTGACTTTG ACCTCTTCCT CATGAACATC TTCCCCTACT GCACCTGCAT CAGCTACGTC 900AACAGCTGCC TCAACCCCTT CCTCTATGCC TTTTTCGACC CCCGCTTCCG CCAGGCCTGC 960ACCTCCATGC TCTGCTGTGG CCAGAGCAGG TGCGCAGGCA CCTCCCACAG CAGCAGTGGG 1020GAGAAGTCAG CCAGCTACTC TTCGGGGCAC AGCCAGGGGC CCGGCCCCAA CATGGGCAAG 1080GGTGGAGAAC AGATGCACGA GAAATCCATC CCCTACAGCC AGGAGACCCT TGTGGTTGAC 1140TAG 1143(63)SEQ ID NO62的資料(i)序列特征(A)長度380個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO62的序列描述Met Glu Glu Gly Gly Asp Phe Asp Asn Tyr Tyr Gly Ala Asp Asn Gln1 5 10 15Ser Glu Cys Glu Tyr Thr Asp Trp Lys Ser Ser Gly Ala Leu Ile Pro20 25 30Ala Ile Tyr Met Leu Val Phe Leu Leu Gly Thr Thr Gly Asn Gly Leu35 40 45Val Leu Trp Thr Val Phe Arg Ser Ser Arg Glu Lys Arg Arg Ser Ala50 55 60Asp Ile Phe Ile Ala Ser Leu Ala Val Ala Asp Leu Thr Phe Val Val65 70 75 80Thr Leu Pro Leu Trp Ala Thr Tyr Thr Tyr Arg Asp Tyr Asp Trp Pro85 90 95Phe Gly Thr Phe Phe Cys Lys Leu Ser Ser Tyr Leu Ile Phe Val Asn100 105 110Met Tyr Ala Ser Val Phe Cys Leu Thr Gly Leu Ser Phe Asp Arg Tyr115 120 125Leu Ala Ile Val Arg Pro Val Ala Asn Ala Arg Leu Arg Leu Arg Val130 135 140Ser Gly Ala Val Ala Thr Ala Val Leu Trp Val Leu Ala Ala Leu Leu145 150 155 160Ala Met Pro Val Met Val Leu Arg Thr Thr Gly Asp Leu Glu Asn Thr165 170 175Thr Lys Val Gln Cys Tyr Met Asp Tyr Ser Met Val Ala Thr Val Ser180 185 190Ser Glu Trp Ala Trp Glu Val Gly Leu Gly Val Ser Ser Thr Thr Val195 200 205Gly Phe Val Val Pro Phe Thr Ile Met Leu Thr Cys Tyr Phe Phe Ile210 215 220Ala Gln Thr Ile Ala Gly His Phe Arg Lys Glu Arg Ile Glu Gly Leu225 230 235 240Arg Lys Arg Arg Arg Leu Leu Ser Ile Ile Val Val Leu Val Val Thr245 250 255Phe Ala Leu Cys Trp Met Pro Tyr His Leu Val Lys Thr Leu Tyr Met260 265 270Leu Gly Ser Leu Leu His Trp Pro Cys Asp Phe Asp Leu Phe Leu Met275 280 285Asn Ile Phe Pro Tyr Cys Thr Cys Ile Ser Tyr Val Asn Ser Cys Leu290 295 300Asn Pro Phe Leu Tyr Ala Phe Phe Asp Pro Arg Phe Arg Gln Ala Cys305 310 315 320Thr Ser Met Leu Cys Cys Gly Gln Ser Arg Cys Ala Gly Thr Ser His325 330 335Ser Ser Ser Gly Glu Lys Ser Ala Ser Tyr Ser Ser Gly His Ser Gln340 345 350Gly Pro Gly Pro Asn Met Gly Lys Gly Gly Glu Gln Met His Glu Lys355 360 365Ser Ile Pro Tyr Ser Gln Glu Thr Leu Val Val Asp370 375 380(64)SEQ ID NO63的資料(i)序列特征(A)長度31個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO63的序列描述TGAGAATTCT GGTGACTCAC AGCCGGCACA G 31(65)SEQ ID NO64的資料(i)序列特征(A)長度31個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO64的序列描述GCCGGATCCA AGGAAAAGCA GCAATAAAAG G31(66)SEQ ID NO65的資料(i)序列特征(A)長度1119個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO65的序列描述ATGAACTACC CGCTAACGCT GGAAATGGAC CTCGAGAACC TGGAGGACCT GTTCTGGGAA 60CTGGACAGAT TGGACAACTA TAACGACACC TCCCTGGTGG AAAATCATCT CTGCCCTGCC 120ACAGAGGGTC CCCTCATGGC CTCCTTCAAG GCCGTGTTCG TGCCCGTGGC CTACAGCCTC 180ATCTTCCTCC TGGGCGTGAT CGGCAACGTC CTGGTGCTGG TGATCCTGGA GCGGCACCGG 240CAGACACGCA GTTCCACGGA GACCTTCCTG TTCCACCTGG CCGTGGCCGA CCTCCTGCTG 300GTCTTCATCT TGCCCTTTGC CGTGGCCGAG GGCTCTGTGG GCTGGGTCCT GGGGACCTTC 360CTCTGCAAAA CTGTGATTGC CCTGCACAAA GTCAACTTCT ACTGCAGCAG CCTGCTCCTG 420GCCTGCATCG CCGTGGACCG CTACCTGGCC ATTGTCCACG CCGTCCATGC CTACCGCCAC 480CGCCGCCTCC TCTCCATCCA CATCACCTGT GGGACCATCT GGCTGGTGGG CTTCCTCCTT 540GCCTTGCCAG AGATTCTCTT CGCCAAAGTC AGCCAAGGCC ATCACAACAA CTCCCTGCCA 600CGTTGCACCT TCTCCCAAGA GAACCAAGCA GAAACGCATG CCTGGTTCAC CTCCCGATTC 660CTCTACCATG TGGCGGGATT CCTGCTGCCC ATGCTGGTGA TGGGCTGGTG CTACGTGGGG 720GTAGTGCACA GGTTGCGCCA GGCCCAGCGG CGCCCTCAGC GGCAGAAGGC AGTCAGGGTG 780GCCATCCTGG TGACAAGCAT CTTCTTCCTC TGCTGGTCAC CCTACCACAT CGTCATCTTC 840CTGGACACCC TGGCGAGGCT GAAGGCCGTG GACAATACCT GCAAGCTGAA TGGCTCTCTC 900CCCGTGGCCA TCACCATGTG TGAGTTCCTG GGCCTGGCCC ACTGCTGCCT CAACCCCATG 960CTCTACACTT TCGCCGGCGT GAAGTTCCGC AGTGACCTGT CGCGGCTCCT GACCAAGCTG 1020GGCTGTACCG GCCCTGCCTC CCTGTGCCAG CTCTTCCCTA GCTGGCGCAG GAGCAGTCTC 1080TCTGAGTCAG AGAATGCCAC CTCTCTCACC ACGTTCTAG1119(67)SEQ ID NO66的資料(i)序列特征(A)長度372個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO66的序列描述Met Asn Tyr Pro Leu Thr Leu Glu Met Asp Leu Glu Asn Leu Glu Asp1 5 10 15Leu Phe Trp Glu Leu Asp Arg Leu Asp Asn Tyr Asn Asp Thr Ser Leu20 25 30Val Glu Asn His Leu Cys Pro Ala Thr Glu Gly Pro Leu Met Ala Ser35 40 45Phe Lys Ala Val Phe Val Pro Val Ala Tyr Ser Leu Ile Phe Leu Leu50 55 60Gly Val Ile Gly Asn Val Leu Val Leu Val Ile Leu Glu Arg His Arg65 70 75 80Gln Thr Arg Ser Ser Thr Glu Thr Phe Leu Phe His Leu Ala Val Ala85 90 95Asp Leu Leu Leu Val Phe Ile Leu Pro Phe Ala Val Ala Glu Gly Ser100 105 110Val Gly Trp Val Leu Gly Thr Phe Leu Cys Lys Thr Val Ile Ala Leu115 120 125His Lys Val Asn Phe Tyr Cys Ser Ser Leu Leu Leu Ala Cys Ile Ala130 135 140Val Asp Arg Tyr Leu Ala Ile Val His Ala Val His Ala Tyr Arg His145 150 155 160Arg Arg Leu Leu Ser Ile His Ile Thr Cys Gly Thr Ile Trp Leu Val165 170 175Gly Phe Leu Leu Ala Leu Pro Glu Ile Leu Phe Ala Lys Val Ser Gln180 185 190Gly His His Asn Asn Ser Leu Pro Arg Cys Thr Phe Ser Gln Glu Asn195 200 205Gln Ala Glu Thr His Ala Trp Phe Thr Ser Arg Phe Leu Tyr His Val
210 215 220Ala Gly Phe Leu Leu Pro Met Leu Val Met Gly Trp Cys Tyr Val Gly225 230 235 240Val Val His Arg Leu Arg Gln Ala Gln Arg Arg Pro Gln Arg Gln Lys245 250 255Ala Val Arg Val Ala Ile Leu Val Thr Ser Ile Phe Phe Leu Cys Trp260 265 270Ser Pro Tyr His Ile Val Ile Phe Leu Asp Thr Leu Ala Arg Leu Lys275 280 285Ala Val Asp Asn Thr Cys Lys Leu Asn Gly Ser Leu Pro Val Ala Ile290 295 300Thr Met Cys Glu Phe Leu Gly Leu Ala His Cys Cys Leu Asn Pro Met305 310 315 320Leu Tyr Thr Phe Ala Gly Val Lys Phe Arg Ser Asp Leu Ser Arg Leu325 330 335Leu Thr Lys Leu Gly Cys Thr Gly Pro Ala Ser Leu Cys Gln Leu Phe340 345 350Pro Ser Trp Arg Arg Ser Ser Leu Ser Glu Ser Glu Asn Ala Thr Ser355 360 365Leu Thr Thr Phe370(68)SEQ ID NO67的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO67的序列描述CAAAGCTTGA AAGCTGCACG GTGCAGAGAC 30(69)SEQ ID NO68的資料(i)序列特征(A)長度30個堿基對
(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組)(xi)SEQ ID NO68的序列描述GCGGATCCCG AGTCACACCC TGGCTGGGCC 30(70)SEQ ID NO69的資料(i)序列特征(A)長度1128個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO69的序列描述ATGGATGTGA CTTCCCAAGC CCGGGGCGTG GGCCTGGAGA TGTACCCAGG CACCGCGCAG 60CCTGCGGCCC CCAACACCAC CTCCCCCGAG CTCAACCTGT CCCACCCGCT CCTGGGCACC 120GCCCTGGCCA ATGGGACAGG TGAGCTCTCG GAGCACCAGC AGTACGTGAT CGGCCTGTTC 180CTCTCGTGCC TCTACACCAT CTTCCTCTTC CCCATCGGCT TTGTGGGCAA CATCCTGATC 240CTGGTGGTGA ACATCAGCTT CCGCGAGAAG ATGACCATCC CCGACCTGTA CTTCATCAAC 300CTGGCGGTGG CGGACCTCAT CCTGGTGGCC GACTCCCTCA TTGAGGTGTT CAACCTGCAC 360GAGCGGTACT ACGACATCGC CGTCCTGTGC ACCTTCATGT CGCTCTTCCT GCAGGTCAAC 420ATGTACAGCA GCGTCTTCTT CCTCACCTGG ATGAGCTTCG ACCGCTACAT CGCCCTGGCC 480AGGGCCATGC GCTGCAGCCT GTTCCGCACC AAGCACCACG CCCGGCTGAG CTGTGGCCTC 540ATCTGGATGG CATCCGTGTC AGCCACGCTG GTGCCCTTCA CCGCCGTGCA CCTGCAGCAC 600ACCGACGAGG CCTGCTTCTG TTTCGCGGAT GTCCGGGAGG TGCAGTGGCT CGAGGTCACG 660CTGGGCTTCA TCGTGCCCTT CGCCATCATC GGCCTGTGCT ACTCCCTCAT TGTCCGGGTG 720CTGGTCAGGG CGCACCGGCA CCGTGGGCTG CGGCCCCGGC GGCAGAAGGC GCTCCGCATG 780ATCCTCGCGG TGGTGCTGGT CTTCTTCGTC TGCTGGCTGC CGGAGAACGT CTTCATCAGC 840GTGCACCTCC TGCAGCGGAC GCAGCCTGGG GCCGCTCCCT GCAAGCAGTC TTTCCGCCAT 900GCCCACCCCC TCACGGGCCA CATTGTCAAC CTCACCGCCT TCTCCAACAG CTGCCTAAAC 960CCCCTCATCT ACAGCTTTCT CGGGGAGACC TTCAGGGACA AGCTGAGGCT GTACATTGAG 1020CAGAAAACAA ATTTGCCGGC CCTGAACCGC TTCTGTCACG CTGCCCTGAA GGCCGTCATT 1080CCAGACAGCA CCGAGCAGTC GGATGTGAGG TTCAGCAGTG CCGTGTAG 1128(71)SEQ ID NO70的資料(i)序列特征(A)長度375個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO70的序列描述Met Asp Val Thr Ser Gln Ala Arg Gly Val Gly Leu Glu Met Tyr Pro1 5 10 15Gly Thr Ala Gln Pro Ala Ala Pro Asn Thr Thr Ser Pro Glu Leu Asn20 25 30Leu Ser His Pro Leu Leu Gly Thr Ala Leu Ala Asn Gly Thr Gly Glu35 40 45Leu Ser Glu His Gln Gln Tyr Val Ile Gly Leu Phe Leu Ser Cys Leu50 55 60Tyr Thr Ile Phe Leu Phe Pro Ile Gly Phe Val Gly Asn Ile Leu Ile65 70 75 80Leu Val Val Asn Ile Ser Phe Arg Glu Lys Met Thr Ile Pro Asp Leu85 90 95Tyr Phe Ile Asn Leu Ala Val Ala Asp Leu Ile Leu Val Ala Asp Ser100 105 110Leu Ile Glu Val Phe Asn Leu His Glu Arg Tyr Tyr Asp Ile Ala Val115 120 125Leu Cys Thr Phe Met Ser Leu Phe Leu Gln Val Asn Met Tyr Ser Ser130 135 140Val Phe Phe Leu Thr Trp Met Ser Phe Asp Arg Tyr Ile Ala Leu Ala145 150 155 160Arg Ala Met Arg Cys Ser Leu Phe Arg Thr Lys His His Ala Arg Leu165 170 175Ser Cys Gly Leu Ile Trp Met Ala Ser Val Ser Ala Thr Leu Val Pro180 185 190Phe Thr Ala Val His Leu Gln His Thr Asp Glu Ala Cys Phe Cys Phe195 200 205Ala Asp Val Arg Glu Val Gln Trp Leu Glu Val Thr Leu Gly Phe Ile210 215 220Val Pro Phe Ala Ile Ile Gly Leu Cys Tyr Ser Leu Ile Val Arg Val225 230 235 240Leu Val Arg Ala His Arg His Arg Gly Leu Arg Pro Arg Arg Gln Lys245 250 255Ala Leu Arg Met Ile Leu Ala Val Val Leu Val Phe Phe Val Cys Trp
260 265 270Leu Pro Glu Asn Val Phe Ile Ser Val His Leu Leu Gln Arg Thr Gln275 280 285Pro Gly Ala Ala Pro Cys Lys Gln Ser Phe Arg His Ala His Pro Leu290 295 300Thr Gly His Ile Val Asn Leu Thr Ala Phe Ser Asn Ser Cys Leu Asn305 310 315 320Pro Leu Ile Tyr Ser Phe Leu Gly Glu Thr Phe Arg Asp Lys Leu Arg325 330 335Leu Tyr Ile Glu Gln Lys Thr Asn Leu Pro Ala Leu Asn Arg Phe Cys340 345 350His Ala Ala Leu Lys Ala Val Ile Pro Asp Ser Thr Glu Gln Ser Asp355 360 365Val Arg Phe Ser Ser Ala Val370 375(72)SEQ ID NO71的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO71的序列描述ACAGAATTCC TGTGTGGTTT TACCGCCCAG 30(73)SEQ ID NO72的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO72的序列描述CTCGGATCCA GGCAGAAGAG TCGCCTATGG 30(74)SEQ ID NO73的資料(i)序列特征(A)長度1137個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO73的序列描述ATGGACCTGG GGAAACCAAT GAAAAGCGTG CTGGTGGTGG CTCTCCTTGT CATTTTCCAG 60GTATGCCTGT GTCAAGATGA GGTCACGGAC GATTACATCG GAGACAACAC CACAGTGGAC 120TACACTTTGT TCGAGTCTTT GTGCTCCAAG AAGGACGTGC GGAACTTTAA AGCCTGGTTC 180CTCCCTATCA TGTACTCCAT CATTTGTTTC GTGGGCCTAC TGGGCAATGG GCTGGTCGTG 240TTGACCTATA TCTATTTCAA GAGGCTCAAG ACCATGACCG ATACCTACCT GCTCAACCTG 300GCGGTGGCAG ACATCCTCTT CCTCCTGACC CTTCCCTTCT GGGCCTACAG CGCGGCCAAG 360TCCTGGGTCT TCGGTGTCCA CTTTTGCAAG CTCATCTTTG CCATCTACAA GATGAGCTTC 420TTCAGTGGCA TGCTCCTACT TCTTTGCATC AGCATTGACC GCTACGTGGC CATCGTCCAG 480GCTGTCTCAG CTCACCGCCA CCGTGCCCGC GTCCTTCTCA TCAGCAAGCT GTCCTGTGTG 540GGCATCTGGA TACTAGCCAC AGTGCTCTCC ATCCCAGAGC TCCTGTACAG TGACCTCCAG 600AGGAGCAGCA GTGAGCAAGC GATGCGATGC TCTCTCATCA CAGAGCATGT GGAGGCCTTT 660ATCACCATCC AGGTGGCCCA GATGGTGATC GGCTTTCTGG TCCCCCTGCT GGCCATGAGC 720TTCTGTTACC TTGTCATCAT CCGCACCCTG CTCCAGGCAC GCAACTTTGA GCGCAACAAG 780GCCATCAAGG TGATCATCGC TGTGGTCGTG GTCTTCATAG TCTTCCAGCT GCCCTACAAT 840GGGGTGGTCC TGGCCCAGAC GGTGGCCAAC TTCAACATCA CCAGTAGCAC CTGTGAGCTC 900AGTAAGCAAC TCAACATCGC CTACGACGTC ACCTACAGCC TGGCCTGCGT CCGCTGCTGC 960GTCAACCCTT TCTTGTACGC CTTCATCGGC GTCAAGTTCC GCAACGATCT CTTCAAGCTC 1020TTCAAGGACC TGGGCTGCCT CAGCCAGGAG CAGCTCCGGC AGTGGTCTTC CTGTCGGCAC 1080ATCCGGCGCT CCTCCATGAG TGTGGAGGCC GAGACCACCA CCACCTTCTC CCCATAG1137(75)SEQ ID NO74的資料(i)序列特征(A)長度378個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO74的序列描述Met Asp Leu Gly Lys Pro Met Lys Ser Val Leu Val Val Ala Leu Leu1 5 10 15Val Ile Phe Gln Val Cys Leu Cys Gln Asp Glu Val Thr Asp Asp Tyr20 25 30Ile Gly Asp Asn Thr Thr Val Asp Tyr Thr Leu Phe Glu Ser Leu Cys
35 40 45Ser Lys Lys Asp Val Arg Asn Phe Lys Ala Trp Phe Leu Pro Ile Met50 55 60Tyr Ser Ile Ile Cys Phe Val Gly Leu Leu Gly Asn Gly Leu Val Val65 70 75 80Leu Thr Tyr Ile Tyr Phe Lys Arg Leu Lys Thr Met Thr Asp Thr Tyr85 90 95Leu Leu Asn Leu Ala Val Ala Asp Ile Leu Phe Leu Leu Thr Leu Pro100 105 110Phe Trp Ala Tyr Ser Ala Ala Lys Ser Trp Val Phe Gly Val His Phe115 120 125Cys Lys Leu Ile Phe Ala Ile Tyr Lys Met Ser Phe Phe Ser Gly Met130 135 140Leu Leu Leu Leu Cys Ile Ser Ile Asp Arg Tyr Val Ala Ile Val Gln145 150 155 160Ala Val Ser Ala His Arg His Arg Ala Arg Val Leu Leu Ile Ser Lys165 170 175Leu Ser Cys Val Gly Ile Trp Ile Leu Ala Thr Val Leu Ser Ile Pro180 185 190Glu Leu Leu Tyr Ser Asp Leu Gln Arg Ser Ser Ser Glu Gln Ala Met195 200 205Arg Cys Ser Leu Ile Thr Glu His Val Glu Ala Phe Ile Thr Ile Gln210 215 220Val Ala Gln Met Val Ile Gly Phe Leu Val Pro Leu Leu Ala Met Ser225 230 235 240Phe Cys Tyr Leu Val Ile Ile Arg Thr Leu Leu Gln Ala Arg Asn Phe245 250 255Glu Arg Asn Lys Ala Ile Lys Val Ile Ile Ala Val Val Val Val Phe260 265 270Ile Val Phe Gln Leu Pro Tyr Asn Gly Val Val Leu Ala Gln Thr Val275 280 285Ala Asn Phe Asn Ile Thr Ser Ser Thr Cys Glu Leu Ser Lys Gln Leu290 295 300Asn Ile Ala Tyr Asp Val Thr Tyr Ser Leu Ala Cys Val Arg Cys Cys305 310 315 320Val Asn Pro Phe Leu Tyr Ala Phe Ile Gly Val Lys Phe Arg Asn Asp325 330 335Leu Phe Lys Leu Phe Lys Asp Leu Gly Cys Leu Ser Gln Glu Gln Leu340 345 350Arg Gln Trp Ser Ser Cys Arg His Ile Arg Arg Ser Ser Met Ser Val355 360 365Glu Ala Glu Thr Thr Thr Thr Phe Ser Pro370 375(76)SEQ ID NO75的資料(i)序列特征(A)長度32個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO75的序列描述CTGGAATTCA CCTGGACCAC CACCAATGGA TA 32(77)SEQ ID NO76的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO76的序列描述CTCGGATCCT GCAAAGTTTG TCATACAGTT 30(78)SEQ ID NO77的資料(i)序列特征(A)長度1085個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO77的序列描述ATGGATATAC AAATGGCAAA CAATTTTACT CCGCCCTCTG CAACTCCTCA GGGAAATGAC 60TGTGACCTCT ATGCACATCA CAGCACGGCC AGGATAGTAA TGCCTCTGCA TTACAGCCTC 120GTCTTCATCA TTGGGCTCGT GGGAAACTTA CTAGCCTTGG TCGTCATTGT TCAAAACAGG 180AAAAAAATCA ACTCTACCAC CCTCTATTCA ACAAATTTGG TGATTTCTGA TATACTTTTT 240ACCACGGCTT TGCCTACACG AATAGCCTAC TATGCAATGG GCTTTGACTG GAGAATCGGA 300GATGCCTTGT GTAGGATAAC TGCGCTAGTG TTTTACATCA ACACATATGC AGGTGTGAAC 360TTTATGACCT GCCTGAGTAT TGACCGCTTC ATTGCTGTGG TGCACCCTCT ACGCTACAAC 420AAGATAAAAA GGATTGAACA TGCAAAAGGC GTGTGCATAT TTGTCTGGAT TCTAGTATTT 480GCTCAGACAC TCCCACTCCT CATCAACCCT ATGTCAAAGC AGGAGGCTGA AAGGATTACA 540TGCATGGAGT ATCCAAACTT TGAAGAAACT AAATCTCTTC CCTGGATTCT GCTTGGGGCA 600TGTTTCATAG GATATGTACT TCCACTTATA ATCATTCTCA TCTGCTATTC TCAGATCTGC 660TGCAAACTCT TCAGAACTGC CAAACAAAAC CCACTCACTG AGAAATCTGG TGTAAACAAA 720AAGGCTCTCA ACACAATTAT TCTTATTATT GTTGTGTTTG TTCTCTGTTT CACACCTTAC 780CATGTTGCAA TTATTCAACA TATGATTAAG AAGCTTCGTT TCTCTAATTT CCTGGAATGT 840AGCCAAAGAC ATTCGTTCCA GATTTCTCTG CACTTTACAG TATGCCTGAT GAACTTCAAT 900TGCTGCATGG ACCCTTTTAT CTACTTCTTT GCATGTAAAG GGTATAAGAG AAAGGTTATG 960AGGATGCTGA AACGGCAAGT CAGTGTATCG ATTTCTAGTG CTGTGAAGTC AGCCCCTGAA 1020GAAAATTCAC GTGAAATGAC AGAAACGCAG ATGATGATAC ATTCCAAGTC TTCAAATGGA 1080AAGTGA1086(79)SEQ ID NO78的資料(i)序列特征(A)長度361個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO78的序列描述Met Asp Ile Gln Met Ala Asn Asn Phe Thr Pro Pro Ser Ala Thr Pro1 5 10 15Gln Gly Asn Asp Cys Asp Leu Tyr Ala His His Ser Thr Ala Arg Ile20 25 30Val Met Pro Leu His Tyr Ser Leu Val Phe Ile Ile Gly Leu Val Gly35 40 45Asn Leu Leu Ala Leu Val Val Ile Val Gln Asn Arg Lys Lys Ile Asn50 55 60Ser Thr Thr Leu Tyr Ser Thr Asn Leu Val Ile Ser Asp Ile Leu Phe65 70 75 80Thr Thr Ala Leu Pro Thr Arg Ile Ala Tyr Tyr Ala Met Gly Phe Asp
85 90 95Trp Arg Ile Gly Asp Ala Leu Cys Arg Ile Thr Ala Leu Val Phe Tyr100 105 110Ile Asn Thr Tyr Ala Gly Val Asn Phe Met Thr Cys Leu Ser Ile Asp115 120 125Arg Phe Ile Ala Val Val His Pro Leu Arg Tyr Asn Lys Ile Lys Argl30 135 140Ile Glu His Ala Lys Gly Val Cys Ile Phe Val Trp Ile Leu Val Phe145 150 155 160Ala Gln Thr Leu Pro Leu Leu Ile Asn Pro Met Ser Lys Gln Glu Ala165 170 175Glu Arg Ile Thr Cys Met Glu Tyr Pro Asn Phe Glu Glu Thr Lys Ser180 185 190Leu Pro Trp Ile Leu Leu Gly Ala Cys Phe Ile Gly Tyr Val Leu Pro195 200 205Leu Ile Ile Ile Leu Ile Cys Tyr Ser Gln Ile Cys Cys Lys Leu Phe210 215 220Arg Thr Ala Lys Gln Asn Pro Leu Thr Glu Lys Ser Gly Val Asn Lys225 230 235 240Lys Ala Leu Asn Thr Ile Ile Leu Ile Ile Val Val Phe Val Leu Cys245 250 255Phe Thr Pro Tyr His Val Ala Ile Ile Gln His Met Ile Lys Lys Leu260 265 270Arg Phe Ser Asn Phe Leu Glu Cys Ser Gln Arg His Ser Phe Gln Ile275 280 285Ser Leu His Phe Thr Val Cys Leu Met Asn Phe Asn Cys Cys Met Asp290 295 300Pro Phe Ile Tyr Phe Phe Ala Cys Lys Gly Tyr Lys Arg Lys Val Met305 310 315 320Arg Met Leu Lys Arg Gln Val Ser Val Ser Ile Ser Ser Ala Val Lys325 330 335Ser Ala Pro Glu Glu Asn Ser Arg Glu Met Thr Glu Thr Gln Met Met340 345 350Ile His Ser Lys Ser Ser Asn Gly Lys
355 360(80)SEQ ID NO79的資料(i)序列特征(A)長度31個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO79的序列描述CTGGAATTCT CCTGCTCATC CAGCCATGCG G 31(81) SEQ ID NO80的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO80的序列描述CCTGGATCCC CACCCCTACT GGGGCCTCAG 30(82)SEQ ID NO81的資料(i)序列特征(A)長度1446個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO81的序列描述ATGCGGTGGC TGTGGCCCCT GGCTGTCTCT CTTGCTGTGA TTTTGGCTGT GGGGCTAAGC 60AGGGTCTCTG GGGGTGCCCC CCTGCACCTG GGCAGGCACA GAGCCGAGAC CCAGGAGCAG 120CAGAGCCGAT CCAAGAGGGG CACCGAGGAT GAGGAGGCCA AGGGCGTGCA GCAGTATGTG 180CCTGAGGAGT GGGCGGAGTA CCCCCGGCCC ATTCACCCTG CTGGCCTGCA GCCAACCAAG 240CCCTTGGTGG CCACCAGCCC TAACCCCGAC AAGGATGGGG GCACCCCAGA CAGTGGGCAG 300GAACTGAGGG GCAATCTGAC AGGGGCACCA GGGCAGAGGC TACAGATCCA GAACCCCCTG 360TATCCGGTGA CCGAGAGCTC CTACAGTGCC TATGCCATCA TGCTTCTGGC GCTGGTGGTG 420TTTGCGGTGG GCATTGTGGG CAACCTGTCG GTCATGTGCA TCGTGTGGCA CAGCTACTAC 480CTGAAGAGCG CCTGGAACTC CATCCTTGCC AGCCTGGCCC TCTGGGATTT TCTGGTCCTC 540TTTTTCTGCC TCCCTATTGT CATCTTCAAC GAGATCACCA AGCAGAGGCT ACTGGGTGAC 600GTTTCTTGTC GTGCCGTGCC CTTCATGGAG GTCTCCTCTC TGGGAGTCAC GACTTTCAGC 660CTCTGTGCCC TGGGCATTGA CCGCTTCCAC GTGGCCACCA GCACCCTGCC CAAGGTGAGG 720CCCATCGAGC GGTGCCAATC CATCCTGGCC AAGTTGGCTG TCATCTGGGT GGGCTCCATG 780ACGCTGGCTG TGCCTGAGCT CCTGCTGTGG CAGCTGGCAC AGGAGCCTGC CCCCACCATG 840GGCACCCTGG ACTCATGCAT CATGAAACCC TCAGCCAGCC TGCCCGAGTC CCTGTATTCA 900CTGGTGATGA CCTACCAGAA CGCCCGCATG TGGTGGTACT TTGGCTGCTA CTTCTGCCTG 960CCCATCCTCT TCACAGTCAC CTGCCAGCTG GTGACATGGC GGGTGCGAGG CCCTCCAGGG 1020AGGAAGTCAG AGTGCAGGGC CAGCAAGCAC GAGCAGTGTG AGAGCCAGCT CAACAGCACC 1080GTGGTGGGCC TGACCGTGGT CTACGCCTTC TGCACCCTCC CAGAGAACGT CTGCAACATC 1140GTGGTGGCCT ACCTCTCCAC CGAGCTGACC CGCCAGACCC TGGACCTCCT GGGCCTCATC 1200AACCAGTTCT CCACCTTCTT CAAGGGCGCC ATCACCCCAG TGCTGCTCCT TTGCATCTGC 1260AGGCCGCTGG GCCAGGCCTT CCTGGACTGC TGCTGCTGCT GCTGCTGTGA GGAGTGCGGC 1320GGGGCTTCGG AGGCCTCTGC TGCCAATGGG TCGGACAACA AGCTCAAGAC CGAGGTGTCC 1380TCTTCCATCT ACTTCCACAA GCCCAGGGAG TCACCCCCAC TCCTGCCCCT GGGCACACCT 1440TGCTGA1446(83)SEQ ID NO82的資料(i)序列特征(A)長度481個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO82的序列描述Met Arg Trp Leu Trp Pro Leu Ala Val Ser Leu Ala Val Ile Leu Ala1 5 10 15Val Gly Leu Ser Arg Val Ser Gly Gly Ala Pro Leu His Leu Gly Arg20 25 30His Arg Ala Glu Thr Gln Glu Gln Gln Ser Arg Ser Lys Arg Gly Thr35 40 45Glu Asp Glu Glu Ala Lys Gly Val Gln Gln Tyr Val Pro Glu Glu Trp50 55 60Ala Glu Tyr Pro Arg Pro Ile His Pro Ala Gly Leu Gln Pro Thr Lys65 70 75 80Pro Leu Val Ala Thr Ser Pro Asn Pro Asp Lys Asp Gly Gly Thr Pro85 90 95Asp Ser Gly Gln Glu Leu Arg Gly Asn Leu Thr Gly Ala Pro Gly Gln100 105 110Arg Leu Gln Ile Gln Asn Pro Leu Tyr Pro Val Thr Glu Ser Ser Tyrl15 120 125Ser Ala Tyr Ala Ile Met Leu Leu Ala Leu Val Val Phe Ala Val Gly130 135 140Ile Val Gly Asn Leu Ser Val Met Cys Ile Val Trp His Ser Tyr Tyr145 150 155 160Leu Lys Ser Ala Trp Asn Ser Ile Leu Ala Ser Leu Ala Leu Trp Aspl65 170 175Phe Leu Val Leu Phe Phe Cys Leu Pro Ile Val Ile Phe Asn Glu Ile180 185 190Thr Lys Gln Arg Leu Leu Gly Asp Val Ser Cys Arg Ala Val Pro Phe195 200 205Met Glu Va1 Ser Ser Leu Gly Val Thr Thr Phe Ser Leu Cys Ala Leu210 215 220Gly Ile Asp Arg Phe His Val Ala Thr Ser Thr Leu Pro Lys Val Arg225 230 235 240Pro Ile Glu Arg Cys Gln Ser Ile Leu Ala Lys Leu Ala Val Ile Trp245 250 255Val Gly Ser Met Thr Leu Ala Val Pro Glu Leu Leu Leu Trp Gln Leu260 265 270Ala Gln Glu Pro Ala Pro Thr Met Gly Thr Leu Asp Ser Cys Ile Met275 280 285Lys Pro Ser Ala Ser Leu Pro Glu Ser Leu Tyr Ser Leu Val Met Thr290 295 300Tyr Gln Asn Ala Arg Met Trp Trp Tyr Phe Gly Cys Tyr Phe Cys Leu305 310 315 320Pro Ile Leu Phe Thr Val Thr Cys Gln Leu Val Thr Trp Arg Val Arg325 330 335Gly Pro Pro Gly Arg Lys Ser Glu Cys Arg Ala Ser Lys His Glu Gln340 345 350Cys Glu Ser Gln Leu Asn Ser Thr Val Val Gly Leu Thr Val Val Tyr355 360 365Ala Phe Cys Thr Leu Pro Glu Asn Val Cys Asn Ile Val Val Ala Tyr370 375 380Leu Ser Thr Glu Leu Thr Arg Gln Thr Leu Asp Leu Leu Gly Leu Ile385 390 395 400Asn Gln Phe Ser Thr Phe Phe Lys Gly Ala Ile Thr Pro Val Leu Leu405 410 415Leu Cys Ile Cys Arg Pro Leu Gly Gln Ala Phe Leu Asp Cys Cys Cys420 425 430Cys Cys Cys Cys Glu Glu Cys Gly Gly Ala Ser Glu Ala Ser Ala Ala435 440 445Asn Gly Ser Asp Asn Lys Leu Lys Thr Glu Val Ser Ser Ser Ile Tyr450 455 460Phe His Lys Pro Arg Glu Ser Pro Pro Leu Leu Pro Leu Gly Thr Pro465 470 475 480Cys(84)SEQ ID NO83的資料(i)序列特征(A)長度22個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO83的序列描述ATGTGGAACG CGACGCCCAG CG 22(85)SEQ ID NO84的資料(i)序列特征(A)長度22個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO84的序列描述TCATGTATTA ATACTAGATT CT 22(86)SEQ ID NO85的資料(i)序列特征(A)長度38個堿基對
(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO85的序列描述TACCATGTGG AACGCGACGC CCAGCGAAGA GCCGGGGT38(87)SEQ ID NO86的資料(i)序列特征(A)長度39個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO86的序列描述CGGAATTCAT GTATTAATAC TAGATTCTGT CCAGGCCCG 39(88)SEQ ID NO87的資料(i)序列特征(A)長度1101個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi) SEQ ID NO87的序列描述ATGTGGAACG CGACGCCCAG CGAAGAGCCG GGGTTCAACC TCACACTGGC CGACCTGGAC 60TGGGATGCTT CCCCCGGCAA CGACTCGCTG GGCGACGAGC TGCTGCAGCT CTTCCCCGCG 120CCGCTGCTGG CGGGCGTCAC AGCCACCTGC GTGGCACTCT TCGTGGTGGG TATCGCTGGC 180AACCTGCTCA CCATGCTGGT GGTGTCGCGC TTCCGCGAGC TGCGCACCAC CACCAACCTC 240TACCTGTCCA GCATGGCCTT CTCCGATCTG CTCATCTTCC TCTGCATGCC CCTGGACCTC 300GTTCGCCTCT GGCAGTACCG GCCCTGGAAC TTCGGCGACC TCCTCTGCAA ACTCTTCCAA 360TTCGTCAGTG AGAGCTGCAC CTACGCCACG GTGCTCACCA TCACAGCGCT GAGCGTCGAG 420CGCTACTTCG CCATCTGCTT CCCACTCCGG GCCAAGGTGG TGGTCACCAA GGGGCGGGTG 480AAGCTGGTCA TCTTCGTCAT CTGGGCCGTG GCCTTCTGCA GCGCCGGGCC CATCTTCGTG 540CTAGTCGGGG TGGAGCACGA GAACGGCACC GACCCTTGGG ACACCAACGA GTGCCGCCCC 600ACCGAGTTTG CGGTGCGCTC TGGACTGCTC ACGGTCATGG TGTGGGTGTC CAGCATCTTC 660TTCTTCCTTC CTGTCTTCTG TCTCACGGTC CTCTACAGTC TCATCGGCAG GAAGCTGTGG 720CGGAGGAGGC GCGGCGATGC TGTCGTGGGT GCCTCGCTCA GGGACCAGAA CCACAAGCAA 780ACCGTGAAAA TGCTGGCTGT AGTGGTGTTT GCCTTCATCC TCTGCTGGCT CCCCTTCCAC 840GTAGGGCGAT ATTTATTTTC CAAATCCTTT GAGCCTGGCT CCTTGGAGAT TGCTCAGATC 900AGCCAGTACT GCAACCTCGT GTCCTTTGTC CTCTTCTACC TCAGTGCTGC CATCAACCCC 960ATTCTGTACA ACATCATGTC CAAGAAGTAC CGGGTGGCAG TGTTCAGACT TCTGGGATTC 1020GAACCCTTCT CCCAGAGAAA GCTCTCCACT CTGAAAGATG AAAGTTCTCG GGCCTGGACA 1080GAATCTAGTA TTAATACATG A 1101(89)SEQ ID NO88的資料(i)序列特征(A)長度366個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO88的序列描述Met Trp Asn Ala Thr Pro Ser Glu Glu Pro Gly Phe Asn Leu Thr Leu1 5 10 15Ala Asp Leu Asp Trp Asp Ala Ser Pro Gly Asn Asp Ser Leu Gly Asp20 25 30Glu Leu Leu Gln Leu Phe Pro Ala Pro Leu Leu Ala Gly Val Thr Ala35 40 45Thr Cys Val Ala Leu Phe Val Val Gly Ile Ala Gly Asn Leu Leu Thr50 55 60Met Leu Val Val Ser Arg Phe Arg Glu Leu Arg Thr Thr Thr Asn Leu65 70 75 80Tyr Leu Ser Ser Met Ala Phe Ser Asp Leu Leu Ile Phe Leu Cys Met85 90 95Pro Leu Asp Leu Val Arg Leu Trp Gln Tyr Arg Pro Trp Asn Phe Gly100 105 110Asp Leu Leu Cys Lys Leu Phe Gln Phe Val Ser Glu Ser Cys Thr Tyr115 120 125Ala Thr Val Leu Thr Ile Thr Ala Leu Ser Val Glu Arg Tyr Phe Ala130 135 140Ile Cys Phe Pro Leu Arg Ala Lys Val Val Val Thr Lys Gly Arg Val145 150 155 160Lys Leu Val Ile Phe Val Ile Trp Ala Val Ala Phe Cys Ser Ala Gly165 170 175Pro Ile Phe Val Leu Val Gly Val Glu His Glu Asn Gly Thr Asp Pro180 185 190Trp Asp Thr Asn Glu Cys Arg Pro Thr Glu Phe Ala Val Arg Ser Gly195 200 205Leu Leu Thr Val Met Val Trp Val Ser Ser Ile Phe Phe Phe Leu Pro210 215 220Val Phe Cys Leu Thr Val Leu Tyr Ser Leu Ile Gly Arg Lys Leu Trp225 230 235 240Arg Arg Arg Arg Gly Asp Ala Val Val Gly Ala Ser Leu Arg Asp Gln245 250 255Asn His Lys Gln Thr Val Lys Met Leu Ala Val Val Val Phe Ala Phe260 265 270Ile Leu Cys Trp Leu Pro Phe His Val Gly Arg Tyr Leu Phe Ser Lys275 280 285Ser Phe Glu Pro Gly Ser Leu Glu Ile Ala Gln Ile Ser Gln Tyr Cys290 295 300Asn Leu Val Ser Phe Val Leu Phe Tyr Leu Ser Ala Ala Ile Asn Pro305 310 315 320Ile Leu Tyr Asn Ile Met Ser Lys Lys Tyr Arg Val Ala Val Phe Arg325 330 335Leu Leu Gly Phe Glu Pro Phe Ser Gln Arg Lys Leu Ser Thr Leu Lys340 345 350Asp Glu Ser Ser Arg Ala Trp Thr Glu Ser Ser Ile Asn Thr355 360 365(90) SEQ ID NO89的資料(i)序列特征(A)長度33個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO89的序列描述GCAAGCTTGT GCCCTCACCA AGCCATGCGA GCC 33(91)SEQ ID NO90的資料(i)序列特征
(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi) SEQ ID NO90的序列描述CGGAATTCAG CAATGAGTTC CGACAGAAGC 30(92)SEQ ID NO91的資料(i)序列特征(A)長度1842個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO91的序列描述ATGCGAGCCC CGGGCGCGCT TCTCGCCCGC ATGTCGCGGC TACTGCTTCT GCTACTGCTC 60AAGGTGTCTG CCTCTTCTGC CCTCGGGGTC GCCCCTGCGT CCAGAAACGA AACTTGTCTG 120GGGGAGAGCT GTGCACCTAC AGTGATCCAG CGCCGCGGCA GGGACGCCTG GGGACCGGGA 180AATTCTGCAA GAGACGTTCT GCGAGCCCGA GCACCCAGGG AGGAGCAGGG GGCAGCGTTT 240CTTGCGGGAC CCTCCTGGGA CCTGCCGGCG GCCCCGGGCC GTGACCCGGC TGCAGGCAGA 300GGGGCGGAGG CGTCGGCAGC CGGACCCCCG GGACCTCCAA CCAGGCCACC TGGCCCCTGG 360AGGTGGAAAG GTGCTCGGGG TCAGGAGCCT TCTGAAACTT TGGGGAGAGG GAACCCCACG 420GCCCTCCAGC TCTTCCTTCA GATCTCAGAG GAGGAAGAGA AGGGTCCCAG AGGCGCTGGC 480ATTTCCGGGC GTAGCCAGGA GCAGAGTGTG AAGACAGTCC CCGGAGCCAG CGATCTTTTT 540TACTGGCCAA GGAGAGCCGG GAAACTCCAG GGTTCCCACC ACAAGCCCCT GTCCAAGACG 600GCCAATGGAC TGGCGGGGCA CGAAGGGTGG ACAATTGCAC TCCCGGGCCG GGCGCTGGCC 660CAGAATGGAT CCTTGGGTGA AGGAATCCAT GAGCCTGGGG GTCCCCGCCG GGGAAACAGC 720ACGAACCGGC GTGTGAGACT GAAGAACCCC TTCTACCCGC TGACCCAGGA GTCCTATGGA 780GCCTACGCGG TCATGTGTCT GTCCGTGGTG ATCTTCGGGA CCGGCATCAT TGGCAACCTG 840GCGGTGATGA GCATCGTGTG CCACAACTAC TACATGCGGA GCATCTCCAA CTCCCTCTTG 900GCCAACCTGG CCTTCTGGGA CTTTCTCATC ATCTTCTTCT GCCTTCCGCT GGTCATCTTC 960CACGAGCTGA CCAAGAAGTG GCTGCTGGAG GACTTCTCCT GCAAGATCGT GCCCTATATA 1020GAGGTCGCTT CTCTGGGAGT CACCACTTTC ACCTTATGTG CTCTGTGCAT AGACCGCTTC 1080CGTGCTGCCA CCAACGTACA GATGTACTAC GAAATGATCG AAAACTGTTC CTCAACAACT 1140GCCAAACTTG CTGTTATATG GGTGGGAGCT CTATTGTTAG CACTTCCAGA AGTTGTTCTC 1200CGCCAGCTGA GCAAGGAGGA TTTGGGGTTT AGTGGCCGAG CTCCGGCAGA AAGGTGCATT 1260ATTAAGATCT CTCCTGATTT ACCAGACACC ATCTATGTTC TAGCCCTCAC CTACGACAGT 1320GCGAGACTGT GGTGGTATTT TGGCTGTTAC TTTTGTTTGC CCACGCTTTT CACCATCACC 1380TGCTCTCTAG TGACTGCGAG GAAAATCCGC AAAGCAGAGA AAGCCTGTAC CCGAGGGAAT 1440AAACGGCAGA TTCAACTAGA GAGTCAGATG AACTGTACAG TAGTGGCACT GACCATTTTA 1500TATGGATTTT GCATTATTCC TGAAAATATC TGCAACATTG TTACTGCCTA CATGGCTACA 1560GGGGTTTCAC AGCAGACAAT GGACCTCCTT AATATCATCA GCCAGTTCCT TTTGTTCTTT 1620AAGTCCTGTG TCACCCCAGT CCTCCTTTTC TGTCTCTGCA AACCCTTCAG TCGGGCCTTC 1680ATGGAGTGCT GCTGCTGTTG CTGTGAGGAA TGCATTCAGA AGTCTTCAAC GGTGACCAGT 1740GATGACAATG ACAACGAGTA CACCACGGAA CTCGAACTCT CGCCTTTCAG TACCATACGC 1800CGTGAAATGT CCACTTTTGC TTCTGTCGGA ACTCATTGCT GA1842(93) SEQ ID NO92的資料(i)序列特征(A)長度613個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO92的序列描述Met Arg Ala Pro Gly Ala Leu Leu Ala Arg Met Ser Arg Leu Leu Leu1 5 10 15Leu Leu Leu Leu Lys Val Ser Ala Ser Ser Ala Leu Gly Val Ala Pro20 25 30Ala Ser Arg Asn Glu Thr Cys Leu Gly Glu Ser Cys Ala Pro Thr Val35 40 45Ile Gln Arg Arg Gly Arg Asp Ala Trp Gly Pro Gly Asn Ser Ala Arg50 55 60Asp Val Leu Arg Ala Arg Ala Pro Arg Glu Glu Gln Gly Ala Ala Phe65 70 75 80Leu Ala Gly Pro Ser Trp Asp Leu Pro Ala Ala Pro Gly Arg Asp Pro85 90 95Ala Ala Gly Arg Gly Ala Glu Ala Ser Ala Ala Gly Pro Pro Gly Pro100 105 110Pro Thr Arg Pro Pro Gly Pro Trp Arg Trp Lys Gly Ala Arg Gly Gln115 120 125Glu Pro Ser Glu Thr Leu Gly Arg Gly Asn Pro Thr Ala Leu Gln Leu130 135 140Phe Leu Gln Ile Ser Glu Glu Glu Glu Lys Gly Pro Arg Gly Ala Gly145 150 155 160Ile Ser Gly Arg Ser Gln Glu Gln Ser Val Lys Thr Val Pro Gly Ala165 170 175Ser Asp Leu Phe Tyr Trp Pro Arg Arg Ala Gly Lys Leu Gln Gly Ser180 185 190His His Lys Pro Leu Ser Lys Thr Ala Asn Gly Leu Ala Gly His Glu195 200 205Gly Trp Thr Ile Ala Leu Pro Gly Arg Ala Leu Ala Gln Asn Gly Ser210 215 220Leu Gly Glu Gly Ile His Glu Pro Gly Gly Pro Arg Arg Gly Asn Ser225 230 235 240Thr Asn Arg Arg Val Arg Leu Lys Asn Pro Phe Tyr Pro Leu Thr Gln245 250 255Glu Ser Tyr Gly Ala Tyr Ala Val Met Cys Leu Ser Val Val Ile Phe260 265 270Gly Thr Gly Ile Ile Gly Asn Leu Ala Val Met Ser Ile Val Cys His275 280 285Asn Tyr Tyr Met Arg Ser Ile Ser Asn Ser Leu Leu Ala Asn Leu Ala290 295 300Phe Trp Asp Phe Leu Ile Ile Phe Phe Cys Leu Pro Leu Val Ile Phe305 310 315 320His Glu Leu Thr Lys Lys Trp Leu Leu Glu Asp Phe Ser Cys Lys Ile325 330 335Val Pro Tyr Ile Glu Val Ala Ser Leu Gly Val Thr Thr Phe Thr Leu340 345 350Cys Ala Leu Cys Ile Asp Arg Phe Arg Ala Ala Thr Asn Val Gln Met355 360 365Tyr Tyr Glu Met Ile Glu Asn Cys Ser Ser Thr Thr Ala Lys Leu Ala370 375 380Val Ile Trp Val Gly Ala Leu Leu Leu Ala Leu Pro Glu Val Val Leu385 390 395 400Arg Gln Leu Ser Lys Glu Asp Leu Gly Phe Ser Gly Arg Ala Pro Ala405 410 415Glu Arg Cys Ile Ile Lys Ile Ser Pro Asp Leu Pro Asp Thr Ile Tyr420 425 430Val Leu Ala Leu Thr Tyr Asp Ser Ala Arg Leu Trp Trp Tyr Phe Gly435 440 445Cys Tyr Phe Cys Leu Pro Thr Leu Phe Thr Ile Thr Cys Ser Leu Val450 455 460Thr Ala Arg Lys Ile Arg Lys Ala Glu Lys Ala Cys Thr Arg Gly Asn465 470 475 480Lys Arg Gln Ile Gln Leu Glu Ser Gln Met Asn Cys Thr Val Val Ala485 490 495Leu Thr Ile Leu Tyr Gly Phe Cys Ile Ile Pro Glu Asn Ile Cys Asn500 505 510Ile Val Thr Ala Tyr Met Ala Thr Gly Val Ser Gln Gln Thr Met Asp515 520 525Leu Leu Asn Ile Ile Ser Gln Phe Leu Leu Phe Phe Lys Ser Cys Val530 535 540Thr Pro Val Leu Leu Phe Cys Leu Cys Lys Pro Phe Ser Arg Ala Phe545 550 555 560Met Glu Cys Cys Cys Cys Cys Cys Glu Glu Cys Ile Gln Lys Ser Ser565 570 575Thr Val Thr Ser Asp Asp Asn Asp Asn Glu Tyr Thr Thr Glu Leu Glu580 585 590Leu Ser Pro Phe Ser Thr Ile Arg Arg Glu Met Ser Thr Phe Ala Ser595 600 605Val Gly Thr His Cys610(94)SEQ ID NO93的資料(i)序列特征(A)長度34個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO93的序列描述CAGAATTCAG AGAAAAAAAG TGAATATGGT TTTT34(95)SEQ ID NO94的資料(i)序列特征(A)長度32個堿基對(B)類型核酸
(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO94的序列描述TTGGATCCCT GGTGCATAAC AATTGAAAGA AT 32(96)SEQ ID NO95的資料(i)序列特征(A)長度1248個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO95的序列描述ATGGTTTTTG CTCACAGAAT GGATAACAGC AAGCCACATT TGATTATTCC TACACTTCTG 60GTGCCCCTCC AAAACCGCAG CTGCACTGAA ACAGCCACAC CTCTGCCAAG CCAATACCTG 120ATGGAATTAA GTGAGGAGCA CAGTTGGATG AGCAACCAAA CAGACCTTCA CTATGTGCTG 180AAACCCGGGG AAGTGGCCAC AGCCAGCATC TTCTTTGGGA TTCTGTGGTT GTTTTCTATC 240TTCGGCAATT CCCTGGTTTG TTTGGTCATC CATAGGAGTA GGAGGACTCA GTCTACCACC 300AACTACTTTG TGGTCTCCAT GGCATGTGCT GACCTTCTCA TCAGCGTTGC CAGCACGCCT 360TTCGTCCTGC TCCAGTTCAC CACTGGAAGG TGGACGCTGG GTAGTGCAAC GTGCAAGGTT 420GTGCGATATT TTCAATATCT CACTCCAGGT GTCCAGATCT ACGTTCTCCT CTCCATCTGC 480ATAGACCGGT TCTACACCAT CGTCTATCCT CTGAGCTTCA AGGTGTCCAG AGAAAAAGCC 540AAGAAAATGA TTGCGGCATC GTGGATCTTT GATGCAGGCT TTGTGACCCC TGTGCTCTTT 600TTCTATGGCT CCAACTGGGA CAGTCATTGT AACTATTTCC TCCCCTCCTC TTGGGAAGGC 660ACTGCCTACA CTGTCATCCA CTTCTTGGTG GGCTTTGTGA TTCCATCTGT CCTCATAATT 720TTATTTTACC AAAAGGTCAT AAAATATATT TGGAGAATAG GCACAGATGG CCGAACGGTG 780AGGAGGACAA TGAACATTGT CCCTCGGACA AAAGTGAAAA CTATCAAGAT GTTCCTCATT 840TTAAATCTGT TGTTTTTGCT CTCCTGGCTG CCTTTTCATG TAGCTCAGCT ATGGCACCCC 900CATGAACAAG ACTATAAGAA AAGTTCCCTT GTTTTCACAG CTATCACATG GATATCCTTT 960AGTTCTTCAG CCTCTAAACC TACTCTGTAT TCAATTTATA ATGCCAATTT TCGGAGAGGG 1020ATGAAAGAGA CTTTTTGCAT GTCCTCTATG AAATGTTACC GAAGCAATGC CTATACTATC 1080ACAACAAGTT CAAGGATGGC CAAAAAAAAC TACGTTGGCA TTTCAGAAAT CCCTTCCATG 1140GCCAAAACTA TTACCAAAGA CTCGATCTAT GACTCATTTG ACAGAGAAGC CAAGGAAAAA 1200AAGCTTGCTT GGCCCATTAA CTCAAATCCA CCAAATACTT TTGTCTAA 1248(97)SEQ ID NO96的資料(i)序列特征(A)長度415個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO96的序列描述Met Val Phe Ala His Arg Met Asp Asn Ser Lys Pro His Leu Ile Ile1 5 10 15Pro Thr Leu Leu Val Pro Leu Gln Asn Arg Ser Cys Thr Glu Thr Ala20 25 30Thr Pro Leu Pro Ser Gln Tyr Leu Met Glu Leu Ser Glu Glu His Ser35 40 45Trp Met Ser Asn Gln Thr Asp Leu His Tyr Val Leu Lys Pro Gly Glu50 55 60Val Ala Thr Ala Ser Ile Phe Phe Gly Ile Leu Trp Leu Phe Ser Ile65 70 75 80Phe Gly Asn Ser Leu Val Cys Leu Val Ile His Arg Ser Arg Arg Thr85 90 95Gln Ser Thr Thr Asn Tyr Phe Val Val Ser Met Ala Cys Ala Asp Leu100 105 110Leu Ile Ser Val Ala Ser Thr Pro Phe Val Leu Leu Gln Phe Thr Thr115 120 125Gly Arg Trp Thr Leu Gly Ser Ala Thr Cys Lys Val Val Arg Tyr Phe130 135 140Gln Tyr Leu Thr Pro Gly Val Gln Ile Tyr Val Leu Leu Ser Ile Cys145 150 155 160Ile Asp Arg Phe Tyr Thr Ile Val Tyr Pro Leu Ser Phe Lys Val Ser165 170 175Arg Glu Lys Ala Lys Lys Met Ile Ala Ala Ser Trp Ile Phe Asp Ala180 185 190Gly Phe Val Thr Pro Val Leu Phe Phe Tyr Gly Ser Asn Trp Asp Ser195 200 205His Cys Asn Tyr Phe Leu Pro Ser Ser Trp Glu Gly Thr Ala Tyr Thr210 215 220Val Ile His Phe Leu Val Gly Phe Val Ile Pro Ser Val Leu Ile Ile225 230 235 240Leu Phe Tyr Gln Lys Val Ile Lys Tyr Ile Trp Arg Ile Gly Thr Asp245 250 255Gly Arg Thr Val Arg Arg Thr Met Asn Ile Val Pro Arg Thr Lys Val
260 265 270Lys Thr Ile Lys Met Phe Leu Ile Leu Asn Leu Leu Phe Leu Leu Ser275 280 285Trp Leu Pro Phe His Val Ala Gln Leu Trp His Pro His Glu Gln Asp290 295 300Tyr Lys Lys Ser Ser Leu Val Phe Thr Ala Ile Thr Trp Ile Ser Phe305 310 315 320Ser Ser Ser Ala Ser Lys Pro Thr Leu Tyr Ser Ile Tyr Asn Ala Asn325 330 335Phe Arg Arg Gly Met Lys Glu Thr Phe Cys Met Ser Ser Met Lys Cys340 345 350Tyr Arg Ser Asn Ala Tyr Thr Ile Thr Thr Ser Ser Arg Met Ala Lys355 360 365Lys Asn Tyr Val Gly Ile Ser Glu Ile Pro Ser Met Ala Lys Thr Ile370 375 380Thr Lys Asp Ser Ile Tyr Asp Ser Phe Asp Arg Glu Ala Lys Glu Lys385 390 395 400Lys Leu Ala Trp Pro Ile Asn Ser Asn Pro Pro Asn Thr Phe Val405 410 415(98)SEQ ID NO97的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO97的序列描述GGAAAGCTTA ACGATCCCCA GGAGCAACAT 30(99)SEQ ID NO98的資料(i)序列特征(A)長度31個堿基對(B)類型核酸(C)鏈型單鏈
(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO98的序列描述CTGGGATCCT ACGAGAGCAT TTTTCACACA G31(100)SEQ ID NO99的資料(i)序列特征(A)長度1842個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO99的序列描述ATGGGGCCCA CCCTAGCGGT TCCCACCCCC TATGGCTGTA TTGGCTGTAA GCTACCCCAG 60CCAGAATACC CACCGGCTCT AATCATCTTT ATGTTCTGCG CGATGGTTAT CACCATCGTT 120GTAGACCTAA TCGGCAACTC CATGGTCATT TTGGCTGTGA CGAAGAACAA GAAGCTCCGG 180AATTCTGGCA ACATCTTCGT GGTCAGTCTC TCTGTGGCCG ATATGCTGGT GGCCATCTAC 240CCATACCCTT TGATGCTGCA TGCCATGTCC ATTGGGGGCT GGGATCTGAG CCAGTTACAG 300TGCCAGATGG TCGGGTTCAT CACAGGGCTG AGTGTGGTCG GCTCCATCTT CAACATCGTG 360GCAATCGCTA TCAACCGTTA CTGCTACATC TGCCACAGCC TCCAGTACGA ACGGATCTTC 420AGTGTGCGCA ATACCTGCAT CTACCTGGTC ATCACCTGGA TCATGACCGT CCTGGCTGTC 480CTGCCCAACA TGTACATTGG CACCATCGAG TACGATCCTC GCACCTACAC CTGCATCTTC 540AACTATCTGA ACAACCCTGT CTTCACTGTT ACCATCGTCT GCATCCACTT CGTCCTCCCT 600CTCCTCATCG TGGGTTTCTG CTACGTGAGG ATCTGGACCA AAGTGCTGGC GGCCCGTGAC 660CCTGCAGGGC AGAATCCTGA CAACCAACTT GCTGAGGTTC GCAATTTTCT AACCATGTTT 720GTGATCTTCC TCCTCTTTGC AGTGTGCTGG TGCCCTATCA ACGTGCTCAC TGTCTTGGTG 780GCTGTCAGTC CGAAGGAGAT GGCAGGCAAG ATCCCCAACT GGCTTTATCT TGCAGCCTAC 840TTCATAGCCT ACTTCAACAG CTGCCTCAAC GCTGTGATCT ACGGGCTCCT CAATGAGAAT 900TTCCGAAGAG AATACTGGAC CATCTTCCAT GCTATGCGGC ACCCTATCAT ATTCTTCCCT 960GGCCTCATCA GTGATATTCG TGAGATGCAG GAGGCCCGTA CCCTGGCCCG CGCCCGTGCC 1020CATGCTCGCG ACCAAGCTCG TGAACAAGAC CGTGCCCATG CCTGTCCTGC TGTGGAGGAA 1080ACCCCGATGA ATGTCCGGAA TGTTCCATTA CCTGGTGATG CTGCAGCTGG CCACCCCGAC 1140CGTGCCTCTG GCCACCCTAA GCCCCATTCC AGATCCTCCT CTGCCTATCG CAAATCTGCC 1200TCTACCCACC ACAAGTCTGT CTTTAGCCAC TCCAAGGCTG CCTCTGGTCA CCTCAAGCCT 1260GTCTCTGGCC ACTCCAAGCC TGCCTCTGGT CACCCCAAGT CTGCCACTGT CTACCCTAAG 1320CCTGCCTCTG TCCATTTCAA GGGTGACTCT GTCCATTTCA AGGGTGACTC TGTCCATTTC 1380AAGCCTGACT CTGTTCATTT CAAGCCTGCT TCCAGCAACC CCAAGCCCAT CACTGGCCAC 1440CATGTCTCTG CTGGCAGCCA CTCCAAGTCT GCCTTCAGTG CTGCCACCAG CCACCCTAAA 1500CCCATCAAGC CAGCTACCAG CCATGCTGAG CCCACCACTG CTGACTATCC CAAGCCTGCC 1560ACTACCAGCC ACCCTAAGCC CGCTGCTGCT GACAACCCTG AGCTCTCTGC CTCCCATTGC 1620CCCGAGATCC CTGCCATTGC CCACCCTGTG TCTGACGACA GTGACCTCCC TGAGTCGGCC 1680TCTAGCCCTG CCGCTGGGCC CACCAAGCCT GCTGCCAGCC AGCTGGAGTC TGACACCATC 1740GCTGACCTTC CTGACCCTAC TGTAGTCACT ACCAGTACCA ATGATTACCA TGATGTCGTG 1800GTTGTTGATG TTGAAGATGA TCCTGATGAA ATGGCTGTGT GA1842(101)SEQ ID NO100的資料(i)序列特征(A)長度613個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO100的序列描述Met Gly Pro Thr Leu Ala Val Pro Thr Pro Tyr Gly Cys Ile Gly Cys1 5 10 15Lys Leu Pro Gln Pro Glu Tyr Pro Pro Ala Leu Ile Ile Phe Met Phe20 25 30Cys Ala Met Val Ile Thr Ile Val Val Asp Leu Ile Gly Asn Ser Met35 40 45Val Ile Leu Ala Val Thr Lys Asn Lys Lys Leu Arg Asn Ser Gly Asn50 55 60Ile Phe Val Val Ser Leu Ser Val Ala Asp Met Leu Val Ala Ile Tyr65 70 75 80Pro Tyr Pro Leu Met Leu His Ala Met Ser Ile Gly Gly Trp Asp Leu85 90 95Ser Gln Leu Gln Cys Gln Met Val Gly Phe Ile Thr Gly Leu Ser Val100 105 110Val Gly Ser Ile Phe Asn Ile Val Ala Ile Ala Ile Asn Arg Tyr Cys115 120 125Tyr Ile Cys His Ser Leu Gln Tyr Glu Arg Ile Phe Ser Val Arg Asn130 135 140Thr Cys Ile Tyr Leu Val Ile Thr Trp Ile Met Thr Val Leu Ala Val145 150 155 160Leu Pro Asn Met Tyr Ile Gly Thr Ile Glu Tyr Asp Pro Arg Thr Tyr165 170 175Thr Cys Ile Phe Asn Tyr Leu Asn Asn Pro Val Phe Thr Val Thr Ile180 185 190Val Cys Ile His Phe Val Leu Pro Leu Leu Ile Val Gly Phe Cys Tyr195 200 205Val Arg Ile Trp Thr Lys Val Leu Ala Ala Arg Asp Pro Ala Gly Gln
210 215 220Asn Pro Asp Asn Gln Leu Ala Glu Val Arg Asn Phe Leu Thr Met Phe225 230 235 240Val Ile Phe Leu Leu phe Ala Val Cys Trp Cys Pro Ile Asn Val Leu245 250 255Thr Val Leu Val Ala Val Ser Pro Lys Glu Met Ala Gly Lys Ile Pro260 265 270Asn Trp Leu Tyr Leu Ala Ala Tyr Phe Ile Ala Tyr Phe Asn Ser Cys275 280 285Leu Asn Ala Val Ile Tyr Gly Leu Leu Asn Glu Asn Phe Arg Arg Glu290 295 300Tyr Trp Thr Ile Phe His Ala Met Arg His Pro Ile Ile Phe Phe Pro305 310 315 320Gly Leu Ile Ser Asp Ile Arg Glu Met Gln Glu Ala Arg Thr Leu Ala325 330 335Arg Ala Arg Ala His Ala Arg Asp Gln Ala Arg Glu Gln Asp Arg Ala340 345 350His Ala Cys Pro Ala Val Glu Glu Thr Pro Met Asn Val Arg Asn Val355 360 365Pro Leu Pro Gly Asp Ala Ala Ala Gly His Pro Asp Arg Ala Ser Gly370 375 380His Pro Lys Pro His Ser Arg Ser Ser Ser Ala Tyr Arg Lys Ser Ala385 390 395 400Ser Thr His His Lys Ser Val Phe Ser His Ser Lys Ala Ala Ser Gly405 410 415His Leu Lys Pro Val Ser Gly His Ser Lys Pro Ala Ser Gly His Pro420 425 430Lys Ser Ala Thr Val Tyr Pro Lys Pro Ala Ser Val His Phe Lys Gly435 440 445Asp Ser Val His Phe Lys Gly Asp Ser Val His Phe Lys Pro Asp Ser450 455 460Val His Phe Lys Pro Ala Ser Ser Asn Pro Lys Pro Ile Thr Gly His465 470 475 480His Val Ser Ala Gly Ser His Ser Lys Ser Ala Phe Ser Ala Ala Thr
485 490 495Ser His Pro Lys Pro Ile Lys Pro Ala Thr Ser His Ala Glu Pro Thr500 505 510Thr Ala Asp Tyr Pro Lys Pro Ala Thr Thr Ser His Pro Lys Pro Ala515 520 525Ala Ala Asp Asn Pro Glu Leu Ser Ala Ser His Cys Pro Glu Ile Pro530 535 540Ala Ile Ala His Pro Val Ser Asp Asp Ser Asp Leu Pro Glu Ser Ala545 550 555560Ser Ser Pro Ala Ala Gly Pro Thr Lys Pro Ala Ala Ser Gln Leu Glu565 570 575Ser Asp Thr Ile Ala Asp Leu Pro Asp Pro Thr Val Val Thr Thr Ser580 585 590Thr Asn Asp Tyr His Asp Val Val Val Val Asp Val Glu Asp Asp Pro595 600 605Asp Glu Met Ala Val610(102)SEQ ID NO101的資料(i)序列特征(A)長度32個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO101的序列描述TCCAAGCTTC GCCATGGGAC ATAACGGGAG CT 32(103)SEQ ID NO102的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO102的序列描述CGTGAATTCC AAGAATTTAC AATCCTTGCT 30(104)SEQ ID NO103的資料(i)序列特征(A)長度1548個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO103的序列描述ATGGGACATA ACGGGAGCTG GATCTCTCCA AATGCCAGCG AGCCGCACAA CGCGTCCGGC 60GCCGAGGCTG CGGGTGTGAA CCGCAGCGCG CTCGGGGAGT TCGGCGAGGC GCAGCTGTAC 120CGCCAGTTCA CCACCACCGT GCAGGTCGTC ATCTTCATAG GCTCGCTGCT CGGAAACTTC 180ATGGTGTTAT GGTCAACTTG CCGCACAACC GTGTTCAAAT CTGTCACCAA CAGGTTCATT 240AAAAACCTGG CCTGCTCGGG GATTTGTGCC AGCCTGGTCT GTGTGCCCTT CGACATCATC 300CTCAGCACCA GTCCTCACTG TTGCTGGTGG ATCTACACCA TGCTCTTCTG CAAGGTCGTC 360AAATTTTTGC ACAAAGTATT CTGCTCTGTG ACCATCCTCA GCTTCCCTGC TATTGCTTTG 420GACAGGTACT ACTCAGTCCT CTATCCACTG GAGAGGAAAA TATCTGATGC CAAGTCCCGT 480GAACTGGTGA TGTACATCTG GGCCCATGCA GTGGTGGCCA GTGTCCCTGT GTTTGCAGTA 540ACCAATGTGG CTGACATCTA TGCCACGTCC ACCTGCACGG AAGTCTGGAG CAACTCCTTG 600GGCCACCTGG TGTACGTTCT GGTGTATAAC ATCACCACGG TCATTGTGCC TGTGGTGGTG 660GTGTTCCTCT TCTTGATACT GATCCGACGG GCCCTGAGTG CCAGCCAGAA GAAGAAGGTC 720ATCATAGCAG CGCTCCGGAC CCCACAGAAC ACCATCTCTA TTCCCTATGC CTCCCAGCGG 780GAGGCCGAGC TGCACGCCAC CCTGCTCTCC ATGGTGATGG TCTTCATCTT GTGTAGCGTG 840CCCTATGCCA CCCTGGTCGT CTACCAGACT GTGCTCAATG TCCCTGACAC TTCCGTCTTC 900TTGCTGCTCA CTGCTGTTTG GCTGCCCAAA GTCTCCCTGC TGGCAAACCC TGTTCTCTTT 960CTTACTGTGA ACAAATCTGT CCGCAAGTGC TTGATAGGGA CCCTGGTGCA ACTACACCAC 1020CGGTACAGTC GCCGTAATGT GGTCAGTACA GGGAGTGGCA TGGCTGAGGC CAGCCTGGAA 1080CCCAGCATAC GCTCGGGTAG CCAGCTCCTG GAGATGTTCC ACATTGGGCA GCAGCAGATC 1140TTTAAGCCCA CAGAGGATGA GGAAGAGAGT GAGGCCAAGT ACATTGGCTC AGCTGACTTC 1200CAGGCCAAGG AGATATTTAG CACCTGCCTG GAGGGAGAGC AGGGGCCACA GTTTGCGCCC 1260TCTGCCCCAC CCCTGAGCAC AGTGGACTCT GTATCCCAGG TGGCACCGGC AGCCCCTGTG 1320GAACCTGAAA CATTCCCTGA TAAGTATTCC CTGCAGTTTG GCTTTGGGCC TTTTGAGTTG 1380CCTCCTCAGT GGCTCTCAGA GACCCGAAAC AGCAAGAAGC GGCTGCTTCC CCCCTTGGGC 1440AACACCCCAG AAGAGCTGAT CCAGACAAAG GTGCCCAAGG TAGGCAGGGT GGAGCGGAAG 1500ATGAGCAGAA ACAATAAAGT GAGCATTTTT CCAAAGGTGG ATTCCTAG 1548(105)SEQ ID NO104的資料(i)序列特征(A)長度515個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO104的序列描述Met Gly His Asn Gly Ser Trp Ile Ser Pro Asn Ala Ser Glu Pro His1 5 10 15Asn Ala Ser Gly Ala Glu Ala Ala Gly Val Asn Arg Ser Ala Leu Gly20 25 30Glu Phe Gly Glu Ala Gln Leu Tyr Arg Gln Phe Thr Thr Thr Val Gln35 40 45Val Val Ile Phe Ile Gly Ser Leu Leu Gly Asn Phe Met Val Leu Trp50 55 60Ser Thr Cys Arg Thr Thr Val Phe Lys Ser Val Thr Asn Arg Phe Ile65 70 75 80Lys Asn Leu Ala Cys Ser Gly Ile Cys Ala Ser Leu Val Cys Val Pro85 90 95Phe Asp Ile Ile Leu Ser Thr Ser Pro His Cys Cys Trp Trp Ile Tyr100 105 110Thr Met Leu Phe Cys Lys Val Val Lys Phe Leu His Lys Val Phe Cys115 120 125Ser Val Thr Ile Leu Ser Phe Pro Ala Ile Ala Leu Asp Arg Tyr Tyr130 135 140Ser Val Leu Tyr Pro Leu Glu Arg Lys Ile Ser Asp Ala Lys Ser Arg145 150 155 160Glu Leu Val Met Tyr Ile Trp Ala His Ala Val Val Ala Ser Val Pro165 170 175Val Phe Ala Val Thr Asn Val Ala Asp Ile Tyr Ala Thr Ser Thr Cys180 185 190Thr Glu Val Trp Ser Asn Ser Leu Gly His Leu Val Tyr Val Leu Val195 200 205Tyr Asn Ile Thr Thr Val Ile Val Pro Val Val Val Val Phe Leu Phe210 215 220Leu Ile Leu Ile Arg Arg Ala Leu Ser Ala Ser Gln Lys Lys Lys Val225 230 235 240Ile Ile Ala Ala Leu Arg Thr Pro Gln Asn Thr Ile Ser Ile Pro Tyr245 250 255Ala Ser Gln Arg Glu Ala Glu Leu His Ala Thr Leu Leu Ser Met Val260 265 270Met Val Phe Ile Leu Cys Ser Val Pro Tyr Ala Thr Leu Val Val Tyr275 280 285Gln Thr Val Leu Asn Val Pro Asp Thr Ser Val Phe Leu Leu Leu Thr290 295 300Ala Val Trp Leu Pro Lys Val Ser Leu Leu Ala Asn Pro Val Leu Phe305 310 315 320Leu Thr Val Asn Lys Ser Val Arg Lys Cys Leu Ile Gly Thr Leu Val325 330 335Gln Leu His His Arg Tyr Ser Arg Arg Asn Val Val Ser Thr Gly Ser340 345 350Gly Met Ala Glu Ala Ser Leu Glu Pro Ser Ile Arg Ser Gly Ser Gln355 360 365Leu Leu Glu Met Phe His Ile Gly Gln Gln Gln Ile Phe Lys Pro Thr370 375 380Glu Asp Glu Glu Glu Ser Glu Ala Lys Tyr Ile Gly Ser Ala Asp Phe385 390 395 400Gln Ala Lys Glu Ile Phe Ser Thr Cys Leu Glu Gly Glu Gln Gly Pro405 410 415Gln Phe Ala Pro Ser Ala Pro Pro Leu Ser Thr Val Asp Ser Val Ser420 425 430Gln Val Ala Pro Ala Ala Pro Val Glu Pro Glu Thr Phe Pro Asp Lys435 440 445Tyr Ser Leu Gln Phe Gly Phe Gly Pro Phe Glu Leu Pro Pro Gln Trp450 455 460Leu Ser Glu Thr Arg Asn Ser Lys Lys Arg Leu Leu Pro Pro Leu Gly465 470 475 480Asn Thr Pro Glu Glu Leu Ile Gln Thr Lys Val Pro Lys Val Gly Arg485 490 495Val Glu Arg Lys Met Ser Arg Asn Asn Lys Val Ser Ile Phe Pro Lys500 505 510Val Asp Ser515(106)SEQ ID NO105的資料(i)序列特征(A)長度29個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO105的序列描述GGAGAATTCA CTAGGCGAGG CGCTCCATC 29(107)SEQ ID NO106的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO106的序列描述GGAGGATCCA GGAAACCTTA GGCCGAGTCC 30(108)SEQ ID NO107的資料(i)序列特征(A)長度1164個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO107的序列描述ATGAATCGGC ACCATCTGCA GGATCACTTT CTGGAAATAG ACAAGAAGAA CTGCTGTGTG 60TTCCGAGATG ACTTCATTGC CAAGGTGTTG CCGCCGGTGT TGGGGCTGGA GTTTATCTTT 120GGGCTTCTGG GCAATGGCCT TGCCCTGTGG ATTTTCTGTT TCCACCTCAA GTCCTGGAAA 180TCCAGCCGGA TTTTCCTGTT CAACCTGGCA GTAGCTGACT TTCTACTGAT CATCTGCCTG 240CCGTTCGTGA TGGACTACTA TGTGCGGCGT TCAGACTGGA ACTTTGGGGA CATCCCTTGC 300CGGCTGGTGC TCTTCATGTT TGCCATGAAC CGCCAGGGCA GCATCATCTT CCTCACGGTG 360GTGGCGGTAG ACAGGTATTT CCGGGTGGTC CATCCCCACC ACGCCCTGAA CAAGATCTCC 420AATTGGACAG CAGCCATCAT CTCTTGCCTT CTGTGGGGCA TCACTGTTGG CCTAACAGTC 480CACCTCCTGA AGAAGAAGTT GCTGATCCAG AATGGCCCTG CAAATGTGTG CATCAGCTTC 540AGCATCTGCC ATACCTTCCG GTGGCACGAA GCTATGTTCC TCCTGGAGTT CCTCCTGCCC 600CTGGGCATCA TCCTGTTCTG CTCAGCCAGA ATTATCTGGA GCCTGCGGCA GAGACAAATG 660GACCGGCATG CCAAGATCAA GAGAGCCATC ACCTTCATCA TGGTGGTGGC CATCGTCTTT 720GTCATCTGCT TCCTTCCCAG CGTGGTTGTG CGGATCCGCA TCTTCTGGCT CCTGCACACT 780TCGGGCACGC AGAATTGTGA AGTGTACCGC TCGGTGGACC TGGCGTTCTT TATCACTCTC 840AGCTTCACCT ACATGAACAG CATGCTGGAC CCCGTGGTGT ACTACTTCTC CAGCCCATCC 900TTTCCCAACT TCTTCTCCAC TTTGATCAAC CGCTGCCTCC AGAGGAAGAT GACAGGTGAG 960CCAGATAATA ACCGCAGCAC GAGCGTCGAG CTCACAGGGG ACCCCAACAA AACCAGAGGC 1020GCTCCAGAGG CGTTAATGGC CAACTCCGGT GAGCCATGGA GCCCCTCTTA TCTGGGCCCA 1080ACCTCAAATA ACCATTCCAA GAAGGGACAT TGTCACCAAG AACCAGCATC TCTGGAGAAA 1140CAGTTGGGCT GTTGCATCGA GTAA1164(109)SEQ ID NO108的資料(i)序列特征(A)長度387個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO108的序列描述Met Asn Arg His His Leu Gln Asp His Phe Leu Glu Ile Asp Lys Lys1 5 10 15Asn Cys Cys Val Phe Arg Asp Asp Phe Ile Ala Lys Val Leu Pro Pro20 25 30Val Leu Gly Leu Glu Phe Ile Phe Gly Leu Leu Gly Asn Gly Leu Ala35 40 45Leu Trp Ile Phe Cys Phe His Leu Lys Ser Trp Lys Ser Ser Arg Ile50 55 60Phe Leu Phe Asn Leu Ala Val Ala Asp Phe Leu Leu Ile Ile Cys Leu65 70 75 80Pro Phe Val Met Asp Tyr Tyr Val Arg Arg Ser Asp Trp Asn Phe Gly85 90 95Asp Ile Pro Cys Arg Leu Val Leu Phe Met Phe Ala Met Asn Arg Gln100 105 110Gly Ser Ile Ile Phe Leu Thr Val Val Ala Val Asp Arg Tyr Phe Arg115 120 125Val Val His Pro His His Ala Leu Asn Lys Ile Ser Asn Trp Thr Ala130 135 140Ala Ile Ile Ser Cys Leu Leu Trp Gly Ile Thr Val Gly Leu Thr Val145 150 155 160His Leu Leu Lys Lys Lys Leu Leu Ile Gln Asn Gly Pro Ala Asn Val165 170 175Cys Ile Ser Phe Ser Ile Cys His Thr Phe Arg Trp His Glu Ala Met180 185 190Phe Leu Leu Glu Phe Leu Leu Pro Leu Gly Ile Ile Leu Phe Cys Ser195 200 205Ala Arg Ile Ile Trp Ser Leu Arg Gln Arg Gln Met Asp Arg His Ala210 215 220Lys Ile Lys Arg Ala Ile Thr Phe Ile Met Val Val Ala Ile Val Phe225 230 235 240Val Ile Cys Phe Leu Pro Ser Val Val Val Arg Ile Arg Ile Phe Trp245 250 255Leu Leu His Thr Ser Gly Thr Gln Asn Cys Glu Val Tyr Arg Ser Val260 265 270Asp Leu Ala Phe Phe Ile Thr Leu Ser Phe Thr Tyr Met Asn Ser Met275 280 285Leu Asp Pro Val Val Tyr Tyr Phe Ser Ser Pro Ser Phe Pro Asn Phe290 295 300Phe Ser Thr Leu Ile Asn Arg Cys Leu Gln Arg Lys Met Thr Gly Glu305 310 315 320Pro Asp Asn Asn Arg Ser Thr Ser Val Glu Leu Thr Gly Asp Pro Asn325 330 335Lys Thr Arg Gly Ala Pro Glu Ala Leu Met Ala Asn Ser Gly Glu Pro340 345 350Trp Ser Pro Ser Tyr Leu Gly Pro Thr Ser Asn Asn His Ser Lys Lys355 360 365Gly His Cys His Gln Glu Pro Ala Ser Leu Glu Lys Gln Leu Gly Cys370 375 380Cys Ile Glu385(110)SEQ ID NO109的資料(i)序列特征(A)長度37堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(iv)反義否(xi)SEQ ID NO109的序列描述ACCATGGCTT GCAATGGCAG TGCGGCCAGG GGGCACT 37(111)SEQ ID NO110的資料(i)序列特征(A)長度39個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(iv)反義是(xi)SEQ ID NO110的序列描述CGACCAGGAC AAACAGCATC TTGGTCACTT GTCTCCGGC 39(112)SEQ ID NO111的資料(i)序列特征(A)長度39個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(iv)反義否(xi)SEQ ID NO111的序列描述GACCAAGATG CTGTTTGTCC TGGTCGTGGT GTTTGGCAT39(113)SEQ ID NO112的資料(i)序列特征(A)長度35個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(iv)反義是(xi)SEQ ID NO112的序列描述CGGAATTCAG GATGGATCGG TCTCTTCCTC CGCCT35(114)SEQ ID NO113的資料(i)序列特征(A)長度1212個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO113的序列描述ATGGCTTGCA ATGGCAGTGC GGCCAGGGGG CACTTTGACC CTGAGGACTT GAACCTGACT 60GACGAGGCAC TGAGACTCAA GTACCTGGGG CCCCAGCAGA CAGAGCTGTT CATGCCCATC 120TGTGCCACAT ACCTGCTGAT CTTCGTGGTG GGCGCTGTGG GCAATGGGCT GACCTGTCTG 180GTCATCCTGC GCCACAAGGC CATGCGCACG CCTACCAACT ACTACCTCTT CAGCCTGGCC 240GTGTCGGACC TGCTGGTGCT GCTGGTGGGC CTGCCCCTGG AGCTCTATGA GATGTGGCAC 300AACTACCCCT TCCTGCTGGG CGTTGGTGGC TGCTATTTCC GCACGCTACT GTTTGAGATG 360GTCTGCCTGG CCTCAGTGCT CAACGTCACT GCCCTGAGCG TGGAACGCTA TGTGGCCGTG 420GTGCACCCAC TCCAGGCCAG GTCCATGGTG ACGCGGGCCC ATGTGCGCCG AGTGCTTGGG 480GCCGTCTGGG GTCTTGCCAT GCTCTGCTCC CTGCCCAACA CCAGCCTGCA CGGCATCCGG 540CAGCTGCACG TGCCCTGCCG GGGCCCAGTG CCAGACTCAG CTGTTTGCAT GCTGGTCCGC 600CCACGGGCCC TCTACAACAT GGTAGTGCAG ACCACCGCGC TGCTCTTCTT CTGCCTGCCC 660ATGGCCATCA TGAGCGTGCT CTACCTGCTC ATTGGGCTGC GACTGCGGCG GGAGAGGCTG 720CTGCTCATGC AGGAGGCCAA GGGCAGGGGC TCTGCAGCAG CCAGGTCCAG ATACACCTGC 780AGGCTCCAGC AGCACGATCG GGGCCGGAGA CAAGTGACCA AGATGCTGTT TGTCCTGGTC 840GTGGTGTTTG GCATCTGCTG GGCCCCGTTC CACGCCGACC GCGTCATGTG GAGCGTCGTG 900TCACAGTGGA CAGATGGCCT GCACCTGGCC TTCCAGCACG TGCACGTCAT CTCCGGCATC 960TTCTTCTACC TGGGCTCGGC GGCCAACCCC GTGCTCTATA GCCTCATGTC CAGCCGCTTC 1020CGAGAGACCT TCCAGGAGGC CCTGTGCCTC GGGGCCTGCT GCCATCGCCT CAGACCCCGC 1080CACAGCTCCC ACAGCCTCAG CAGGATGACC ACAGGCAGCA CCCTGTGTGA TGTGGGCTCC 1140CTGGGCAGCT GGGTCCACCC CCTGGCTGGG AACGATGGCC CAGAGGCGCA GCAAGAGACC 1200GATCCATCCT GA 1212(115)SEQ ID NO114的資料(i)序列特征(A)長度403個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO114的序列描述Met Ala Cys Asn Gly Ser Ala Ala Arg Gly His Phe Asp Pro Glu Asp1 5 10 15Leu Asn Leu Thr Asp Glu Ala Leu Arg Leu Lys Tyr Leu Gly Pro Gln
20 25 30Gln Thr Glu Leu Phe Met Pro Ile Cys Ala Thr Tyr Leu Leu Ile Phe35 40 45Val Val Gly Ala Val Gly Asn Gly Leu Thr Cys Leu Val Ile Leu Arg50 55 60His Lys Ala Met Arg Thr Pro Thr Asn Tyr Tyr Leu Phe Ser Leu Ala65 70 75 80Val Ser Asp Leu Leu Val Leu Leu Val Gly Leu Pro Leu Glu Leu Tyr85 90 95Glu Met Trp His Asn Tyr Pro Phe Leu Leu Gly Val Gly Gly Cys Tyr100 105 110Phe Arg Thr Leu Leu Phe Glu Met Val Cys Leu Ala Ser Val Leu Asn115 120 125Val Thr Ala Leu Ser Val Glu Arg Tyr Val Ala Val Val His Pro Leu130 135140Gln Ala Arg Ser Met Val Thr Arg Ala His Val Arg Arg Val Leu Gly145 150 155 160Ala Val Trp Gly Leu Ala Met Leu Cys Ser Leu Pro Asn Thr Ser Leu165 170 175His Gly Ile Arg Gln Leu His Val Pro Cys Arg Gly Pro Val Pro Asp180 185 190Ser Ala Val Cys Met Leu Val Arg Pro Arg Ala Leu Tyr Asn Met Val195 200 205Val Gln Thr Thr Ala Leu Leu Phe Phe Cys Leu Pro Met Ala Ile Met210 215 220Ser Val Leu Tyr Leu Leu Ile Gly Leu Arg Leu Arg Arg Glu Arg Leu225 230 235 240Leu Leu Met Gln Glu Ala Lys Gly Arg Gly Ser Ala Ala Ala Arg Ser245 250 255Arg Tyr Thr Cys Arg Leu Gln Gln His Asp Arg Gly Arg Arg Gln Val260 265 270Thr Lys Met Leu Phe Val Leu Val Val Val Phe Gly Ile Cys Trp Ala275 280 285Pro Phe His Ala Asp Arg Val Met Trp Ser Val Val Ser Gln Trp Thr
290 295 300Asp Gly Leu His Leu Ala Phe Gln His Val His Val Ile Ser Gly Ile305 310 315 320Phe Phe Tyr Leu Gly Ser Ala Ala Asn Pro Val Leu Tyr Ser Leu Met325 330 335Ser Ser Arg Phe Arg Glu Thr Phe Gln Glu Ala Leu Cys Leu Gly Ala340 345 350Cys Cys His Arg Leu Arg Pro Arg His Ser Ser His Ser Leu Ser Arg355 360 365Met Thr Thr Gly Ser Thr Leu Cys Asp Val Gly Ser Leu Gly Ser Trp370 375 380Val His Pro Leu Ala Gly Asn Asp Gly Pro Glu Ala Gln Gln Glu Thr385 390 395 400Asp Pro Ser(116)SEQ ID NO115的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO115的序列描述GGAAGCTTCA GGCCCAAAGA TGGGGAACAT 30(117)SEQ ID NO116的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO116的序列描述GTGGATCCAC CCGCGGAGGA CCCAGGCTAG30(118)SEQ ID NO117的資料(i)序列特征(A)長度1098個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組)(xi)SEQ ID NO117的序列描述ATGGGGAACA TCACTGCAGA CAACTCCTCG ATGAGCTGTA CCATCGACCA TACCATCCAC 60CAGACGCTGG CCCCGGTGGT CTATGTTACC GTGCTGGTGG TGGGCTTCCC GGCCAACTGC 120CTGTCCCTCT ACTTCGGCTA CCTGCAGATC AAGGCCCGGA ACGAGCTGGG CGTGTACCTG 180TGCAACCTGA CGGTGGCCGA CCTCTTCTAC ATCTGCTCGC TGCCCTTCTG GCTGCAGTAC 240GTGCTGCAGC ACGACAACTG GTCTCACGGC GACCTGTCCT GCCAGGTGTG CGGCATCCTC 300CTGTACGAGA ACATCTACAT CAGCGTGGGC TTCCTCTGCT GCATCTCCGT GGACCGCTAC 360CTGGCTGTGG CCCATCCCTT CCGCTTCCAC CAGTTCCGGA CCCTGAAGGC GGCCGTCGGC 420GTCAGCGTGG TCATCTGGGC CAAGGAGCTG CTGACCAGCA TCTACTTCCT GATGCACGAG 480GAGGTCATCG AGGACGAGAA CCAGCACCGC GTGTGCTTTG AGCACTACCC CATCCAGGCA 540TGGCAGCGCG CCATCAACTA CTACCGCTTC CTGGTGGGCT TCCTCTTCCC CATCTGCCTG 600CTGCTGGCGT CCTACCAGGG CATCCTGCGC GCCGTGCGCC GGAGCCACGG CACCCAGAAG 660AGCCGCAAGG ACCAGATCCA GCGGCTGGTG CTCAGCACCG TGGTCATCTT CCTGGCCTGC 720TTCCTGCCCT ACCACGTGTT GCTGCTGGTG CGCAGCGTCT GGGAGGCCAG CTGCGACTTC 780GCCAAGGGCG TTTTCAACGC CTACCACTTC TCCCTCCTGC TCACCAGCTT CAACTGCGTC 840GCCGACCCCG TGCTCTACTG CTTCGTCAGC GAGACCACCC ACCGGGACCT GGCCCGCCTC 900CGCGGGGCCT GCCTGGCCTT CCTCACCTGC TCCAGGACCG GCCGGGCCAG GGAGGCCTAC 960CCGCTGGGTG CCCCCGAGGC CTCCGGGAAA AGCGGGGCCC AGGGTGAGGA GCCCGAGCTG 1020TTGACCAAGC TCCACCCGGC CTTCCAGACC CCTAACTCGC CAGGGTCGGG CGGGTTCCCC 1080ACGGGCAGGT TGGCCTAG 1098(119)SEQ ID NO118的資料(i)序列特征(A)長度365個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO118的序列描述Met Gly Asn Ile Thr Ala Asp Asn Ser Ser Met Ser Cys Thr Ile Asp1 5 10 15His Thr Ile His Gln Thr Leu Ala Pro Val Val Tyr Val Thr Val Leu20 25 30Val Val Gly Phe Pro Ala Asn Cys Leu Ser Leu Tyr Phe Gly Tyr Leu35 40 45Gln Ile Lys Ala Arg Asn Glu Leu Gly Val Tyr Leu Cys Asn Leu Thr50 55 60Val Ala Asp Leu Phe Tyr Ile Cys Ser Leu Pro Phe Trp Leu Gln Tyr65 70 75 80Val Leu Gln His Asp Asn Trp Ser His Gly Asp Leu Ser Cys Gln Val85 90 95Cys Gly Ile Leu Leu Tyr Glu Asn Ile Tyr Ile Ser Val Gly Phe Leu100 105 110Cys Cys Ile Ser Val Asp Arg Tyr Leu Ala Val Ala His Pro Phe Arg115 120 125Phe His Gln Phe Arg Thr Leu Lys Ala Ala Val Gly Val Ser Val Val130 135 140Ile Trp Ala Lys Glu Leu Leu Thr Ser Ile Tyr Phe Leu Met His Glu145 150 155 160Glu Val Ile Glu Asp Glu Asn Gln His Arg Val Cys Phe Glu His Tyr165 170 175Pro Ile Gln Ala Trp Gln Arg Ala Ile Asn Tyr Tyr Arg Phe Leu Val180 185 190Gly Phe Leu Phe Pro Ile Cys Leu Leu Leu Ala Ser Tyr Gln Gly Ile195 200 205Leu Arg Ala Val Arg Arg Ser His Gly Thr Gln Lys Ser Arg Lys Asp210 215 220Gln Ile Gln Arg Leu Val Leu Ser Thr Val Val Ile Phe Leu Ala Cys225 230 235 240Phe Leu Pro Tyr His Val Leu Leu Leu Val Arg Ser Val Trp Glu Ala245 250 255Ser Cys Asp Phe Ala Lys Gly Val Phe Asn Ala Tyr His Phe Ser Leu260 265 270Leu Leu Thr Ser Phe Asn Cys Val Ala Asp Pro Val Leu Tyr Cys Phe275 280 285Val Ser Glu Thr Thr His Arg Asp Leu Ala Arg Leu Arg Gly Ala Cys290 295 300Leu Ala Phe Leu Thr Cys Ser Arg Thr Gly Arg Ala Arg Glu Ala Tyr305 310 315 320Pro Leu Gly Ala Pro Glu Ala Ser Gly Lys Ser Gly Ala Gln Gly Glu325 330 335Glu Pro Glu Leu Leu Thr Lys Leu His Pro Ala Phe Gln Thr Pro Asn340 345 350Ser Pro Gly Ser Gly Gly Phe Pro Thr Gly Arg Leu Ala355 360 365(120)SEQ ID NO119的資料(i)序列特征(A)長度26個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組)(xi) SEQ ID NO119的序列描述GACCTCGAGT CCTTCTACAC CTCATC 26(121)SEQ ID NO120的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO120的序列描述TGCTCTAGAT TCCAGATAGG TGAAAACTTG 30(122)SEQ ID NO121的資料(i)序列特征(A)長度1416個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi) SEQ ID NO121的序列描述ATGGATATTC TTTGTGAAGA AAATACTTCT TTGAGCTCAA CTACGAACTC CCTAATGCAA 60TTAAATGATG ACAACAGGCT CTACAGTAAT GACTTTAACT CCGGAGAAGC TAACACTTCT 120GATGCATTTA ACTGGACAGT CGACTCTGAA AATCGAACCA ACCTTTCCTG TGAAGGGTGC 180CTCTCACCGT CGTGTCTCTC CTTACTTCAT CTCCAGGAAA AAAACTGGTC TGCTTTACTG 240ACAGCCGTAG TGATTATTCT AACTATTGCT GGAAACATAC TCGTCATCAT GGCAGTGTCC 300CTAGAGAAAA AGCTGCAGAA TGCCACCAAC TATTTCCTGA TGTCACTTGC CATAGCTGAT 360ATGCTGCTGG GTTTCCTTGT CATGCCCGTG TCCATGTTAA CCATCCTGTA TGGGTACCGG 420TGGCCTCTGC CGAGCAAGCT TTGTGCAGTC TGGATTTACC TGGACGTGCT CTTCTCCACG 480GCCTCCATCA TGCACCTCTG CGCCATCTCG CTGGACCGCT ACGTCGCCAT CCAGAATCCC 540ATCCACCACA GCCGCTTCAA CTCCAGAACT AAGGCATTTC TGAAAATCAT TGCTGTTTGG 600ACCATATCAG TAGGTATATC CATGCCAATA CCAGTCTTTG GGCTACAGGA CGATTCGAAG 660GTCTTTAAGG AGGGGAGTTG CTTACTCGCC GATGATAACT TTGTCCTGAT CGGCTCTTTT 720GTGTCATTTT TCATTCCCTT AACCATCATG GTGATCACCT ACTTTCTAAC TATCAAGTCA 780CTCCAGAAAG AAGCTACTTT GTGTGTAAGT GATCTTGGCA CACGGGCCAA ATTAGCTTCT 840TTCAGCTTCC TCCCTCAGAG TTCTTTGTCT TCAGAAAAGC TCTTCCAGCG GTCGATCCAT 900AGGGAGCCAG GGTCCTACAC AGGCAGGAGG ACTATGCAGT CCATCAGCAA TGAGCAAAAG 960GCATGCAAGG TGCTGGGCAT CGTCTTCTTC CTGTTTGTGG TGATGTGGTG CCCTTTCTTC 1020ATCACAAACA TCATGGCCGT CATCTGCAAA GAGTCCTGCA ATGAGGATGT CATTGGGGCC 1080CTGCTCAATG TGTTTGTTTG GATCGGTTAT CTCTCTTCAG CAGTCAACCC ACTAGTCTAC 1140ACACTGTTCA ACAAGACCTA TAGGTCAGCC TTTTCACGGT ATATTCAGTG TCAGTACAAG 1200GAAAACAAAA AACCATTGCA GTTAATTTTA GTGAACACAA TACCGGCTTT GGCCTACAAG 1260TCTAGCCAAC TTCAAATGGG ACAAAAAAAG AATTCAAAGC AAGATGCCAA GACAACAGAT 1320AATGACTGCT CAATGGTTGC TCTAGGAAAG CAGTATTCTG AAGAGGCTTC TAAAGACAAT 1380AGCGACGGAG TGAATGAAAA GGTGAGCTGT GTGTGA 1416(123)SEQ ID NO122的資料(i)序列特征(A)長度471個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO122的序列描述Met Asp Ile Leu Cys Glu Glu Asn Thr Ser Leu Ser Ser Thr Thr Asn1 5 10 15Ser Leu Met Gln Leu Asn Asp Asp Asn Arg Leu Tyr Ser Asn Asp Phe20 25 30Asn Ser Gly Glu Ala Asn Thr Ser Asp Ala Phe Asn Trp Thr Val Asp35 40 45Ser Glu Asn Arg Thr Asn Leu Ser Cys Glu Gly Cys Leu Ser Pro Ser50 55 60Cys Leu Ser Leu Leu His Leu Gln Glu Lys Asn Trp Ser Ala Leu Leu65 70 75 80Thr Ala Val Val Ile Ile Leu Thr Ile Ala Gly Asn Ile Leu Val Ile
85 90 95Met Ala Val Ser Leu Glu Lys Lys Leu Gln Asn Ala Thr Asn Tyr Phe100 105 110Leu Met Ser Leu Ala Ile Ala Asp Met Leu Leu Gly Phe Leu Val Met115 120 125Pro Val Ser Met Leu Thr Ile Leu Tyr Gly Tyr Arg Trp Pro Leu Pro130 135 140Ser Lys Leu Cys Ala Val Trp Ile Tyr Leu Asp Val Leu Phe Ser Thr145 150 155 160Ala Ser lle Met His Leu Cys Ala Ile Ser Leu Asp Arg Tyr Val Ala165 170 175Ile Gln Asn Pro Ile His His Ser Arg Phe Asn Ser Arg Thr Lys Ala180 185 190Phe Leu Lys Ile Ile Ala Val Trp Thr Ile Ser Val Gly Ile Ser Met195 200 205Pro Ile Pro Val Phe Gly Leu Gln Asp Asp Ser Lys Val Phe Lys Glu210 215 220Gly Ser Cys Leu Leu Ala Asp Asp Asn Phe Val Leu Ile Gly Ser Phe225 230 235 240Val Ser Phe Phe Ile Pro Leu Thr Ile Met Val Ile Thr Tyr Phe Leu245 250 255Thr Ile Lys Ser Leu Gln Lys Glu Ala Thr Leu Cys Val Ser Asp Leu260 265 270Gly Thr Arg Ala Lys Leu Ala Ser Phe Ser Phe Leu Pro Gln Ser Ser275 280 285Leu Ser Ser Glu Lys Leu Phe Gln Arg Ser Ile His Arg Glu Pro Gly290 295 300Ser Tyr Thr Gly Arg Arg Thr Met Gln Ser Ile Ser Asn Glu Gln Lys305 310 315 320Ala Cys Lys Val Leu Gly Ile Val Phe Phe Leu Phe Val Val Met Trp325 330 335Cys Pro Phe Phe Ile Thr Asn Ile Met Ala Val Ile Cys Lys Glu Ser340 345 350Cys Asn Glu Asp Val Ile Gly Ala Leu Leu Asn Val Phe Val Trp Ile
355 360 365Gly Tyr Leu Ser Ser Ala Val Asn Pro Leu Val Tyr Thr Leu Phe Asn370 375 380Lys Thr Tyr Arg Ser Ala Phe Ser Arg Tyr Ile Gln Cys Gln Tyr Lys385 390 395 400Glu Asn Lys Lys Pro Leu Gln Leu Ile Leu Val Asn Thr Ile Pro Ala405 410 415Leu Ala Tyr Lys Ser Ser Gln Leu Gln Met Gly Gln Lys Lys Asn Ser420 425 430Lys Gln Asp Ala Lys Thr Thr Asp Asn Asp Cys Ser Met Val Ala Leu435 440 445Gly Lys Gln Tyr Ser Glu Glu Ala Ser Lys Asp Asn Ser Asp Gly Val450 455 460Asn Glu Lys Val Ser Cys Val465 470(124)SEQ ID NO123的資料(i)序列特征(A)長度27個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO123的序列描述GACCTCGAGG TTGCTTAAGA CTGAAGC27(125)SEQ ID NO124的資料(i)序列特征(A)長度27個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO124的序列描述ATTTCTAGAC ATATGTAGCT TGTACCG 27(126)SEQ ID NO125的資料(i)序列特征(A)長度1377個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO125的序列描述ATGGTGAACC TGAGGAATGC GGTGCATTCA TTCCTTGTGC ACCTAATTGG CCTATTGGTT 60TGGCAATGTG ATATTTCTGT GAGCCCAGTA GCAGCTATAG TAACTGACAT TTTCAATACC 120TCCGATGGTG GACGCTTCAA ATTCCCAGAC GGGGTACAAA ACTGGCCAGC ACTTTCAATC 180GTCATCATAA TAATCATGAC AATAGGTGGC AACATCCTTG TGATCATGGC AGTAAGCATG 240GAAAAGAAAC TGCACAATGC CACCAATTAC TTCTTAATGT CCCTAGCCAT TGCTGATATG 300CTAGTGGGAC TACTTGTCAT GCCCCTGTCT CTCCTGGCAA TCCTTTATGA TTATGTCTGG 360CCACTACCTA GATATTTGTG CCCCGTCTGG ATTTCTTTAG ATGTTTTATT TTCAACAGCG 420TCCATCATGC ACCTCTGCGC TATATCGCTG GATCGGTATG TAGCAATACG TAATCCTATT 480GAGCATAGCC GTTTCAATTC GCGGACTAAG GCCATCATGA AGATTGCTAT TGTTTGGGCA 540ATTTCTATAG GTGTATCAGT TCCTATCCCT GTGATTGGAC TGAGGGACGA AGAAAAGGTG 600TTCGTGAACA ACACGACGTG CGTGCTCAAC GACCCAAATT TCGTTCTTAT TGGGTCCTTC 660GTAGCTTTCT TCATACCGCT GACGATTATG GTGATTACGT ATTGCCTGAC CATCTACGTT 720CTGCGCCGAC AAGCTTTGAT GTTACTGCAC GGCCACACCG AGGAACCGCC TGGACTAAGT 780CTGGATTTCC TGAAGTGCTG CAAGAGGAAT ACGGCCGAGG AAGAGAACTC TGCAAACCCT 840AACCAAGACC AGAACGCACG CCGAAGAAAG AAGAAGGAGA GACGTCCTAG GGGCACCATG 900CAGGCTATCA ACAATGAAAG AAAAGCTTCG AAAGTCCTTG GGATTGTTTT CTTTGTGTTT 960CTGATCATGT GGTGCCCATT TTTCATTACC AATATTCTGT CTGTTCTTTG TGAGAAGTCC 1020TGTAACCAAA AGCTCATGGA AAAGCTTCTG AATGTGTTTG TTTGGATTGG CTATGTTTGT 1080TCAGGAATCA ATCCTCTGGT GTATACTCTG TTCAACAAAA TTTACCGAAG GGCATTCTCC 1140AACTATTTGC GTTGCAATTA TAAGGTAGAG AAAAAGCCTC CTGTCAGGCA GATTCCAAGA 1200GTTGCCGCCA CTGCTTTGTC TGGGAGGGAG CTTAATGTTA ACATTTATCG GCATACCAAT 1260GAACCGGTGA TCGAGAAAGC CAGTGACAAT GAGCCCGGTA TAGAGATGCA AGTTGAGAAT 1320TTAGAGTTAC CAGTAAATCC CTCCAGTGTG GTTAGCGAAA GGATTAGCAG TGTGTGA1377(127)SEQ ID NO126的資料(i)序列特征(A)長度458個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO126的序列描述Met Val Asn Leu Arg Asn Ala Val His Ser Phe Leu Val His Leu Ile1 5 10 15Gly Leu Leu Val Trp Gln Cys Asp Ile Ser Val Ser Pro Val Ala Ala20 25 30Ile Val Thr Asp Ile Phe Asn Thr Ser Asp Gly Gly Arg Phe Lys Phe35 40 45Pro Asp Gly Val Gln Asn Trp Pro Ala Leu Ser Ile Val Ile Ile Ile50 55 60Ile Met Thr Ile Gly Gly Asn Ile Leu Val Ile Met Ala Val Ser Met65 70 75 80Glu Lys Lys Leu His Asn Ala Thr Asn Tyr Phe Leu Met Ser Leu Ala85 90 95Ile A1a Asp Met Leu Val Gly Leu Leu Val Met Pro Leu Ser Leu Leu100 105 110Ala Ile Leu Tyr Asp Tyr Val Trp Pro Leu Pro Arg Tyr Leu Cys Pro115 120 125Val Trp Ile Ser Leu Asp Val Leu Phe Ser Thr Ala Ser Ile Met His130 135 140Leu Cys Ala Ile Ser Leu Asp Arg Tyr Val Ala Ile Arg Asn Pro Ile145 150 155 160Glu His Ser Arg Phe Asn Ser Arg Thr Lys Ala Ile Met Lys Ile Ala165 170 175Ile Val Trp Ala Ile Ser Ile Gly Val Ser Val Pro Ile Pro Val Ile180 185 190Gly Leu Arg Asp Glu Glu Lys Val Phe Val Asn Asn Thr Thr Cys Val195 200 205Leu Asn Asp Pro Asn Phe Val Leu Ile Gly Ser Phe Val Ala Phe Phe210 215 220Ile Pro Leu Thr Ile Met Val Ile Thr Tyr Cys Leu Thr Ile Tyr Val225 230 235 240Leu Arg Arg Gln Ala Leu Met Leu Leu His Gly His Thr Glu Glu Pro245 250 255Pro Gly Leu Ser Leu Asp Phe Leu Lys Cys Cys Lys Arg Asn Thr Ala260 265 270Glu Glu Glu Asn Ser Ala Asn Pro Asn Gln Asp Gln Asn Ala Arg Arg275 280 285Arg Lys Lys Lys Glu Arg Arg Pro Arg Gly Thr Met Gln Ala Ile Asn290 295 300Asn Glu Arg Lys Ala Ser Lys Val Leu Gly Ile Val Phe Phe Val Phe305 310 315 320Leu Ile Met Trp Cys Pro Phe Phe Ile Thr Asn Ile Leu Ser Val Leu325 330 335Cys Glu Lys Ser Cys Asn Gln Lys Leu Met Glu Lys Leu Leu Asn Val340 345 350Phe Val Trp Ile Gly Tyr Val Cys Ser Gly Ile Asn Pro Leu Val Tyr355 360 365Thr Leu Phe Asn Lys Ile Tyr Arg Arg Ala Phe Ser Asn Tyr Leu Arg370 375 380Cys Asn Tyr Lys Val Glu Lys Lys Pro Pro Val Arg Gln Ile Pro Arg385 390 395 400Val Ala Ala Thr Ala Leu Ser Gly Arg Glu Leu Asn Val Asn Ile Tyr405 410 415Arg His Thr Asn Glu Pro Val Ile Glu Lys Ala Ser Asp Asn Glu Pro420 425 430Gly Ile Glu Met Gln Val Glu Asn Leu Glu Leu Pro Val Asn Pro Ser435 440 445Ser Val Val Ser Glu Arg Ile Ser Ser Val450 455(128)SEQ ID NO127的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO127的序列描述GGTAAGCTTG GCAGTCCACG CCAGGCCTTC 30(129)SEQ ID NO128的資料(i)序列特征
(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO128的序列描述TCCGAATTCT CTGTAGACAC AAGGCTTTGG 30(130) SEQ ID NO129的資料(i)序列特征(A)長度1068個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO129的序列描述ATGGATCAGT TCCCTGAATC AGTGACAGAA AACTTTGAGT ACGATGATTT GGCTGAGGCC 60TGTTATATTG GGGACATCGT GGTCTTTGGG ACTGTGTTCC TGTCCATATT CTACTCCGTC 120ATCTTTGCCA TTGGCCTGGT GGGAAATTTG TTGGTAGTGT TTGCCCTCAC CAACAGCAAG 180AAGCCCAAGA GTGTCACCGA CATTTACCTC CTGAACCTGG CCTTGTCTGA TCTGCTGTTT 240GTAGCCACTT TGCCCTTCTG GACTCACTAT TTGATAAATG AAAAGGGCCT CCACAATGCC 300ATGTGCAAAT TCACTACCGC CTTCTTCTTC ATCGGCTTTT TTGGAAGCAT ATTCTTCATC 360ACCGTCATCA GCATTGATAG GTACCTGGCC ATCGTCCTGG CCGCCAACTC CATGAACAAC 420CGGACCGTGC AGCATGGCGT CACCATCAGC CTAGGCGTCT GGGCAGCAGC CATTTTGGTG 480GCAGCACCCC AGTTCATGTT CACAAAGCAG AAAGAAAATG AATGCCTTGG TGACTACCCC 540GAGGTCCTCC AGGAAATCTG GCCCGTGCTC CGCAATGTGG AAACAAATTT TCTTGGCTTC 600CTACTCCCCC TGCTCATTAT GAGTTATTGC TACTTCAGAA TCATCCAGAC GCTGTTTTCC 660TGCAAGAACC ACAAGAAAGC CAAAGCCATT AAACTGATCC TTCTGGTGGT CATCGTGTTT 720TTCCTCTTCT GGACACCCTA CAACGTTATG ATTTTCCTGG AGACGCTTAA GCTCTATGAC 780TTCTTTCCCA GTTGTGACAT GAGGAAGGAT CTGAGGCTGG CCCTCAGTGT GACTGAGACG 840GTTGCATTTA GCCATTGTTG CCTGAATCCT CTCATCTATG CATTTGCTGG GGAGAAGTTC 900AGAAGATACC TTTACCACCT GTATGGGAAA TGCCTGGCTG TCCTGTGTGG GCGCTCAGTC 960CACGTTGATT TCTCCTCATC TGAATCACAA AGGAGCAGGC ATGGAAGTGT TCTGAGCAGC 1020AATTTTACTT ACCACACGAG TGATGGAGAT GCATTGCTCC TTCTCTGA 1068(131)SEQ ID NO130的資料(i)序列特征(A)長度355個氨基酸(B)類型氨基酸(C)鏈型
(D)拓撲學不相關(ii)分子類蛋白質(zhì)(xi)SEQ ID NO130的序列描述Met Asp Gln Phe Pro Glu Ser Val Thr Glu Asn Phe Glu Tyr Asp Asp1 5 10 15Leu Ala Glu Ala Cys Tyr Ile Gly Asp Ile Val Val Phe Gly Thr Val20 25 30Phe Leu Ser Ile Phe Tyr Ser Val Ile Phe Ala Ile Gly Leu Val Gly35 40 45Asn Leu Leu Val Val Phe Ala Leu Thr Asn Ser Lys Lys Pro Lys Ser50 55 60Val Thr Asp Ile Tyr Leu Leu Asn Leu Ala Leu Ser Asp Leu Leu Phe65 70 75 80Val Ala Thr Leu Pro Phe Trp Thr His Tyr Leu Ile Asn Glu Lys Gly85 90 95Leu His Asn Ala Met Cys Lys Phe Thr Thr Ala Phe Phe Phe Ile Gly100 105 110Phe Phe Gly Ser Ile Phe Phe Ile Thr Val Ile Ser Ile Asp Arg Tyr115 120 125Leu Ala Ile Val Leu Ala Ala Asn Ser Met Asn Asn Arg Thr Val Gln130 135 140His Gly Val Thr Ile Ser Leu Gly Val Trp Ala Ala Ala Ile Leu Val145 150 155 160Ala Ala Pro Gln Phe Met Phe Thr Lys Gln Lys Glu Asn Glu Cys Leu165 170 175Gly Asp Tyr Pro Glu Val Leu Gln Glu Ile Trp Pro Val Leu Arg Asn180 185 190Val Glu Thr Asn Phe Leu Gly Phe Leu Leu Pro Leu Leu Ile Met Ser195 200 205Tyr Cys Tyr Phe Arg Ile I1e Gln Thr Leu Phe Ser Cys Lys Asn His210 215 220Lys Lys Ala Lys Ala Ile Lys Leu Ile Leu Leu Val Val Ile Val Phe225 230 235 240Phe Leu Phe Trp Thr Pro Tyr Asn Val Met Ile Phe Leu Glu Thr Leu245 250 255Lys Leu Tyr Asp Phe Phe Pro Ser Cys Asp Met Arg Lys Asp Leu Arg260 265 270Leu Ala Leu Ser Val Thr Glu Thr Val Ala Phe Ser His Cys Cys Leu275 280 285Asn Pro Leu Ile Tyr Ala Phe Ala Gly Glu Lys Phe Arg Arg Tyr Leu290 295 300Tyr His Leu Tyr Gly Lys Cys Leu Ala Val Leu Cys Gly Arg Ser Val305 310 315 320His Val Asp Phe Ser Ser Ser Glu Ser Gln Arg Ser Arg His Gly Ser325 330 335Val Leu Ser Ser Asn Phe Thr Tyr His Thr Ser Asp Gly Asp Ala Leu340 345 350Leu Leu Leu355(132)SEQ ID NO131的資料(i)序列特征(A)長度32個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO131的序列描述GATCTCCAGT AGGCATAAGT GGACAATTCT GG 32(133)SEQ ID NO132的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO132的序列描述CTCCTTCGGT CCTCCTATCG TTGTCAGAAG 30(134)SEQ ID NO133的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO133的序列描述AGAAGGCCAA GATCGCGCGG CTGGCCCTCA 30(135)SEQ ID NO134的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO134的序列描述CGGCGCCACC GCACGAAAAA GCTCATCTTC 30(136)SEQ ID NO135的資料(i)序列特征(A)長度33個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO135的序列描述GCCAAGAAGC GGGTGAAGTT CCTGGTGGTG GCA 33(137)SEQ ID NO136的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO136的序列描述CAGGCGGAAG GTGAAAGTCC TGGTCCTCGT 30(138)SEQ ID NO137的資料(i)序列特征(A)長度33個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO137的序列描述CGGCGCCTGC GGGCCAAGCG GCTGGTGGTG GTG 33(139)SEQ ID NO138的資料(i)序列特征(A)長度31個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO138的序列描述CCAAGCACAA AGCCAAGAAA GTGACCATCA C31(140)SEQ ID NO139的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO139的序列描述GCGCCGGCGC ACCAAATGCT TGCTGGTGGT 30(141)SEQ ID NO140的資料(i)序列特征
(A)長度41個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO140的序列描述CAAAAAGCTG AAGAAATCTA AGAAGATCAT CTTTATTGTC G 41(142)SEQ ID NO141的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO141的序列描述CAAGACCAAG GCAAAACGCA TGATCGCCAT 30(143)SEQ ID NO142的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO142的序列描述GTCAAGGAGA AGTCCAAAAG GATCATCATC 30(144)SEQ ID NO143的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO143的序列描述CGCCGCGTGC GGGCCAAGCA GCTCCTGCTC 30(145)SEQ ID NO144的資料(i)序列特征(A)長度33個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO144的序列描述CCTGATAAGC GCTATAAAAT GGTCCTGTTT CGA 33(146)SEQ ID NO145的資料(i)序列特征(A)長度36個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO145的序列描述GAAAGACAAA AGAGAGTCAA GAGGATGTCT TTATTG 36(147)SEQ ID NO146的資料(i)序列特征(A)長度33個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO146的序列描述CGGAGAAAGA GGGTGAAACG CACAGCCATC GCC 33(148)SEQ ID NO147的資料(i)序列特征(A)長度30個堿基對(B)類型核酸
(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO147的序列描述AAGCTTCAGC GGGCCAAGGC ACTGGTCACC 30(149)SEQ ID NO148的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO148的序列描述CAGCGGCAGA AGGCAAAAAG GGTGGCCATC 30(150)SEQ ID NO149的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO149的序列描述CGGCAGAAGG CGAAGCGCAT GATCCTCGCG 30(151)SEQ ID NO150的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO150的序列描述GAGCGCAACA AGGCCAAAAA GGTGATCATC 30(152)SEQ ID NO151的資料(i)序列特征(A)長度39個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO151的序列描述GGTGTAAACA AAAAGGCTAA AAACACAATT ATTCTTATT39(153) SEQ ID NO152的資料(i)序列特征(A)長度27個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO152的序列描述GAGAGCCAGC TCAAGAGCAC CGTGGTG 27(154)SEQ ID NO153的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO153的序列描述CCACAAGCAA ACCAAGAAAA TGCTGGCTGT 30(155)SEQ ID NO154的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO154的序列描述CATCAAGTGT ATCATGTGCC AAGTACGCCC 30(156)SEQ ID NO155的資料(i)序列特征(A)長度34個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO155的序列描述CTAGAGAGTC AGATGAAGTG TACAGTAGTG GCAC 34(157)SEQ ID NO156的資料(i)序列特征(A)長度36個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO156的序列描述CTAGAGAGTC AGATGAAGTG TACAGTAGTG GCAC 34(158)SEQ ID NO157的資料(i)序列特征(A)長度33個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO157的序列描述GCTGAGGTTC GCAATAAACT AACCATGTTT GTG 33(159)SEQ ID NO158的資料(i)序列特征
(A)長度29個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO158的序列描述GGGAGGCCGA GCTGAAAGCC ACCCTGCTC 29(160)SEQ ID NO159的資料(i)序列特征(A)長度31個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO159的序列描述CAAGATCAAG AGAGCCAAAA CCTTCATCAT G 31(161)SEQ ID NO160的資料(i)序列特征(A)長度31個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO160的序列描述CCGGAGACAA GTGAAGAAGA TGCTGTTTGT C 31(162)SEQ ID NO161的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO161的序列描述GCAAGGACCA GATCAAGCGG CTGGTGCTCA 30(163)SEQ ID NO162的資料(i)序列特征(A)長度34個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO162的序列描述CAAGAAAGCC AAAGCCAAGA AACTGATCCT TCTG34(164)SEQ ID NO163的資料(i)序列特征(A)長度1068個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO163的序列描述ATGGAAGATT TGGAGGAAAC ATTATTTGAA GAATTTGAAA ACTATTCCTA TGACCTAGAC 60TATTACTCTC TGGAGTCTGA TTTGGAGGAG AAAGTCCAGC TGGGAGTTGT TCACTGGGTC 120TCCCTGGTGT TATATTGTTT GGCTTTTGTT CTGGGAATTC CAGGAAATGC CATCGTCATT 180TGGTTCACGG GGCTCAAGTG GAAGAAGACA GTCACCACTC TGTGGTTCCT CAATCTAGCC 240ATTGCGGATT TCATTTTTCT TCTCTTTCTG CCCCTGTACA TCTCCTATGT GGCCATGAAT 300TTCCACTGGC CCTTTGGCAT CTGGCTGTGC AAAGCCAATT CCTTCACTGC CCAGTTGAAC 360ATGTTTGCCA GTGTTTTTTT CCTGACAGTG ATCAGCCTGG ACCACTATAT CCACTTGATC 420CATCCTGTCT TATCTCATCG GCATCGAACC CTCAAGAACT CTCTGATTGT CATTATATTC 480ATCTGGCTTT TGGCTTCTCT AATTGGCGGT CCTGCCCTGT ACTTCCGGGA CACTGTGGAG 540TTCAATAATC ATACTCTTTG CTATAACAAT TTTCAGAAGC ATGATCCTGA CCTCACTTTG 600ATCAGGCACC ATGTTCTGAC TTGGGTGAAA TTTATCATTG GCTATCTCTT CCCTTTGCTA 660ACAATGAGTA TTTGCTACTT GTGTCTCATC TTCAAGGTGA AGAAGCGAAC AGTCCTGATC 720TCCAGTAGGC ATAAGTGGAC AATTCTGGTT GTGGTTGTGG CCTTTGTGGT TTGCTGGACT 780CCTTATCACC TGTTTAGCAT TTGGGAGCTC ACCATTCACC ACAATAGCTA TTCCCACCAT 840GTGATGCAGG CTGGAATCCC CCTCTCCACT GGTTTGGCAT TCCTCAATAG TTGCTTGAAC 900CCCATCCTTT ATGTCCTAAT TAGTAAGAAG TTCCAAGCTC GCTTCCGGTC CTCAGTTGCT 960GAGATACTCA AGTACACACT GTGGGAAGTC AGCTGTTCTG GCACAGTGAG TGAACAGCTC 1020AGGAACTCAG AAACCAAGAA TCTGTGTCTC CTGGAAACAG CTCAATAA 1068(165)SEQ ID NO164的資料(i)序列特征
(A)長度355個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO164的序列描述Met Glu Asp Leu Glu Glu Thr Leu Phe Glu Glu Phe Glu Asn Tyr Ser1 5 10 15Tyr Asp Leu Asp Tyr Tyr Ser Leu Glu Ser Asp Leu Glu Glu Lys Val20 25 30Gln Leu Gly Val Val His Trp Val Ser Leu Val Leu Tyr Cys Leu Ala35 40 45Phe Val Leu Gly Ile Pro Gly Asn Ala Ile Val Ile Trp Phe Thr Gly50 55 60Leu Lys Trp Lys Lys Thr Val Thr Thr Leu Trp Phe Leu Asn Leu Ala65 70 75 80Ile Ala Asp Phe Ile Phe Leu Leu Phe Leu Pro Leu Tyr Ile Ser Tyr85 90 95Val Ala Met Asn Phe His Trp Pro Phe Gly Ile Trp Leu Cys Lys Ala100 105 110Asn Ser Phe Thr Ala Gln Leu Asn Met Phe Ala Ser Val Phe Phe Leu115 120 125Thr Val Ile Ser Leu Asp His Tyr Ile His Leu Ile His Pro Val Leu130 135 140Ser His Arg His Arg Thr Leu Lys Asn Ser Leu Ile Val Ile Ile Phe145 150 155 160Ile Trp Leu Leu Ala Ser Leu Ile Gly Gly Pro Ala Leu Tyr Phe Arg165 170 175Asp Thr Val Glu Phe Asn Asn His Thr Leu Cys Tyr Asn Asn Phe Gln180 185 190Lys His Asp Pro Asp Leu Thr Leu Ile Arg His His Val Leu Thr Trp195 200 205Val Lys Phe Ile Ile Gly Tyr Leu Phe Pro Leu Leu Thr Met Ser Ile210 215 220Cys Tyr Leu Cys Leu Ile Phe Lys Val Lys Lys Arg Thr Val Leu Ile225 230 235 240Ser Ser Arg His Lys Trp Thr Ile Leu Val Val Val Val Ala Phe Val245 250 255Val Cys Trp Thr Pro Tyr His Leu Phe Ser Ile Trp Glu Leu Thr Ile260 265 270His His Asn Ser Tyr Ser His His Val Met Gln Ala Gly Ile Pro Leu275 280 285Ser Thr Gly Leu Ala Phe Leu Asn Ser Cys Leu Asn Pro Ile Leu Tyr290 295 300Val Leu Ile Ser Lys Lys Phe Gln Ala Arg Phe Arg Ser Ser Val Ala305 310 315 320Glu Ile Leu Lys Tyr Thr Leu Trp Glu Val Ser Cys Ser Gly Thr Val325 330 335Ser Glu Gln Leu Arg Asn Ser Glu Thr Lys Asn Leu Cys Leu Leu Glu340 345 350Thr Ala Gln355(166)SEQ ID NO165的資料(i)序列特征(A)長度1089個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO165的序列描述ATGGGCAACC ACACGTGGGA GGGCTGCCAC GTGGACTCGC GCGTGGACCA CCTCTTTCCG 60CCATCCCTCT ACATCTTTGT CATCGGCGTG GGGCTGCCCA CCAACTGCCT GGCTCTGTGG 120GCGGCCTACC GCCAGGTGCA ACAGCGCAAC GAGCTGGGCG TCTACCTGAT GAACCTCAGC 180ATCGCCGACC TGCTGTACAT CTGCACGCTG CCGCTGTGGG TGGACTACTT CCTGCACCAC 240GACAACTGGA TCCACGGCCC CGGGTCCTGC AAGCTCTTTG GGTTCATCTT CTACACCAAT 300ATCTACATCA GCATCGCCTT CCTGTGCTGC ATCTCGGTGG ACCGCTACCT GGCTGTGGCC 360CACCCACTCC GCTTCGCCCG CCTGCGCCGC GTCAAGACCG CCGTGGCCGT GAGCTCCGTG 420GTCTGGGCCA CGGAGCTGGG CGCCAACTCG GCGCCCCTGT TCCATGACGA GCTCTTCCGA 480GACCGCTACA ACCACACCTT CTGCTTTGAG AAGTTCCCCA TGGAAGGCTG GGTGGCCTGG 540ATGAACCTCT ATCGGGTGTT CGTGGGCTTC CTCTTCCCGT GGGCGCTCAT GCTGCTGTCG 600TACCGGGGCA TCCTGCGGGC CGTGCGGGGC AGCGTGTCCA CCGAGCGCCA GGAGAAGGCC 660AAGATCGCGC GGCTGGCCCT CAGCCTCATC GCCATCGTGC TGGTCTGCTT TGCGCCCTAT 720CACGTGCTCT TGCTGTCCCG CAGCGCCATC TACCTGGGCC GCCCCTGGGA CTGCGGCTTC 780GAGGAGCGCG TCTTTTCTGC ATACCACAGC TCACTGGCTT TCACCAGCCT CAACTGTGTG 840GCGGACCCCA TCCTCTACTG CCTGGTCAAC GAGGGCGCCC GCAGCGATGT GGCCAAGGCC 900CTGCACAACC TGCTCCGCTT TCTGGCCAGC GACAAGCCCC AGGAGATGGC CAATGCCTCG 960CTCACCCTGG AGACCCCACT CACCTCCAAG AGGAACAGCA CAGCCAAAGC CATGACTGGC 1020AGCTGGGCGG CCACTCCGCC TTCCCAGGGG GACCAGGTGC AGCTGAAGAT GCTGCCGCCA 1080GCACAATGA 1089(167)SEQ ID NO166的資料(i)序列特征(A)長度362個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO166的序列描述Met Gly Asn His Thr Trp Glu Gly Cys His Val Asp Ser Arg Val Asp1 5 10 15His Leu Phe Pro Pro Ser Leu Tyr Ile Phe Val Ile Gly Val Gly Leu20 25 30Pro Thr Asn Cys Leu Ala Leu Trp Ala Ala Tyr Arg Gln Val Gln Gln35 40 45Arg Asn Glu Leu Gly Val Tyr Leu Met Asn Leu Ser Ile Ala Asp Leu50 55 60Leu Tyr Ile Cys Thr Leu Pro Leu Trp Val Asp Tyr Phe Leu His His65 70 75 80Asp Asn Trp Ile His Gly Pro Gly Ser Cys Lys Leu Phe Gly Phe Ile85 90 95Phe Tyr Thr Asn Ile Tyr Ile Ser Ile Ala Phe Leu Cys Cys Ile Ser100 105 110Val Asp Arg Tyr Leu Ala Val Ala His Pro Leu Arg Phe Ala Arg Leu115 120 125Arg Arg Val Lys Thr Ala Val Ala Val Ser Ser Val Val Trp Ala Thr130 135 140Glu Leu Gly Ala Asn Ser Ala Pro Leu Phe His Asp Glu Leu Phe Arg145 150 155 160Asp Arg Tyr Asn His Thr Phe Cys Phe Glu Lys Phe Pro Met Glu Gly165 170 175Trp Val Ala Trp Met Asn Leu Tyr Arg Val Phe Val Gly Phe Leu Phe
180 185 190Pro Trp Ala Leu Met Leu Leu Ser Tyr Arg Gly Ile Leu Arg Ala Val195 200 205Arg Gly Ser Val Ser Thr Glu Arg Gln Glu Lys Ala Lys Ile Ala Arg210 215 220Leu Ala Leu Ser Leu Ile Ala Ile Val Leu Val Cys Phe Ala Pro Tyr225 230 235 240His Val Leu Leu Leu Ser Arg Ser Ala Ile Tyr Leu Gly Arg Pro Trp245 250 255Asp Cys Gly Phe Glu Glu Arg Val Phe Ser Ala Tyr His Ser Ser Leu260 265 270Ala Phe Thr Ser Leu Asn Cys Val Ala Asp Pro Ile Leu Tyr Cys Leu275 280 285Val Asn Glu Gly Ala Arg Ser Asp Val Ala Lys Ala Leu His Asn Leu290 295 300Leu Arg Phe Leu Ala Ser Asp Lys Pro Gln Glu Met Ala Asn Ala Ser305 310 315 320Leu Thr Leu Glu Thr Pro Leu Thr Ser Lys Arg Asn Ser Thr Ala Lys325 330 335Ala Met Thr Gly Ser Trp Ala Ala Thr Pro Pro Ser Gln Gly Asp Gln340 345 350Val Gln Leu Lys Met Leu Pro Pro Ala Gln355 360(168)SEQ ID NO167的資料(i)序列特征(A)長度1002個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO167的序列描述ATGGAGTCCT CAGGCAACCC AGAGAGCACC ACCTTTTTTT ACTATGACCT TCAGAGCCAG 60CCGTGTGAGA ACCAGGCCTG GGTCTTTGCT ACCCTCGCCA CCACTGTCCT GTACTGCCTG 120GTGTTTCTCC TCAGCCTAGT GGGCAACAGC CTGGTCCTGT GGGTCCTGGT GAAGTATGAG 180AGCCTGGAGT CCCTCACCAA CATCTTCATC CTCAACCTGT GCCTCTCAGA CCTGGTGTTC 240GCCTGCTTGT TGCCTGTGTG GATCTCCCCA TACCACTGGG GCTGGGTGCT GGGAGACTTC 300CTCTGCAAAC TCCTCAATAT GATCTTCTCC ATCAGCCTCT ACAGCAGCAT CTTCTTCCTG 360ACCATCATGA CCATCCACCG CTACCTGTCG GTAGTGAGCC CCCTCTCCAC CCTGCGCGTC 420CCCACCCTCC GCTGCCGGGT GCTGGTGACC ATGGCTGTGT GGGTAGCCAG CATCCTGTCC 480TCCATCCTCG ACACCATCTT CCACAAGGTG CTTTCTTCGG GCTGTGATTA TTCCGAACTC 540ACGTGGTACC TCACCTCCGT CTACCAGCAC AACCTCTTCT TCCTGCTGTC CCTGGGGATT 600ATCCTGTTCT GCTACGTGGA GATCCTCAGG ACCCTGTTCC GCTCACGCTC CAAGCGGCGC 660CACCGCACGA AAAAGCTCAT CTTCGCCATC GTGGTGGCCT ACTTCCTCAG CTGGGGTCCC 720TACAACTTCA CCCTGTTTCT GCAGACGCTG TTTCGGACCC AGATCATCCG GAGCTGCGAG 780GCCAAACAGC AGCTAGAATA CGCCCTGCTC ATCTGCCGCA ACCTCGCCTT CTCCCACTGC 840TGCTTTAACC CGGTGCTCTA TGTCTTCGTG GGGGTCAAGT TCCGCACACA CCTGAAACAT 900GTTCTCCGGC AGTTCTGGTT CTGCCGGCTG CAGGCACCCA GCCCAGCCTC GATCCCCCAC 960TCCCCTGGTG CCTTCGCCTA TGAGGGCGCC TCCTTCTACT GA1002(169)SEQ ID NO168的資料(i)序列特征(A)長度333個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO168的序列描述Met Glu Ser Ser Gly Asn Pro Glu Ser Thr Thr Phe Phe Tyr Tyr Asp1 5 10 15Leu Gln Ser Gln Pro Cys Glu Asn Gln Ala Trp Val Phe Ala Thr Leu20 25 30Ala Thr Thr Val Leu Tyr Cys Leu Val Phe Leu Leu Ser Leu Val Gly35 40 45Asn Ser Leu Val Leu Trp Val Leu Val Lys Tyr Glu Ser Leu Glu Ser50 55 60Leu Thr Asn Ile Phe Ile Leu Asn Leu Cys Leu Ser Asp Leu Val Phe65 70 75 80Ala Cys Leu Leu Pro Val Trp Ile Ser Pro Tyr His Trp Gly Trp Val85 90 95Leu Gly Asp Phe Leu Cys Lys Leu Leu Asn Met Ile Phe Ser Ile Ser100 l05 110Leu Tyr Ser Ser Ile Phe Phe Leu Thr Ile Met Thr Ile His Arg Tyr115 120 125Leu Ser Val Val Ser Pro Leu Ser Thr Leu Arg Val Pro Thr Leu Arg
130 135 140Cys Arg Val Leu Val Thr Met Ala Val Trp Val Ala Ser Ile Leu Ser145 150 155 160Ser Ile Leu Asp Thr Ile Phe His Lys Val Leu Ser Ser Gly Cys Asp165 170 175Tyr Ser Glu Leu Thr Trp Tyr Leu Thr Ser Val Tyr Gln His Asn Leu180 185 190Phe Phe Leu Leu Ser Leu Gly Ile Ile Leu Phe Cys Tyr Val Glu Ile195 200 205Leu Arg Thr Leu Phe Arg Ser Arg Ser Lys Arg Arg His Arg Thr Lys210 215 220Lys Leu Ile Phe Ala Ile Val Val Ala Tyr Phe Leu Ser Trp Gly Pro225 230 235 240Tyr Asn Phe Thr Leu Phe Leu Gln Thr Leu Phe Arg Thr Gln Ile Ile245 250 255Arg Ser Cys Glu Ala Lys Gln Gln Leu Glu Tyr Ala Leu Leu Ile Cys260 265 270Arg Asn Leu Ala Phe Ser His Cys Cys Phe Asn Pro Val Leu Tyr Val275 280 285Phe Val Gly Val Lys Phe Arg Thr His Leu Lys His Val Leu Arg Gln290 295 300Phe Trp Phe Cys Arg Leu Gln Ala Pro Ser Pro Ala Ser Ile Pro His305 310 315 320Ser Pro Gly Ala Phe Ala Tyr Glu Gly Ala Ser Phe Tyr325 330(170)SEQ ID NO169的資料(i)序列特征(A)長度987個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO169的序列描述ATGGACAACG CCTCGTTCTC GGAGCCCTGG CCCGCCAACG CATCGGGCCC GGACCCGGCG 60CTGAGCTGCT CCAACGCGTC GACTCTGGCG CCGCTGCCGG CGCCGCTGGC GGTGGCTGTA 120CCAGTTGTCT ACGCGGTGAT CTGCGCCGTG GGTCTGGCGG GCAACTCCGC CGTGCTGTAC 180GTGTTGCTGC GGGCGCCCCG CATGAAGACC GTCACCAACC TGTTCATCCT CAACCTGGCC 240ATCGCCGACG AGCTCTTCAC GCTGGTGCTG CCCATCAACA TCGCCGACTT CCTGCTGCGG 300CAGTGGCCCT TCGGGGAGCT CATGTGCAAG CTCATCGTGG CTATCGACCA GTACAACACC 360TTCTCCAGCC TCTACTTCCT CACCGTCATG AGCGCCGACC GCTACCTGGT GGTGTTGGCC 420ACTGCGGAGT CGCGCCGGGT GGCCGGCCGC ACCTACAGCG CCGCGCGCGC GGTGAGCCTG 480GCCGTGTGGG GGATCGTCAC ACTCGTCGTG CTGCCCTTCG CAGTCTTCGC CCGGCTAGAC 540GACGAGCAGG GCCGGCGCCA GTGCGTGCTA GTCTTTCCGC AGCCCGAGGC CTTCTGGTGG 600CGCGCGAGCC GCCTCTACAC GCTCGTGCTG GGCTTCGCCA TCCCCGTGTC CACCATCTGT 660GTCCTCTATA CCACCCTGCT GTGCCGGCTG CATGCCATGC GGCTGGACAG CCACGCCAAG 720GCCCTGGAGC GCGCCAAGAA GCGGGTGAAG TTCCTGGTGG TGGCAATCCT GGCGGTGTGC 780CTCCTCTGCT GGACGCCCTA CCACCTGAGC ACCGTGGTGG CGCTCACCAC CGACCTCCCG 840CAGACGCCGC TGGTCATCGC TATCTCCTAC TTCATCACCA GCCTGACGTA CGCCAACAGC 900TGCCTCAACC CCTTCCTCTA CGCCTTCCTG GACGCCAGCT TCCGCAGGAA CCTCCGCCAG 960CTGATAACTT GCCGCGCGGC AGCCTGA 987(171)SEQ ID NO170的資料(i)序列特征(A)長度328個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO170的序列描述Met Asp Asn Ala Ser Phe Ser Glu Pro Trp Pro Ala Asn Ala Ser Gly1 5 10 15Pro Asp Pro Ala Leu Ser Cys Ser Asn Ala Ser Thr Leu Ala Pro Leu20 25 30Pro Ala Pro Leu Ala Val Ala Val Pro Val Val Tyr Ala Val Ile Cys35 40 45Ala Val Gly Leu Ala Gly Asn Ser Ala Val Leu Tyr Val Leu Leu Arg50 55 60Ala Pro Arg Met Lys Thr Val Thr Asn Leu Phe Ile Leu Asn Leu Ala65 70 75 80Ile Ala Asp Glu Leu Phe Thr Leu Val Leu Pro Ile Asn Ile Ala Asp85 90 95Phe Leu Leu Arg Gln Trp Pro Phe Gly Glu Leu Met Cys Lys Leu Ile100 105 110Val Ala Ile Asp Gln Tyr Asn Thr Phe Ser Ser Leu Tyr Phe Leu Thr
115 120 125Val Met Ser Ala Asp Arg Tyr Leu Val Val Leu Ala Thr Ala Glu Ser130 135 140Arg Arg Val Ala Gly Arg Thr Tyr Ser Ala Ala Arg Ala Val Ser Leu145 150 155 160Ala Val Trp Gly Ile Val Thr Leu Val Val Leu Pro Phe Ala Val Phe165 170 175Ala Arg Leu Asp Asp Glu Gln Gly Arg Arg Gln Cys Val Leu Val Phe180 185 190Pro Gln Pro Glu Ala Phe Trp Trp Arg Ala Ser Arg Leu Tyr Thr Leu195 200 205Val Leu Gly Phe Ala Ile Pro Val Ser Thr Ile Cys Val Leu Tyr Thr210 215 220Thr Leu Leu Cys Arg Leu His Ala Met Arg Leu Asp Ser His Ala Lys225 230 235 240Ala Leu Glu Arg Ala Lys Lys Arg Val Lys Phe Leu Val Val Ala Ile245 250 255Leu Ala Val Cys Leu Leu Cys Trp Thr Pro Tyr His Leu Ser Thr Val260 265 270Val Ala Leu Thr Thr Asp Leu Pro Gln Thr Pro Leu Val Ile Ala Ile275 280 285Ser Tyr Phe Ile Thr Ser Leu Thr Tyr Ala Asn Ser Cys Leu Ash Pro290 295 300Phe Leu Tyr Ala Phe Leu Asp Ala Ser Phe Arg Arg Asn Leu Arg Gln305 310 315 320Leu Ile Thr Cys Arg Ala Ala Ala325(172)SEQ ID NO171的資料(i)序列特征(A)長度1002個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO171的序列描述ATGCAGGCCG CTGGGCACCC AGAGCCCCTT GACAGCAGGG GCTCCTTCTC CCTCCCCACG 60ATGGGTGCCA ACGTCTCTCA GGACAATGGC ACTGGCCACA ATGCCACCTT CTCCGAGCCA 120CTGCCGTTCC TCTATGTGCT CCTGCCCGCC GTGTACTCCG GGATCTGTGC TGTGGGGCTG 180ACTGGCAACA CGGCCGTCAT CCTTGTAATC CTAAGGGCGC CCAAGATGAA GACGGTGACC 240AACGTGTTCA TCCTGAACCT GGCCGTCGCC GACGGGCTCT TCACGCTGGT ACTGCCTGTC 300AACATCGCGG AGCACCTGCT GCAGTACTGG CCCTTCGGGG AGCTGCTCTG CAAGCTGGTG 360CTGGCCGTCG ACCACTACAA CATCTTCTCC AGCATCTACT TCCTAGCCGT GATGAGCGTG 420GACCGATACC TGGTGGTGCT GGCCACCGTG AGGTCCCGCC ACATGCCCTG GCGCACCTAC 480CGGGGGGCGA AGGTCGCCAG CCTGTGTGTC TGGCTGGGCG TCACGGTCCT GGTTCTGCCC 540TTCTTCTCTT TCGCTGGCGT CTACAGCAAC GAGCTGCAGG TCCCAAGCTG TGGGCTGAGC 600TTCCCGTGGC CCGAGCAGGT CTGGTTCAAG GCCAGCCGTG TCTACACGTT GGTCCTGGGC 660TTCGTGCTGC CCGTGTGCAC CATCTGTGTG CTCTACACAG ACCTCCTGCG CAGGCTGCGG 720GCCGTGCGGC TCCGCTCTGG AGCCAAGGCT CTAGGCAAGG CCAGGCGGAA GGTGAAAGTC 780CTGGTCCTCG TCGTGCTGGC CGTGTGCCTC CTCTGCTGGA CGCCCTTCCA CCTGGCCTCT 840GTCGTGGCCC TGACCACGGA CCTGCCCCAG ACCCCACTGG TCATCAGTAT GTCCTACGTC 900ATCACCAGCC TCACGTACGC CAACTCGTGC CTGAACCCCT TCCTCTACGC CTTTCTAGAT 960GACAACTTCC GGAAGAACTT CCGCAGCATA TTGCGGTGCT GA1002(173)SEQ ID NO172的資料(i)序列特征(A)長度333個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO172的序列描述Met Gln Ala Ala Gly His Pro Glu Pro Leu Asp Ser Arg Gly Ser Phe1 5 10 15Ser Leu Pro Thr Met Gly Ala Asn Val Ser Gln Asp Asn Gly Thr Gly20 25 30His Asn Ala Thr Phe Ser Glu Pro Leu Pro Phe Leu Tyr Val Leu Leu35 40 45Pro Ala Val Tyr Ser Gly Ile Cys Ala Val Gly Leu Thr Gly Asn Thr50 55 60Ala Val Ile Leu Val Ile Leu Arg Ala Pro Lys Met Lys Thr Val Thr65 70 75 80Asn Val Phe Ile Leu Asn Leu Ala Val Ala Asp Gly Leu Phe Thr Leu85 90 95Val Leu Pro Val Asn Ile Ala Glu His Leu Leu Gln Tyr Trp Pro Phe100 105 110Gly Glu Leu Leu Cys Lys Leu Val Leu Ala Val Asp His Tyr Asn Ile115 120 125Phe Ser Ser Ile Tyr Phe Leu Ala Val Met Ser Val Asp Arg Tyr Leu130 135 140Val Val Leu Ala Thr Val Arg Ser Arg His Met Pro Trp Arg Thr Tyr145 150 155 160Arg Gly Ala Lys Val Ala Ser Leu Cys Val Trp Leu Gly Val Thr Val165 170 175Leu Val Leu Pro Phe Phe Ser Phe Ala Gly Val Tyr Ser Asn Glu Leu180 185 190Gln Val Pro Ser Cys Gly Leu Ser Phe Pro Trp Pro Glu Gln Val Trp195 200 205Phe Lys Ala Ser Arg Val Tyr Thr Leu Val Leu Gly Phe Val Leu Pro210 215 220Val Cys Thr Ile Cys Val Leu Tyr Thr Asp Leu Leu Arg Arg Leu Arg225 230 235 240Ala Val Arg Leu Arg Ser Gly Ala Lys Ala Leu Gly Lys Ala Arg Arg245 250 255Lys Val Lys Val Leu Val Leu Val Val Leu Ala Val Cys Leu Leu Cys260 265 270Trp Thr Pro Phe His Leu Ala Ser Val Val Ala Leu Thr Thr Asp Leu275 280 285Pro Gln Thr Pro Leu Val Ile Ser Met Ser Tyr Val Ile Thr Ser Leu290 295 300Thr Tyr Ala Asn Ser Cys Leu Asn Pro Phe Leu Tyr Ala Phe Leu Asp305 310 315 320Asp Asn Phe Arg Lys Asn Phe Arg Ser Ile Leu Arg Cys325 330(174)SEQ ID NO173的資料(i)序列特征(A)長度1107個堿基對(B)類型核酸(C)鏈型單鏈
(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO173的序列描述ATGGTCCTTG AGGTGAGTGA CCACCAAGTG CTAAATGACG CCGAGGTTGC CGCCCTCCTG 60GAGAACTTCA GCTCTTCCTA TGACTATGGA GAAAACGAGA GTGACTCGTG CTGTACCTCC 120CCGCCCTGCC CACAGGACTT CAGCCTGAAC TTCGACCGGG CCTTCCTGCC AGCCCTCTAC 180AGCCTCCTCT TTCTGCTGGG GCTGCTGGGC AACGGCGCGG TGGCAGCCGT GCTGCTGAGC 240CGGCGGACAG CCCTGAGCAG CACCGACACC TTCCTGCTCC ACCTAGCTGT AGCAGACACG 300CTGCTGGTGC TGACACTGCC GCTCTGGGCA GTGGACGCTG CCGTCCAGTG GGTCTTTGGC 360TCTGGCCTCT GCAAAGTGGC AGGTGCCCTC TTCAACATCA ACTTCTACGC AGGAGCCCTC 420CTGCTGGCCT GCATCAGCTT TGACCGCTAC CTGAACATAG TTCATGCCAC CCAGCTCTAC 480CGCCGGGGGC CCCCGGCCCG CGTGACCCTC ACCTGCCTGG CTGTCTGGGG GCTCTGCCTG 540CTTTTCGCCC TCCCAGACTT CATCTTCCTG TCGGCCCACC ACGACGAGCG CCTCAACGCC 600ACCCACTGCC AATACAACTT CCCACAGGTG GGCCGCACGG CTCTGCGGGT GCTGCAGCTG 660GTGGCTGGCT TTCTGCTGCC CCTGCTGGTC ATGGCCTACT GCTATGCCCA CATCCTGGCC 720GTGCTGCTGG TTTCCAGGGG CCAGCGGCGC CTGCGGGCCA AGCGGCTGGT GGTGGTGGTC 780GTGGTGGCCT TTGCCCTCTG CTGGACCCCC TATCACCTGG TGGTGCTGGT GGACATCCTC 840ATGGACCTGG GCGCTTTGGC CCGCAACTGT GGCCGAGAAA GCAGGGTAGA CGTGGCCAAG 900TCGGTCACCT CAGGCCTGGG CTACATGCAC TGCTGCCTCA ACCCGCTGCT CTATGCCTTT 960GTAGGGGTCA AGTTCCGGGA GCGGATGTGG ATGCTGCTCT TGCGCCTGGG CTGCCCCAAC 1020CAGAGAGGGC TCCAGAGGCA GCCATCGTCT TCCCGCCGGG ATTCATCCTG GTCTGAGACC 1080TCAGAGGCCT CCTACTCGGG CTTGTGA 1107(175)SEQ ID NO174的資料(i)序列特征(A)長度368個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO174的序列描述Met Val Leu Glu Val Ser Asp His Gln Val Leu Asn Asp Ala Glu Val1 5 10 15Ala Ala Leu Leu Glu Asn Phe Ser Ser Ser Tyr Asp Tyr Gly Glu Asn20 25 30Glu Ser Asp Ser Cys Cys Thr Ser Pro Pro Cys Pro Gln Asp Phe Ser35 40 45Leu Asn Phe Asp Arg Ala Phe Leu Pro Ala Leu Tyr Ser Leu Leu Phe50 55 60Leu Leu Gly Leu Leu Gly Asn Gly Ala Val Ala Ala Val Leu Leu Ser65 70 75 80Arg Arg Thr Ala Leu Ser Ser Thr Asp Thr Phe Leu Leu His Leu Ala
85 90 95Val Ala Asp Thr Leu Leu Val Leu Thr Leu Pro Leu Trp Ala Val Asp100 105 110Ala Ala Val Gln Trp Val Phe Gly Ser Gly Leu Cys Lys Val Ala Gly115 120 125Ala Leu Phe Asn Ile Asn Phe Tyr Ala Gly Ala Leu Leu Leu Ala Cys130 135 140Ile Ser Phe Asp Arg Tyr Leu Asn Ile Val His Ala Thr Gln Leu Tyr145 150 155 160Arg Arg Gly Pro Pro Ala Arg Val Thr Leu Thr Cys Leu Ala Val Trp165 170 175Gly Leu Cys Leu Leu Phe Ala Leu Pro Asp Phe Ile Phe Leu Ser Ala180 185 190His His Asp Glu Arg Leu Asn Ala Thr His Cys Gln Tyr Asn Phe Pro195 200 205Gln Val Gly Arg Thr Ala Leu Arg Val Leu Gln Leu Val Ala Gly Phe210 215 220Leu Leu Pro Leu Leu Val Met Ala Tyr Cys Tyr Ala His Ile Leu Ala225 230 235 240Val Leu Leu Val Ser Arg Gly Gln Arg Arg Leu Arg Ala Lys Arg Leu245 250 255Val Val Val Val Val Val Ala Phe Ala Leu Cys Trp Thr Pro Tyr His260 265 270Leu Val Val Leu Val Asp Ile Leu Met Asp Leu Gly Ala Leu Ala Arg275 280 285Asn Cys Gly Arg Glu Ser Arg Val Asp Val Ala Lys Ser Val Thr Ser290 295 300Gly Leu Gly Tyr Met His Cys Cys Leu Asn Pro Leu Leu Tyr Ala Phe305 310 315 320Val Gly Val Lys Phe Arg Glu Arg Met Trp Met Leu Leu Leu Arg Leu325 330 335Gly Cys Pro Asn Gln Arg Gly Leu Gln Arg Gln Pro Ser Ser Ser Arg340 345 350Arg Asp Ser Ser Trp Ser Glu Thr Ser Glu Ala Ser Tyr Ser Gly Leu
355 360 365(176)SEQ ID NO175的資料(i)序列特征(A)長度1074個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO175的序列描述ATGGCTGATG ACTATGGCTC TGAATCCACA TCTTCCATGG AAGACTACGT TAACTTCAAC 60TTCACTGACT TCTACTGTGA GAAAAACAAT GTCAGGCAGT TTGCGAGCCA TTTCCTCCCA 120CCCTTGTACT GGCTCGTGTT CATCGTGGGT GCCTTGGGCA ACAGTCTTGT TATCCTTGTC 180TACTGGTACT GCACAAGAGT GAAGACCATG ACCGACATGT TCCTTTTGAA TTTGGCAATT 240GCTGACCTCC TCTTTCTTGT CACTCTTCCC TTCTGGGCCA TTGCTGCTGC TGACCAGTGG 300AAGTTCCAGA CCTTCATGTG CAAGGTGGTC AACAGCATGT ACAAGATGAA CTTCTACAGC 360TGTGTGTTGC TGATCATGTG CATCAGCGTG GACAGGTACA TTGCCATTGC CCAGGCCATG 420AGAGCACATA CTTGGAGGGA GAAAAGGCTT TTGTACAGCA AAATGGTTTG CTTTACCATC 480TGGGTATTGG CAGCTGCTCT CTGCATCCCA GAAATCTTAT ACAGCCAAAT CAAGGAGGAA 540TCCGGCATTG CTATCTGCAC CATGGTTTAC CCTAGCGATG AGAGCACCAA ACTGAAGTCA 600GCTGTCTTGA CCCTGAAGGT CATTCTGGGG TTCTTCCTTC CCTTCGTGGT CATGGCTTGC 660TGCTATACCA TCATCATTCA CACCCTGATA CAAGCCAAGA AGTCTTCCAA GCACAAAGCC 720AAGAAAGTGA CCATCACTGT CCTGACCGTC TTTGTCTTGT CTCAGTTTCC CTACAACTGC 780ATTTTGTTGG TGCAGACCAT TGACGCCTAT GCCATGTTCA TCTCCAACTG TGCCGTTTCC 840ACCAACATTG ACATCTGCTT CCAGGTCACC CAGACCATCG CCTTCTTCCA CAGTTGCCTG 900AACCCTGTTC TCTATGTTTT TGTGGGTGAG AGATTCCGCC GGGATCTCGT GAAAACCCTG 960AAGAACTTGG GTTGCATCAG CCAGGCCCAG TGGGTTTCAT TTACAAGGAG AGAGGGAAGC 1020TTGAAGCTGT CGTCTATGTT GCTGGAGACA ACCTCAGGAG CACTCTCCCT CTGA 1074(177)SEQ ID NO176的資料(i)序列特征(A)長度357個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO176的序列描述Met Ala Asp Asp Tyr Gly Ser Glu Ser Thr Ser Ser Met Glu Asp Tyr1 5 10 15Val Asn Phe Asn Phe Thr Asp Phe Tyr Cys Glu Lys Asn Asn Val Arg20 25 30Gln Phe Ala Ser His Phe Leu Pro Pro Leu Tyr Trp Leu Val Phe Ile35 40 45Val Gly Ala Leu Gly Asn Ser Leu Val Ile Leu Val Tyr Trp Tyr Cys50 55 60Thr Arg Val Lys Thr Met Thr Asp Met Phe Leu Leu Asn Leu Ala Ile65 70 75 80Ala Asp Leu Leu Phe Leu Val Thr Leu Pro Phe Trp Ala Ile Ala Ala85 90 95Ala Asp Gln Trp Lys Phe Gln Thr Phe Met Cys Lys Val Val Asn Ser100 105 110Met Tyr Lys Met Asn Phe Tyr Ser Cys Val Leu Leu Ile Met Cys Ile115 120 125Ser Val Asp Arg Tyr Ile Ala Ile Ala Gln Ala Met Arg Ala His Thr130 135 140Trp Arg Glu Lys Arg Leu Leu Tyr Ser Lys Met Val Cys Phe Thr Ile145 150 155 160Trp Val Leu Ala Ala Ala Leu Cys Ile Pro Glu Ile Leu Tyr Ser Gln165 170 175Ile Lys Glu Glu Ser Gly Ile Ala Ile Cys Thr Met Val Tyr Pro Ser180 185 190Asp Glu Ser Thr Lys Leu Lys Ser Ala Val Leu Thr Leu Lys Val Ile195 200 205Leu Gly Phe Phe Leu Pro Phe Val Val Met Ala Cys Cys Tyr Thr Ile210 215 220Ile Ile His Thr Leu Ile Gln Ala Lys Lys Ser Ser Lys His Lys Ala225 230 235 240Lys Lys Val Thr Ile Thr Val Leu Thr Val Phe Val Leu Ser Gln Phe245 250 255Pro Tyr Asn Cys Ile Leu Leu Val Gln Thr Ile Asp Ala Tyr Ala Met260 265 270Phe Ile Ser Asn Cys Ala Val Ser Thr Asn Ile Asp Ile Cys Phe Gln275 280 285Val Thr Gln Thr Ile Ala Phe Phe His Ser Cys Leu Asn Pro Val Leu290 295 300Tyr Val Phe Val Gly Glu Arg Phe Arg Arg Asp Leu Val Lys Thr Leu305 310 315 320Lys Asn Leu Gly Cys Ile Ser Gln Ala Gln Trp Val Ser Phe Thr Arg325 330 335Arg Glu Gly Ser Leu Lys Leu Ser Ser Met Leu Leu Glu Thr Thr Ser340 345 350Gly Ala Leu Ser Leu355(178)SEQ ID NO177的資料(i)序列特征(A)長度1110個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO177的序列描述ATGGCCTCAT CGACCACTCG GGGCCCCAGG GTTTCTGACT TATTTTCTGG GCTGCCGCCG 60GCGGTCACAA CTCCCGCCAA CCAGAGCGCA GAGGCCTCGG CGGGCAACGG GTCGGTGGCT 120GGCGCGGACG CTCCAGCCGT CACGCCCTTC CAGAGCCTGC AGCTGGTGCA TCAGCTGAAG 180GGGCTGATCG TGCTGCTCTA CAGCGTCGTG GTGGTCGTGG GGCTGGTGGG CAACTGCCTG 240CTGGTGCTGG TGATCGCGCG GGTGCCGCGG CTGCACAACG TGACGAACTT CCTCATCGGC 300AACCTGGCCT TGTCCGACGT GCTCATGTGC ACCGCCTGCG TGCCGCTCAC GCTGGCCTAT 360GCCTTCGAGC CACGCGGCTG GGTGTTCGGC GGCGGCCTGT GCCACCTGGT CTTCTTCCTG 420CAGCCGGTCA CCGTCTATGT GTCGGTGTTC ACGCTCACCA CCATCGCAGT GGACCGCTAC 480GTCGTGCTGG TGCACCCGCT GAGGCGCGCA TCTCGCTGCG CCTCAGCCTA CGCTGTGCTG 540GCCATCTGGG CGCTGTCCGC GGTGCTGGCG CTGCCGCCCG CCGTGCACAC CTATCACGTG 600GAGCTCAAGC CGCACGACGT GCGCCTCTGC GAGGAGTTCT GGGGCTCCCA GGAGCGCCAG 660CGCCAGCTCT ACGCCTGGGG GCTGCTGCTG GTCACCTACC TGCTCCCTCT GCTGGTCATC 720CTCCTGTCTT ACGTCCGGGT GTCAGTGAAG CTCCGCAACC GCGTGGTGCC GGGCTGCGTG 780ACCCAGAGCC AGGCCGACTG GGACCGCGCT CGGCGCCGGC GCACCAAATG CTTGCTGGTG 840GTGGTCGTGG TGGTGTTCGC CGTCTGCTGG CTGCCGCTGC ACGTCTTCAA CCTGCTGCGG 900GACCTCGACC CCCACGCCAT CGACCCTTAC GCCTTTGGGC TGGTGCAGCT GCTCTGCCAC 960TGGCTCGCCA TGAGTTCGGC CTGCTACAAC CCCTTCATCT ACGCCTGGCT GCACGACAGC 1020TTCCGCGAGG AGCTGCGCAA ACTGTTGGTC GCTTGGCCCC GCAAGATAGC CCCCCATGGC 1080CAGAATATGA CCGTCAGCGT GGTCATCTGA 1110(179)SEQ ID NO178的資料(i)序列特征(A)長度369個氨基酸(B)類型氨基酸
(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO178的序列描述Met Ala Ser Ser Thr Thr Arg Gly Pro Arg Val Ser Asp Leu Phe Ser1 5 10 15Gly Leu Pro Pro Ala Val Thr Thr Pro Ala Asn Gln Ser Ala Glu Ala20 25 30Ser Ala Gly Asn Gly Ser Val Ala Gly Ala Asp Ala Pro Ala Val Thr35 40 45Pro Phe Gln Ser Leu Gln Leu Val His Gln Leu Lys Gly Leu Ile Val50 55 60Leu Leu Tyr Ser Val Val Val Val Val Gly Leu Val Gly Asn Cys Leu65 70 75 80Leu Val Leu Val Ile Ala Arg Val Pro Arg Leu His Asn Val Thr Asn85 90 95Phe Leu Ile Gly Asn Leu Ala Leu Ser Asp Val Leu Met Cys Thr Ala100 105 110Cys Val Pro Leu Thr Leu Ala Tyr Ala Phe Glu Pro Arg Gly Trp Val115 120 125Phe Gly Gly Gly Leu Cys His Leu Val Phe Phe Leu Gln Pro Val Thr130 135 140Val Tyr Val Ser Val Phe Thr Leu Thr Thr Ile Ala Val Asp Arg Tyr145 150 155 160Val Val Leu Val His Pro Leu Arg Arg Ala Ser Arg Cys Ala Ser Ala165 170 175Tyr Ala Val Leu Ala Ile Trp Ala Leu Ser Ala Val Leu Ala Leu Pro180 185 190Pro Ala Val His Thr Tyr His Val Glu Leu Lys Pro His Asp Val Arg195 200 205Leu Cys Glu Glu Phe Trp Gly Ser Gln Glu Arg Gln Arg Gln Leu Tyr210 215 220Ala Trp Gly Leu Leu Leu Val Thr Tyr Leu Leu Pro Leu Leu Val Ile225 230 235 240Leu Leu Ser Tyr Val Arg Val Ser Val Lys Leu Arg Asn Arg Val Val
245 250 255Pro Gly Cys Val Thr Gln Ser Gln Ala Asp Trp Asp Arg Ala Arg Arg260 265 270Arg Arg Thr Lys Cys Leu Leu Val Val Val Val Val Val Phe Ala Val275 280 285Cys Trp Leu Pro Leu His Val Phe Asn Leu Leu Arg Asp Leu Asp Pro290 295 300His Ala Ile Asp Pro Tyr Ala Phe Gly Leu Val Gln Leu Leu Cys His305 310 315 320Trp Leu Ala Met Ser Ser Ala Cys Tyr Asn Pro Phe Ile Tyr Ala Trp325 330 335Leu His Asp Ser Phe Arg Glu Glu Leu Arg Lys Leu Leu Val Ala Trp340 345 350Pro Arg Lys Ile Ala Pro His Gly Gln Asn Met Thr Val Ser Val Val355 360 365(180)SEQ ID NO179的資料(i)序列特征(A)長度1083個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO179的序列描述ATGGACCCAG AAGAAACTTC AGTTTATTTG GATTATTACT ATGCTACGAG CCCAAACTCT 60GACATCAGGG AGACCCACTC CCATGTTCCT TACACCTCTG TCTTCCTTCC AGTCTTTTAC 120ACAGCTGTGT TCCTGACTGG AGTGCTGGGG AACCTTGTTC TCATGGGAGC GTTGCATTTC 180AAACCCGGCA GCCGAAGACT GATCGACATC TTTATCATCA ATCTGGCTGC CTCTGACTTC 240ATTTTTCTTG TCACATTGCC TCTCTGGGTG GATAAAGAAG CATCTCTAGG ACTGTGGAGG 300ACGGGCTCCT TCCTGTGCAA AGGGAGCTCC TACATGATCT CCGTCAATAT GCACTGCAGT 360GTCCTCCTGC TCACTTGCAT GAGTGTTGAC CGCTACCTGG CCATTGTGTG GCCAGTCGTA 420TCCAGGAAAT TCAGAAGGAC AGACTGTGCA TATGTAGTCT GTGCCAGCAT CTGGTTTATC 480TCCTGCCTGC TGGGGTTGCC TACTCTTCTG TCCAGGGAGC TCACGCTGAT TGATGATAAG 540CCATACTGTG CAGAGAAAAA GGCAACTCCA ATTAAACTCA TATGGTCCCT GGTGGCCTTA 600ATTTTCACCT TTTTTGTCCC TTTGTTGAGC ATTGTGACCT GCTACTGTTG CATTGCAAGG 660AAGCTGTGTG CCCATTACCA GCAATCAGGA AAGCACAACA AAAAGCTGAA GAAATCTAAG 720AAGATCATCT TTATTGTCGT GGCAGCCTTT CTTGTCTCCT GGCTGCCCTT CAATACTTTC 780AAGTTCCTGG CCATTGTCTC TGGGTTGCGG CAAGAACACT ATTTACCCTC AGCTATTCTT 840CAGCTTGGTA TGGAGGTGAG TGGACCCTTG GCATTTGCCA ACAGCTGTGT CAACCCTTTC 900ATTTACTATA TCTTCGACAG CTACATCCGC CGGGCCATTG TCCACTGCTT GTGCCCTTGC 960CTGAAAAACT ATGACTTTGG GAGTAGCACT GAGACATCAG ATAGTCACCT CACTAAGGCT 1020CTCTCCACCT TCATTCATGC AGAAGATTTT GCCAGGAGGA GGAAGAGGTC TGTGTCACTC 1080TAA 1083(181)SEQ ID NO180的資料(i)序列特征(A)長度360個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO180的序列描述Met Asp Pro Glu Glu Thr Ser Val Tyr Leu Asp Tyr Tyr Tyr Ala Thr1 5 10 15Ser Pro Asn Ser Asp Ile Arg Glu Thr His Ser His Val Pro Tyr Thr20 25 30Ser Val Phe Leu Pro Val Phe Tyr Thr Ala Val Phe Leu Thr Gly Val35 40 45Leu Gly Asn Leu Val Leu Met Gly Ala Leu His Phe Lys Pro Gly Ser50 55 60Arg Arg Leu Ile Asp Ile Phe Ile Ile Asn Leu Ala Ala Ser Asp Phe65 70 75 80Ile Phe Leu Val Thr Leu Pro Leu Trp Val Asp Lys Glu Ala Ser Leu85 90 95Gly Leu Trp Arg Thr Gly Ser Phe Leu Cys Lys Gly Ser Ser Tyr Met100 105 110Ile Ser Val Asn Met His Cys Ser Val Leu Leu Leu Thr Cys Met Ser115 120 125Val Asp Arg Tyr Leu Ala Ile Val Trp Pro Val Val Ser Arg Lys Phe130 135 140Arg Arg Thr Asp Cys Ala Tyr Val Val Cys Ala Ser Ile Trp Phe Ile145 150 155 160Ser Cys Leu Leu Gly Leu Pro Thr Leu Leu Ser Arg Glu Leu Thr Leu165 170 175Ile Asp Asp Lys Pro Tyr Cys Ala Glu Lys Lys Ala Thr Pro Ile Lys180 185 190Leu Ile Trp Ser Leu Val Ala Leu Ile Phe Thr Phe Phe Val Pro Leu195 200 205Leu Ser Ile Val Thr Cys Tyr Cys Cys Ile Ala Arg Lys Leu Cys Ala210 215 220His Tyr Gln Gln Ser Gly Lys His Asn Lys Lys Leu Lys Lys Ser Lys225 230 235 240Lys Ile Ile Phe Ile Val Val Ala Ala Phe Leu Val Ser Trp Leu Pro245 250 255Phe Asn Thr Phe Lys Phe Leu Ala Ile Val Ser Gly Leu Arg Gln Glu260 265 270His Tyr Leu Pro Ser Ala Ile Leu Gln Leu Gly Met Glu Val Ser Gly275 280 285Pro Leu Ala Phe Ala Asn Ser Cys Val Asn Pro Phe Ile Tyr Tyr Ile290 295 300Phe Asp Ser Tyr Ile Arg Arg Ala Ile Val His Cys Leu Cys Pro Cys305 310 315 320Leu Lys Asn Tyr Asp Phe Gly Ser Ser Thr Glu Thr Ser Asp Ser His325 330 335Leu Thr Lys Ala Leu Ser Thr Phe Ile His Ala Glu Asp Phe Ala Arg340 345 350Arg Arg Lys Arg Ser Val Ser Leu355 360(182)SEQ ID NO181的資料(i)序列特征(A)長度1020個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO181的序列描述ATGAATGGCC TTGAAGTGGC TCCCCCAGGT CTGATCACCA ACTTCTCCCT GGCCACGGCA 60GAGCAATGTG GCCAGGAGAC GCCACTGGAG AACATGCTGT TCGCCTCCTT CTACCTTCTG 120GATTTTATCC TGGCTTTAGT TGGCAATACC CTGGCTCTGT GGCTTTTCAT CCGAGACCAC 180AAGTCCGGGA CCCCGGCCAA CGTGTTCCTG ATGCATCTGG CCGTGGCCGA CTTGTCGTGC 240GTGCTGGTCC TGCCCACCCG CCTGGTCTAC CACTTCTCTG GGAACCACTG GCCATTTGGG 300GAAATCGCAT GCCGTCTCAC CGGCTTCCTC TTCTACCTCA ACATGTACGC CAGCATCTAC 360TTCCTCACCT GCATCAGCGC CGACCGTTTC CTGGCCATTG TGCACCCGGT CAAGTCCCTC 420AAGCTCCGCA GGCCCCTCTA CGCACACCTG GCCTGTGCCT TCCTGTGGGT GGTGGTGGCT 480GTGGCCATGG CCCCGCTGCT GGTGAGCCCA CAGACCGTGC AGACCAACCA CACGGTGGTC 540TGCCTGCAGC TGTACCGGGA GAAGGCCTCC CACCATGCCC TGGTGTCCCT GGCAGTGGCC 600TTCACCTTCC CGTTCATCAC CACGGTCACC TGCTACCTGC TGATCATCCG CAGCCTGCGG 660CAGGGCCTGC GTGTGGAGAA GCGCCTCAAG ACCAAGGCAA AACGCATGAT CGCCATAGTG 720CTGGCCATCT TCCTGGTCTG CTTCGTGCCC TACCACGTCA ACCGCTCCGT CTACGTGCTG 780CACTACCGCA GCCATGGGGC CTCCTGCGCC ACCCAGCGCA TCCTGGCCCT GGCAAACCGC 840ATCACCTCCT GCCTCACCAG CCTCAACGGG GCACTCGACC CCATCATGTA TTTCTTCGTG 900GCTGAGAAGT TCCGCCACGC CCTGTGCAAC TTGCTCTGTG GCAAAAGGCT CAAGGGCCCG 960CCCCCCAGCT TCGAAGGGAA AACCAACGAG AGCTCGCTGA GTGCCAAGTC AGAGCTGTGA 1020(183)SEQ ID NO182的資料(i)序列特征(A)長度339個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO182的序列描述Met Asn Gly Leu Glu Val Ala Pro Pro Gly Leu Ile Thr Asn Phe Ser1 5 10 15Leu Ala Thr Ala Glu Gln Cys Gly Gln Glu Thr Pro Leu Glu Asn Met20 25 30Leu Phe Ala Ser Phe Tyr Leu Leu Asp Phe Ile Leu Ala Leu Val Gly35 40 45Asn Thr Leu Ala Leu Trp Leu Phe Ile Arg Asp His Lys Ser Gly Thr50 55 60Pro Ala Asn Val Phe Leu Met His Leu Ala Val Ala Asp Leu Ser Cys65 70 75 80Val Leu Val Leu Pro Thr Arg Leu Val Tyr His Phe Ser Gly Asn His85 90 95Trp Pro Phe Gly Glu Ile Ala Cys Arg Leu Thr Gly Phe Leu Phe Tyr100 105 110Leu Asn Met Tyr Ala Ser Ile Tyr Phe Leu Thr Cys Ile Ser Ala Asp115 120 125Arg Phe Leu Ala Ile Val His Pro Val Lys Ser Leu Lys Leu Arg Arg130 135 140Pro Leu Tyr Ala His Leu Ala Cys Ala Phe Leu Trp Val Val Val Ala145 150 l55 160Val Ala Met Ala Pro Leu Leu Val Ser Pro Gln Thr Val Gln Thr Asn165 170 175His Thr Val Val Cys Leu Gln Leu Tyr Arg Glu Lys Ala Ser His His180 185 190Ala Leu Val Ser Leu Ala Val Ala Phe Thr Phe Pro Phe Ile Thr Thr195 200 205Val Thr Cys Tyr Leu Leu Ile Ile Arg Ser Leu Arg Gln Gly Leu Arg210 215 220Val Glu Lys Arg Leu Lys Thr Lys Ala Lys Arg Met Ile Ala Ile Val225 230 235 240Leu Ala Ile Phe Leu Val Cys Phe Val Pro Tyr His Val Asn Arg Ser245 250 255Val Tyr Val Leu His Tyr Arg Ser His Gly Ala Ser Cys Ala Thr Gln260 265 270Arg Ile Leu Ala Leu Ala Asn Arg Ile Thr Ser Cys Leu Thr Ser Leu275 280 285Asn Gly Ala Leu Asp Pro Ile Met Tyr Phe Phe Val Ala Glu Lys Phe290 295 300Arg His Ala Leu Cys Asn Leu Leu Cys Gly Lys Arg Leu Lys Gly Pro305 310 315 320Pro Pro Ser Phe Glu Gly Lys Thr Asn Glu Ser Ser Leu Ser Ala Lys325 330 335Ser Glu Leu(184)SEQ ID NO183的資料(i)序列特征(A)長度996個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO183的序列描述ATGATCACCC TGAACAATCA AGATCAACCT GTCCCTTTTA ACAGCTCACA TCCAGATGAA 60TACAAAATTG CAGCCCTTGT CTTCTATAGC TGTATCTTCA TAATTGGATT ATTTGTTAAC 120ATCACTGCAT TATGGGTTTT CAGTTGTACC ACCAAGAAGA GAACCACGGT AACCATCTAT 180ATGATGAATG TGGCATTAGT GGACTTGATA TTTATAATGA CTTTACCCTT TCGAATGTTT 240TATTATGCAA AAGATGAATG GCCATTTGGA GAGTACTTCT GCCAGATTCT TGGAGCTCTC 300ACAGTGTTTT ACCCAAGCAT TGCTTTATGG CTTCTTGCCT TTATTAGTGC TGACAGATAC 360ATGGCCATTG TACAGCCGAA GTACGCCAAA GAACTTAAAA ACACGTGCAA AGCCGTGCTG 420GCGTGTGTGG GAGTCTGGAT AATGACCCTG ACCACGACCA CCCCTCTGCT ACTGCTCTAT 480AAAGACCCAG ATAAAGACTC CACTCCCGCC ACCTGCCTCA AGATTTCTGA CATCATCTAT 540CTAAAAGCTG TGAACGTGCT GAACCTCACT CGACTGACAT TTTTTTTCTT GATTCCTTTG 600TTCATCATGA TTGGGTGCTA CTTGGTCATT ATTCATAATC TCCTTCACGG CAGGACGTCT 660AAGCTGAAAC CCAAAGTCAA GGAGAAGTCC AAAAGGATCA TCATCACGCT GCTGGTGCAG 720GTGCTCGTCT GCTTTATGCC CTTCCACATC TGTTTCGCTT TCCTGATGCT GGGAACGGGG 780GAGAATAGTT ACAATCCCTG GGGAGCCTTT ACCACCTTCC TCATGAACCT CAGCACGTGT 840CTGGATGTGA TTCTCTACTA CATCGTTTCA AAACAATTTC AGGCTCGAGT CATTAGTGTC 900ATGCTATACC GTAATTACCT TCGAAGCATG CGCAGAAAAA GTTTCCGATC TGGTAGTCTA 960AGGTCACTAA GCAATATAAA CAGTGAAATG TTATGA 996(185)SEQ ID NO184的資料(i)序列特征(A)長度331個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO184的序列描述Met Ile Thr Leu Asn Asn Gln Asp Gln Pro Val Pro Phe Asn Ser Ser1 5 10 15His Pro Asp Glu Tyr Lys Ile Ala Ala Leu Val Phe Tyr Ser Cys Ile20 25 30Phe Ile Ile Gly Leu Phe Val Asn Ile Thr Ala Leu Trp Val Phe Ser35 40 45Cys Thr Thr Lys Lys Arg Thr Thr Val Thr Ile Tyr Met Met Asn Val50 55 60Ala Leu Val Asp Leu Ile Phe Ile Met Thr Leu Pro Phe Arg Met Phe65 70 75 80Tyr Tyr Ala Lys Asp Glu Trp Pro Phe Gly Glu Tyr Phe Cys Gln Ile85 90 95Leu Gly Ala Leu Thr Val Phe Tyr Pro Ser Ile Ala Leu Trp Leu Leu100 105 110Ala Phe Ile Ser Ala Asp Arg Tyr Met Ala Ile Val Gln Pro Lys Tyr115 120 125Ala Lys Glu Leu Lys Asn Thr Cys Lys Ala Val Leu Ala Cys Val Gly130 135 140Val Trp Ile Met Thr Leu Thr Thr Thr Thr Pro Leu Leu Leu Leu Tyr145 150 155 160Lys Asp Pro Asp Lys Asp Ser Thr Pro Ala Thr Cys Leu Lys Ile Ser165 170 175Asp Ile Ile Tyr Leu Lys Ala Val Asn Val Leu Asn Leu Thr Arg Leu180 185 190Thr Phe Phe Phe Leu Ile Pro Leu Phe Ile Met Ile Gly Cys Tyr Leu195 200 205Val Ile Ile His Asn Leu Leu His Gly Arg Thr Ser Lys Leu Lys Pro210 215 220Lys Val Lys Glu Lys Ser Lys Arg Ile Ile Ile Thr Leu Leu Val Gln225 230 235 240Val Leu Val Cys Phe Met Pro Phe His Ile Cys Phe Ala Phe Leu Met245 250 255Leu Gly Thr Gly Glu Asn Ser Tyr Asn Pro Trp Gly Ala Phe Thr Thr260 265 270Phe Leu Met Asn Leu Ser Thr Cys Leu Asp Val Ile Leu Tyr Tyr Ile275 280 285Val Ser Lys Gln Phe Gln Ala Arg Val Ile Ser Val Met Leu Tyr Arg290 295 300Asn Tyr Leu Arg Ser Met Arg Arg Lys Ser Phe Arg Ser Gly Ser Leu305 310 315 320Arg Ser Leu Ser Asn Ile Asn Ser Glu Met Leu325 330(186)SEQ ID NO185的資料(i)序列特征(A)長度1077個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO185的序列描述ATGCCCTCTG TGTCTCCAGC GGGGCCCTCG GCCGGGGCAG TCCCCAATGC CACCGCAGTG 60ACAACAGTGC GGACCAATGC CAGCGGGCTG GAGGTGCCCC TGTTCCACCT GTTTGCCCGG 120CTGGACGAGG AGCTGCATGG CACCTTCCCA GGCCTGTGCG TGGCGCTGAT GGCGGTGCAC 180GGAGCCATCT TCCTGGCAGG GCTGGTGCTC AACGGGCTGG CGCTGTACGT CTTCTGCTGC 240CGCACCCGGG CCAAGACACC CTCAGTCATC TACACCATCA ACCTGGTGGT GACCGATCTA 300CTGGTAGGGC TGTCCCTGCC CACGCGCTTC GCTGTGTACT ACGGCGCCAG GGGCTGCCTG 360CGCTGTGCCT TCCCGCACGT CCTCGGTTAC TTCCTCAACA TGCACTGCTC CATCCTCTTC 420CTCACCTGCA TCTGCGTGGA CCGCTACCTG GCCATCGTGC GGCCCGAAGG CTCCCGCCGC 480TGCCGCCAGC CTGCCTGTGC CAGGGCCGTG TGCGCCTTCG TGTGGCTGGC CGCCGGTGCC 540GTCACCCTGT CGGTGCTGGG CGTGACAGGC AGCCGGCCCT GCTGCCGTGT CTTTGCGCTG 600ACTGTCCTGG AGTTCCTGCT GCCCCTGCTG GTCATCAGCG TGTTTACCGG CCGCATCATG 660TGTGCACTGT CGCGGCCGGG TCTGCTCCAC CAGGGTCGCC AGCGCCGCGT GCGGGCCAAG 720CAGCTCCTGC TCACGGTGCT CATCATCTTT CTCGTCTGCT TCACGCCCTT CCACGCCCGC 780CAAGTGGCCG TGGCGCTGTG GCCCGACATG CCACACCACA CGAGCCTCGT GGTCTACCAC 840GTGGCCGTGA CCCTCAGCAG CCTCAACAGC TGCATGGACC CCATCGTCTA CTGCTTCGTC 900ACCAGTGGCT TCCAGGCCAC CGTCCGAGGC CTCTTCGGCC AGCACGGAGA GCGTGAGCCC 960AGCAGCGGTG ACGTGGTCAG CATGCACAGG AGCTCCAAGG GCTCAGGCCG TCATCACATC 1020CTCAGTGCCG GCCCTCACGC CCTCACCCAG GCCCTGGCTA ATGGGCCCGA GGCTTAG1077(187)SEQ ID NO186的資料(i)序列特征(A)長度358個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO186的序列描述Met Pro Ser Val Ser Pro Ala Gly Pro Ser Ala Gly Ala Val Pro Asn1 5 10 15Ala Thr Ala Val Thr Thr Val Arg Thr Asn Ala Ser Gly Leu Glu Val20 25 30Pro Leu Phe His Leu Phe Ala Arg Leu Asp Glu Glu Leu His Gly Thr35 40 45Phe Pro Gly Leu Cys Val Ala Leu Met Ala Val His Gly Ala Ile Phe50 55 60Leu Ala Gly Leu Val Leu Asn Gly Leu Ala Leu Tyr Val Phe Cys Cys65 70 75 80Arg Thr Arg Ala Lys Thr Pro Ser Val Ile Tyr Thr Ile Asn Leu Val85 90 95Val Thr Asp Leu Leu Val Gly Leu Ser Leu Pro Thr Arg Phe Ala Val100 105 110Tyr Tyr Gly Ala Arg Gly Cys Leu Arg Cys Ala Phe Pro His Val Leu
115 120 125Gly Tyr Phe Leu Asn Met His Cys Ser Ile Leu Phe Leu Thr Cys Ile130 135 140Cys Val Asp Arg Tyr Leu Ala Ile Val Arg Pro Glu Gly Ser Arg Ala145 150 155 160Cys Arg Gln Pro Ala Cys Ala Arg Ala Val Cys Ala Phe Val Trp Leu165 170 175Ala Ala Gly Ala Val Thr Leu Ser Val Leu Gly Val Thr Gly Ser Arg180 185 190Pro Cys Cys Arg Val Phe Ala Leu Thr Val Leu Glu Phe Leu Leu Pro195 200 205Leu Leu Val Ile Ser Val Phe Thr Gly Arg Ile Met Cys Ala Leu Ser210 215 220Arg Pro Gly Leu Leu His Gln Gly Arg Gln Arg Arg Val Arg Ala Lys225 230 235 240Gln Leu Leu Leu Thr Val Leu Ile Ile Phe Leu Val Cys Phe Thr Pro245 250 255Phe His Ala Arg Gln Val Ala Val Ala Leu Trp Pro Asp Met Pro His260 265 270His Thr Ser Leu Val Val Tyr His Val Ala Val Thr Leu Ser Ser Leu275 280 285Asn Ser Cys Met Asp Pro Ile Val Tyr Cys Phe Val Thr Ser Gly Phe290 295 300Gln Ala Thr Val Arg Gly Leu Phe Gly Gln His Gly Glu Arg Glu Pro305 310 315 320Ser Ser Gly Asp Val Val Ser Met His Arg Ser Ser Lys Gly Ser Gly325 330 335Arg His His Ile Leu Ser Ala Gly Pro His Ala Leu Thr Gln Ala Leu340 345 350Ala Asn Gly Pro Glu Ala355(188)SEQ ID NO187的資料(i)序列特征(A)長度1050個堿基對
(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO187的序列描述ATGAACTCCA CCTTGGATGG TAATCAGAGC AGCCACCCTT TTTGCCTCTT GGCATTTGGC 60TATTTGGAAA CTGTCAATTT TTGCCTTTTG GAAGTATTGA TTATTGTCTT TCTAACTGTA 120TTGATTATTT CTGGCAACAT CATTGTGATT TTTGTATTTC ACTGTGCACC TTTGTTGAAC 180CATCACACTA CAAGTTATTT TATCCAGACT ATGGCATATG CTGACCTTTT TGTTGGGGTG 240AGCTGCGTGG TCCCTTCTTT ATCACTCCTC CATCACCCCC TTCCAGTAGA GGAGTCCTTG 300ACTTGCCAGA TATTTGGTTT TGTAGTATCA GTTCTGAAGA GCGTCTCCAT GGCTTCTCTG 360GCCTGTATCA GCATTGATAG ATACATTGCC ATTACTAAAC CTTTAACCTA TAATACTCTG 420GTTACACCCT GGAGACTACG CCTGTGTATT TTCCTGATTT GGCTATACTC GACCCTGGTC 480TTCCTGCCTT CCTTTTTCCA CTGGGGCAAA CCTGGATATC ATGGAGATGT GTTTCAGTGG 540TGTGCGGAGT CCTGGCACAC CGACTCCTAC TTCACCCTGT TCATCGTGAT GATGTTATAT 600GCCCCAGCAG CCCTTATTGT CTGCTTCACC TATTTCAACA TCTTCCGCAT CTGCCAACAG 660CACACAAAGG ATATCAGCGA AAGGCAAGCC CGCTTCAGCA GCCAGAGTGG GGAGACTGGG 720GAAGTGCAGG CCTGTCCTGA TAAGCGCTAT AAAATGGTCC TGTTTCGAAT CACTAGTGTA 780TTTTACATCC TCTGGTTGCC ATATATCATC TACTTCTTGT TGGAAAGCTC CACTGGCCAC 840AGCAACCGCT TCGCATCCTT CTTGACCACC TGGCTTGCTA TTAGTAACAG TTTCTGCAAC 900TGTGTAATTT ATAGTCTCTC CAACAGTGTA TTCCAAAGAG GACTAAAGCG CCTCTCAGGG 960GCTATGTGTA CTTCTTGTGC AAGTCAGACT ACAGCCAACG ACCCTTACAC AGTTAGAAGC 1020AAAGGCCCTC TTAATGGATG TCATATCTGA 1050(189)SEQ ID NO188的資料(i)序列特征(A)長度349個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO188的序列描述Met Asn Ser Thr Leu Asp Gly Asn Gln Ser Ser Hi s Pro Phe Cys Leu1 5 10 15Leu Ala Phe Gly Tyr Leu Glu Thr Val Asn Phe Cys Leu Leu Glu Val20 25 30Leu Ile Ile Val Phe Leu Thr Val Leu Ile Ile Ser Gly Asn Ile Ile35 40 45Val Ile Phe Val Phe His Cys Ala Pro Leu Leu Asn His His Thr Thr50 55 60Ser Tyr Phe Ile Gln Thr Met Ala Tyr Ala Asp Leu Phe Val Gly Val65 70 75 80Ser Cys Val Val Pro Ser Leu Ser Leu Leu His His Pro Leu Pro Val85 90 95Glu Glu Ser Leu Thr Cys Gln Ile Phe Gly Phe Val Val Ser Val Leu100 105 110Lys Ser Val Ser Met Ala Ser Leu Ala Cys Ile Ser Ile Asp Arg Tyr115 120 125Ile Ala Ile Thr Lys Pro Leu Thr Tyr Asn Thr Leu Val Thr Pro Trp130 135 140Arg Leu Arg Leu Cys Ile Phe Leu Ile Trp Leu Tyr Ser Thr Leu Val145 150 155 160Phe Leu Pro Ser Phe Phe His Trp Gly Lys Pro Gly Tyr His Gly Asp165 170 175Val Phe Gln Trp Cys Ala Glu Ser Trp His Thr Asp Ser Tyr Phe Thr180 185 190Leu Phe Ile Val Met Met Leu Tyr Ala Pro Ala Ala Leu Ile Val Cys195 200 205Phe Thr Tyr Phe Asn Ile Phe Arg Ile Cys Gln Gln His Thr Lys Asp210 215 220Ile Ser Glu Arg Gln Ala Arg Phe Ser Ser Gln Ser Gly Glu Thr Gly225 230 235 240Glu Val Gln Ala Cys Pro Asp Lys Arg Tyr Lys Met Val Leu Phe Arg245 250 255Ile Thr Ser Val Phe Tyr Ile Leu Trp Leu Pro Tyr Ile Ile Tyr Phe260 265 270Leu Leu Glu Ser Ser Thr Gly His Ser Asn Arg Phe Ala Ser Phe Leu275 280 285Thr Thr Trp Leu Ala Ile Ser Asn Ser Phe Cys Asn Cys Val Ile Tyr290 295 300Ser Leu Ser Asn Ser Val Phe Gln Arg Gly Leu Lys Arg Leu Ser Gly305 310 315 320Ala Met Cys Thr Ser Cys Ala Ser Gln Thr Thr Ala Asn Asp Pro Tyr325 330 335Thr Val Arg Ser Lys Gly Pro Leu Asn Gly Cys His Ile
340 345(190)SEQ ID NO189的資料(i)序列特征(A)長度1302個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO189的序列描述ATGTGTTTTT CTCCCATTCT GGAAATCAAC ATGCAGTCTG AATCTAACAT TACAGTGCGA 60GATGACATTG ATGACATCAA CACCAATATG TACCAACCAC TATCATATCC GTTAAGCTTT 120CAAGTGTCTC TCACCGGATT TCTTATGTTA GAAATTGTGT TGGGACTTGG CAGCAACCTC 180ACTGTATTGG TACTTTACTG CATGAAATCC AACTTAATCA ACTCTGTCAG TAACATTATT 240ACAATGAATC TTCATGTACT TGATGTAATA ATTTGTGTGG GATGTATTCC TCTAACTATA 300GTTATCCTTC TGCTTTCACT GGAGAGTAAC ACTGCTCTCA TTTGCTGTTT CCATGAGGCT 360TGTGTATCTT TTGCAAGTGT CTCAACAGCA ATCAACGTTT TTGCTATCAC TTTGGACAGA 420TATGACATCT CTGTAAAACC TGCAAACCGA ATTCTGACAA TGGGCAGAGC TGTAATGTTA 480ATGATATCCA TTTGGATTTT TTCTTTTTTC TCTTTCCTGA TTCCTTTTAT TGAGGTAAAT 540TTTTTCAGTC TTCAAAGTGG AAATACCTGG GAAAACAAGA CACTTTTATG TGTCAGTACA 600AATGAATACT ACACTGAACT GGGAATGTAT TATCACCTGT TAGTACAGAT CCCAATATTC 660TTTTTCACTG TTGTAGTAAT GTTAATCACA TACACCAAAA TACTTCAGGC TCTTAATATT 720CGAATAGGCA CAAGATTTTC AACAGGGCAG AAGAAGAAAG CAAGAAAGAA AAAGACAATT 780TCTCTAACCA CACAACATGA GGCTACAGAC ATGTCACAAA GCAGTGGTGG GAGAAATGTA 840GTCTTTGGTG TAAGAACTTC AGTTTCTGTA ATAATTGCCC TCCGGCGAGC TGTGAAACGA 900CACCGTGAAC GACGAGAAAG ACAAAAGAGA GTCAAGAGGA TGTCTTTATT GATTATTTCT 960ACATTTCTTC TCTGCTGGAC ACCAATTTCT GTTTTAAATA CCACCATTTT ATGTTTAGGC 1020CCAAGTGACC TTTTAGTAAA ATTAAGATTG TGTTTTTTAG TCATGGCTTA TGGAACAACT 1080ATATTTCACC CTCTATTATA TGCATTCACT AGACAAAAAT TTCAAAAGGT CTTGAAAAGT 1140AAAATGAAAA AGCGAGTTGT TTCTATAGTA GAAGCTGATC CCCTGCCTAA TAATGCTGTA 1200ATACACAACT CTTGGATAGA TCCCAAAAGA AACAAAAAAA TTACCTTTGA AGATAGTGAA 1260ATAAGAGAAA AACGTTTAGT GCCTCAGGTT GTCACAGACT AG1302(191)SEQ ID NO190的資料(i)序列特征(A)長度433個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO190的序列描述Met Cys Phe Ser Pro Ile Leu Glu Ile Asn Met Gln Ser Glu Ser Asn1 5 10 15Ile Thr Val Vrg Asp Asp Ile Asp Asp Ile Asn Thr Asn Met Tyr Gln20 25 30Pro Leu Ser Tyr Pro Leu Ser Phe Gln Val Ser Leu Thr Gly Phe Leu35 40 45Met Leu Glu Ile Val Leu Gly Leu Gly Ser Asn Leu Thr Val Leu Val50 55 60Leu Tyr Cys Met Lys Ser Asn Leu Ile Asn Ser Val Ser Asn Ile Ile65 70 75 80Thr Met Asn Leu His Val Leu Asp Val Ile Ile Cys Val Gly Cys Ile85 90 95Pro Leu Thr Ile Val Ile Leu Leu Leu Ser Leu Glu Ser Asn Thr Ala100 l05 110Leu Ile Cys Cys Phe His Glu Ala Cys Val Ser Phe Ala Ser Val Ser115 120 125Thr Ala Ile Asn Val Phe Ala Ile Thr Leu Asp Arg Tyr Asp Ile Ser130 135 140Val Lys Pro Ala Asn Arg Ile Leu Thr Met Gly Arg Ala Val Met Leu145 150 155 160Met Ile Ser Ile Trp Ile Phe Ser Phe Phe Ser Phe Leu Ile Pro Phe165 170 175Ile Glu Val Asn Phe Phe Ser Leu Gln Ser Gly Asn Thr Trp Glu Asn180 185 190Lys Thr Leu Leu Cys Val Ser Thr Asn Glu Tyr Tyr Thr Glu Leu Gly195 200 205Met Tyr Tyr His Leu Leu Val Gln Ile Pro Ile Phe Phe Phe Thr Val210 215 220Val Val Met Leu Ile Thr Tyr Thr Lys Ile Leu Gln Ala Leu Asn Ile225 230 235 240Arg Ile Gly Thr Arg Phe Ser Thr Gly Gln Lys Lys Lys Ala Arg Lys245 250 255Lys Lys Thr Ile Ser Leu Thr Thr Gln His Glu Ala Thr Asp Met Ser260 265 270Gln Ser Ser Gly Gly Arg Asn Val Val Phe Gly Val Arg Thr Ser Val275 280 285Ser Val Ile Ile Ala Leu Arg Arg Ala Val Lys Arg His Arg Glu Arg290 295 300Arg Glu Arg Gln Lys Arg Val Lys Arg Met Ser Leu Leu Ile Ile Ser305 310 315 320Thr Phe Leu Leu Cys Trp Thr Pro Ile Ser Val Leu Asn Thr Thr Ile325 330 335Leu Cys Leu Gly Pro Ser Asp Leu Leu Val Lys Leu Arg Leu Cys Phe340 345 350Leu Val Met Ala Tyr Gly Thr Thr Ile Phe His Pro Leu Leu Tyr Ala355 360 365Phe Thr Arg Gln Lys Phe Gln Lys Val Leu Lys Ser Lys Met Lys Lys370 375 380Arg Val Val Ser Ile Val Glu Ala Asp Pro Leu Pro Asn Asn Ala Val385 390 395 400Ile His Asn Ser Trp Ile Asp Pro Lys Arg Asn Lys Lys Ile Thr Phe405 410 415Glu Asp Ser Glu Ile Arg Glu Lys Arg Leu Val Pro Gln Val Val Thr420 425 430Asp(192)SEQ ID NO191的資料(i)序列特征(A)長度1209個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO191的序列描述ATGTTGTGTC CTTCCAAGAC AGATGGCTCA GGGCACTCTG GTAGGATTCA CCAGGAAACT 60CATGGAGAAG GGAAAAGGGA CAAGATTAGC AACAGTGAAG GGAGGGAGAA TGGTGGGAGA 120GGATTCCAGA TGAACGGTGG GTCGCTGGAG GCTGAGCATG CCAGCAGGAT GTCAGTTCTC 180AGAGCAAAGC CCATGTCAAA CAGCCAACGC TTGCTCCTTC TGTCCCCAGG ATCACCTCCT 240CGCACGGGGA GCATCTCCTA CATCAACATC ATCATGCCTT CGGTGTTCGG CACCATCTGC 300CTCCTGGGCA TCATCGGGAA CTCCACGGTC ATCTTCGCGG TCGTGAAGAA GTCCAAGCTG 360CACTGGTGCA ACAACGTCCC CGACATCTTC ATCATCAACC TCTCGGTAGT AGATCTCCTC 420TTTCTCCTGG GCATGCCCTT CATGATCCAC CAGCTCATGG GCAATGGGGT GTGGCACTTT 480GGGGAGACCA TGTGCACCCT CATCACGGCC ATGGATGCCA ATAGTCAGTT CACCAGCACC 540TACATCCTGA CCGCCATGGC CATTGACCGC TACCTGGCCA CTGTCCACCC CATCTCTTCC 600ACGAAGTTCC GGAAGCCCTC TGTGGCCACC CTGGTGATCT GCCTCCTGTG GGCCCTCTCC 660TTCATCAGCA TCACCCCTGT GTGGCTGTAT GCCAGACTCA TCCCCTTCCC AGGAGGTGCA 720GTGGGCTGCG GCATACGCCT GCCCAACCCA GACACTGACC TCTACTGGTT CACCCTGTAC 780CAGTTTTTCC TGGCCTTTGC CCTGCCTTTT GTGGTCATCA CAGCCGCATA CGTGAGGATC 840CTGCAGCGCA TGACGTCCTC AGTGGCCCCC GCCTCCCAGC GCAGCATCCG GCTGCGGACA 900AAGAGGGTGA AACGCACAGC CATCGCCATC TGTCTGGTCT TCTTTGTGTG CTGGGCACCC 960TACTATGTGC TACAGCTGAC CCAGTTGTCC ATCAGCCGCC CGACCCTCAC CTTTGTCTAC 1020TTATACAATG CGGCCATCAG CTTGGGCTAT GCCAACAGCT GCCTCAACCC CTTTGTGTAC 1080ATCGTGCTCT GTGAGACGTT CCGCAAACGC TTGGTCCTGT CGGTGAAGCC TGCAGCCCAG 1140GGGCAGCTTC GCGCTGTCAG CAACGCTCAG ACGGCTGACG AGGAGAGGAC AGAAAGCAAA 1200GGCACCTGA 1209(193)SEQ ID NO192的資料(i)序列特征(A)長度402個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO192的序列描述Met Leu Cys Pro Ser Lys Thr Asp Gly Ser Gly His Ser Gly Arg Ile1 5 10 15His Gln Glu Thr His Gly Glu Gly Lys Arg Asp Lys Ile Ser Asn Ser20 25 30Glu Gly Arg Glu Asn Gly Gly Arg Gly Phe Gln Met Asn Gly Gly Ser35 40 45Leu Glu Ala Glu His Ala Ser Arg Met Ser Val Leu Arg Ala Lys Pro50 55 60Met Ser Asn Ser Gln Arg Leu Leu Leu Leu Ser Pro Gly Ser Pro Pro65 70 75 80Arg Thr Gly Ser Ile Ser Tyr Ile Asn Ile Ile Met Pro Ser Val Phe85 90 95Gly Thr Ile Cys Leu Leu Gly Ile Ile Gly Asn Ser Thr Val Ile Phe100 105 110Ala Val Val Lys Lys Ser Lys Leu His Trp Cys Asn Asn Val Pro Asp115 120 125Ile Phe Ile Ile Asn Leu Ser Val Val Asp Leu Leu Phe Leu Leu Gly130 135 140Met Pro Phe Met Ile His Gln Leu Met Gly Asn Gly Val Trp His Phe145 150 155 160Gly Glu Thr Met Cys Thr Leu Ile Thr Ala Met Asp Ala Asn Ser Gln165 170 175Phe Thr Ser Thr Tyr lle Leu Thr Ala Met Ala Ile Asp Arg Tyr Leu180 185 190Ala Thr Val His Pro Ile Ser Ser Thr Lys Phe Arg Lys Pro Ser Val195 200 205Ala Thr Leu Val Ile Cys Leu Leu Trp Ala Leu Ser Phe Ile Ser Ile210 215 220Thr Pro Val Trp Leu Tyr Ala Arg Leu Ile Pro Phe Pro Gly Gly Ala225 230 235 240Val Gly Cys Gly Ile Arg Leu Pro Asn Pro Asp Thr Asp Leu Tyr Trp245 250 255Phe Thr Leu Tyr Gln Phe Phe Leu Ala Phe Ala Leu Pro Phe Val Val260 265 270Ile Thr Ala Ala Tyr Val Arg Ile Leu Gln Arg Met Thr Ser Ser Val275 280 285Ala Pro Ala Ser Gln Arg Ser Ile Arg Leu Arg Thr Lys Arg Val Lys290 295 300Arg Thr Ala Ile Ala Ile Cys Leu Val Phe Phe Val Cys Trp Ala Pro305 310 315 320Tyr Tyr Val Leu Gln Leu Thr Gln Leu Ser Ile Ser Arg Pro Thr Leu325 330 335Thr Phe Val Tyr Leu Tyr Asn Ala Ala Ile Ser Leu Gly Tyr Ala Asn340 345 350Ser Cys Leu Asn Pro Phe Val Tyr Ile Val Leu Cys Glu Thr Phe Arg355 360 365Lys Arg Leu Val Leu Ser Val Lys Pro Ala Ala Gln Gly Gln Leu Arg370 375 380Ala Val Ser Asn Ala Gln Thr Ala Asp Glu Glu Arg Thr Glu Ser Lys385 390 395 400Gly Thr(194)SEQ ID NO193的資料(i)序列特征
(A)長度1128個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO193的序列描述ATGGATGTGA CTTCCCAAGC CCGGGGCGTG GGCCTGGAGA TGTACCCAGG CACCGCGCAC 60GCTGCGGCCC CCAACACCAC CTCCCCCGAG CTCAACCTGT CCCACCCGCT CCTGGGCACC 120GCCCTGGCCA ATGGGACAGG TGAGCTCTCG GAGCACCAGC AGTACGTGAT CGGCCTGTTC 180CTCTCGTGCC TCTACACCAT CTTCCTCTTC CCCATCGGCT TTGTGGGCAA CATCCTGATC 240CTGGTGGTGA ACATCAGCTT CCGCGAGAAG ATGACCATCC CCGACCTGTA CTTCATCAAC 300CTGGCGGTGG CGGACCTCAT CCTGGTGGCC GACTCCCTCA TTGAGGTGTT CAACCTGCAC 360GAGCGGTACT ACGACATCGC CGTCCTGTGC ACCTTCATGT CGCTCTTCCT GCAGGTCAAC 420ATGTACAGCA GCGTCTTCTT CCTCACCTGG ATGAGCTTCG ACCGCTACAT CGCCCTGGCC 480AGGGCCATGC GCTGCAGCCT GTTCCGCACC AAGCACCACG CCCGGCTGAG CTGTGGCCTC 540ATCTGGATGG CATCCGTGTC AGCCACGCTG GTGCCCTTCA CCGCCGTGCA CCTGCAGCAC 600ACCGACGAGG CCTGCTTCTG TTTCGCGGAT GTCCGGGAGG TGCAGTGGCT CGAGGTCACG 660CTGGGCTTCA TCGTGCCCTT CGCCATCATC GGCCTGTGCT ACTCCCTCAT TGTCCGGGTG 720CTGGTCAGGG CGCACCGGCA CCGTGGGCTG CGGCCCCGGC GGCAGAAGGC GAAACGCATG 780ATCCTCGCGG TGGTGCTGGT CTTCTTCGTC TGCTGGCTGC CGGAGAACGT CTTCATCAGC 840GTGCACCTCC TGCAGCGGAC GCAGCCTGGG GCCGCTCCCT GCAAGCAGTC TTTCCGCCAT 900GCCCACCCCC TCACGGGCCA CATTGTCAAC CTCGCCGCCT TCTCCAACAG CTGCCTAAAC 960CCCCTCATCT ACAGCTTTCT CGGGGAGACC TTCAGGGACA AGCTGAGGCT GTACATTGAG 1020CAGAAAACAA ATTTGCCGGC CCTGAACCGC TTCTGTCACG CTGCCCTGAA GGCCGTCATT 1080CCAGACAGCA CCGAGCAGTC GGATGTGAGG TTCAGCAGTG CCGTGTGA 1128(195)SEQ ID NO194的資料(i)序列特征(A)長度375個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO194的序列描述Met Asp Val Thr Ser Gln Ala Arg Gly Val Gly Leu Glu Met Tyr Pro1 5 10 15Gly Thr Ala His Ala Ala Ala Pro Asn Thr Thr Ser Pro Glu Leu Asn20 25 30Leu Ser His Pro Leu Leu Gly Thr Ala Leu Ala Asn Gly Thr Gly Glu35 40 45Leu Ser Glu His Gln Gln Tyr Val Ile Gly Leu Phe Leu Ser Cys Leu50 55 60Tyr Thr Ile Phe Leu Phe Pro Ile Gly Phe Val Gly Asn Ile Leu Ile65 70 75 80Leu Val Val Asn Ile Ser Phe Arg Glu Lys Met Thr Ile Pro Asp Leu85 90 95Tyr Phe Ile Asn Leu Ala Val Ala Asp Leu Ile Leu Val Ala Asp Ser100 105 110Leu Ile Glu Val Phe Asn Leu His Glu Arg Tyr Tyr Asp Ile Ala Val115 120 125Leu Cys Thr Phe Met Ser Leu Phe Leu Gln Val Asn Met Tyr Ser Ser130 135 140Val Phe Phe Leu Thr Trp Met Ser Phe Asp Arg Tyr Ile Ala Leu Ala145 150 155 160Arg Ala Met Arg Cys Ser Leu Phe Arg Thr Lys His His Ala Arg Leu165 170 175Ser Cys Gly Leu Ile Trp Met Ala Ser Val Ser Ala Thr Leu Val Pro180 185 190Phe Thr Ala Val His Leu Gln His Thr Asp Glu Ala Cys Phe Cys Phe195 200 205Ala Asp Val Arg Glu Val Gln Trp Leu Glu Val Thr Leu Gly Phe Ile210 215 220Val Pro Phe Ala Ile Ile Gly Leu Cys Tyr Ser Leu Ile Val Arg Val225 230 235 240Leu Val Arg Ala His Arg His Arg Gly Leu Arg Pro Arg Arg Gln Lys245 250 255Ala Lys Arg Met Ile Leu Ala Val Val Leu Val Phe Phe Val Cys Trp260 265 270Leu Pro Glu Asn Val Phe Ile Ser Val His Leu Leu Gln Arg Thr Gln275 280 285Pro Gly Ala Ala Pro Cys Lys Gln Ser Phe Arg His Ala His Pro Leu290 295 300Thr Gly His Ile Val Asn Leu Ala Ala Phe Ser Asn Ser Cys Leu Asn305 310 315 320Pro Leu Ile Tyr Ser Phe Leu Gly Glu Thr Phe Arg Asp Lys Leu Arg325 330 335Leu Tyr Ile Glu Gln Lys Thr Asn Leu Pro Ala Leu Asn Arg Phe Cys340 345 350His Ala Ala Leu Lys Ala Val Ile Pro Asp Ser Thr Glu Gln Ser Asp355 360 365Val Arg Phe Ser Ser Ala Val370 375(196)SEQ ID NO195的資料(i)序列特征(A)長度960個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO195的序列描述ATGCCATTCC CAAACTGCTC AGCCCCCAGC ACTGTGGTGG CCACAGCTGT GGGTGTCTTG 60CTGGGGCTGG AGTGTGGGCT GGGTCTGCTG GGCAACGCGG TGGCGCTGTG GACCTTCCTG 120TTCCGGGTCA GGGTGTGGAA GCCGTACGCT GTCTACCTGC TCAACCTGGC CCTGGCTGAC 180CTGCTGTTGG CTGCGTGCCT GCCTTTCCTG GCCGCCTTCT ACCTGAGCCT CCAGGCTTGG 240CATCTGGGCC GTGTGGGCTG CTGGGCCCTG CGCTTCCTGC TGGACCTCAG CCGCAGCGTG 300GGGATGGCCT TCCTGGCCGC CGTGGCTTTG GACCGGTACC TCCGTGTGGT CCACCCTCGG 360CTTAAGGTCA ACCTGCTGTC TCCTCAGGCG GCCCTGGGGG TCTCGGGCCT CGTCTGGCTC 420CTGATGGTCG CCCTCACCTG CCCGGGCTTG CTCATCTCTG AGGCCGCCCA GAACTCCACC 480AGGTGCCACA GTTTCTACTC CAGGGCAGAC GGCTCCTTCA GCATCATCTG GCAGGAAGCA 540CTCTCCTGCC TTCAGTTTGT CCTCCCCTTT GGCCTCATCG TGTTCTGCAA TGCAGGCATC 600ATCAGGGCTC TCCAGAAAAG ACTCCGGGAG CCTGAGAAAC AGCCCAAGCT TCAGCGGGCC 660AAGGCACTGG TCACCTTGGT GGTGGTGCTG TTTGCTCTGT GCTTTCTGCC CTGCTTCCTG 720GCCAGAGTCC TGATGCACAT CTTCCAGAAT CTGGGGAGCT GCAGGGCCCT TTGTGCAGTG 780GCTCATACCT CGGATGTCAC GGGCAGCCTC ACCTACCTGC ACAGTGTCGT CAACCCCGTG 840GTATACTGCT TCTCCAGCCC CACCTTCAGG AGCTCCTATC GGAGGGTCTT CCACACCCTC 900CGAGGCAAAG GGCAGGCAGC AGAGCCCCCA GATTTCAACC CCAGAGACTC CTATTCCTGA 960(197)SEQ ID NO196的資料(i)序列特征(A)長度319個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO196的序列描述Met Pro Phe Pro Asn Cys Ser Ala Pro Ser Thr Val Val Ala Thr Ala1 5 10 15Val Gly Val Leu Leu Gly Leu Glu Cys Gly Leu Gly Leu Leu Gly Asn20 25 30Ala Val Ala Leu Trp Thr Phe Leu Phe Arg Val Arg Val Trp Lys Pro35 40 45Tyr Ala Val Tyr Leu Leu Asn Leu Ala Leu Ala Asp Leu Leu Leu Ala50 55 60Ala Cys Leu Pro Phe Leu Ala Ala Phe Tyr Leu Ser Leu Gln Ala Trp65 70 75 80His Leu Gly Arg Val Gly Cys Trp Ala Leu Arg Phe Leu Leu Asp Leu85 90 95Ser Arg Ser Val Gly Met Ala Phe Leu Ala Ala Val Ala Leu Asp Arg100 105 110Tyr Leu Arg Val Val His Pro Arg Leu Lys Val Asn Leu Leu Ser Pro115 120 125Gln Ala Ala Leu Gly Val Ser Gly Leu Val Trp Leu Leu Met Val Ala130 135 140Leu Thr Cys Pro Gly Leu Leu Ile Ser Glu Ala Ala Gln Asn Ser Thr145 150 155 160Arg Cys His Ser Phe Tyr Ser Arg Ala Asp Gly Ser Phe Ser Ile Ile165 170 175Trp Gln Glu Ala Leu Ser Cys Leu Gln Phe Val Leu Pro Phe Gly Leu180 185 190Ile Val Phe Cys Asn Ala Gly Ile Ile Arg Ala Leu Gln Lys Arg Leu195 200 205Arg Glu Pro Glu Lys Gln Pro Lys Leu Gln Arg Ala Lys Ala Leu Val210 215 220Thr Leu Val Val Val Leu Phe Ala Leu Cys Phe Leu Pro Cys Phe Leu225 230 235 240Ala Arg Val Leu Met His Ile Phe Gln Asn Leu Gly Ser Cys Arg Ala245 250 255Leu Cys Ala Val Ala His Thr Ser Asp Val Thr Gly Ser Leu Thr Tyr260 265 270Leu His Ser Val Val Asn Pro Val Val Tyr Cys Phe Ser Ser Pro Thr
275 280 285Phe Arg Ser Ser Tyr Arg Arg Val Phe His Thr Leu Arg Gly Lys Gly290 295 300Gln Ala Ala Glu Pro Pro Asp Phe Asn Pro Arg Asp Ser Tyr Ser305 310 315(198)SEQ ID NO197的資料(i)序列特征(A)長度1143個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO197的序列描述ATGGAGGAAG GTGGTGATTT TGACAACTAC TATGGGGCAG ACAACCAGTC TGAGTGTGAG 60TACACAGACT GGAAATCCTC GGGGGCCCTC ATCCCTGCCA TCTACATGTT GGTCTTCCTC 120CTGGGCACCA CGGGAAACGG TCTGGTGCTC TGGACCGTGT TTCGGAGCAG CCGGGAGAAG 180AGGCGCTCAG CTGATATCTT CATTGCTAGC CTGGCGGTGG CTGACCTGAC CTTCGTGGTG 240ACGCTGCCCC TGTGGGCTAC CTACACGTAC CGGGACTATG ACTGGCCCTT TGGGACCTTC 300TTCTGCAAGC TCAGCAGCTA CCTCATCTTC GTCAACATGT ACGCCAGCGT CTTCTGCCTC 360ACCGGCCTCA GCTTCGACCG CTACCTGGCC ATCGTGAGGC CAGTGGCCAA TGCTCGGCTG 420AGGCTGCGGG TCAGCGGGGC CGTGGCCACG GCAGTTCTTT GGGTGCTGGC CGCCCTCCTG 480GCCATGCCTG TCATGGTGTT ACGCACCACC GGGGACTTGG AGAACACCAC TAAGGTGCAG 540TGCTACATGG ACTACTCCAT GGTGGCCACT GTGAGCTCAG AGTGGGCCTG GGAGGTGGGC 600CTTGGGGTCT CGTCCACCAC CGTGGGCTTT GTGGTGCCCT TCACCATCAT GCTGACCTGT 660TACTTCTTCA TCGCCCAAAC CATCGCTGGC CACTTCCGCA AGGAACGCAT CGAGGGCCTG 720CGGAAGCGGC GCCGGCTTAA GAGCATCATC GTGGTGCTGG TGGTGACCTT TGCCCTGTGC 780TGGATGCCCT ACCACCTGGT GAAGACGCTG TACATGCTGG GCAGCCTGCT GCACTGGCCC 840TGTGACTTTG ACCTCTTCCT CATGAACATC TTCCCCTACT GCACCTGCAT CAGCTACGTC 900AACAGCTGCC TCAACCCCTT CCTCTATGCC TTTTTCGACC CCCGCTTCCG CCAGGCCTGC 960ACCTCCATGC TCTGCTGTGG CCAGAGCAGG TGCGCAGGCA CCTCCCACAG CAGCAGTGGG 1020GAGAAGTCAG CCAGCTACTC TTCGGGGCAC AGCCAGGGGC CCGGCCCCAA CATGGGCAAG 1080GGTGGAGAAC AGATGCACGA GAAATCCATC CCCTACAGCC AGGAGACCCT TGTGGTTGAC 1140TAG 1143(199)SEQ ID NO198的資料(i)序列特征(A)長度380個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO198的序列描述Met Glu Glu Gly Gly Asp Phe Asp Asn Tyr Tyr Gly Ala Asp Asn Gln1 5 10 15Ser Glu Cys Glu Tyr Thr Asp Trp Lys Ser Ser Gly Ala Leu Ile Pro20 25 30Ala Ile Tyr Met Leu Val Phe Leu Leu Gly Thr Thr Gly Asn Gly Leu35 40 45Val Leu Trp Thr Val Phe Arg Ser Ser Arg Glu Lys Arg Arg Ser Ala50 55 60Asp Ile Phe Ile Ala Ser Leu Ala Val Ala Asp Leu Thr Phe Val Val65 70 75 80Thr Leu Pro Leu Trp Ala Thr Tyr Thr Tyr Arg Asp Tyr Asp Trp Pro85 90 95Phe Gly Thr Phe Phe Cys Lys Leu Ser Ser Tyr Leu Ile Phe Val Asn100 105 110Met Tyr Ala Ser Val Phe Cys Leu Thr Gly Leu Ser Phe Asp Arg Tyr115 120 125Leu Ala Ile Val Arg Pro Val Ala Asn Ala Arg Leu Arg Leu Arg Val130 135 140Ser Gly Ala Val Ala Thr Ala Val Leu Trp Val Leu Ala Ala Leu Leu145 150 155 160Ala Met Pro Val Met Val Leu Arg Thr Thr Gly Asp Leu Glu Asn Thr165 170 175Thr Lys Val Gln Cys Tyr Met Asp Tyr Ser Met Val Ala Thr Val Ser180 185 190Ser Glu Trp Ala Trp Glu Val Gly Leu Gly Val Ser Ser Thr Thr Val195 200 205Gly Phe Val Val Pro Phe Thr Ile Met Leu Thr Cys Tyr Phe Phe Ile210 215 220Ala Gln Thr Ile Ala Gly His Phe Arg Lys Glu Arg Ile Glu Gly Leu225 230 235 240Arg Lys Arg Arg Arg Leu Lys Ser Ile Ile Val Val Leu Val Val Thr245 250 255Phe Ala Leu Cys Trp Met Pro Tyr His Leu Val Lys Thr Leu Tyr Met260 265 270Leu Gly Ser Leu Leu His Trp Pro Cys Asp Phe Asp Leu Phe Leu Met275 280 285Asn Ile Phe Pro Tyr Cys Thr Cys Ile Ser Tyr Val Asn Ser Cys Leu290 295 300Asn Pro Phe Leu Tyr Ala Phe Phe Asp Pro Arg Phe Arg Gln Ala Cys305 310 315 320Thr Ser Met Leu Cys Cys Gly Gln Ser Arg Cys Ala Gly Thr Ser His325 330 335Ser Ser Ser Gly Glu Lys Ser Ala Ser Tyr Ser Ser Gly His Ser Gln340 345 350Gly Pro Gly Pro Asn Met Gly Lys Gly Gly Glu Gln Met His Glu Lys355 360 365Ser Ile Pro Tyr Ser Gln Glu Thr Leu Val Val Asp370 375 380(200)SEQ ID NO199的資料(i)序列特征(A)長度1119個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO199的序列描述ATGAACTACC CGCTAACGCT GGAAATGGAC CTCGAGAACC TGGAGGACCT GTTCTGGGAA 60CTGGACAGAT TGGACAACTA TAACGACACC TCCCTGGTGG AAAATCATCT CTGCCCTGCC 120ACAGAGGGTC CCCTCATGGC CTCCTTCAAG GCCGTGTTCG TGCCCGTGGC CTACAGCCTC 180ATCTTCCTCC TGGGCGTGAT CGGCAACGTC CTGGTGCTGG TGATCCTGGA GCGGCACCGG 240CAGACACGCA GTTCCACGGA GACCTTCCTG TTCCACCTGG CCGTGGCCGA CCTCCTGCTG 300GTCTTCATCT TGCCCTTTGC CGTGGCCGAG GGCTCTGTGG GCTGGGTCCT GGGGACCTTC 360CTCTGCAAAA CTGTGATTGC CCTGCACAAA GTCAACTTCT ACTGCAGCAG CCTGCTCCTG 420GCCTGCATCG CCGTGGACCG CTACCTGGCC ATTGTCCACG CCGTCCATGC CTACCGCCAC 480CGCCGCCTCC TCTCCATCCA CATCACCTGT GGGACCATCT GGCTGGTGGG CTTCCTCCTT 540GCCTTGCCAG AGATTCTCTT CGCCAAAGTC AGCCAAGGCC ATCACAACAA CTCCCTGCCA 600CGTTGCACCT TCTCCCAAGA GAACCAAGCA GAAACGCATG CCTGGTTCAC CTCCCGATTC 660CTCTACCATG TGGCGGGATT CCTGCTGCCC ATGCTGGTGA TGGGCTGGTG CTACGTGGGG 720GTAGTGCACA GGTTGCGCCA GGCCCAGCGG CGCCCTCAGC GGCAGAAGGC AAAAAGGGTG 780GCCATCCTGG TGACAAGCAT CTTCTTCCTC TGCTGGTCAC CCTACCACAT CGTCATCTTC 840CTGGACACCC TGGCGAGGCT GAAGGCCGTG GACAATACCT GCAAGCTGAA TGGCTCTCTC 900CCCGTGGCCA TCACCATGTG TGAGTTCCTG GGCCTGGCCC ACTGCTGCCT CAACCCCATG 960CTCTACACTT TCGCCGGCGT GAAGTTCCGC AGTGACCTGT CGCGGCTCCT GACCAAGCTG 1020GGCTGTACCG GCCCTGCCTC CCTGTGCCAG CTCTTCCCTA GCTGGCGCAG GAGCAGTCTC 1080TCTGAGTCAG AGAATGCCAC CTCTCTCACC ACGTTCTAG1119(201)SEQ ID NO200的資料(i)序列特征(A)長度372個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO200的序列描述Met Asn Tyr Pro Leu Thr Leu Glu Met Asp Leu Glu Asn Leu Glu Asp1 5 10 15Leu Phe Trp Glu Leu Asp Arg Leu Asp Asn Tyr Asn Asp Thr Ser Leu20 25 30Val Glu Asn His Leu Cys Pro Ala Thr Glu Gly Pro Leu Met Ala Ser35 40 45Phe Lys Ala Val Phe Val Pro Val Ala Tyr Ser Leu Ile Phe Leu Leu50 55 60Gly Val Ile Gly Asn Val Leu Val Leu Val Ile Leu Glu Arg His Arg65 70 75 80Gln Thr Arg Ser Ser Thr Glu Thr Phe Leu Phe His Leu Ala Val Ala85 90 95Asp Leu Leu Leu Val Phe Ile Leu Pro Phe Ala Val Ala Glu Gly Ser100 105 110Val Gly Trp Val Leu Gly Thr Phe Leu Cys Lys Thr Val Ile Ala Leu115 120 125His Lys Val Asn Phe Tyr Cys Ser Ser Leu Leu Leu Ala Cys Ile Ala130 135 140Val Asp Arg Tyr Leu Ala Ile Val His Ala Val His Ala Tyr Arg His145 150 155 160Arg Arg Leu Leu Ser Ile His Ile Thr Cys Gly Thr Ile Trp Leu Val165 170 175Gly Phe Leu Leu Ala Leu Pro Glu Ile Leu Phe Ala Lys Val Ser Gln180 185 190Gly His His Asn Asn Ser Leu Pro Arg Cys Thr Phe Ser Gln Glu Asn195 200 205Gln Ala Glu Thr His Ala Trp Phe Thr Ser Arg Phe Leu Tyr His Val210 215 220Ala Gly Phe Leu Leu Pro Met Leu Val Met Gly Trp Cys Tyr Val Gly225 230 235 240Val Val His Arg Leu Arg Gln Ala Gln Arg Arg Pro Gln Arg Gln Lys245 250 255Ala Lys Arg Val Ala Ile Leu Val Thr Ser Ile Phe Phe Leu Cys Trp260 265 270Ser Pro Tyr His Ile Val Ile Phe Leu Asp Thr Leu Ala Arg Leu Lys275 280 285Ala Val Asp Asn Thr Cys Lys Leu Asn Gly Ser Leu Pro Val Ala Ile290 295 300Thr Met Cys Glu Phe Leu Gly Leu Ala His Cys Cys Leu Asn Pro Met305 310 315 320Leu Tyr Thr Phe Ala Gly Val Lys Phe Arg Ser Asp Leu Ser Arg Leu325 330 335Leu Thr Lys Leu Gly Cys Thr Gly Pro Ala Ser Leu Cys Gln Leu Phe340 345 350Pro Ser Trp Arg Arg Ser Ser Leu Ser Glu Ser Glu Asn Ala Thr Ser355 360 365Leu Thr Thr Phe370(202)SEQ ID NO201的資料(i)序列特征(A)長度1128個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO201的序列描述ATGGATGTGA CTTCCCAAGC CCGGGGCGTG GGCCTGGAGA TGTACCCAGG CACCGCGCAG 60CCTGCGGCCC CCAACACCAC CTCCCCCGAG CTCAACCTGT CCCACCCGCT CCTGGGCACC 120GCCCTGGCCA ATGGGACAGG TGAGCTCTCG GAGCACCAGC AGTACGTGAT CGGCCTGTTC 180CTCTCGTGCC TCTACACCAT CTTCCTCTTC CCCATCGGCT TTGTGGGCAA CATCCTGATC 240CTGGTGGTGA ACATCAGCTT CCGCGAGAAG ATGACCATCC CCGACCTGTA CTTCATCAAC 300CTGGCGGTGG CGGACCTCAT CCTGGTGGCC GACTCCCTCA TTGAGGTGTT CAACCTGCAC 360GAGCGGTACT ACGACATCGC CGTCCTGTGC ACCTTCATGT CGCTCTTCCT GCAGGTCAAC 420ATGTACAGCA GCGTCTTCTT CCTCACCTGG ATGAGCTTCG ACCGCTACAT CGCCCTGGCC 480AGGGCCATGC GCTGCAGCCT GTTCCGCACC AAGCACCACG CCCGGCTGAG CTGTGGCCTC 540ATCTGGATGG CATCCGTGTC AGCCACGCTG GTGCCCTTCA CCGCCGTGCA CCTGCAGCAC 600ACCGACGAGG CCTGCTTCTG TTTCGCGGAT GTCCGGGAGG TGCAGTGGCT CGAGGTCACG 660CTGGGCTTCA TCGTGCCCTT CGCCATCATC GGCCTGTGCT ACTCCCTCAT TGTCCGGGTG 720CTGGTCAGGG CGCACCGGCA CCGTGGGCTG CGGCCCCGGC GGCAGAAGGC GAAGCGCATG 780ATCCTCGCGG TGGTGCTGGT CTTCTTCGTC TGCTGGCTGC CGGAGAACGT CTTCATCAGC 840GTGCACCTCC TGCAGCGGAC GCAGCCTGGG GCCGCTCCCT GCAAGCAGTC TTTCCGCCAT 900GCCCACCCCC TCACGGGCCA CATTGTCAAC CTCACCGCCT TCTCCAACAG CTGCCTAAAC 960CCCCTCATCT ACAGCTTTCT CGGGGAGACC TTCAGGGACA AGCTGAGGCT GTACATTGAG 1020CAGAAAACAA ATTTGCCGGC CCTGAACCGC TTCTGTCACG CTGCCCTGAA GGCCGTCATT 1080CCAGACAGCA CCGAGCAGTC GGATGTGAGG TTCAGCAGTG CCGTGTAG 1128(203)SEQ ID NO202的資料(i)序列特征(A)長度375個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO202的序列描述Met Asp Val Thr Ser Gln Ala Arg Gly Val Gly Leu Glu Met Tyr Pro1 5 10 15Gly Thr Ala Gln Pro Ala Ala Pro Asn Thr Thr Ser Pro Glu Leu Asn20 25 30Leu Ser His Pro Leu Leu Gly Thr Ala Leu Ala Asn Gly Thr Gly Glu35 40 45Leu Ser Glu His Gln Gln Tyr Val Ile Gly Leu Phe Leu Ser Cys Leu50 55 60Tyr Thr Ile Phe Leu Phe Pro Ile Gly Phe Val Gly Asn Ile Leu Ile65 70 75 80Leu Val Val Asn Ile Ser Phe Arg Glu Lys Met Thr Ile Pro Asp Leu85 90 95Tyr Phe Ile Asn Leu Ala Val Ala Asp Leu Ile Leu Val Ala Asp Ser100 105 110Leu Ile Glu Val Phe Asn Leu His Glu Arg Tyr Tyr Asp Ile Ala Val115 120 125Leu Cys Thr Phe Met Ser Leu Phe Leu Gln Val Ash Met Tyr Ser Ser130 135 140Val Phe Phe Leu Thr Trp Met Ser Phe Asp Arg Tyr Ile Ala Leu Ala145 150 155 160Arg Ala Met Arg Cys Ser Leu Phe Arg Thr Lys His His Ala Arg Leu165 170 175Ser Cys Gly Leu Ile Trp Met Ala Ser Val Ser Ala Thr Leu Val Pro180 185 190Phe Thr Ala Val His Leu Gln His Thr Asp Glu Ala Cys Phe Cys Phe195 200 205Ala Asp Val Arg Glu Val Gln Trp Leu Glu Val Thr Leu Gly Phe Ile210 215 220Val Pro Phe Ala Ile Ile Gly Leu Cys Tyr Ser Leu Ile Val Arg Val225 230 235 240Leu Val Arg Ala His Arg His Arg Gly Leu Arg Pro Arg Arg Gln Lys245 250 255Ala Lys Arg Met Ile Leu Ala Val Val Leu Val Phe Phe Val Cys Trp260 265 270Leu Pro Glu Asn Val Phe Ile Ser Val His Leu Leu Gln Arg Thr Gln275 280 285Pro Gly Ala Ala Pro Cys Lys Gln Ser Phe Arg His Ala His Pro Leu290 295 300Thr Gly His Ile Val Asn Leu Thr Ala Phe Ser Asn Ser Cys Leu Asn305 310 315 320Pro Leu Ile Tyr Ser Phe Leu Gly Glu Thr Phe Arg Asp Lys Leu Arg325 330 335Leu Tyr Ile Glu Gln Lys Thr Asn Leu Pro Ala Leu Asn Arg Phe Cys340 345 350His Ala Ala Leu Lys Ala Val Ile Pro Asp Ser Thr Glu Gln Ser Asp355 360 365Val Arg Phe Ser Ser Ala Val370 375(204)SEQ ID NO203的資料(i)序列特征(A)長度1137個堿基對(B)類型核酸
(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO203的序列描述ATGGACCTGG GGAAACCAAT GAAAAGCGTG CTGGTGGTGG CTCTCCTTGT CATTTTCCAG 60GTATGCCTGT GTCAAGATGA GGTCACGGAC GATTACATCG GAGACAACAC CACAGTGGAC 120TACACTTTGT TCGAGTCTTT GTGCTCCAAG AAGGACGTGC GGAACTTTAA AGCCTGGTTC 180CTCCCTATCA TGTACTCCAT CATTTGTTTC GTGGGCCTAC TGGGCAATGG GCTGGTCGTG 240TTGACCTATA TCTATTTCAA GAGGCTCAAG ACCATGACCG ATACCTACCT GCTCAACCTG 300GCGGTGGCAG ACATCCTCTT CCTCCTGACC CTTCCCTTCT GGGCCTACAG CGCGGCCAAG 360TCCTGGGTCT TCGGTGTCCA CTTTTGCAAG CTCATCTTTG CCATCTACAA GATGAGCTTC 420TTCAGTGGCA TGCTCCTACT TCTTTGCATC AGCATTGACC GCTACGTGGC CATCGTCCAG 480GCTGTCTCAG CTCACCGCCA CCGTGCCCGC GTCCTTCTCA TCAGCAAGCT GTCCTGTGTG 540GGCATCTGGA TACTAGCCAC AGTGCTCTCC ATCCCAGAGC TCCTGTACAG TGACCTCCAG 600AGGAGCAGCA GTGAGCAAGC GATGCGATGC TCTCTCATCA CAGAGCATGT GGAGGCCTTT 660ATCACCATCC AGGTGGCCCA GATGGTGATC GGCTTTCTGG TCCCCCTGCT GGCCATGAGC 720TTCTGTTACC TTGTCATCAT CCGCACCCTG CTCCAGGCAC GCAACTTTGA GCGCAACAAG 780GCCAAAAAGG TGATCATCGC TGTGGTCGTG GTCTTCATAG TCTTCCAGCT GCCCTACAAT 840GGGGTGGTCC TGGCCCAGAC GGTGGCCAAC TTCAACATCA CCAGTAGCAC CTGTGAGCTC 900AGTAAGCAAC TCAACATCGC CTACGACGTC ACCTACAGCC TGGCCTGCGT CCGCTGCTGC 960GTCAACCCTT TCTTGTACGC CTTCATCGGC GTCAAGTTCC GCAACGATCT CTTCAAGCTC 1020TTCAAGGACC TGGGCTGCCT CAGCCAGGAG CAGCTCCGGC AGTGGTCTTC CTGTCGGCAC 1080ATCCGGCGCT CCTCCATGAG TGTGGAGGCC GAGACCACCA CCACCTTCTC CCCATAG1137(205)SEQ ID NO204的資料(i)序列特征(A)長度378個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO204的序列描述Met Asp Leu Gly Lys Pro Met Lys Ser Val Leu Val Val Ala Leu Leu1 5 10 15Val Ile Phe Gln Val Cys Leu Cys Gln Asp Glu Val Thr Asp Asp Tyr20 25 30Ile Gly Asp Asn Thr Thr Val Asp Tyr Thr Leu Phe Glu Ser Leu Cys35 40 45Ser Lys Lys Asp Val Arg Asn Phe Lys Ala Trp Phe Leu Pro Ile Met50 55 60Tyr Ser Ile Ile Cys Phe Val Gly Leu Leu Gly Asn Gly Leu Val Val65 70 75 80Leu Thr Tyr Ile Tyr Phe Lys Arg Leu Lys Thr Met Thr Asp Thr Tyr85 90 95Leu Leu Asn Leu Ala Val Ala Asp Ile Leu Phe Leu Leu Thr Leu Pro100 105 110Phe Trp Ala Tyr Ser Ala Ala Lys Ser Trp Val Phe Gly Val His Phe115 120 125Cys Lys Leu Ile Phe Ala Ile Tyr Lys Met Ser Phe Phe Ser Gly Met130 135 140Leu Leu Leu Leu Cys Ile Ser Ile Asp Arg Tyr Val Ala Ile Val Gln145 150 155 160Ala Val Ser Ala His Arg His Arg Ala Arg Val Leu Leu Ile Ser Lys165 170 175Leu Ser Cys Val Gly Ile Trp Ile Leu Ala Thr Val Leu Ser Ile Pro180 185 190Glu Leu Leu Tyr Ser Asp Leu Gln Arg Ser Ser Ser Glu Gln Ala Met195 200 205Arg Cys Ser Leu Ile Thr Glu His Val Glu Ala Phe Ile Thr Ile Gln210 215 220Val Ala Gln Met Val Ile Gly Phe Leu Val Pro Leu Leu Ala Met Ser225 230 235 240Phe Cys Tyr Leu Val Ile Ile Arg Thr Leu Leu Gln Ala Arg Asn Phe245 250 255Glu Arg Asn Lys Ala Lys Lys Val Ile Ile Ala Val Val Val Val Phe260 265 270Ile Val Phe Gln Leu Pro Tyr Asn Gly Val Val Leu Ala Gln Thr Val275 280 285Ala Asn Phe Asn Ile Thr Ser Ser Thr Cys Glu Leu Ser Lys Gln Leu290 295 300Asn Ile Ala Tyr Asp Val Thr Tyr Ser Leu Ala Cys Val Arg Cys Cys305 310 315 320Val Asn Pro Phe Leu Tyr Ala Phe Ile Gly Val Lys Phe Arg Asn Asp325 330 335Leu Phe Lys Leu Phe Lys Asp Leu Gly Cys Leu Ser Gln Glu Gln Leu340 345 350Arg Gln Trp Ser Ser Cys Arg His Ile Arg Arg Ser Ser Met Ser Val355 360 365Glu Ala Glu Thr Thr Thr Thr Phe Ser Pro370 375(206)SEQ ID NO205的資料(i)序列特征(A)長度1086個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO205的序列描述ATGGATATAC AAATGGCAAA CAATTTTACT CCGCCCTCTG CAACTCCTCA GGGAAATGAC 60TGTGACCTCT ATGCACATCA CAGCACGGCC AGGATAGTAA TGCCTCTGCA TTACAGCCTC 120GTCTTCATCA TTGGGCTCGT GGGAAACTTA CTAGCCTTGG TCGTCATTGT TCAAAACAGG 180AAAAAAATCA ACTCTACCAC CCTCTATTCA ACAAATTTGG TGATTTCTGA TATACTTTTT 240ACCACGGCTT TGCCTACACG AATAGCCTAC TATGCAATGG GCTTTGACTG GAGAATCGGA 300GATGCCTTGT GTAGGATAAC TGCGCTAGTG TTTTACATCA ACACATATGC AGGTGTGAAC 360TTTATGACCT GCCTGAGTAT TGACCGCTTC ATTGCTGTGG TGCACCCTCT ACGCTACAAC 420AAGATAAAAA GGATTGAACA TGCAAAAGGC GTGTGCATAT TTGTCTGGAT TCTAGTATTT 480GCTCAGACAC TCCCACTCCT CATCAACCCT ATGTCAAAGC AGGAGGCTGA AAGGATTACA 540TGCATGGAGT ATCCAAACTT TGAAGAAACT AAATCTCTTC CCTGGATTCT GCTTGGGGCA 600TGTTTCATAG GATATGTACT TCCACTTATA ATCATTCTCA TCTGCTATTC TCAGATCTGC 660TGCAAACTCT TCAGAACTGC CAAACAAAAC CCACTCACTG AGAAATCTGG TGTAAACAAA 720AAGGCTAAAA ACACAATTAT TCTTATTATT GTTGTGTTTG TTCTCTGTTT CACACCTTAC 780CATGTTGCAA TTATTCAACA TATGATTAAG AAGCTTCGTT TCTCTAATTT CCTGGAATGT 840AGCCAAAGAC ATTCGTTCCA GATTTCTCTG CACTTTACAG TATGCCTGAT GAACTTCAAT 900TGCTGCATGG ACCCTTTTAT CTACTTCTTT GCATGTAAAG GGTATAAGAG AAAGGTTATG 960AGGATGCTGA AACGGCAAGT CAGTGTATCG ATTTCTAGTG CTGTGAAGTC AGCCCCTGAA 1020GAAAATTCAC GTGAAATGAC AGAAACGCAG ATGATGATAC ATTCCAAGTC TTCAAATGGA 1080AAGTGA 1086(207)SEQ ID NO206的資料(i)序列特征(A)長度361個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO206的序列描述Met Asp Ile Gln Met Ala Asn Asn Phe Thr Pro Pro Ser Ala Thr Pro1 5 10 15Gln Gly Asn Asp Cys Asp Leu Tyr Ala His His Ser Thr Ala Arg Ile20 25 30Val Met Pro Leu His Tyr Ser Leu Val Phe Ile Ile Gly Leu Val Gly35 40 45Asn Leu Leu Ala Leu Val Val Ile Val Gln Asn Arg Lys Lys Ile Asn50 55 60Ser Thr Thr Leu Tyr Ser Thr Asn Leu Val Ile Ser Asp Ile Leu Phe65 70 75 80Thr Thr Ala Leu Pro Thr Arg Ile Ala Tyr Tyr Ala Met Gly Phe Asp85 90 95Trp Arg Ile Gly Asp Ala Leu Cys Arg Ile Thr Ala Leu Val Phe Tyr100 105 110Ile Asn Thr Tyr Ala Gly Val Asn Phe Met Thr Cys Leu Ser Ile Asp115 120 125Arg Phe Ile Ala Val Val His Pro Leu Arg Tyr Asn Lys Ile Lys Arg130 135 140Ile Glu His Ala Lys Gly Val Cys Ile Phe Val Trp Ile Leu Val Phe145 150 155 160Ala Gln Thr Leu Pro Leu Leu Ile Asn Pro Met Ser Lys Gln Glu Ala165 170 175Glu Arg Ile Thr Cys Met Glu Tyr Pro Asn Phe Glu Glu Thr Lys Ser180 185 190Leu Pro Trp Ile Leu Leu Gly Ala Cys Phe Ile Gly Tyr Val Leu Pro195 200 205Leu Ile Ile Ile Leu Ile Cys Tyr Ser Gln Ile Cys Cys Lys Leu Phe210 215 220Arg Thr Ala Lys Gln Asn Pro Leu Thr Glu Lys Ser Gly Val Asn Lys225 230 235 240Lys Ala Lys Asn Thr Ile Ile Leu Ile Ile Val Val Phe Val Leu Cys245 250 255Phe Thr Pro Tyr His Val Ala Ile Ile Gln His Met Ile Lys Lys Leu260 265 270Arg Phe Ser Asn Phe Leu Glu Cys Ser Gln Arg His Ser Phe Gln Ile
275 280 285Ser Leu His Phe Thr Val Cys Leu Met Asn Phe Asn Cys Cys Met Asp290 295 300Pro Phe Ile Tyr Phe Phe Ala Cys Lys Gly Tyr Lys Arg Lys Val Met305 310 315 320Arg Met Leu Lys Arg Gln Val Ser Val Ser Ile Ser Ser Ala Val Lys325 330 335Ser Ala Pro Glu Glu Asn Ser Arg Glu Met Thr Glu Thr Gln Met Met340 345 350Ile His Ser Lys Ser Ser Asn Gly Lys355 360(208)SEQ ID NO207的資料(i)序列特征(A)長度1446個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO207的序列描述ATGCGGTGGC TGTGGCCCCT GGCTGTCTCT CTTGCTGTGA TTTTGGCTGT GGGGCTAAGC 60AGGGTCTCTG GGGGTGCCCC CCTGCACCTG GGCAGGCACA GAGCCGAGAC CCAGGAGCAG 120CAGAGCCGAT CCAAGAGGGG CACCGAGGAT GAGGAGGCCA AGGGCGTGCA GCAGTATGTG 180CCTGAGGAGT GGGCGGAGTA CCCCCGGCCC ATTCACCCTG CTGGCCTGCA GCCAACCAAG 240CCCTTGGTGG CCACCAGCCC TAACCCCGAC AAGGATGGGG GCACCCCAGA CAGTGGGCAG 300GAACTGAGGG GCAATCTGAC AGGGGCACCA GGGCAGAGGC TACAGATCCA GAACCCCCTG 360TATCCGGTGA CCGAGAGCTC CTACAGTGCC TATGCCATCA TGCTTCTGGC GCTGGTGGTG 420TTTGCGGTGG GCATTGTGGG CAACCTGTCG GTCATGTGCA TCGTGTGGCA CAGCTACTAC 480CTGAAGAGCG CCTGGAACTC CATCCTTGCC AGCCTGGCCC TCTGGGATTT TCTGGTCCTC 540TTTTTCTGCC TCCCTATTGT CATCTTCAAC GAGATCACCA AGCAGAGGCT ACTGGGTGAC 600GTTTCTTGTC GTGCCGTGCC CTTCATGGAG GTCTCCTCTC TGGGAGTCAC GACTTTCAGC 660CTCTGTGCCC TGGGCATTGA CCGCTTCCAC GTGGCCACCA GCACCCTGCC CAAGGTGAGG 720CCCATCGAGC GGTGCCAATC CATCCTGGCC AAGTTGGCTG TCATCTGGGT GGGCTCCATG 780ACGCTGGCTG TGCCTGAGCT CCTGCTGTGG CAGCTGGCAC AGGAGCCTGC CCCCACCATG 840GGCACCCTGG ACTCATGCAT CATGAAACCC TCAGCCAGCC TGCCCGAGTC CCTGTATTCA 900CTGGTGATGA CCTACCAGAA CGCCCGCATG TGGTGGTACT TTGGCTGCTA CTTCTGCCTG 960CCCATCCTCT TCACAGTCAC CTGCCAGCTG GTGACATGGC GGGTGCGAGG CCCTCCAGGG 1020AGGAAGTCAG AGTGCAGGGC CAGCAAGCAC GAGCAGTGTG AGAGCCAGCT CAAGAGCACC 1080GTGGTGGGCC TGACCGTGGT CTACGCCTTC TGCACCCTCC CAGAGAACGT CTGCAACATC 1140GTGGTGGCCT ACCTCTCCAC CGAGCTGACC CGCCAGACCC TGGACCTCCT GGGCCTCATC 1200AACCAGTTCT CCACCTTCTT CAAGGGCGCC ATCACCCCAG TGCTGCTCCT TTGCATCTGC 1260AGGCCGCTGG GCCAGGCCTT CCTGGACTGC TGCTGCTGCT GCTGCTGTGA GGAGTGCGGC 1320GGGGCTTCGG AGGCCTCTGC TGCCAATGGG TCGGACAACA AGCTCAAGAC CGAGGTGTCC 1380TCTTCCATCT ACTTCCACAA GCCCAGGGAG TCACCCCCAC TCCTGCCCCT GGGCACACCT 1440TGCTGA 1446(209)SEQ ID NO208的資料(i)序列特征(A)長度481個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO208的序列描述Met Arg Trp Leu Trp Pro Leu Ala Val Ser Leu Ala Val Ile Leu Ala1 5 10 15Val Gly Leu Ser Arg Val Ser Gly Gly Ala Pro Leu His Leu Gly Arg20 25 30His Arg Ala Glu Thr Gln Glu Gln Gln Ser Arg Ser Lys Arg Gly Thr35 40 45Glu Asp Glu Glu Ala Lys Gly Val Gln Gln Tyr Val Pro Glu Glu Trp50 55 60Ala Glu Tyr Pro Arg Pro Ile His Pro Ala Gly Leu Gln Pro Thr Lys65 70 75 80Pro Leu Val Ala Thr Ser Pro Asn Pro Asp Lys Asp Gly Gly Thr Pro85 90 95Asp Ser Gly Gln Glu Leu Arg Gly Asn Leu Thr Gly Ala Pro Gly Gln100 105 110Arg Leu Gln Ile Gln Asn Pro Leu Tyr Pro Val Thr Glu Ser Ser Tyr115 120 125Ser Ala Tyr Ala Ile Met Leu Leu Ala Leu Val Val Phe Ala Val Gly130 135 140Ile Val Gly Asn Leu Ser Val Met Cys Ile Val Trp His Ser Tyr Tyr145 150 155 160Leu Lys Ser Ala Trp Asn Ser Ile Leu Ala Ser Leu Ala Leu Trp Asp165 170 175Phe Leu Val Leu Phe Phe Cys Leu Pro Ile Val Ile Phe Asn Glu Ile180 185 190Thr Lys Gln Arg Leu Leu Gly Asp Val Ser Cys Arg Ala Val Pro Phe
195 200 205Met Glu Val Ser Ser Leu Gly Val Thr Thr Phe Ser Leu Cys Ala Leu210 215 220Gly Ile Asp Arg Phe His Val Ala Thr Ser Thr Leu Pro Lys Val Arg225 230 235 240Pro Ile Glu Arg Cys Gln Ser Ile Leu Ala Lys Leu Ala Val Ile Trp245 250 255Val Gly Ser Met Thr Leu Ala Val Pro Glu Leu Leu Leu Trp Gln Leu260 265 270Ala Gln Glu Pro Ala Pro Thr Met Gly Thr Leu Asp Ser Cys Ile Met275 280 285Lys Pro Ser Ala Ser Leu Pro Glu Ser Leu Tyr Ser Leu Val Met Thr290 295 300Tyr Gln Asn Ala Arg Met Trp Trp Tyr Phe Gly Cys Tyr Phe Cys Leu305 310 315 320Pro Ile Leu Phe Thr Val Thr Cys Gln Leu Val Thr Trp Arg Val Arg325 330 335Gly Pro Pro Gly Arg Lys Ser Glu Cys Arg Ala Ser Lys His Glu Gln340 345 350Cys Glu Ser Gln Leu Lys Ser Thr Val Val Gly Leu Thr Val Val Tyr355 360 365Ala Phe Cys Thr Leu Pro Glu Asn Val Cys Asn Ile Val Val Ala Tyr370 375 380Leu Ser Thr Glu Leu Thr Arg Gln Thr Leu Asp Leu Leu Gly Leu Ile385 390 395 400Asn Gln Phe Ser Thr Phe Phe Lys Gly Ala Ile Thr Pro Val Leu Leu405 410 415Leu Cys Ile Cys Arg Pro Leu Gly Gln Ala Phe Leu Asp Cys Cys Cys420 425 430Cys Cys Cys Cys Glu Glu Cys Gly Gly Ala Ser Glu Ala Ser Ala Ala435 440 445Asn Gly Ser Asp Asn Lys Leu Lys Thr Glu Val Ser Ser Ser Ile Tyr450 455 460Phe His Lys Pro Arg Glu Ser Pro Pro Leu Leu Pro Leu Gly Thr Pro465 470 475 480Cys(210)SEQ ID NO209的資料(i)序列特征(A)長度1101個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO209的序列描述ATGTGGAACG CGACGCCCAG CGAAGAGCCG GGGTTCAACC TCACACTGGC CGACCTGGAC 60TGGGATGCTT CCCCCGGCAA CGACTCGCTG GGCGACGAGC TGCTGCAGCT CTTCCCCGCG 120CCGCTGCTGG CGGGCGTCAC AGCCACCTGC GTGGCACTCT TCGTGGTGGG TATCGCTGGC 180AACCTGCTCA CCATGCTGGT GGTGTCGCGC TTCCGCGAGC TGCGCACCAC CACCAACCTC 240TACCTGTCCA GCATGGCCTT CTCCGATCTG CTCATCTTCC TCTGCATGCC CCTGGACCTC 300GTTCGCCTCT GGCAGTACCG GCCCTGGAAC TTCGGCGACC TCCTCTGCAA ACTCTTCCAA 360TTCGTCAGTG AGAGCTGCAC CTACGCCACG GTGCTCACCA TCACAGCGCT GAGCGTCGAG 420CGCTACTTCG CCATCTGCTT CCCACTCCGG GCCAAGGTGG TGGTCACCAA GGGGCGGGTG 480AAGCTGGTCA TCTTCGTCAT CTGGGCCGTG GCCTTCTGCA GCGCCGGGCC CATCTTCGTG 540CTAGTCGGGG TGGAGCACGA GAACGGCACC GACCCTTGGG ACACCAACGA GTGCCGCCCC 600ACCGAGTTTG CGGTGCGCTC TGGACTGCTC ACGGTCATGG TGTGGGTGTC CAGCATCTTC 660TTCTTCCTTC CTGTCTTCTG TCTCACGGTC CTCTACAGTC TCATCGGCAG GAAGCTGTGG 720CGGAGGAGGC GCGGCGATGC TGTCGTGGGT GCCTCGCTCA GGGACCAGAA CCACAAGCAA 780ACCAAGAAAA TGCTGGCTGT AGTGGTGTTT GCCTTCATCC TCTGCTGGCT CCCCTTCCAC 840GTAGGGCGAT ATTTATTTTC CAAATCCTTT GAGCCTGGCT CCTTGGAGAT TGCTCAGATC 900AGCCAGTACT GCAACCTCGT GTCCTTTGTC CTCTTCTACC TCAGTGCTGC CATCAACCCC 960ATTCTGTACA ACATCATGTC CAAGAAGTAC CGGGTGGCAG TGTTCAGACT TCTGGGATTC 1020GAACCCTTCT CCCAGAGAAA GCTCTCCACT CTGAAAGATG AAAGTTCTCG GGCCTGGACA 1080GAATCTAGTA TTAATACATG A 1101(211)SEQ ID NO210的資料(i)序列特征(A)長度366個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO210的序列描述Met Trp Asn Ala Thr Pro Ser Glu Glu Pro Gly Phe Asn Leu Thr Leu1 5 10 15Ala Asp Leu Asp Trp Asp Ala Ser Pro Gly Asn Asp Ser Leu Gly Asp
20 25 30Glu Leu Leu Gln Leu Phe Pro Ala Pro Leu Leu Ala Gly Val Thr Ala35 40 45Thr Cys Val Ala Leu Phe Val Val Gly Ile Ala Gly Asn Leu Leu Thr50 55 60Met Leu Val Val Ser Arg Phe Arg Glu Leu Arg Thr Thr Thr Asn Leu65 70 75 80Tyr Leu Ser Ser Met Ala Phe Ser Asp Leu Leu Ile Phe Leu Cys Met85 90 95Pro Leu Asp Leu Val Arg Leu Trp Gln Tyr Arg Pro Trp Asn Phe Gly100 105 110Asp Leu Leu Cys Lys Leu Phe Gln Phe Val Ser Glu Ser Cys Thr Tyr115 120 125Ala Thr Val Leu Thr Ile Thr Ala Leu Ser Val Glu Arg Tyr Phe Ala130 135 140Ile Cys Phe Pro Leu Arg Ala Lys Val Val Val Thr Lys Gly Arg Val145 150 155 160Lys Leu Val Ile Phe Val Ile Trp Ala Val Ala Phe Cys Ser Ala Gly165 170 175Pro Ile Phe Val Leu Val Gly Val Glu His Glu Asn Gly Thr Asp Pro180 185 190Trp Asp Thr Asn Glu Cys Arg Pro Thr Glu Phe Ala Val Arg Ser Gly195 200 205Leu Leu Thr Val Met Val Trp Val Ser Ser Ile Phe Phe Phe Leu Pro210 215 220Val Phe Cys Leu Thr Val Leu Tyr Ser Leu Ile Gly Arg Lys Leu Trp225 230 235 240Arg Arg Arg Arg Gly Asp Ala Val Val Gly Ala Ser Leu Arg Asp Gln245 250 255Asn His Lys Gln Thr Lys Lys Met Leu Ala Val Val Val Phe Ala Phe260 265 270Ile Leu Cys Trp Leu Pro Phe His Val Gly Arg Tyr Leu Phe Ser Lys275 280 285Ser Phe Glu Pro Gly Ser Leu Glu Ile Ala Gln Ile Ser Gln Tyr Cys
290 295 300Asn Leu Val Ser Phe Val Leu Phe Tyr Leu Ser Ala Ala Ile Asn Pro305 310 315 320Ile Leu Tyr Asn Ile Met Ser Lys Lys Tyr Arg Val Ala Val Phe Arg325 330 335Leu Leu Gly Phe Glu Pro Phe Ser Gln Arg Lys Leu Ser Thr Leu Lys340 345 350Asp Glu Ser Ser Arg Ala Trp Thr Glu Ser Ser Ile Asn Thr355 360 365(212)SEQ ID NO211的資料(i)序列特征(A)長度1842個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO211的序列描述ATGCGAGCCC CGGGCGCGCT TCTCGCCCGC ATGTCGCGGC TACTGCTTCT GCTACTGCTC 60AAGGTGTCTG CCTCTTCTGC CCTCGGGGTC GCCCCTGCGT CCAGAAACGA AACTTGTCTG 120GGGGAGAGCT GTGCACCTAC AGTGATCCAG CGCCGCGGCA GGGACGCCTG GGGACCGGGA 180AATTCTGCAA GAGACGTTCT GCGAGCCCGA GCACCCAGGG AGGAGCAGGG GGCAGCGTTT 240CTTGCGGGAC CCTCCTGGGA CCTGCCGGCG GCCCCGGGCC GTGACCCGGC TGCAGGCAGA 300GGGGCGGAGG CGTCGGCAGC CGGACCCCCG GGACCTCCAA CCAGGCCACC TGGCCCCTGG 360AGGTGGAAAG GTGCTCGGGG TCAGGAGCCT TCTGAAACTT TGGGGAGAGG GAACCCCACG 420GCCCTCCAGC TCTTCCTTCA GATCTCAGAG GAGGAAGAGA AGGGTCCCAG AGGCGCTGGC 480ATTTCCGGGC GTAGCCAGGA GCAGAGTGTG AAGACAGTCC CCGGAGCCAG CGATCTTTTT 540TACTGGCCAA GGAGAGCCGG GAAACTCCAG GGTTCCCACC ACAAGCCCCT GTCCAAGACG 600GCCAATGGAC TGGCGGGGCA CGAAGGGTGG ACAATTGCAC TCCCGGGCCG GGCGCTGGCC 660CAGAATGGAT CCTTGGGTGA AGGAATCCAT GAGCCTGGGG GTCCCCGCCG GGGAAACAGC 720ACGAACCGGC GTGTGAGACT GAAGAACCCC TTCTACCCGC TGACCCAGGA GTCCTATGGA 780GCCTACGCGG TCATGTGTCT GTCCGTGGTG ATCTTCGGGA CCGGCATCAT TGGCAACCTG 840GCGGTGATGT GCATCGTGTG CCACAACTAC TACATGCGGA GCATCTCCAA CTCCCTCTTG 900GCCAACCTGG CCTTCTGGGA CTTTCTCATC ATCTTCTTCT GCCTTCCGCT GGTCATCTTC 960CACGAGCTGA CCAAGAAGTG GCTGCTGGAG GACTTCTCCT GCAAGATCGT GCCCTATATA 1020GAGGTCGCCT CTCTGGGAGT CACCACTTTC ACCTTATGTG CTCTGTGCAT AGACCGCTTC 1080CGTGCTGCCA CCAACGTACA GATGTACTAC GAAATGATCG AAAATTGTTC CTCAACAACT 1140GCCAAACTTG CTGTTATATG GGTGGGAGCT CTATTGTTAG CACTTCCAGA AGTTGTTCTC 1200CGCCAGCTGA GCAAGGAGGA TTTGGGGTTT AGTGGCCGAG CTCCGGCAGA AAGGTGCATT 1260ATTAAGATCT CTCCTGATTT ACCAGACACC ATCTATGTTC TAGCCCTCAC CTACGACAGT 1320GCGAGACTGT GGTGGTATTT TGGCTGTTAC TTTTGTTTGC CCACGCTTTT CACCATCACC 1380TGCTCTCTAG TGACTGCGAG GAAAATCCGC AAAGCAGAGA AAGCCTGTAC CCGAGGGAAT 1440AAACGGCAGA TTCAACTAGA GAGTCAGATG AAGTGTACAG TAGTGGCACT GACCATTTTA 1500TATGGATTTT GCATTATTCC TGAAAATATC TGCAACATTG TTACTGCCTA CATGGCTACA 1560GGGGTTTCAC AGCAGACAAT GGACCTCCTT AATATCATCA GCCAGTTCCT TTTGTTCTTT 1620AAGTCCTGTG TCACCCCAGT CCTCCTTTTC TGTCTCTGCA AACCCTTCAG TCGGGCCTTC 1680ATGGAGTGCT GCTGCTGTTG CTGTGAGGAA TGCATTCAGA AGTCTTCAAC GGTGACCAGT 1740GATGACAATG ACAACGAGTA CACCACGGAA CTCGAACTCT CGCCTTTCAG TACCATACGC 1800CGTGAAATGT CCACTTTTGC TTCTGTCGGA ACTCATTGCT GA 1842(213)SEQ ID NO212的資料(i)序列特征(A)長度613個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO212的序列描述Met Arg Ala Pro Gly Ala Leu Leu Ala Arg Met Ser Arg Leu Leu Leul 5 10 15Leu Leu Leu Leu Lys Val Ser Ala Ser Ser Ala Leu Gly Val Ala Pro20 25 30Ala Ser Arg Asn Glu Thr Cys Leu Gly Glu Ser Cys Ala Pro Thr Val35 40 45Ile Gln Arg Arg Gly Arg Asp Ala Trp Gly Pro Gly Asn Ser Ala Arg50 55 60Asp Val Leu Arg Ala Arg Ala Pro Arg Glu Glu Gln Gly Ala Ala Phe65 70 75 80Leu Ala Gly Pro Ser Trp Asp Leu Pro Ala Ala Pro Gly Arg Asp Pro85 90 95Ala Ala Gly Arg Gly Ala Glu Ala Ser Ala Ala Gly Pro Pro Gly Pro100 105 1l0Pro Thr Arg Pro Pro Gly Pro Trp Arg Trp Lys Gly Ala Arg Gly Gln115 120 125Glu Pro Ser Glu Thr Leu Gly Arg Gly Asn Pro Thr Ala Leu Gln Leu130 135 140Phe Leu Gln Ile Ser Glu Glu Glu Glu Lys Gly Pro Arg Gly Ala Gly145 150 155 160Ile Ser Gly Arg Ser Gln Glu Gln Ser Val Lys Thr Val Pro Gly Ala165 170 175Ser Asp Leu Phe Tyr Trp Pro Arg Arg Ala Gly Lys Leu Gln Gly Ser
180 185 190His His Lys Pro Leu Ser Lys Thr Ala Asn Gly Leu Ala Gly His Glu195 200 205Gly Trp Thr Ile Ala Leu Pro Gly Arg Ala Leu Ala Gln Asn Gly Ser210 215 220Leu Gly Glu Gly Ile His Glu Pro Gly Gly Pro Arg Arg Gly Asn Ser225 230 235 240Thr Asn Arg Arg Val Arg Leu Lys Asn Pro Phe Tyr Pro Leu Thr Gln245 250 255Glu Ser Tyr Gly Ala Tyr Ala Val Met Cys Leu Ser Val Val Ile Phe260 265 270Gly Thr Gly Ile Ile Gly Asn Leu Ala Val Met Cys Ile Val Cys His275 280 285Asn Tyr Tyr Met Arg Ser Ile Ser Asn Ser Leu Leu Ala Asn Leu Ala290 295 300Phe Trp Asp Phe Leu Ile Ile Phe Phe Cys Leu Pro Leu Val Ile Phe305 310 315 320His Glu Leu Thr Lys Lys Trp Leu Leu Glu Asp Phe Ser Cys Lys Ile325 330 335Val Pro Tyr Ile Glu Val Ala Ser Leu Gly Val Thr Thr Phe Thr Leu340 345 350Cys Ala Leu Cys Ile Asp Arg Phe Arg Ala Ala Thr Asn Val Gln Met355 360 365Tyr Tyr Glu Met Ile Glu Asn Cys Ser Ser Thr Thr Ala Lys Leu Ala370 375 380Val Ile Trp Val Gly Ala Leu Leu Leu Ala Leu Pro Glu Val Val Leu385 390 395 400Arg Gln Leu Ser Lys Glu Asp Leu Gly Phe Ser Gly Arg Ala Pro Ala405 410 415Glu Arg Cys Ile Ile Lys Ile Ser Pro Asp Leu Pro Asp Thr Ile Tyr420 425 430Val Leu Ala Leu Thr Tyr Asp Ser Ala Arg Leu Trp Trp Tyr Phe Gly435 440 445Cys Tyr Phe Cys Leu Pro Thr Leu Phe Thr Ile Thr Cys Ser Leu Val
450 455 460Thr Ala Arg Lys Ile Arg Lys Ala Glu Lys Ala Cys Thr Arg Gly Asn465 470 475 480Lys Arg Gln Ile Gln Leu Glu Ser Gln Met Lys Cys Thr Val Val Ala485 490 495Leu Thr Ile Leu Tyr Gly Phe Cys Ile Ile Pro Glu Asn Ile Cys Asn500 505 510Ile Val Thr Ala Tyr Met Ala Thr Gly Val Ser Gln Gln Thr Met Asp515 520 525Leu Leu Asn Ile Ile Ser Gln Phe Leu Leu Phe Phe Lys Ser Cys Val530 535 540Thr Pro Val Leu Leu Phe Cys Leu Cys Lys Pro Phe Ser Arg Ala Phe545 550 555 560Met Glu Cys Cys Cys Cys Cys Cys Glu Glu Cys Ile Gln Lys Ser Ser565 570 575Thr Val Thr Ser Asp Asp Asn Asp Asn Glu Tyr Thr Thr Glu Leu Glu580 585 590Leu Ser Pro Phe Ser Thr Ile Arg Arg Glu Met Ser Thr Phe Ala Ser595 600 605Val Gly Thr His Cys610(214)SEQ ID NO213的資料(i)序列特征(A)長度1248個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO213的序列描述ATGGTTTTTG CTCACAGAAT GGATAACAGC AAGCCACATT TGATTATTCC TACACTTCTG 60GTGCCCCTCC AAAACCGCAG CTGCACTGAA ACAGCCACAC CTCTGCCAAG CCAATACCTG 120ATGGAATTAA GTGAGGAGCA CAGTTGGATG AGCAACCAAA CAGACCTTCA CTATGTGCTG 180AAACCCGGGG AAGTGGCCAC AGCCAGCATC TTCTTTGGGA TTCTGTGGTT GTTTTCTATC 240TTCGGCAATT CCCTGGTTTG TTTGGTCATC CATAGGAGTA GGAGGACTCA GTCTACCACC 300AACTACTTTG TGGTCTCCAT GGCATGTGCT GACCTTCTCA TCAGCGTTGC CAGCACGCCT 360TTCGTCCTGC TCCAGTTCAC CACTGGAAGG TGGACGCTGG GTAGTGCAAC GTGCAAGGTT 420GTGCGATATT TTCAATATCT CACTCCAGGT GTCCAGATCT ACGTTCTCCT CTCCATCTGC 480ATAGACCGGT TCTACACCAT CGTCTATCCT CTGAGCTTCA AGGTGTCCAG AGAAAAAGCC 540AAGAAAATGA TTGCGGCATC GTGGATCTTT GATGCAGGCT TTGTGACCCC TGTGCTCTTT 600TTCTATGGCT CCAACTGGGA CAGTCATTGT AACTATTTCC TCCCCTCCTC TTGGGAAGGC 660ACTGCCTACA CTGTCATCCA CTTCTTGGTG GGCTTTGTGA TTCCATCTGT CCTCATAATT 720TTATTTTACC AAAAGGTCAT AAAATATATT TGGAGAATAG GCACAGATGG CCGAACGGTG 780AGGAGGACAA TGAACATTGT CCCTCGGACA AAAGTGAAAA CTAAAAAGAT GTTCCTCATT 840TTAAATCTGT TGTTTTTGCT CTCCTGGCTG CCTTTTCATG TAGCTCAGCT ATGGCACCCC 900CATGAACAAG ACTATAAGAA AAGTTCCCTT GTTTTCACAG CTATCACATG GATATCCTTT 960AGTTCTTCAG CCTCTAAACC TACTCTGTAT TCAATTTATA ATGCCAATTT TCGGAGAGGG 1020ATGAAAGAGA CTTTTTGCAT GTCCTCTATG AAATGTTACC GAAGCAATGC CTATACTATC 1080ACAACAAGTT CAAGGATGGC CAAAAAAAAC TACGTTGGCA TTTCAGAAAT CCCTTCCATG 1140GCCAAAACTA TTACCAAAGA CTCGATCTAT GACTCATTTG ACAGAGAAGC CAAGGAAAAA 1200AAGCTTGCTT GGCCCATTAA CTCAAATCCA CCAAATACTT TTGTCTAA1248(215)SEQ ID NO214的資料(i)序列特征(A)長度415個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO214的序列描述Met Val Phe Ala His Arg Met Asp Asn Ser Lys Pro His Leu Ile Ile1 5 10 15Pro Thr Leu Leu Val Pro Leu Gln Asn Arg Ser Cys Thr Glu Thr Ala20 25 30Thr Pro Leu Pro Ser Gln Tyr Leu Met Glu Leu Ser Glu Glu His Ser35 40 45Trp Met Ser Asn Gln Thr Asp Leu His Tyr Val Leu Lys Pro Gly Glu50 55 60Val Ala Thr Ala Ser Ile Phe Phe Gly Ile Leu Trp Leu Phe Ser Ile65 70 75 80Phe Gly Asn Ser Leu Val Cys Leu Val Ile His Arg Ser Arg Arg Thr85 90 95Gln Ser Thr Thr Ash Tyr Phe Val Val Ser Met Ala Cys Ala Asp Leu100 105 110Leu Ile Ser Val Ala Ser Thr Pro Phe Val Leu Leu Gln Phe Thr Thr115 120 125Gly Arg Trp Thr Leu Gly Ser Ala Thr Cys Lys Val Val Arg Tyr Phe130 135 140Gln Tyr Leu Thr Pro Gly Val Gln Ile Tyr Val Leu Leu Ser Ile Cys145 150 155 160Ile Asp Arg Phe Tyr Thr Ile Val Tyr Pro Leu Ser Phe Lys Val Ser165 170 175Arg Glu Lys Ala Lys Lys Met Ile Ala Ala Ser Trp Ile Phe Asp Ala180 185 190Gly Phe Val Thr Pro Val Leu Phe Phe Tyr Gly Ser Asn Trp Asp Ser195 200 205His Cys Asn Tyr Phe Leu Pro Ser Ser Trp Glu Gly Thr Ala Tyr Thr210 215 220Val Ile His Phe Leu Val Gly Phe Val Ile Pro Ser Val Leu Ile Ile225 230 235 240Leu Phe Tyr Gln Lys Val Ile Lys Tyr Ile Trp Arg Ile Gly Thr Asp245 250 255Gly Arg Thr Val Arg Arg Thr Met Asn Ile Val Pro Arg Thr Lys Val260 265 270Lys Thr Lys Lys Met Phe Leu Ile Leu Asn Leu Leu Phe Leu Leu Ser275 280 285Trp Leu Pro Phe His Val Ala Gln Leu Trp His Pro His Glu Gln Asp290 295 300Tyr Lys Lys Ser Ser Leu Val Phe Thr Ala Ile Thr Trp Ile Ser Phe305 310 315 320Ser Ser Ser Ala Ser Lys Pro Thr Leu Tyr Ser Ile Tyr Asn Ala Asn325 330 335Phe Arg Arg Gly Met Lys Glu Thr Phe Cys Met Ser Ser Met Lys Cys340 345 350Tyr Arg Ser Asn Ala Tyr Thr Ile Thr Thr Ser Ser Arg Met Ala Lys355 360 365Lys Asn Tyr Val Gly Ile Ser Glu Ile Pro Ser Met Ala Lys Thr Ile370 375 380Thr Lys Asp Ser Ile Tyr Asp Ser Phe Asp Arg Glu Ala Lys Glu Lys385 390 395 400Lys Leu Ala Trp Pro Ile Asn Ser Asn Pro Pro Asn Thr Phe Val405 410 415(216) SEQ ID NO215的資料(i)序列特征(A)長度1842個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO215的序列描述ATGGGGCCCA CCCTAGCGGT TCCCACCCCC TATGGCTGTA TTGGCTGTAA GCTACCCCAG 60CCAGAATACC CACCGGCTCT AATCATCTTT ATGTTCTGCG CGATGGTTAT CACCATCGTT 120GTAGACCTAA TCGGCAACTC CATGGTCATT TTGGCTGTGA CGAAGAACAA GAAGCTCCGG 180AATTCTGGCA ACATCTTCGT GGTCAGTCTC TCTGTGGCCG ATATGCTGGT GGCCATCTAC 240CCATACCCTT TGATGCTGCA TGCCATGTCC ATTGGGGGCT GGGATCTGAG CCAGTTACAG 300TGCCAGATGG TCGGGTTCAT CACAGGGCTG AGTGTGGTCG GCTCCATCTT CAACATCGTG 360GCAATCGCTA TCAACCGTTA CTGCTACATC TGCCACAGCC TCCAGTACGA ACGGATCTTC 420AGTGTGCGCA ATACCTGCAT CTACCTGGTC ATCACCTGGA TCATGACCGT CCTGGCTGTC 480CTGCCCAACA TGTACATTGG CACCATCGAG TACGATCCTC GCACCTACAC CTGCATCTTC 540AACTATCTGA ACAACCCTGT CTTCACTGTT ACCATCGTCT GCATCCACTT CGTCCTCCCT 600CTCCTCATCG TGGGTTTCTG CTACGTGAGG ATCTGGACCA AAGTGCTGGC GGCCCGTGAC 660CCTGCAGGGC AGAATCCTGA CAACCAACTT GCTGAGGTTC GCAATAAACT AACCATGTTT 720GTGATCTTCC TCCTCTTTGC AGTGTGCTGG TGCCCTATCA ACGTGCTCAC TGTCTTGGTG 780GCTGTCAGTC CGAAGGAGAT GGCAGGCAAG ATCCCCAACT GGCTTTATCT TGCAGCCTAC 840TTCATAGCCT ACTTCAACAG CTGCCTCAAC GCTGTGATCT ACGGGCTCCT CAATGAGAAT 900TTCCGAAGAG AATACTGGAC CATCTTCCAT GCTATGCGGC ACCCTATCAT ATTCTTCTCT 960GGCCTCATCA GTGATATTCG TGAGATGCAG GAGGCCCGTA CCCTGGCCCG CGCCCGTGCC 1020CATGCTCGCG ACCAAGCTCG TGAACAAGAC CGTGCCCATG CCTGTCCTGC TGTGGAGGAA 1080ACCCCGATGA ATGTCCGGAA TGTTCCATTA CCTGGTGATG CTGCAGCTGG CCACCCCGAC 1140CGTGCCTCTG GCCACCCTAA GCCCCATTCC AGATCCTCCT CTGCCTATCG CAAATCTGCC 1200TCTACCCACC ACAAGTCTGT CTTTAGCCAC TCCAAGGCTG CCTCTGGTCA CCTCAAGCCT 1260GTCTCTGGCC ACTCCAAGCC TGCCTCTGGT CACCCCAAGT CTGCCACTGT CTACCCTAAG 1320CCTGCCTCTG TCCATTTCAA GGCTGACTCT GTCCATTTCA AGGGTGACTC TGTCCATTTC 1380AAGCCTGACT CTGTTCATTT CAAGCCTGCT TCCAGCAACC CCAAGCCCAT CACTGGCCAC 1440CATGTCTCTG CTGGCAGCCA CTCCAAGTCT GCCTTCAATG CTGCCACCAG CCACCCTAAA 1500CCCATCAAGC CAGCTACCAG CCATGCTGAG CCCACCACTG CTGACTATCC CAAGCCTGCC 1560ACTACCAGCC ACCCTAAGCC CGCTGCTGCT GACAACCCTG AGCTCTCTGC CTCCCATTGC 1620CCCGAGATCC CTGCCATTGC CCACCCTGTG TCTGACGACA GTGACCTCCC TGAGTCGGCC 1680TCTAGCCCTG CCGCTGGGCC CACCAAGCCT GCTGCCAGCC AGCTGGAGTC TGACACCATC 1740GCTGACCTTC CTGACCCTAC TGTAGTCACT ACCAGTACCA ATGATTACCA TGATGTCGTG 1800GTTGTTGATG TTGAAGATGA TCCTGATGAA ATGGCTGTGT GA 1842(217)SEQ ID NO216的資料(i)序列特征(A)長度613個氨基酸(B)類型氨基酸
(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO216的序列描述Met Gly Pro Thr Leu Ala Val Pro Thr Pro Tyr Gly Cys Ile Gly Cys1 5 10 15Lys Leu Pro Gln Pro Glu Tyr Pro Pro Ala Leu Ile Ile Phe Met Phe20 25 30Cys Ala Met Val Ile Thr Ile Val Val Asp Leu Ile Gly Asn Ser Met35 40 45Val Ile Leu Ala Val Thr Lys Asn Lys Lys Leu Arg Asn Ser Gly Asn50 55 60Ile Phe Val Val Ser Leu Ser Val Ala Asp Met Leu Val Ala Ile Tyr65 70 75 80Pro Tyr Pro Leu Met Leu His Ala Met Ser Ile Gly Gly Trp Asp Leu85 90 95Ser Gln Leu Gln Cys Gln Met Val Gly Phe Ile Thr Gly Leu Ser Val100 105 110Val Gly Ser Ile Phe Asn Ile Val Ala Ile Ala Ile Asn Arg Tyr Cys115 120 125Tyr Ile Cys His Ser Leu Gln Tyr Glu Arg Ile Phe Ser Val Arg Asn130 135 140Thr Cys Ile Tyr Leu Val Ile Thr Trp Ile Met Thr Val Leu Ala Val145 150 155 160Leu Pro Asn Met Tyr Ile Gly Thr Ile Glu Tyr Asp Pro Arg Thr Tyr165 170 175Thr Cys Ile Phe Asn Tyr Leu Asn Asn Pro Val Phe Thr Val Thr Ile180 185 190Val Cys Ile His Phe Val Leu Pro Leu Leu Ile Val Gly Phe Cys Tyr195 200 205Val Arg Ile Trp Thr Lys Val Leu Ala Ala Arg Asp Pro Ala Gly Gln210 215 220Asn Pro Asp Asn Gln Leu Ala Glu Val Arg Asn Lys Leu Thr Met Phe225 230 235 240Val Ile Phe Leu Leu Phe Ala Val Cys Trp Cys Pro Ile Asn Val Leu
245 250 255Thr Val Leu Val Ala Val Ser Pro Lys Glu Met Ala Gly Lys Ile Pro260 265 270Asn Trp Leu Tyr Leu Ala Ala Tyr Phe Ile Ala Tyr Phe Asn Ser Cys275 280 285Leu Asn Ala Val Ile Tyr Gly Leu Leu Asn Glu Asn Phe Arg Arg Glu290 295 300Tyr Trp Thr Ile Phe His Ala Met Arg His Pro Ile Ile Phe Phe Ser305 310 315 320Gly Leu Ile Ser Asp Ile Arg Glu Met Gln Glu Ala Arg Thr Leu Ala325 330 335Arg Ala Arg Ala His Ala Arg Asp Gln Ala Arg Glu Gln Asp Arg Ala340 345 350His Ala Cys Pro Ala Val Glu Glu Thr Pro Met Asn Val Arg Asn Val355 360 365Pro Leu Pro Gly Asp Ala Ala Ala Gly His Pro Asp Arg Ala Ser Gly370 375 380His Pro Lys Pro His Ser Arg Ser Ser Ser Ala Tyr Arg Lys Ser Ala385 390 395 400Ser Thr His His Lys Ser Val Phe Ser His Ser Lys Ala Ala Ser Gly405 410 415His Leu Lys Pro Val Ser Gly His Ser Lys Pro Ala Ser Gly His Pro420 425 430Lys Ser Ala Thr Val Tyr Pro Lys Pro Ala Ser Val His Phe Lys Ala435 440 445Asp Ser Val His Phe Lys Gly Asp Ser Val His Phe Lys Pro Asp Ser450 455 460Val His Phe Lys Pro Ala Ser Ser Asn Pro Lys Pro Ile Thr Gly His465 470 475 480His Val Ser Ala Gly Ser His Ser Lys Ser Ala Phe Asn Ala Ala Thr485 490 495Ser His Pro Lys Pro Ile Lys Pro Ala Thr Ser His Ala Glu Pro Thr500 505 510Thr Ala Asp Tyr Pro Lys Pro Ala Thr Thr Ser His Pro Lys Pro Ala
515 520 525Ala Ala Asp Asn Pro Glu Leu Ser Ala Ser His Cys Pro Glu Ile Pro530 535 540Ala Ile Ala His Pro Val Ser Asp Asp Ser Asp Leu Pro Glu Ser Ala545 550 555 560Ser Ser Pro Ala Ala Gly Pro Thr Lys Pro Ala Ala Ser Gln Leu Glu565 570 575Ser Asp Thr Ile Ala Asp Leu Pro Asp Pro Thr Val Val Thr Thr Ser580 585 590Thr Asn Asp Tyr His Asp Val Val Val Val Asp Val Glu Asp Asp Pro595 600 605Asp Glu Met Ala Val610(218)SEQ ID NO217的資料(i)序列特征(A)長度1854個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組)(xi)SEQ ID NO217的序列描述ATGGGGCCCA CCCTAGCGGT TCCCACCCCC TATGGCTGTA TTGGCTGTAA GCTACCCCAG 60CCAGAATACC CACCGGCTCT AATCATCTTT ATGTTCTGCG CGATGGTTAT CACCATCGTT 120GTAGACCTAA TCGGCAACTC CATGGTCATT TTGGCTGTGA CGAAGAACAA GAAGCTCCGG 180AATTCTGGCA ACATCTTCGT GGTCAGTCTC TCTGTGGCCG ATATGCTGGT GGCCATCTAC 240CCATACCCTT TGATGCTGCA TGCCATGTCC ATTGGGGGCT GGGATCTGAG CCAGTTACAG 300TGCCAGATGG TCGGGTTCAT CACAGGGCTG AGTGTGGTCG GCTCCATCTT CAACATCGTG 360GCAATCGCTA TCAACCGTTA CTGCTACATC TGCCACAGCC TCCAGTACGA ACGGATCTTC 420AGTGTGCGCA ATACCTGCAT CTACCTGGTC ATCACCTGGA TCATGACCGT CCTGGCTGTC 480CTGCCCAACA TGTACATTGG CACCATCGAG TACGATCCTC GCACCTACAC CTGCATCTTC 540AACTATCTGA ACAACCCTGT CTTCACTGTT ACCATCGTCT GCATCCACTT CGTCCTCCCT 600CTCCTCATCG TGGGTTTCTG CTACGTGAGG ATCTGGACCA AAGTGCTGGC GGCCCGTGAC 660CCTGCAGGGC AGAATCCTGA CAACCAACTT GCTGAGGTTC GCAATAAACT AACCATGTTT 720GTGATCTTCC TCCTCTTTGC AGTGTGCTGG TGCCCTATCA ACGTGCTCAC TGTCTTGGTG 780GCTGTCAGTC CGAAGGAGAT GGCAGGCAAG ATCCCCAACT GGCTTTATCT TGCAGCCTAC 840TTCATAGCCT ACTTCAACAG CTGCCTCAAC GCTGTGATCT ACGGGCTCCT CAATGAGAAT 900TTCCGAAGAG AATACTGGAC CATCTTCCAT GCTATGCGGC ACCCTATCAT ATTCTTCTCT 960GGCCTCATCA GTGATATTCG TGAGATGCAG GAGGCCCGTA CCCTGGCCCG CGCCCGTGCC 1020CATGCTCGCG ACCAAGCTCG TGAACAAGAC CGTGCCCATG CCTGTCCTGC TGTGGAGGAA 1080ACCCCGATGA ATGTCCGGAA TGTTCCATTA CCTGGTGATG CTGCAGCTGG CCACCCCGAC 1140CGTGCCTCTG GCCACCCTAA GCCCCATTCC AGATCCTCCT CTGCCTATCG CAAATCTGCC 1200TCTACCCACC ACAAGTCTGT CTTTAGCCAC TCCAAGGCTG CCTCTGGTCA CCTCAAGCCT 1260GTCTCTGGCC ACTCCAAGCC TGCCTCTGGT CACCCCAAGT CTGCCACTGT CTACCCTAAG 1320CCTGCCTCTG TCCATTTCAA GGCTGACTCT GTCCATTTCA AGGGTGACTC TGTCCATTTC 1380AAGCCTGACT CTGTTCATTT CAAGCCTGCT TCCAGCAACC CCAAGCCCAT CACTGGCCAC 1440CATGTCTCTG CTGGCAGCCA CTCCAAGTCT GCCTTCAGTG CTGCCACCAG CCACCCTAAA 1500CCCACCACTG GCCACATCAA GCCAGCTACC AGCCATGCTG AGCCCACCAC TGCTGACTAT 1560CCCAAGCCTG CCACTACCAG CCACCCTAAG CCCACTGCTG CTGACAACCC TGAGCTCTCT 1620GCCTCCCATT GCCCCGAGAT CCCTGCCATT GCCCACCCTG TGTCTGACGA CAGTGACCTC 1680CCTGAGTCGG CCTCTAGCCC TGCCGCTGGG CCCACCAAGC CTGCTGCCAG CCAGCTGGAG 1740TCTGACACCA TCGCTGACCT TCCTGACCCT ACTGTAGTCA CTACCAGTAC CAATGATTAC 1800CATGATGTCG TGGTTGTTGA TGTTGAAGAT GATCCTGATG AAATGGCTGT GTGA1854(219)SEQ ID NO218的資料(i)序列特征(A)長度617個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi) SEQ ID NO218的序列描述Met Gly Pro Thr Leu Ala Val Pro Thr Pro Tyr Gly Cys Ile Gly Cys1 5 10 15Lys Leu Pro Gln Pro Glu Tyr Pro Pro Ala Leu Ile Ile Phe Met Phe20 25 30Cys Ala Met Val Ile Thr Ile Val Val Asp Leu Ile Gly Asn Ser Met35 40 45Val Ile Leu Ala Val Thr Lys Asn Lys Lys Leu Arg Asn Ser Gly Asn50 55 60Ile Phe Val Val Ser Leu Ser Val Ala Asp Met Leu Val Ala Ile Tyr65 70 75 80Pro Tyr Pro Leu Met Leu His Ala Met Ser Ile Gly Gly Trp Asp Leu85 90 95Ser Gln Leu Gln Cys Gln Met Val Gly Phe Ile Thr Gly Leu Ser Val100 105 110Val Gly Ser Ile Phe Asn Ile Val Ala Ile Ala Ile Asn Arg Tyr Cys115 120 125Tyr Ile Cys His Ser Leu Gln Tyr Glu Arg Ile Phe Ser Val Arg Asn130 135 140Thr Cys Ile Tyr Leu Val Ile Thr Trp Ile Met Thr Val Leu Ala Val145 150 155 160Leu Pro Asn Met Tyr Ile Gly Thr Ile Glu Tyr Asp Pro Arg Thr Tyr165 170 175Thr Cys Ile Phe Asn Tyr Leu Asn Asn Pro Val Phe Thr Val Thr Ile180 185 190Val Cys Ile His Phe Val Leu Pro Leu Leu Ile Val Gly Phe Cys Tyr195 200 205Val Arg Ile Trp Thr Lys Val Leu Ala Ala Arg Asp Pro Ala Gly Gln210 215 220Asn Pro Asp Asn Gln Leu Ala Glu Val Arg Asn Lys Leu Thr Met Phe225 230 235 240Val Ile Phe Leu Leu Phe Ala Val Cys Trp Cys Pro Ile Asn Val Leu245 250 255Thr Val Leu Val Ala Val Ser Pro Lys Glu Met Ala Gly Lys Ile Pro260 265 270Asn Trp Leu Tyr Leu Ala Ala Tyr Phe Ile Ala Tyr Phe Asn Ser Cys275 280 285Leu Asn Ala Val Ile Tyr Gly Leu Leu Asn Glu Asn Phe Arg Arg Glu290 295 300Tyr Trp Thr Ile Phe His Ala Met Arg His Pro Ile Ile Phe Phe Ser305 310 315 320Gly Leu Ile Ser Asp Ile Arg Glu Met Gln Glu Ala Arg Thr Leu Ala325 330 335Arg Ala Arg Ala His Ala Arg Asp Gln Ala Arg Glu Gln Asp Arg Ala340 345 350His Ala Cys Pro Ala Val Glu Glu Thr Pro Met Asn Val Arg Asn Val355 360 365Pro Leu Pro Gly Asp Ala Ala Ala Gly His Pro Asp Arg Ala Ser Gly370 375 380His Pro Lys Pro His Ser Arg Ser Ser Ser Ala Tyr Arg Lys Ser Ala385 390 395 400Ser Thr His His Lys Ser Val Phe Ser His Ser Lys Ala Ala Ser Gly405 410 415His Leu Lys Pro Val Ser Gly His Ser Lys Pro Ala Ser Gly His Pro
420 425 430Lys Ser Ala Thr Val Tyr Pro Lys Pro Ala Ser Val His Phe Lys Ala435 440 445Asp Ser Val His Phe Lys Gly Asp Ser Val His Phe Lys Pro Asp Ser450 455 460Val His Phe Lys Pro Ala Ser Ser Asn Pro Lys Pro Ile Thr Gly His465 470 475 480His Val Ser Ala Gly Ser His Ser Lys Ser Ala Phe Ser Ala Ala Thr485 490 495Ser His Pro Lys Pro Thr Thr Gly His Ile Lys Pro Ala Thr Ser His500 505 510Ala Glu Pro Thr Thr Ala Asp Tyr Pro Lys Pro Ala Thr Thr Ser His515 520 525Pro Lys Pro Thr Ala Ala Asp Asn Pro Glu Leu Ser Ala Ser His Cys530 535 540Pro Glu Ile Pro Ala Ile Ala His Pro Val Ser Asp Asp Ser Asp Leu545 550 555 560Pro Glu Ser Ala Ser Ser Pro Ala Ala Gly Pro Thr Lys Pro Ala Ala565 570 575Ser Gln Leu Glu Ser Asp Thr Ile Ala Asp Leu Pro Asp Pro Thr Val580 585 590Val Thr Thr Ser Thr Asn Asp Tyr His Asp Val Val Val Val Asp Val595 600 605Glu Asp Asp Pro Asp Glu Met Ala Val610 615(220)SEQ ID NO219的資料(i)序列特征(A)長度1548個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組)(xi)SEQ ID NO219的序列描述ATGGGACATA ACGGGAGCTG GATCTCTCCA AATGCCAGCG AGCCGCACAA CGCGTCCGGC 60GCCGAGGCTG CGGGTGTGAA CCGCAGCGCG CTCGGGGAGT TCGGCGAGGC GCAGCTGTAC 120CGCCAGTTCA CCACCACCGT GCAGGTCGTC ATCTTCATAG GCTCGCTGCT CGGAAACTTC 180ATGGTGTTAT GGTCAACTTG CCGCACAACC GTGTTCAAAT CTGTCACCAA CAGGTTCATT 240AAAAACCTGG CCTGCTCGGG GATTTGTGCC AGCCTGGTCT GTGTGCCCTT CGACATCATC 300CTCAGCACCA GTCCTCACTG TTGCTGGTGG ATCTACACCA TGCTCTTCTG CAAGGTCGTC 360AAATTTTTGC ACAAAGTATT CTGCTCTGTG ACCATCCTCA GCTTCCCTGC TATTGCTTTG 420GACAGGTACT ACTCAGTCCT CTATCCACTG GAGAGGAAAA TATCTGATGC CAAGTCCCGT 480GAACTGGTGA TGTACATCTG GGCCCATGCA GTGGTGGCCA GTGTCCCTGT GTTTGCAGTA 540ACCAATGTGG CTGACATCTA TGCCACGTCC ACCTGCACGG AAGTCTGGAG CAACTCCTTG 600GGCCACCTGG TGTACGTTCT GGTGTATAAC ATCACCACGG TCATTGTGCC TGTGGTGGTG 660GTGTTCCTCT TCTTGATACT GATCCGACGG GCCCTGAGTG CCAGCCAGAA GAAGAAGGTC 720ATCATAGCAG CGCTCCGGAC CCCACAGAAC ACCATCTCTA TTCCCTATGC CTCCCAGCGG 780GAGGCCGAGC TGAAAGCCAC CCTGCTCTCC ATGGTGATGG TCTTCATCTT GTGTAGCGTG 840CCCTATGCCA CCCTGGTCGT CTACCAGACT GTGCTCAATG TCCCTGACAC TTCCGTCTTC 900TTGCTGCTCA CTGCTGTTTG GCTGCCCAAA GTCTCCCTGC TGGCAAACCC TGTTCTCTTT 960CTTACTGTGA ACAAATCTGT CCGCAAGTGC TTGATAGGGA CCCTGGTGCA ACTACACCAC 1020CGGTACAGTC GCCGTAATGT GGTCAGTACA GGGAGTGGCA TGGCTGAGGC CAGCCTGGAA 1080CCCAGCATAC GCTCGGGTAG CCAGCTCCTG GAGATGTTCC ACATTGGGCA GCAGCAGATC 1140TTTAAGCCCA CAGAGGATGA GGAAGAGAGT GAGGCCAAGT ACATTGGCTC AGCTGACTTC 1200CAGGCCAAGG AGATATTTAG CACCTGCCTG GAGGGAGAGC AGGGGCCACA GTTTGCGCCC 1260TCTGCCCCAC CCCTGAGCAC AGTGGACTCT GTATCCCAGG TGGCACCGGC AGCCCCTGTG 1320GAACCTGAAA CATTCCCTGA TAAGTATTCC CTGCAGTTTG GCTTTGGGCC TTTTGAGTTG 1380CCTCCTCAGT GGCTCTCAGA GACCCGAAAC AGCAAGAAGC GGCTGCTTCC CCCCTTGGGC 1440AACACCCCAG AAGAGCTGAT CCAGACAAAG GTGCCCAAGG TAGGCAGGGT GGAGCGGAAG 1500ATGAGCAGAA ACAATAAAGT GAGCATTTTT CCAAAGGTGG ATTCCTAG1548(221)SEQ ID NO220的資料(i)序列特征(A)長度515個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO220的序列描述Met Gly His Asn Gly Ser Trp Ile Ser Pro Asn Ala Ser Glu Pro His1 5 10 15Asn Ala Ser Gly Ala Glu Ala Ala Gly Val Asn Arg Ser Ala Leu Gly20 25 30Glu Phe Gly Glu Ala Gln Leu Tyr Arg Gln Phe Thr Thr Thr Val Gln35 40 45Val Val Ile Phe Ile Gly Ser Leu Leu Gly Asn Phe Met Val Leu Trp50 55 60Ser Thr Cys Arg Thr Thr Val Phe Lys Ser Val Thr Asn Arg Phe Ile65 70 75 80Lys Asn Leu Ala Cys Ser Gly Ile Cys Ala Ser Leu Val Cys Val Pro
85 90 95Phe Asp Ile Ile Leu Ser Thr Ser Pro His Cys Cys Trp Trp Ile Tyr100 105 110Thr Met Leu Phe Cys Lys Val Val Lys Phe Leu His Lys Val Phe Cys115 120 125Ser Val Thr Ile Leu Ser Phe Pro Ala Ile Ala Leu Asp Arg Tyr Tyr130 135 140Ser Val Leu Tyr Pro Leu Glu Arg Lys Ile Ser Asp Ala Lys Ser Arg145 150 155 160Glu Leu Val Met Tyr Ile Trp Ala His Ala Val Val Ala Ser Val Pro165 170 175Val Phe Ala Val Thr Asn Val Ala Asp Ile Tyr Ala Thr Ser Thr Cys180 185 190Thr Glu Val Trp Ser Asn Ser Leu Gly His Leu Val Tyr Val Leu Val195 200 205Tyr Asn Ile Thr Thr Val Ile Val Pro Val Val Val Val Phe Leu Phe210 215 220Leu Ile Leu Ile Arg Arg Ala Leu Ser Ala Ser Gln Lys Lys Lys Val225 230 235 240Ile Ile Ala Ala Leu Arg Thr Pro Gln Asn Thr Ile Ser Ile Pro Tyr245 250 255Ala Ser Gln Arg Glu Ala Gh Leu Lys Ala Thr Leu Leu Ser Met Val260265 270Met Val Phe Ile Leu Cys Ser Val Pro Tyr Ala Thr Leu Val Val Tyr275 280 285Gln Thr Val Leu Asn Val Pro Asp Thr Ser Val Phe Leu Leu Leu Thr290 295 300Ala Val Trp Leu Pro Lys Val Ser Leu Leu Ala Asn Pro Val Leu Phe305 310 315 320Leu Thr Val Asn Lys Ser Val Arg Lys Cys Leu Ile Gly Thr Leu Val325 330 335Gln Leu His His Arg Tyr Ser Arg Arg Asn Val Val Ser Thr Gly Ser340 345 350Gly Met Ala Glu Ala Ser Leu Glu Pro Ser Ile Arg Ser Gly Ser Gln
355 360 365Leu Leu Glu Met Phe His Ile Gly Gln Gln Gln Ile Phe Lys Pro Thr370 375 380Glu Asp Glu Glu Glu Ser Glu Ala Lys Tyr Ile Gly Ser Ala Asp Phe385 390 395 400Gln Ala Lys Glu Ile Phe Ser Thr Cys Leu Glu Gly Glu Gln Gly Pro405 410 415Gln Phe Ala Pro Ser Ala Pro Pro Leu Ser Thr Val Asp Ser Val Ser420 425 430Gln Val Ala Pro Ala Ala Pro Val Glu Pro Glu Thr Phe Pro Asp Lys435 440 445Tyr Ser Leu Gln Phe Gly Phe Gly Pro Phe Glu Leu Pro Pro Gln Trp450 455 460Leu Ser Glu Thr Arg Asn Ser Lys Lys Arg Leu Leu Pro Pro Leu Gly465 470 475 480Asn Thr Pro Glu Glu Leu Ile Gln Thr Lys Val Pro Lys Val Gly Arg485 490 495Val Glu Arg Lys Met SerArg Asn Asn Lys Val Ser Ile Phe Pro Lys500505 510Val Asp Ser515(222)SEQ ID NO221的資料(i)序列特征(A)長度1164個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO221的序列描述ATGAATCGGC ACCATCTGCA GGATCACTTT CTGGAAATAG ACAAGAAGAA CTGCTGTGTG 60TTCCGAGATG ACTTCATTGC CAAGGTGTTG CCGCCGGTGT TGGGGCTGGA GTTTATCTTT 120GGGCTTCTGG GCAATGGCCT TGCCCTGTGG ATTTTCTGTT TCCACCTCAA GTCCTGGAAA 180TCCAGCCGGA TTTTCCTGTT CAACCTGGCA GTAGCTGACT TTCTACTGAT CATCTGCCTG 240CCGTTCGTGA TGGACTACTA TGTGCGGCGT TCAGACTGGA AGTTTGGGGA CATCCCTTGC 300CGGCTGGTGC TCTTCATGTT TGCCATGAAC CGCCAGGGCA GCATCATCTT CCTCACGGTG 360GTGGCGGTAG ACAGGTATTT CCGGGTGGTC CATCCCCACC ACGCCCTGAA CAAGATCTCC 420AATTGGACAG CAGCCATCAT CTCTTGCCTT CTGTGGGGCA TCACTGTTGG CCTAACAGTC 480CACCTCCTGA AGAAGAAGTT GCTGATCCAG AATGGCCCTG CAAATGTGTG CATCAGCTTC 540AGCATCTGCC ATACCTTCCG GTGGCACGAA GCTATGTTCC TCCTGGAGTT CCTCCTGCCC 600CTGGGCATCA TCCTGTTCTG CTCAGCCAGA ATTATCTGGA GCCTGCGGCA GAGACAAATG 660GACCGGCATG CCAAGATCAA GAGAGCCAAA ACCTTCATCA TGGTGGTGGC CATCGTCTTT 720GTCATCTGCT TCCTTCCCAG CGTGGTTGTG CGGATCCGCA TCTTCTGGCT CCTGCACACT 780TCGGGCACGC AGAATTGTGA AGTGTACCGC TCGGTGGACC TGGCGTTCTT TATCACTCTC 840AGCTTCACCT ACATGAACAG CATGCTGGAC CCCGTGGTGT ACTACTTCTC CAGCCCATCC 900TTTCCCAACT TCTTCTCCAC TTTGATCAAC CGCTGCCTCC AGAGGAAGAT GACAGGTGAG 960CCAGATAATA ACCGCAGCAC GAGCGTCGAG CTCACAGGGG ACCCCAACAA AACCAGAGGC 1020GCTCCAGAGG CGTTAATGGC CAACTCCGGT GAGCCATGGA GCCCCTCTTA TCTGGGCCCA 1080ACCTCAAATA ACCATTCCAA GAAGGGACAT TGTCACCAAG AACCAGCATC TCTGGAGAAA 1140CAGTTGGGCT GTTGCATCGA GTAA1164(223)SEQ ID NO222的資料(i)序列特征(A)長度387個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO222的序列描述Met Asn Arg His His Leu Gln Asp His Phe Leu Glu Ile Asp Lys Lys1 5 10 15Asn Cys Cys Val Phe Arg Asp Asp Phe Ile Ala Lys Val Leu Pro Pro20 25 30Val Leu Gly Leu Glu Phe Ile Phe Gly Leu Leu Gly Asn Gly Leu Ala35 40 45Leu Trp Ile Phe Cys Phe His Leu Lys Ser Trp Lys Ser Ser Arg Ile50 55 60Phe Leu Phe Asn Leu Ala Val Ala Asp Phe Leu Leu Ile Ile Cys Leu65 70 75 80Pro Phe Val Met Asp Tyr Tyr Val Arg Arg Ser Asp Trp Lys Phe Gly85 90 95Asp Ile Pro Cys Arg Leu Val Leu Phe Met Phe Ala Met Asn Arg Gln100 105 110Gly Ser Ile Ile Phe Leu Thr Val Val Ala Val Asp Arg Tyr Phe Arg115 120 125Val Val His Pro His His Ala Leu Asn Lys Ile Ser Asn Trp Thr Ala130 135 140Ala Ile Ile Ser Cys Leu Leu Trp Gly Ile Thr Val Gly Leu Thr Val145 150 155 160His Leu Leu Lys Lys Lys Leu Leu Ile Gln Asn Gly Pro Ala Asn Val165 170 175Cys Ile Ser Phe Ser Ile Cys His Thr Phe Arg Trp His Glu Ala Met180 185 190Phe Leu Leu Glu Phe Leu Leu Pro Leu Gly Ile Ile Leu Phe Cys Ser195 200 205Ala Arg Ile Ile Trp Ser Leu Arg Gln Arg Gln Met Asp Arg His Ala210 215 220Lys Ile Lys Arg Ala Lys Thr Phe Ile Met Val Val Ala Ile Val Phe225 230 235 240Val Ile Cys Phe Leu Pro Ser Val Val Val Arg Ile Arg Ile Phe Trp245 250 255Leu Leu His Thr Ser Gly Thr Gln Asn Cys Glu Val Tyr Arg Ser Val260 265 270Asp Leu Ala Phe Phe Ile Thr Leu Ser Phe Thr Tyr Met Asn Ser Met275 280 285Leu Asp Pro Val Val Tyr Tyr Phe Ser Ser Pro Ser Phe Pro Asn Phe290 295 300Phe Ser Thr Leu Ile Asn Arg Cys Leu Gln Arg Lys Met Thr Gly Glu305 310 315 320Pro Asp Asn Asn Arg Ser Thr Ser Val Glu Leu Thr Gly Asp Pro Asn325 330 335Lys Thr Arg Gly Ala Pro Glu Ala Leu Met Ala Asn Ser Gly Glu Pro340 345 350Trp Ser Pro Ser Tyr Leu Gly Pro Thr Ser Asn Asn His Ser Lys Lys355 360 365Gly His Cys His Gln Glu Pro Ala Ser Leu Glu Lys Gln Leu Gly Cys370 375 380Cys Ile Glu385(224)SEQ ID NO223的資料(i)序列特征
(A) 長度1212個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學不相關(ii)分子類型DNA(基因組的)(xi)SEQ ID NO223的序列描述ATGGCTTGCA ATGGCAGTGC GGCCAGGGGG CACTTTGACC CTGAGGACTT GAACCTGACT 60GACGAGGCAC TGAGACTCAA GTACCTGGGG CCCCAGCAGA CAGAGCTGTT CATGCCCATC 120TGTGCCACAT ACCTGCTGAT CTTCGTGGTG GGCGCTGTGG GCAATGGGCT GACCTGTCTG 180GTCATCCTGC GCCACAAGGC CATGCGCACG CCTACCAACT ACTACCTCTT CAGCCTGGCC 240GTGTCGGACC TGCTGGTGCT GCTGGTGGGC CTGCCCCTGG AGCTCTATGA GATGTGGCAC 300AACTACCCCT TCCTGCTGGG CGTTGGTGGC TGCTATTTCC GCACGCTACT GTTTGAGATG 360GTCTGCCTGG CCTCAGTGCT CAACGTCACT GCCCTGAGCG TGGAACGCTA TGTGGCCGTG 420GTGCACCCAC TCCAGGCCAG GTCCATGGTG ACGCGGGCCC ATGTGCGCCG AGTGCTTGGG 480GCCGTCTGGG GTCTTGCCAT GCTCTGCTCC CTGCCCAACA CCAGCCTGCA CGGCATCCGG 540CAGCTGCACG TGCCCTGCCG GGGCCCAGTG CCAGACTCAG CTGTTTGCAT GCTGGTCCGC 600CCACGGGCCC TCTACAACAT GGTAGTGCAG ACCACCGCGC TGCTCTTCTT CTGCCTGCCC 660ATGGCCATCA TGAGCGTGCT CTACCTGCTC ATTGGGCTGC GACTGCGGCG GGAGAGGCTG 720CTGCTCATGC AGGAGGCCAA GGGCAGGGGC TCTGCAGCAG CCAGGTCCAG ATACACCTGC 780AGGCTCCAGC AGCACGATCG GGGCCGGAGA CAAGTGAAGA AGATGCTGTT TGTCCTGGTC 840GTGGTGTTTG GCATCTGCTG GGCCCCGTTC CACGCCGACC GCGTCATGTG GAGCGTCGTG 900TCACAGTGGA CAGATGGCCT GCACCTGGCC TTCCAGCACG TGCACGTCAT CTCCGGCATC 960TTCTTCTACC TGGGCTCGGC GGCCAACCCC GTGCTCTATA GCCTCATGTC CAGCCGCTTC 1020CGAGAGACCT TCCAGGAGGC CCTGTGCCTC GGGGCCTGCT GCCATCGCCT CAGACCCCGC 1080CACAGCTCCC ACAGCCTCAG CAGGATGACC ACAGGCAGCA CCCTGTGTGA TGTGGGCTCC 1140CTGGGCAGCT GGGTCCACCC CCTGGCTGGG AACGATGGCC CAGAGGCGCA GCAAGAGACC 1200GATCCATCCT GA 1212(225)SEQ ID NO224的資料(i)序列特征(A)長度403個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi) SEQ ID NO224的序列描述Met Ala Cys Asn Gly Ser Ala Ala Arg Gly His Phe Asp Pro Glu Asp1 5 10 15Leu Asn Leu Thr Asp Glu Ala Leu Arg Leu Lys Tyr Leu Gly Pro Gln20 25 30Gln Thr Glu Leu Phe Met Pro Ile Cys Ala Thr Tyr Leu Leu Ile Phe35 40 45Val Val Gly Ala Val Gly Asn Gly Leu Thr Cys Leu Val Ile Leu Arg50 55 60His Lys Ala Met Arg Thr Pro Thr Asn Tyr Tyr Leu Phe Ser Leu Ala65 70 75 80Val Ser Asp Leu Leu Val Leu Leu Val Gly Leu Pro Leu Glu Leu Tyr85 90 95Glu Met Trp His Asn Tyr Pro Phe Leu Leu Gly Val Gly Gly Cys Tyr100 105 110Phe Arg Thr Leu Leu Phe Glu Met Val Cys Leu Ala Ser Val Leu Asn115 120 125Val Thr Ala Leu Ser Val Glu Arg Tyr Val Ala Val Val His Pro Leu130 135 140Gln Ala Arg Ser Met Val Thr Arg Ala His Val Arg Arg Val Leu Gly145 150 155 160Ala Val Trp Gly Leu Ala Met Leu Cys Ser Leu Pro Asn Thr Ser Leu165 170 175His Gly Ile Arg Gln Leu His Val Pro Cys Arg Gly Pro Val Pro Asp180 185 190Ser Ala Val Cys Met Leu Val Arg Pro Arg Ala Leu Tyr Asn Met Val195 200 205Val Gln Thr Thr Ala Leu Leu Phe Phe Cys Leu Pro Met Ala Ile Met210 215 220Ser Val Leu Tyr Leu Leu Ile Gly Leu Arg Leu Arg Arg Glu Arg Leu225 230 235 240Leu Leu Met Gln Glu Ala Lys Gly Arg Gly Ser Ala Ala Ala Arg Ser245 250 255Arg Tyr Thr Cys Arg Leu Gln Gln His Asp Arg Gly Arg Arg Gln Val260 265 270Lys Lys Met Leu Phe Val Leu Val Val Val Phe Gly Ile Cys Trp Ala275 280 285Pro Phe His Ala Asp Arg Val Met Trp Ser Val Val Ser Gln Trp Thr290 295 300Asp Gly Leu His Leu Ala Phe Gln His Val His Val Ile Ser Gly Ile305 310 315 320Phe Phe Tyr Leu Gly Ser Ala Ala Asn Pro Val Leu Tyr Ser Leu Met325 330 335Ser Ser Arg Phe Arg Glu Thr Phe Gln Glu Ala Leu Cys Leu Gly Ala340 345 350Cys Cys His Arg Leu Arg Pro Arg His Ser Ser His Ser Leu Ser Arg355 360 365Met Thr Thr Gly Ser Thr Leu Cys Asp Val Gly Ser Leu Gly Ser Trp370 375 380Val His Pro Leu Ala Gly Asn Asp Gly Pro Glu Ala Gln Gln Glu Thr385 390 395 400Asp Pro Ser(226)SEQ ID NO225的資料(i)序列特征(A)長度1098個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO225的序列描述ATGGGGAACA TCACTGCAGA CAACTCCTCG ATGAGCTGTA CCATCGACCA TACCATCCAC 60CAGACGCTGG CCCCGGTGGT CTATGTTACC GTGCTGGTGG TGGGCTTCCC GGCCAACTGC 120CTGTCCCTCT ACTTCGGCTA CCTGCAGATC AAGGCCCGGA ACGAGCTGGG CGTGTACCTG 180TGCAACCTGA CGGTGGCCGA CCTCTTCTAC ATCTGCTCGC TGCCCTTCTG GCTGCAGTAC 240GTGCTGCAGC ACGACAACTG GTCTCACGGC GACCTGTCCT GCCAGGTGTG CGGCATCCTC 300CTGTACGAGA ACATCTACAT CAGCGTGGGC TTCCTCTGCT GCATCTCCGT GGACCGCTAC 360CTGGCTGTGG CCCATCCCTT CCGCTTCCAC CAGTTCCGGA CCCTGAAGGC GGCCGTCGGC 420GTCAGCGTGG TCATCTGGGC CAAGGAGCTG CTGACCAGCA TCTACTTCCT GATGCACGAG 480GAGGTCATCG AGGACGAGAA CCAGCACCGC GTGTGCTTTG AGCACTACCC CATCCAGGCA 540TGGCAGCGCG CCATCAACTA CTACCGCTTC CTGGTGGGCT TCCTCTTCCC CATCTGCCTG 600CTGCTGGCGT CCTACCAGGG CATCCTGCGC GCCGTGCGCC GGAGCCACGG CACCCAGAAG 660AGCCGCAAGG ACCAGATCAA GCGGCTGGTG CTCAGCACCG TGGTCATCTT CCTGGCCTGC 720TTCCTGCCCT ACCACGTGTT GCTGCTGGTG CGCAGCGTCT GGGAGGCCAG CTGCGACTTC 780GCCAAGGGCG TTTTCAACGC CTACCACTTC TCCCTCCTGC TCACCAGCTT CAACTGCGTC 840GCCGACCCCG TGCTCTACTG CTTCGTCAGC GAGACCACCC ACCGGGACCT GGCCCGCCTC 900CGCGGGGCCT GCCTGGCCTT CCTCACCTGC TCCAGGACCG GCCGGGCCAG GGAGGCCTAC 960CCGCTGGGTG CCCCCGAGGC CTCCGGGAAA AGCGGGGCCC AGGGTGAGGA GCCCGAGCTG 1020TTGACCAAGC TCCACCCGGC CTTCCAGACC CCTAACTCGC CAGGGTCGGG CGGGTTCCCC 1080ACGGGCAGGT TGGCCTAG1098(227)SEQ ID NO226的資料(i)序列特征
(A)長度365個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO226的序列描述Met Gly Asn Ile Thr Ala Asp Asn Ser Ser Met Ser Cys Thr Ile Asp1 5 10 15His Thr Ile His Gln Thr Leu Ala Pro Val Val Tyr Val Thr Val Leu20 25 30Val Val Gly Phe Pro Ala Asn Cys Leu Ser Leu Tyr Phe Gly Tyr Leu35 40 45Gln Ile Lys Ala Arg Asn Glu Leu Gly Val Tyr Leu Cys Asn Leu Thr50 55 60Val Ala Asp Leu Phe Tyr Ile Cys Ser Leu Pro Phe Trp Leu Gln Tyr65 70 75 80Val Leu Gln His Asp Asn Trp Ser His Gly Asp Leu Ser Cys Gln Val85 90 95Cys Gly Ile Leu Leu Tyr Glu Asn Ile Tyr Ile Ser Val Gly Phe Leu100 105 110Cys Cys Ile Ser Val Asp Arg Tyr Leu Ala Val Ala His Pro Phe Arg115 120 125Phe His Gln Phe Arg Thr Leu Lys Ala Ala Val Gly Val Ser Val Val130 135 140Ile Trp Ala Lys Glu Leu Leu Thr Ser Ile Tyr Phe Leu Met His Glu145 150 155 160Glu Val Ile Glu Asp Glu Asn Gln His Arg Val Cys Phe Glu His Tyr165 170 175Pro Ile Gln Ala Trp Gln Arg Ala Ile Asn Tyr Tyr Arg Phe Leu Val180 185 190Gly Phe Leu Phe Pro Ile Cys Leu Leu Leu Ala Ser Tyr Gln Gly Ile195 200 205Leu Arg Ala Val Arg Arg Ser His Gly Thr Gln Lys Ser Arg Lys Asp210 215 220Gln Ile Lys Arg Leu Val Leu Ser Thr Val Val Ile Phe Leu Ala Cys225 230 235 240Phe Leu Pro Tyr His Val Leu Leu Leu Val Arg Ser Val Trp Glu Ala245 250 255Ser Cys Asp Phe Ala Lys Gly Val Phe Asn Ala Tyr His Phe Ser Leu260 265 270Leu Leu Thr Ser Phe Asn Cys Val Ala Asp Pro Val Leu Tyr Cys Phe275 280 285Val Ser Glu Thr Thr His Arg Asp Leu Ala Arg Leu Arg Gly Ala Cys290 295 300Leu Ala Phe Leu Thr Cys Ser Arg Thr Gly Arg Ala Arg Glu Ala Tyr305 310 315 320Pro Leu Gly Ala Pro Glu Ala Ser Gly Lys Ser Gly Ala Gln Gly Glu325 330 335Glu Pro Glu Leu Leu Thr Lys Leu His Pro Ala Phe Gln Thr Pro Asn340 345 350Ser Pro Gly Ser Gly Gly Phe Pro Thr Gly Arg Leu Ala355 360 365(228)SEQ ID NO227的資料(i)序列特征(A)長度1416個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO227的序列描述ATGGATATTC TTTGTGAAGA AAATACTTCT TTGAGCTCAA CTACGAACTC CCTAATGCAA 60TTAAATGATG ACAACAGGCT CTACAGTAAT GACTTTAACT CCGGAGAAGC TAACACTTCT 120GATGCATTTA ACTGGACAGT CGACTCTGAA AATCGAACCA ACCTTTCCTG TGAAGGGTGC 180CTCTCACCGT CGTGTCTCTC CTTACTTCAT CTCCAGGAAA AAAACTGGTC TGCTTTACTG 240ACAGCCGTAG TGATTATTCT AACTATTGCT GGAAACATAC TCGTCATCAT GGCAGTGTCC 300CTAGAGAAAA AGCTGCAGAA TGCCACCAAC TATTTCCTGA TGTCACTTGC CATAGCTGAT 360ATGCTGCTGG GTTTCCTTGT CATGCCCGTG TCCATGTTAA CCATCCTGTA TGGGTACCGG 420TGGCCTCTGC CGAGCAAGCT TTGTGCAGTC TGGATTTACC TGGACGTGCT CTTCTCCACG 480GCCTCCATCA TGCACCTCTG CGCCATCTCG CTGGACCGCT ACGTCGCCAT CCAGAATCCC 540ATCCACCACA GCCGCTTCAA CTCCAGAACT AAGGCATTTC TGAAAATCAT TGCTGTTTGG 600ACCATATCAG TAGGTATATC CATGCCAATA CCAGTCTTTG GGCTACAGGA CGATTCGAAG 660GTCTTTAAGG AGGGGAGTTG CTTACTCGCC GATGATAACT TTGTCCTGAT CGGCTCTTTT 720GTGTCATTTT TCATTCCCTT AACCATCATG GTGATCACCT ACTTTCTAAC TATCAAGTCA 780CTCCAGAAAG AAGCTACTTT GTGTGTAAGT GATCTTGGCA CACGGGCCAA ATTAGCTTCT 840TTCAGCTTCC TCCCTCAGAG TTCTTTGTCT TCAGAAAAGC TCTTCCAGCG GTCGATCCAT 900AGGGAGCCAG GGTCCTACAC AGGCAGGAGG ACTATGCAGT CCATCAGCAA TGAGCAAAAG 960GCAAAGAAGG TGCTGGGCAT CGTCTTCTTC CTGTTTGTGG TGATGTGGTG CCCTTTCTTC 1020ATCACAAACA TCATGGCCGT CATCTGCAAA GAGTCCTGCA ATGAGGATGT CATTGGGGCC 1080CTGCTCAATG TGTTTGTTTG GATCGGTTAT CTCTCTTCAG CAGTCAACCC ACTAGTCTAC 1140ACACTGTTCA ACAAGACCTA TAGGTCAGCC TTTTCACGGT ATATTCAGTG TCAGTACAAG 1200GAAAACAAAA AACCATTGCA GTTAATTTTA GTGAACACAA TACCGGCTTT GGCCTACAAG 1260TCTAGCCAAC TTCAAATGGG ACAAAAAAAG AATTCAAAGC AAGATGCCAA GACAACAGAT 1320AATGACTGCT CAATGGTTGC TCTAGGAAAG CAGTATTCTG AAGAGGCTTC TAAAGACAAT 1380AGCGACGGAG TGAATGAAAA GGTGAGCTGT GTGTGA 1416(229)SEQ ID NO228的資料(i)序列特征(A)長度470個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO228的序列描述Met Asp Ile Leu Cys Glu Glu Asn Thr Ser Leu Ser Ser Thr Thr Asn1 5 10 15Ser Leu Met Gln Leu Asn Asp Asp Asn Arg Leu Tyr Ser Asn Asp Phe20 25 30Asn Ser Gly Glu Ala Asn Thr Ser Asp Ala Phe Asn Trp Thr Val Asp35 40 45Ser Glu Asn Arg Thr Asn Leu Ser Cys Glu Gly Cys Leu Ser Pro Ser50 55 60Cys Leu Ser Leu Leu His Leu Gln Glu Lys Asn Trp Ser Ala Leu Leu65 70 75 80Thr Ala Val Val Ile Ile Leu Thr Ile Ala Gly Asn Ile Leu Val Ile85 90 95Met Ala Val Ser Leu Glu Lys Lys Leu Gln Asn Ala Thr Asn Tyr Phe100 105 110Leu Met Ser Leu Ala Ile Ala Asp Met Leu Leu Gly Phe Leu Val Met115 120 125Pro Val Ser Met Leu Thr Ile Leu Tyr Gly Tyr Arg Trp Pro Leu Pro130 135 140Ser Lys Leu Cys Ala Val Trp Ile Tyr Leu Asp Val Leu Phe Ser Thr145 150 155 160Ala Ser Ile Met His Leu Cys Ala Ile Ser Leu Asp Arg Tyr Val Ala165 170 175Ile Gln Asn Pro Ile His His Ser Arg Phe Asn Ser Arg Thr Lys Ala180 185 190Phe Leu Lys Ile Ile Ala Val Trp Thr Ile Ser Val Gly Ile Ser Met195 200 205Pro Ile Pro Val Phe Gly Leu Gln Asp Asp Ser Lys Val Phe Lys Glu210 215 220Gly Ser Cys Leu Leu Ala Asp Asp Asn Phe Val Leu Ile Gly Ser Phe225 230 235 240Val Ser Phe Phe Ile Pro Leu Thr Ile Met Val Ile Thr Tyr Phe Leu245 250 255Thr Ile Lys Ser Leu Gln Lys Glu Ala Thr Leu Cys Val Ser Asp Leu260 265 270Gly Thr Arg Ala Lys Leu Ala Ser Phe Ser Phe Leu Pro Gln Ser Ser275 280 285Leu Ser Ser Glu Lys Leu Phe Gln Arg Ser Ile His Arg Glu Pro Gly290 295 300Ser Tyr Thr Gly Arg Arg Thr Met Gln Ser Ile Ser Asn Glu Gln Lys305 310 315 320Ala Lys Lys Val Leu Gly Ile Val Phe Phe Leu Phe Val Val Met Trp325 330 335Cys Pro Phe Phe Ile Thr Asn Ile Met Ala Val Ile Cys Lys Glu Ser340 345 350Cys Asn Glu Asp Val Ile Gly Ala Leu Leu Asn Val Phe Val Trp Ile355 360 365Gly Tyr Leu Ser Ser Ala Val Asn Pro Leu Val Tyr Thr Leu Phe Asn370 375 380Lys Thr Tyr Arg Ser Ala Phe Ser Arg Tyr Ile Gln Cys Gln Tyr Lys385 390 395 400Glu Asn Lys Lys Pro Leu Gln Leu Ile Leu Val Asn Thr Ile Pro Ala405 410 415Leu Ala Tyr Lys Ser Ser Gln Leu Gln Met Gly Gln Lys Lys Asn Ser420 425 430Lys Gln Asp Ala Lys Thr Thr Asp Asn Asp Cys Ser Met Val Ala Leu435 440 445Gly Lys Gln Tyr Ser Glu Glu Ala Ser Lys Asp Asn Ser Asp Gly Val450 455 460Asn Glu Lys Val Ser Cys Val465 470(230)SEQ ID NO229的資料(i)序列特征(A)長度1377個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO229的序列描述ATGGTGAACC TGAGGAATGC GGTGCATTCA TTCCTTGTGC ACCTAATTGG CCTATTGGTT 60TGGCAATGTG ATATTTCTGT GAGCCCAGTA GCAGCTATAG TAACTGACAT TTTCAATACC 120TCCGATGGTG GACGCTTCAA ATTCCCAGAC GGGGTACAAA ACTGGCCAGC ACTTTCAATC 180GTCATCATAA TAATCATGAC AATAGGTGGC AACATCCTTG TGATCATGGC AGTAAGCATG 240GAAAAGAAAC TGCACAATGC CACCAATTAC TTCTTAATGT CCCTAGCCAT TGCTGATATG 300CTAGTGGGAC TACTTGTCAT GCCCCTGTCT CTCCTGGCAA TCCTTTATGA TTATGTCTGG 360CCACTACCTA GATATTTGTG CCCCGTCTGG ATTTCTTTAG ATGTTTTATT TTCAACAGCG 420TCCATCATGC ACCTCTGCGC TATATCGCTG GATCGGTATG TAGCAATACG TAATCCTATT 480GAGCATAGCC GTTTCAATTC GCGGACTAAG GCCATCATGA AGATTGCTAT TGTTTGGGCA 540ATTTCTATAG GTGTATCAGT TCCTATCCCT GTGATTGGAC TGAGGGACGA AGAAAAGGTG 600TTCGTGAACA ACACGACGTG CGTGCTCAAC GACCCAAATT TCGTTCTTAT TGGGTCCTTC 660GTAGCTTTCT TCATACCGCT GACGATTATG GTGATTACGT ATTGCCTGAC CATCTACGTT 720CTGCGCCGAC AAGCTTTGAT GTTACTGCAC GGCCACACCG AGGAACCGCC TGGACTAAGT 780CTGGATTTCC TGAAGTGCTG CAAGAGGAAT ACGGCCGAGG AAGAGAACTC TGCAAACCCT 840AACCAAGACC AGAACGCACG CCGAAGAAAG AAGAAGGAGA GACGTCCTAG GGGCACCATG 900CAGGCTATCA ACAATGAAAG AAAAGCTAAG AAAGTCCTTG GGATTGTTTT CTTTGTGTTT 960CTGATCATGT GGTGCCCATT TTTCATTACC AATATTCTGT CTGTTCTTTG TGAGAAGTCC 1020TGTAACCAAA AGCTCATGGA AAAGCTTCTG AATGTGTTTG TTTGGATTGG CTATGTTTGT 1080TCAGGAATCA ATCCTCTGGT GTATACTCTG TTCAACAAAA TTTACCGAAG GGCATTCTCC 1140AACTATTTGC GTTGCAATTA TAAGGTAGAG AAAAAGCCTC CTGTCAGGCA GATTCCAAGA 1200GTTGCCGCCA CTGCTTTGTC TGGGAGGGAG CTTAATGTTA ACATTTATCG GCATACCAAT 1260GAACCGGTGA TCGAGAAAGC CAGTGACAAT GAGCCCGGTA TAGAGATGCA AGTTGAGAAT 1320TTAGAGTTAC CAGTAAATCC CTCCAGTGTG GTTAGCGAAA GGATTAGCAG TGTGTGA 1377(231)SEQ ID NO230的資料(i)序列特征(A)長度458個氨基酸(B)類型氨基酸
(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO230 的序列描述Met Val Asn Leu Arg Asn Ala Val His Ser Phe Leu Val His Leu Ile1 5 10 15Gly Leu Leu Val Trp Gln Cys Asp Ile Ser Val Ser Pro Val Ala Ala20 25 30Ile Val Thr Asp Ile Phe Asn Thr Ser Asp Gly Gly Arg Phe Lys Phe35 40 45Pro Asp Gly Val Gln Asn Trp Pro Ala Leu Ser Ile Val Ile Ile Ile50 55 60Ile Met Thr Ile Gly Gly Asn Ile Leu Val Ile Met Ala Val Ser Met65 70 75 80Glu Lys Lys Leu His Asn Ala Thr Asn Tyr Phe Leu Met Ser Leu Ala85 90 95Ile Ala Asp Met Leu Val Gly Leu Leu Val Met Pro Leu Ser Leu Leu100 105 110Ala Ile Leu Tyr Asp Tyr Val Trp Pro Leu Pro Arg Tyr Leu Cys Pro115 120 125Val Trp Ile Ser Leu Asp Val Leu Phe Ser Thr Ala Ser Ile Met His130 135 140Leu Cys Ala Ile Ser Leu Asp Arg Tyr Val Ala Ile Arg Asn Pro Ile145 150 155 160Glu His Ser Arg Phe Asn Ser Arg Thr Lys Ala Ile Met Lys Ile Ala165 170 175Ile Val Trp Ala Ile Ser Ile Gly Val Ser Val Pro Ile Pro Val Ile180 185 190Gly Leu Arg Asp Glu Glu Lys Val Phe Val Asn Asn Thr Thr Cys Val195 200 205Leu Asn Asp Pro Asn Phe Val Leu Ile Gly Ser Phe Val Ala Phe Phe210 215 220Ile Pro Leu Thr Ile Met Val Ile Thr Tyr Cys Leu Thr Ile Tyr Val225 230 235 240Leu Arg Arg Gln Ala Leu Met Leu Leu His Gly His Thr Glu Glu Pro
245 250 255Pro Gly Leu Ser Leu Asp Phe Leu Lys Cys Cys Lys Arg Asn Thr Ala260 265 270Glu Glu Glu Asn Ser Ala Asn Pro Asn Gln Asp Gln Asn Ala Arg Arg275 280 285Arg Lys Lys Lys Glu Arg Arg Pro Arg Gly Thr Met Gln Ala Ile Asn290 295 300Asn Glu Arg Lys Ala Lys Lys Val Leu Gly Ile Val Phe Phe Val Phe305 310 315 320Leu Ile Met Trp Cys Pro Phe Phe Ile Thr Asn Ile Leu Ser Val Leu325 330 335Cys Glu Lys Ser Cys Asn Gln Lys Leu Met Glu Lys Leu Leu Asn Val340 345 350Phe Val Trp Ile Gly Tyr Val Cys Ser Gly Ile Asn Pro Leu Val Tyr355 360 365Thr Leu Phe Asn Lys Ile Tyr Arg Arg Ala Phe Ser Asn Tyr Leu Arg370 375 380Cys Asn Tyr Lys Val Glu Lys Lys Pro Pro Val Arg Gln Ile Pro Arg385 390 395 400Val Ala Ala Thr Ala Leu Ser Gly Arg Glu Leu Asn Val Asn Ile Tyr405 410 415Arg His Thr Asn Glu Pro Val Ile Glu Lys Ala Ser Asp Asn Glu Pro420 425 430Gly Ile Glu Met Gln Val Glu Asn Leu Glu Leu Pro Val Asn Pro Ser435 440 445Ser Val Val Ser Glu Arg Ile Ser Ser Val450 455(232)SEQ ID NO231的資料(i)序列特征(A) 長度1068個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO231的序列描述ATGGATCAGT TCCCTGAATC AGTGACAGAA AACTTTGAGT ACGATGATTT GGCTGAGGCC 60TGTTATATTG GGGACATCGT GGTCTTTGGG ACTGTGTTCC TGTCCATATT CTACTCCGTC 120ATCTTTGCCA TTGGCCTGGT GGGAAATTTG TTGGTAGTGT TTGCCCTCAC CAACAGCAAG 180AAGCCCAAGA GTGTCACCGA CATTTACCTC CTGAACCTGG CCTTGTCTGA TCTGCTGTTT 240GTAGCCACTT TGCCCTTCTG GACTCACTAT TTGATAAATG AAAAGGGCCT CCACAATGCC 300ATGTGCAAAT TCACTACCGC CTTCTTCTTC ATCGGCTTTT TTGGAAGCAT ATTCTTCATC 360ACCGTCATCA GCATTGATAG GTACCTGGCC ATCGTCCTGG CCGCCAACTC CATGAACAAC 420CGGACCGTGC AGCATGGCGT CACCATCAGC CTAGGCGTCT GGGCAGCAGC CATTTTGGTG 480GCAGCACCCC AGTTCATGTT CACAAAGCAG AAAGAAAATG AATGCCTTGG TGACTACCCC 540GAGGTCCTCC AGGAAATCTG GCCCGTGCTC CGCAATGTGG AAACAAATTT TCTTGGCTTC 600CTACTCCCCC TGCTCATTAT GAGTTATTGC TACTTCAGAA TCATCCAGAC GCTGTTTTCC 660TGCAAGAACC ACAAGAAAGC CAAAGCCAAG AAACTGATCC TTCTGGTGGT CATCGTGTTT 720TTCCTCTTCT GGACACCCTA CAACGTTATG ATTTTCCTGG AGACGCTTAA GCTCTATGAC 780TTCTTTCCCA GTTGTGACAT GAGGAAGGAT CTGAGGCTGG CCCTCAGTGT GACTGAGACG 840GTTGCATTTA GCCATTGTTG CCTGAATCCT CTCATCTATG CATTTGCTGG GGAGAAGTTC 900AGAAGATACC TTTACCACCT GTATGGGAAA TGCCTGGCTG TCCTGTGTGG GCGCTCAGTC 960CACGTTGATT TCTCCTCATC TGAATCACAA AGGAGCAGGC ATGGAAGTGT TCTGAGCAGC 1020AATTTTACTT ACCACACGAG TGATGGAGAT GCATTGCTCC TTCTCTGA1068(233)SEQ ID NO232的資料(i)序列特征(A)長度355個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO232的序列描述Met Asp Gln Phe Pro Glu Ser Val Thr Glu Asn Phe Glu Tyr Asp Asp1 5 10 15Leu Ala Glu Ala Cys Tyr Ile Gly Asp Ile Val Val Phe Gly Thr Val20 25 30Phe Leu Ser Ile Phe Tyr Ser Val Ile Phe Ala Ile Gly Leu Val Gly35 40 45Asn Leu Leu Val Val Phe Ala Leu Thr Asn Ser Lys Lys Pro Lys Ser50 55 60Val Thr Asp Ile Tyr Leu Leu Asn Leu Ala Leu Ser Asp Leu Leu Phe65 70 75 80Val Ala Thr Leu Pro Phe Trp Thr His Tyr Leu Ile Asn Glu Lys Gly85 90 95Leu His Asn Ala Met Cys Lys Phe Thr Thr Ala Phe Phe Phe Ile Gly
100105 110Phe Phe Gly Ser Ile Phe Phe Ile Thr Val Ile Ser Ile Asp Arg Tyr115 120 125Leu Ala Ile Val Leu Ala Ala Asn Ser Met Asn Asn Arg Thr Val Gln130 135 140His Gly Val Thr Ile Ser Leu Gly Val Trp Ala Ala Ala Ile Leu Val145 150 155 160Ala Ala Pro Gln Phe Met Phe Thr Lys G1n Lys Glu Asn Glu Cys Leu165 170 175Gly Asp Tyr Pro Glu Val Leu Gln Glu Ile Trp Pro Val Leu Arg Asn180 185 190Val Glu Thr Asn Phe Leu Gly Phe Leu Leu Pro Leu Leu Ile Met Ser195 200 205Tyr Cys Tyr Phe Arg Ile Ile Gln Thr Leu Phe Ser Cys Lys Asn His210 215 220Lys Lys Ala Lys Ala Lys Lys Leu Ile Leu Leu Val Val Ile Val Phe225 230 235 240Phe Leu Phe Trp Thr Pro Tyr Asn Val Met Ile Phe Leu Glu Thr Leu245 250 255Lys Leu Tyr Asp Phe Phe Pro Ser Cys Asp Met Arg Lys Asp Leu Arg260 265 270Leu Ala Leu Ser Val Thr Glu Thr Val Ala Phe Ser His Cys Cys Leu275 280 285Asn Pro Leu Ile Tyr Ala Phe Ala G1y Glu Lys Phe Arg Arg Tyr Leu290 295 300Tyr His Leu Tyr Gly Lys Cys Leu Ala Val Leu Cys Gly Arg Ser Val305 310 315 320His Val Asp Phe Ser Ser Ser Glu Ser Gln Arg Ser Arg His Gly Ser325 330 335Val Leu Ser Ser Asn Phe Thr Tyr His Thr Ser Asp Gly Asp Ala Leu340 345 350Leu Leu Leu355(234)SEQ ID NO233的資料(i)序列特征(A)長度29個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(iv)反義否(xi)SEQ ID NO233的序列描述GGCTTAAGAG CATCATCGTG GTGCTGGTG 29(235)SEQ ID NO234的資料(i)序列特征(A)長度34個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(iv)反義是(xi)SEQ ID NO234的序列描述GTCACCACCA GCACCACGAT GATGCTCTTA AGCC 34(236)SEQ ID NO235的資料(i)序列特征(A)長度31個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO235的序列描述CAAAGAAAGT ACTGGGCATC GTCTTCTTCC T 31(237)SEQ ID NO236的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO236的序列描述TGCTCTAGAT TCCAGATAGG TGAAAACTTG 30(238)SEQ ID NO237的資料(i)序列特征(A)長度50個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(iv)反義否(xi)SEQ ID NO237的序列描述CTAGGGGCAC CATGCAGGCT ATCAACAATG AAAGAAAAGC TAAGAAAGTC 50(239)SEQ ID NO238的資料(i)序列特征(A)長度50個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(iv)反義是(xi)SEQ ID NO238的序列描述CAAGGACTTT CTTAGCTTTT CTTTTCATTGT TGATAGCCTG CATGGTGCCC 50(240)SEQ ID NO239的資料(i)序列特征(A)長度35個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO239的序列描述CGGCGGCAGA AGGCGAAACG CATGATCCTC GCGGT 35(241)SEQ ID NO240的資料(i)序列特征(A)長度35個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO240的序列描述ACCGCGAGGA TCATGCGTTT CGCCTTCTGC CGCCG35(242)SEQ ID NO241的資料(i)序列特征(A)長度24個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO241的序列描述GAGACATATT ATCTGCCACG GAGG 24(243)SEQ ID NO242的資料(i)序列特征(A)長度24個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi) SEQ ID NO242的序列描述TTGGCATAGA AACCGGACCC AAGG 24(244)SEQ ID NO243的資料(i)序列特征(A)長度28個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO243的序列描述taagaattcc ataaaaatta tggaatgg 28(245)SEQ ID NO244的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO244的序列描述CCAGGATCCA GCTGAAGTCT TCCATCATTC 30(246)SEQ ID NO245的資料(i)序列特征(A)長度1071個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA (基因組的)(xi)SEQ ID NO245的序列描述ATGAATGGGG TCTCGGAGGG GACCAGAGGC TGCAGTGACA GGCAACCTGG GGTCCTGACA 60CGTGATCGCT CTTGTTCCAG GAAGATGAAC TCTTCCGGAT GCCTGTCTGA GGAGGTGGGG 120TCCCTCCGCC CACTGACTGT GGTTATCCTG TCTGCGTCCA TTGTCGTCGG AGTGCTGGGC 180AATGGGCTGG TGCTGTGGAT GACTGTCTTC CGTATGGCAC GCACGGTCTC CACCGTCTGC 240TTCTTCCACC TGGCCCTTGC CGATTTCATG CTCTCACTGT CTCTGCCCAT TGCCATGTAC 300TATATTGTCT CCAGGCAGTG GCTCCTCGGA GAGTGGGCCT GCAAACTCTA CATCACCTTT 360GTGTTCCTCA GCTACTTTGC CAGTAACTGC CTCCTTGTCT TCATCTCTGT GGACCGTTGC 420ATCTCTGTCC TCTACCCCGT CTGGGCCCTG AACCACCGCA CTGTGCAGCG GGCGAGCTGG 480CTGGCCTTTG GGGTGTGGCT CCTGGCCGCC GCCTTGTGCT CTGCGCACCT GAAATTCCGG 540ACAACCAGAA AATGGAATGG CTGTACGCAC TGCTACTTGG CGTTCAACTC TGACAATGAG 600ACTGCCCAGA TTTGGATTGA AGGGGTCGTG GAGGGACACA TTATAGGGAC CATTGGCCAC 660TTCCTGCTGG GCTTCCTGGG GCCCTTAGCA ATCATAGGCA CCTGCGCCCA CCTCATCCGG 720GCCAAGCTCT TGCGGGAGGG CTGGGTCCAT GCCAACCGGC CCGCGAGGCT GCTGCTGGTG 780CTGGTGAGCG CTTTCTTTAT CTTCTGGTCC CCGTTTAACG TGGTGCTGTT GGTCCATCTG 840TGGCGACGGG TGATGCTCAA GGAAATCTAC CACCCCCGGA TGCTGCTCAT CCTCCAGGCT 900AGCTTTGCCT TGGGCTGTGT CAACAGCAGC CTCAACCCCT TCCTCTACGT CTTCGTTGGC 960AGAGATTTCC AAGAAAAGTT TTTCCAGTCT TTGACTTCTG CCCTGGCGAG GGCGTTTGGA 1020GAGGAGGAGT TTCTGTCATC CTGTCCCCGT GGCAACGCCC CCCGGGAATG A1071(247)SEQ ID NO246的資料(i)序列特征(A)長度356個氨基酸
(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋質(zhì)(xi)SEQ ID NO246的序列描述Met Asn Gly Val Ser Glu Gly Thr Arg Gly Cys Ser Asp Arg Gln Pro1 5 10 15Gly Val Leu Thr Arg Asp Arg Ser Cys Ser Arg Lys Met Asn Ser Ser20 25 30Gly Cys Leu Ser Glu Glu Val Gly Ser Leu Arg Pro Leu Thr Val Val35 40 45Ile Leu Ser Ala Ser Ile Val Val Gly Val Leu Gly Asn Gly Leu Val50 55 60Leu Trp Met Thr Val Phe Arg Met Ala Arg Thr Val Ser Thr Val Cys65 70 75 80Phe Phe His Leu Ala Leu Ala Asp Phe Met Leu Ser Leu Ser Leu Pro85 90 95Ile Ala Met Tyr Tyr Ile Val Ser Arg Gln Trp Leu Leu Gly Glu Trp100 105 110Ala Cys Lys Leu Tyr Ile Thr Phe Val Phe Leu Ser Tyr Phe Ala Ser115 120 125Asn Cys Leu Leu Val Phe Ile Ser Val Asp Arg Cys Ile Ser Val Leu130 135 140Tyr Pro Val Trp Ala Leu Asn His Arg Thr Val Gln Arg Ala Ser Trp145 150 155 160Leu Ala Phe Gly Val Trp Leu Leu Ala Ala Ala Leu Cys Ser Ala His165 170 175Leu Lys Phe Arg Thr Thr Arg Lys Trp Asn Gly Cys Thr His Cys Tyr180 185 190Leu Ala Phe Asn Ser Asp Ash Glu Thr Ala Gln Ile Trp Ile Glu Gly195 200 205Val Val Glu Gly His Ile Ile Gly Thr Ile Gly His Phe Leu Leu Gly210 215 220Phe Leu Gly Pro Leu Ala Ile Ile Gly Thr Cys Ala His Leu Ile Arg225 230 235 240Ala Lys Leu Leu Arg Glu Gly Trp Val His Ala Asn Arg Pro Ala Arg245 250 255Leu Leu Leu Val Leu Val Ser Ala Phe Phe Ile Phe Trp Ser Pro Phe260 265 270Asn Val Val Leu Leu Val His Leu Trp Arg Arg Val Met Leu Lys Glu275 280 285Ile Tyr His Pro Arg Met Leu Leu Ile Leu Gln Ala Ser Phe Ala Leu290 295 300Gly Cys Val Asn Ser Ser Leu Asn Pro Phe Leu Tyr Val Phe Val Gly305 310 315 320Arg Asp Phe Gln Glu Lys Phe Phe Gln Ser Leu Thr Ser Ala Leu Ala325 330 335Arg Ala Phe Gly Glu Glu Glu Phe Leu Ser Ser Cys Pro Arg Gly Asn340 345 350Ala Pro Arg Glu355(248) SEQ ID NO247的資料(i)序列特征(A)長度32個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO247的序列描述GCAGAATTCG GCGGCCCCAT GGACCTGCCC CC 32(249)SEQ ID NO248的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO248的序列描述GCTGGATCCC CCGAGCAGTG GCGTTACTTC30(250) SEQ ID NO249的資料(i)序列特征(A)長度903個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO249的序列描述ATGGACCTGC CCCCGCAGCT CTCCTTCGGC CTCTATGTGG CCGCCTTTGC GCTGGGCTTC 60CCGCTCAACG TCCTGGCCAT CCGAGGCGCG ACGGCCCACG CCCGGCTCCG TCTCACCCCT 120AGCCTGGTCT ACGCCCTGAA CCTGGGCTGC TCCGACCTGC TGCTGACAGT CTCTCTGCCC 180CTGAAGGCGG TGGAGGCGCT AGCCTCCGGG GCCTGGCCTC TGCCGGCCTC GCTGTGCCCC 240GTCTTCGCGG TGGCCCACTT CTTCCCACTC TATGCCGGCG GGGGCTTCCT GGCCGCCCTG 300AGTGCAGGCC GCTACCTGGG AGCAGCCTTC CCCTTGGGCT ACCAAGCCTT CCGGAGGCCG 360TGCTATTCCT GGGGGGTGTG CGCGGCCATC TGGGCCCTCG TCCTGTGTCA CCTGGGTCTG 420GTCTTTGGGT TGGAGGCTCC AGGAGGCTGG CTGGACCACA GCAACACCTC CCTGGGCATC 480AACACACCGG TCAACGGCTC TCCGGTCTGC CTGGAGGCCT GGGACCCGGC CTCTGCCGGC 540CCGGCCCGCT TCAGCCTCTC TCTCCTGCTC TTTTTTCTGC CCTTGGCCAT CACAGCCTTC 600TGCTACGTGG GCTGCCTCCG GGCACTGGCC CGCTCCGGCC TGACGCACAG GCGGAAGCTG 660CGGGCCGCCT GGGTGGCCGG CGGGGCCCTC CTCACGCTGC TGCTCTGCGT AGGACCCTAC 720AACGCCTCCA ACGTGGCCAG CTTCCTGTAC CCCAATCTAG GAGGCTCCTG GCGGAAGCTG 780GGGCTCATCA CGGGTGCCTG GAGTGTGGTG CTTAATCCGC TGGTGACCGG TTACTTGGGA 840AGGGGTCCTG GCCTGAAGAC AGTGTGTGCG GCAAGAACGC AAGGGGGCAA GTCCCAGAAG 900TAA 903(251) SEQ ID NO250的資料(i)序列特征(A)長度300個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO250的序列描述Met Asp Leu Pro Pro Gln Leu Ser Phe Gly Leu Tyr Val Ala Ala Phe1 5 10 15Ala Leu Gly Phe Pro Leu Asn Val Leu Ala Ile Arg Gly Ala Thr Ala20 25 30His Ala Arg Leu Arg Leu Thr Pro Ser Leu Val Tyr Ala Leu Asn Leu35 40 45Gly Cys Ser Asp Leu Leu Leu Thr Val Ser Leu Pro Leu Lys Ala Val
50 55 60Glu Ala Leu Ala Ser Gly Ala Trp Pro Leu Pro Ala Ser Leu Cys Pro65 70 75 80Val Phe Ala Val Ala His Phe Phe Pro Leu Tyr Ala Gly Gly Gly Phe85 90 95Leu Ala Ala Leu Ser Ala Gly Arg Tyr Leu Gly Ala Ala Phe Pro Leu100 105 110Gly Tyr Gln Ala Phe Arg Arg Pro Cys Tyr Ser Trp Gly Val Cys Ala115 120 125Ala Ile Trp Ala Leu Val Leu Cys His Leu Gly Leu Val Phe Gly Leu130 135 140Glu Ala Pro Gly Gly Trp Leu Asp His Ser Asn Thr Ser Leu Gly Ile145 150 155 160Asn Thr Pro Val Asn Gly Ser Pro Val Cys Leu Glu Ala Trp Asp Pro165 170 175Ala Ser Ala Gly Pro Ala Arg Phe Ser Leu Ser Leu Leu Leu Phe Phe180 185 190Leu Pro Leu Ala Ile Thr Ala Phe Cys Tyr Val Gly Cys Leu Arg Ala195 200 205Leu Ala Arg Ser Gly Leu Thr His Arg Arg Lys Leu Arg Ala Ala Trp210 215 220Val Ala Gly Gly Ala Leu Leu Thr Leu Leu Leu Cys Val Gly Pro Tyr225 230 235 240Asn Ala Ser Asn Val Ala Ser Phe Leu Tyr Pro Asn Leu Gly Gly Ser245 250 255Trp Arg Lys Leu Gly Leu Ile Thr Gly Ala Trp Ser Val Val Leu Asn260 265 270Pro Leu Val Thr Gly Tyr Leu Gly Arg Gly Pro Gly Leu Lys Thr Val275 280 285Cys Ala Ala Arg Thr Gln Gly Gly Lys Ser Gln Lys290 295 300(252)SEQ ID NO251的資料(i)序列特征(A)長度31個堿基對
(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO251的序列描述CTCAAGCTTA CTCTCTCTCA CCAGTGGCCA C 31(253)SEQ ID NO252的資料(i)序列特征(A)長度24個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO252的序列描述CCCTCCTCCC CCGGAGGACC TAGC 24(254)SEQ ID NO253的資料(i)序列特征(A)長度1041個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO253的序列描述ATGGATACAG GCCCCGACCA GTCCTACTTC TCCGGCAATC ACTGGTTCGT CTTCTCGGTG 60TACCTTCTCA CTTTCCTGGT GGGGCTCCCC CTCAACCTGC TGGCCCTGGT GGTCTTCGTG 120GGCAAGCTGC AGCGCCGCCC GGTGGCCGTG GACGTGCTCC TGCTCAACCT GACCGCCTCG 180GACCTGCTCC TGCTGCTGTT CCTGCCTTTC CGCATGGTGG AGGCAGCCAA TGGCATGCAC 240TGGCCCCTGC CCTTCATCCT CTGCCCACTC TCTGGATTCA TCTTCTTCAC CACCATCTAT 300CTCACCGCCC TCTTCCTGGC AGCTGTGAGC ATTGAACGCT TCCTGAGTGT GGCCCACCCA 360CTGTGGTACA AGACCCGGCC GAGGCTGGGG CAGGCAGGTC TGGTGAGTGT GGCCTGCTGG 420CTGTTGGCCT CTGCTCACTG CAGCGTGGTC TACGTCATAG AATTCTCAGG GGACATCTCC 480CACAGCCAGG GCACCAATGG GACCTGCTAC CTGGAGTTCC GGAAGGACCA GCTAGCCATC 540CTCCTGCCCG TGCGGCTGGA GATGGCTGTG GTCCTCTTTG TGGTCCCGCT GATCATCACC 600AGCTACTGCT ACAGCCGCCT GGTGTGGATC CTCGGCAGAG GGGGCAGCCA CCGCCGGCAG 660AGGAGGGTGG CGGGGCTGTT GGCGGCCACG CTGCTCAACT TCCTTGTCTG CTTTGGGCCC 720TACAACGTGT CCCATGTCGT GGGCTATATC TGCGGTGAAA GCCCGGCATG GAGGATCTAC 780GTGACGCTTC TCAGCACCCT GAACTCCTGT GTCGACCCCT TTGTCTACTA CTTCTCCTCC 840TCCGGGTTCC AAGCCGACTT TCATGAGCTG CTGAGGAGGT TGTGTGGGCT CTGGGGCCAG 900TGGCAGCAGG AGAGCAGCAT GGAGCTGAAG GAGCAGAAGG GAGGGGAGGA GCAGAGAGCG 960GACCGACCAG CTGAAAGAAA GACCAGTGAA CACTCACAGG GCTGTGGAAC TGGTGGCCAG 1020GTGGCCTGTG CTGAAAGCTA G 1041(255)SEQ ID NO254的資料(i)序列特征(A)長度346個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO254的序列描述Met Asp Thr Gly Pro Asp Gln Ser Tyr Phe Ser Gly Asn His Trp Phe1 5 10 15Val Phe Ser Val Tyr Leu Leu Thr Phe Leu Val Gly Leu Pro Leu Asn20 25 30Leu Leu Ala Leu Val Val Phe Val Gly Lys Leu Gln Arg Arg Pro Val35 40 45Ala Val Asp Val Leu Leu Leu Asn Leu Thr Ala Ser Asp Leu Leu Leu50 55 60Leu Leu Phe Leu Pro Phe Arg Met Val Glu Ala Ala Asn Gly Met His65 70 75 80Trp Pro Leu Pro Phe Ile Leu Cys Pro Leu Ser Gly Phe Ile Phe Phe85 90 95Thr Thr Ile Tyr Leu Thr Ala Leu Phe Leu Ala Ala Val Ser Ile Glu100 105 110Arg Phe Leu Ser Val Ala His Pro Leu Trp Tyr Lys Thr Arg Pro Arg115 120 125Leu Gly Gln Ala Gly Leu Val Ser Val Ala Cys Trp Leu Leu Ala Ser130 135 140Ala His Cys Ser Val Val Tyr Val Ile Glu Phe Ser Gly Asp Ile Ser145 150 155 160His Ser Gln Gly Thr Asn Gly Thr Cys Tyr Leu Glu Phe Arg Lys Asp165 170 175Gln Leu Ala Ile Leu Leu Pro Val Arg Leu Glu Met Ala Val Val Leu180 185 190Phe Val Val Pro Leu Ile Ile Thr Ser Tyr Cys Tyr Ser Arg Leu Val195 200 205Trp Ile Leu Gly Arg Gly Gly Ser His Arg Arg Gln Arg Arg Val Ala210 215 220Gly Leu Leu Ala Ala Thr Leu Leu Asn Phe Leu Val Cys Phe Gly Pro225 230 235 240Tyr Asn Val Ser His Val Val Gly Tyr Ile Cys Gly Glu Ser Pro Ala245 250 255Trp Arg Ile Tyr Val Thr Leu Leu Ser Thr Leu Asn Ser Cys Val Asp260 265 270Pro Phe Val Tyr Tyr Phe Ser Ser Ser Gly Phe Gln Ala Asp Phe His275 280 285Glu Leu Leu Arg Arg Leu Cys Gly Leu Trp Gly Gln Trp Gln Gln Glu290 295 300Ser Ser Met Glu Leu Lys Glu Gln Lys Gly Gly Glu Glu Gln Arg Ala305 310 315 320Asp Arg Pro Ala Glu Arg Lys Thr Ser Glu His Ser Gln Gly Cys Gly325 330 335Thr Gly Gly Gln Val Ala Cys Ala Glu Ser340 345(256)SEQ ID NO255的資料(i)序列特征(A)長度31個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO255的序列描述TTTAAGCTTC CCCTCCAGGA TGCTGCCGGA C 31(257)SEQ ID NO256的資料(i)序列特征(A)長度31個堿基對(B )類型核酸(C)鏈型單鏈(D)拓撲學不相關(ii)分子類型DNA(基因組的)(xi)SEQ ID NO256的序列描述GGCGAATTCT GAAGGTCCAG GGAAACTGCT A 31(258)SEQ ID NO257的資料(i)序列特征(A)長度993個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA (基因組的)(xi)SEQ ID NO257的序列描述ATGCTGCCGG ACTGGAAGAG CTCCTTGATC CTCATGGCTT ACATCATCAT CTTCCTCACT 60GGCCTCCCTG CCAACCTCCT GGCCCTGCGG GCCTTTGTGG GGCGGATCCG CCAGCCCCAG 120CCTGCACCTG TGCACATCCT CCTGCTGAGC CTGACGCTGG CCGACCTCCT CCTGCTGCTG 180CTGCTGCCCT TCAAGATCAT CGAGGCTGCG TCGAACTTCC GCTGGTACCT GCCCAAGGTC 240GTCTGCGCCC TCACGAGTTT TGGCTTCTAC AGCAGCATCT ACTGCAGCAC GTGGCTCCTG 300GCGGGCATCA GCATCGAGCG CTACCTGGGA GTGGCTTTCC CCGTGCAGTA CAAGCTCTCC 360CGCCGGCCTC TGTATGGAGT GATTGCAGCT CTGGTGGCCT GGGTTATGTC CTTTGGTCAC 420TGCACCATCG TGATCATCGT TCAATACTTG AACACGACTG AGCAGGTCAG AAGTGGCAAT 480GAAATTACCT GCTACGAGAA CTTCACCGAT AACCAGTTGG ACGTGGTGCT GCCCGTGCGG 540CTGGAGCTGT GCCTGGTGCT CTTCTTCATC CCCATGGCAG TCACCATCTT CTGCTACTGG 600CGTTTTGTGT GGATCATGCT CTCCCAGCCC CTTGTGGGGG CCCAGAGGCG GCGCCGAGCC 660GTGGGGCTGG CTGTGGTGAC GCTGCTCAAT TTCCTGGTGT GCTTCGGACC TTACAACGTG 720TCCCACCTGG TGGGGTATCA CCAGAGAAAA AGCCCCTGGT GGCGGTCAAT AGCCGTGGTG 780TTCAGTTCAC TCAACGCCAG TCTGGACCCC CTGCTCTTCT ATTTCTCTTC TTCAGTGGTG 840CGCAGGGCAT TTGGGAGAGG GCTGCAGGTG CTGCGGAATC AGGGCTCCTC CCTGTTGGGA 900CGCAGAGGCA AAGACACAGC AGAGGGGACA AATGAGGACA GGGGTGTGGG TCAAGGAGAA 960GGGATGCCAA GTTCGGACTT CACTACAGAG TAG 993(259)SEQ ID NO258的資料(i)序列特征(A)長度362個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi) SEQ ID NO258的序列描述Met Leu Pro Asp Trp Lys Ser Ser Leu Ile Leu Met Ala Tyr Ile Ile1 5 10 15Ile Phe Leu Thr Gly Leu Pro Ala Asn Leu Leu Ala Leu Arg Ala Phe20 25 30Val Gly Arg Ile Arg Gln Pro Gln Pro Ala Pro Val His Ile Leu Leu35 40 45Leu Ser Leu Thr Leu Ala Asp Leu Leu Leu Leu Leu Leu Leu Pro Phe50 55 60Lys Ile Ile Glu Ala Ala Ser Asn Phe Arg Trp Tyr Leu Pro Lys Val65 70 75 80Val Cys Ala Leu Thr Ser Phe Gly Phe Tyr Ser Ser Ile Tyr Cys Ser85 90 95Thr Trp Leu Leu Ala Gly Ile Ser Ile Glu Arg Tyr Leu Gly Val Ala100 105 110Phe Pro Val Gln Tyr Lys Leu Ser Arg Arg Pro Leu Tyr Gly Val Ile115 120 125Ala Ala Leu Val Ala Trp Val Met Ser Phe Gly His Cys Thr Ile Val130 135 140Ile Ile Val Gln Tyr Leu Asn Thr Thr Glu Gln Val Arg Ser Gly Asn145 150 155 160Glu Ile Thr Cys Tyr Glu Asn Phe Thr Asp Asn Gln Leu Asp Val Val165 170 175Leu Pro Val Arg Leu Glu Leu Cys Leu Val Leu Phe Phe Ile Pro Met180 185 190Ala Val Thr Ile Phe Cys Tyr Trp Arg Phe Val Trp Ile Met Leu Ser195 200 205Gln Pro Leu Val Gly Ala Gln Arg Arg Arg Arg Ala Val Gly Leu Ala210 215 220Val Val Thr Leu Leu Asn Phe Leu Val Cys Phe Gly Pro Tyr Asn Val225 230 235 240Ser His Leu Val Gly Tyr His Gln Arg Lys Ser Pro Trp Trp Arg Ser245 250 255Ile Ala Val Val Phe Ser Ser Leu Asn Ala Ser Leu Asp Pro Leu Leu260 265 270Phe Tyr Phe Ser Ser Ser Val Val Arg Arg Ala Phe Gly Arg Gly Leu275 280 285Gln Val Leu Arg Asn Gln Gly Ser Ser Leu Leu Gly Arg Arg Gly Lys290 295 300Asp Thr Ala Glu Gly Thr Asn Glu Asp Arg Gly Val Gly Gln Gly Glu305 310 315 320Gly Met Pro Ser Ser Asp Phe Thr Thr Glu325 330(260)SEQ ID NO259的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO259的列描述CCCAAGCTTC GGGCACCATG GACACCTCCC 30(261)SEQ ID NO260的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO260的序列描述ACAGGATCCA AATGCACAGC ACTGGTAAGC 30(262)SEQ ID NO261的資料(i)序列特征(A)長度25個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO261的序列描述CTATAACTGG GTTACATGGT TTAAC25(263)SEQ ID NO262的資料(i)序列特征
(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO262的序列描述TTTGAATTCA CATATTAATT AGAGACATGG 30(264)SEQ ID NO263的資料(i)序列特征(A)長度2724個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO263的序列描述ATGGACACCT CCCGGCTCGG TGTGCTCCTG TCCTTGCCTG TGCTGCTGCA GCTGGCGACC 60GGGGGCAGCT CTCCCAGGTC TGGTGTGTTG CTGAGGGGCT GCCCCACACA CTGTCATTGC 120GAGCCCGACG GCAGGATGTT GCTCAGGGTG GACTGCTCCG ACCTGGGGCT CTCGGAGCTG 180CCTTCCAACC TCAGCGTCTT CACCTCCTAC CTAGACCTCA GTATGAACAA CATCAGTCAG 240CTGCTCCCGA ATCCCCTGCC CAGTCTCCGC TTCCTGGAGG AGTTACGTCT TGCGGGAAAC 300GCTCTGACAT ACATTCCCAA GGGAGCATTC ACTGGCCTTT ACAGTCTTAA AGTTCTTATG 360CTGCAGAATA ATCAGCTAAG ACACGTACCC ACAGAAGCTC TGCAGAATTT GCGAAGCCTT 420CAATCCCTGC GTCTGGATGC TAACCACATC AGCTATGTGC CCCCAAGCTG TTTCAGTGGC 480CTGCATTCCC TGAGGCACCT GTGGCTGGAT GACAATGCGT TAACAGAAAT CCCCGTCCAG 540GCTTTTAGAA GTTTATCGGC ATTGCAAGCC ATGACCTTGG CCCTGAACAA AATACACCAC 600ATACCAGACT ATGCCTTTGG AAACCTCTCC AGCTTGGTAG TTCTACATCT CCATAACAAT 660AGAATCCACT CCCTGGGAAA GAAATGCTTT GATGGGCTCC ACAGCCTAGA GACTTTAGAT 720TTAAATTACA ATAACCTTGA TGAATTCCCC ACTGCAATTA GGACACTCTC CAACCTTAAA 780GAACTAGGAT TTCATAGCAA CAATATCAGG TCGATACCTG AGAAAGCATT TGTAGGCAAC 840CCTTCTCTTA TTACAATACA TTTCTATGAC AATCCCATCC AATTTGTTGG GAGATCTGCT 900TTTCAACATT TACCTGAACT AAGAACACTG ACTCTGAATG GTGCCTCACA AATAACTGAA 960TTTCCTGATT TAACTGGAAC TGCAAACCTG GAGAGTCTGA CTTTAACTGG AGCACAGATC 1020TCATCTCTTC CTCAAACCGT CTGCAATCAG TTACCTAATC TCCAAGTGCT AGATCTGTCT 1080TACAACCTAT TAGAAGATTT ACCCAGTTTT TCAGTCTGCC AAAAGCTTCA GAAAATTGAC 1140CTAAGACATA ATGAAATCTA CGAAATTAAA GTTGACACTT TCCAGCAGTT GCTTAGCCTC 1200CGATCGCTGA ATTTGGCTTG GAACAAAATT GCTATTATTC ACCCCAATGC ATTTTCCACT 1260TTGCCATCCC TAATAAAGCT GGACCTATCG TCCAACCTCC TGTCGTCTTT TCCTATAACT 1320GGGTTACATG GTTTAACTCA CTTAAAATTA ACAGGAAATC ATGCCTTACA GAGCTTGATA 1380TCATCTGAAA ACTTTCCAGA ACTCAAGGTT ATAGAAATGC CTTATGCTTA CCAGTGCTGT 1440GCATTTGGAG TGTGTGAGAA TGCCTATAAG ATTTCTAATC AATGGAATAA AGGTGACAAC 1500AGCAGTATGG ACGACCTTCA TAAGAAAGAT GCTGGAATGT TTCAGGCTCA AGATGAACGT 1560GACCTTGAAG ATTTCCTGCT TGACTTTGAG GAAGACCTGA AAGCCCTTCA TTCAGTGCAG 1620TGTTCACCTT CCCCAGGCCC CTTCAAACCC TGTGAACACC TGCTTGATGG CTGGCTGATC 1680AGAATTGGAG TGTGGACCAT AGCAGTTCTG GCACTTACTT GTAATGCTTT GGTGACTTCA 1740ACAGTTTTCA GATCCCCTCT GTACATTTCC CCCATTAAAC TGTTAATTGG GGTCATCGCA 1800GCAGTGAACA TGCTCACGGG AGTCTCCAGT GCCGTGCTGG CTGGTGTGGA TGCGTTCACT 1860TTTGGCAGCT TTGCACGACA TGGTGCCTGG TGGGAGAATG GGGTTGGTTG CCATGTCATT 1920GGTTTTTTGT CCATTTTTGC TTCAGAATCA TCTGTTTTCC TGCTTACTCT GGCAGCCCTG 1980GAGCGTGGGT TCTCTGTGAA ATATTCTGCA AAATTTGAAA CGAAAGCTCC ATTTTCTAGC 2040CTGAAAGTAA TCATTTTGCT CTGTGCCCTG CTGGCCTTGA CCATGGCCGC AGTTCCCCTG 2100CTGGGTGGCA GCAAGTATGG CGCCTCCCCT CTCTGCCTGC CTTTGCCTTT TGGGGAGCCC 2160AGCACCATGG GCTACATGGT CGCTCTCATC TTGCTCAATT CCCTTTGCTT CCTCATGATG 2220ACCATTGCCT ACACCAAGCT CTACTGCAAT TTGGACAAGG GAGACCTGGA GAATATTTGG 2280GACTGCTCTA TGGTAAAACA CATTGCCCTG TTGCTCTTCA CCAACTGCAT CCTAAACTGC 2340CCTGTGGCTT TCTTGTCCTT CTCCTCTTTA ATAAACCTTA CATTTATCAG TCCTGAAGTA 2400ATTAAGTTTA TCCTTCTGGT GGTAGTCCCA CTTCCTGCAT GTCTCAATCC CCTTCTCTAC 2460ATCTTGTTCA ATCCTCACTT TAAGGAGGAT CTGGTGAGCC TGAGAAAGCA AACCTACGTC 2520TGGACAAGAT CAAAACACCC AAGCTTGATG TCAATTAACT CTGATGATGT CGAAAAACAG 2580TCCTGTGACT CAACTCAAGC CTTGGTAACC TTTACCAGCT CCAGCATCAC TTATGACCTG 2640CCTCCCAGTT CCGTGCCATC ACCAGCTTAT CCAGTGACTG AGAGCTGCCA TCTTTCCTCT 2700GTGGCATTTG TCCCATGTCT CTAA2724(265)SEQ ID NO264的資料(i)序列特征(A)長度907個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO264的序列描述Met Asp Thr Ser Arg Leu Gly Val Leu Leu Ser Leu Pro Val Leu Leu1 5 10 15Gln Leu Ala Thr Gly Gly Ser Ser Pro Arg Ser Gly Val Leu Leu Arg20 25 30Gly Cys Pro Thr His Cys His Cys Glu Pro Asp Gly Arg Met Leu Leu35 40 45Arg Val Asp Cys Ser Asp Leu Gly Leu Ser Glu Leu Pro Ser Asn Leu50 55 60Ser Val Phe Thr Ser Tyr Leu Asp Leu Ser Met Asn Asn Ile Ser Gln65 70 75 80Leu Leu Pro Asn Pro Leu Pro Ser Leu Arg Phe Leu Glu Glu Leu Arg85 90 95Leu Ala Gly Asn Ala Leu Thr Tyr Ile Pro Lys Gly Ala Phe Thr Gly100 105 110Leu Tyr Ser Leu Lys Val Leu Met Leu Gln Asn Asn Gln Leu Arg His115 120 125Val Pro Thr Glu Ala Leu Gln Asn Leu Arg Ser Leu Gln Ser Leu Arg130 135 140Leu Asp Ala Asn His Ile Ser Tyr Val Pro Pro Ser Cys Phe Ser Gly145 150 155 160Leu His Ser Leu Arg His Leu Trp Leu Asp Asp Asn Ala Leu Thr Glu165 170 175Ile Pro Val Gln Ala Phe Arg Ser Leu Ser Ala Leu Gln Ala Met Thr180 185 190Leu Ala Leu Asn Lys Ile His His Ile Pro Asp Tyr Ala Phe Gly Asn195 200 205Leu Ser Ser Leu Val Val Leu His Leu His Asn Asn Arg Ile His Ser210 215 220Leu Gly Lys Lys Cys Phe Asp Gly Leu His Ser Leu Glu Thr Leu Asp225 230 235 240Leu Asn Tyr Asn Asn Leu Asp Glu Phe Pro Thr Ala Ile Arg Thr Leu245 250 255Ser Asn Leu Lys Glu Leu Gly Phe His Ser Asn Asn Ile Arg Ser Ile260 265 270Pro Glu Lys Ala Phe Val Gly Asn Pro Ser Leu Ile Thr Ile His Phe275 280 285Tyr Asp Asn Pro Ile Gln Phe Val Gly Arg Ser Ala Phe Gln His Leu290 295 300Pro Glu Leu Arg Thr Leu Thr Leu Asn Gly Ala Ser Gln Ile Thr Glu305 310 315 320Phe Pro Asp Leu Thr Gly Thr Ala Asn Leu Glu Ser Leu Thr Leu Thr325 330 335Gly Ala Gln Ile Ser Ser Leu Pro Gln Thr Val Cys Asn Gln Leu Pro340 345 350Asn Leu Gln Val Leu Asp Leu Ser Tyr Asn Leu Leu Glu Asp Leu Pro355 360 365Ser Phe Ser Val Cys Gln Lys Leu Gln Lys Ile Asp Leu Arg His Asn370 375 380Glu Ile Tyr Glu Ile Lys Val Asp Thr Phe Gln Gln Leu Leu Ser Leu385 390 395 400Arg Ser Leu Asn Leu Ala Trp Asn Lys Ile Ala Ile Ile His Pro Asn405 410 415Ala Phe Ser Thr Leu Pro Ser Leu Ile Lys Leu Asp Leu Ser Ser Asn420 425 430Leu Leu Ser Ser Phe Pro Ile Thr Gly Leu His Gly Leu Thr His Leu435 440 445Lys Leu Thr Gly Asn His Ala Leu Gln Ser Leu Ile Ser Ser Glu Asn450 455 460Phe Pro Glu Leu Lys Val Ile Glu Met Pro Tyr Ala Tyr Gln Cys Cys465 470 475 480Ala Phe Gly Val Cys Glu Asn Ala Tyr Lys Ile Ser Asn Gln Trp Asn485 490 495Lys Gly Asp Asn Ser Ser Met Asp Asp Leu His Lys Lys Asp Ala Gly500 505 510Met Phe Gln Ala Gln Asp Glu Arg Asp Leu Glu Asp Phe Leu Leu Asp515 520 525Phe Glu Glu Asp Leu Lys Ala Leu His Ser Val Gln Cys Ser Pro Ser530 535 540Pro Gly Pro Phe Lys Pro Cys Glu His Leu Leu Asp Gly Trp Leu Ile545 550 555 560Arg Ile Gly Val Trp Thr Ile Ala Val Leu Ala Leu Thr Cys Asn Ala565 570 575Leu Val Thr Ser Thr Val Phe Arg Ser Pro Leu Tyr Ile Ser Pro Ile580 585 590Lys Leu Leu Ile Gly Val Ile Ala Ala Val Asn Met Leu Thr Gly Val595 600 605Ser Ser Ala Val Leu Ala Gly Val Asp Ala Phe Thr Phe Gly Ser Phe610 615 620Ala Arg His Gly Ala Trp Trp Glu Asn Gly Val Gly Cys His Val Ile625 630 635 640Gly Phe Leu Ser Ile Phe Ala Ser Glu Ser Ser Val Phe Leu Leu Thr645 650 655Leu Ala Ala Leu Glu Arg Gly Phe Ser Val Lys Tyr Ser Ala Lys Phe660 665 670Glu Thr Lys Ala Pro Phe Ser Ser Leu Lys Val Ile Ile Leu Leu Cys675 680 685Ala Leu Leu Ala Leu Thr Met Ala Ala Val Pro Leu Leu Gly Gly Ser690 695 700Lys Tyr Gly Ala Ser Pro Leu Cys Leu Pro Leu Pro Phe Gly Glu Pro705 710 715 720Ser Thr Met Gly Tyr Met Val Ala Leu Ile Leu Leu Asn Ser Leu Cys725 730 735Phe Leu Met Met Thr Ile Ala Tyr Thr Lys Leu Tyr Cys Asn Leu Asp740 745 750Lys Gly Asp Leu Glu Asn Ile Trp Asp Cys Ser Met Val Lys His Ile755 760 765Ala Leu Leu Leu Phe Thr Asn Cys Ile Leu Asn Cys Pro Val Ala Phe770 775 780Leu Ser Phe Ser Ser Leu Ile Asn Leu Thr Phe Ile Ser Pro Glu Val785 790 795 800Ile Lys Phe Ile Leu Leu Val Val Val Pro Leu Pro Ala Cys Leu Asn805 810 815Pro Leu Leu Tyr Ile Leu Phe Asn Pro His Phe Lys Glu Asp Leu Val820 825 830Ser Leu Arg Lys Gln Thr Tyr Val Trp Thr Arg Ser Lys His Pro Ser835 840 845Leu Met Ser Ile Asn Ser Asp Asp Val Glu Lys Gln Ser Cys Asp Ser850 855 860Thr Gln Ala Leu Val Thr Phe Thr Ser Ser Ser Ile Thr Tyr Asp Leu865 870 875 880Pro Pro Ser Ser Val Pro Ser Pro Ala Tyr Pro Val Thr Glu Ser Cys885 890 895His Leu Ser Ser Val Ala Phe Val Pro Cys Leu900 905(266)SEQ ID NO265的資料(i)序列特征(A)長度30個堿基對
(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO265的序列描述CGGAAGCTGC GGGCCAAATG GGTGGCCGGC 30(267)SEQ ID NO266的資料(i)序列特征(A)長度27個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO266的序列描述CAGAGGAGGG TGAAGGGGCT GTTGGCG 27(268)SEQ ID NO267的資料(i)序列特征(A)長度30個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO267的序列描述GGCGGCGCCG AGCCAAGGGG CTGGCTGTGG30(269)SEQ ID NO268的資料(i)序列特征(A)長度32個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO268的序列描述GGGACTGCTC TATGAAAAAA CACATTGCCC TG32(270)SEQ ID NO269的資料(i)序列特征(A)長度1071個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO269的序列描述ATGAATGGGG TCTCGGAGGG GACCAGAGGC TGCAGTGACA GGCAACCTGG GGTCCTGACA 60CGTGATCGCT CTTGTTCCAG GAAGATGAAC TCTTCCGGAT GCCTGTCTGA GGAGGTGGGG 120TCCCTCCGCC CACTGACTGT GGTTATCCTG TCTGCGTCCA TTGTCGTCGG AGTGCTGGGC 180AATGGGCTGG TGCTGTGGAT GACTGTCTTC CGTATGGCAC GCACGGTCTC CACCGTCTGC 240TTCTTCCACC TGGCCCTTGC CGATTTCATG CTCTCACTGT CTCTGCCCAT TGCCATGTAC 300TATATTGTCT CCAGGCAGTG GCTCCTCGGA GAGTGGGCCT GCAAACTCTA CATCACCTTT 60GTGTTCCTCA GCTACTTTGC CAGTAACTGC CTCCTTGTCT TCATCTCTGT GGACCGTTGC 420ATCTCTGTCC TCTACCCCGT CTGGGCCCTG AACCACCGCA CTGTGCAGCG GGCGAGCTGG 480CTGGCCTTTG GGGTGTGGCT CCTGGCCGCC GCCTTGTGCT CTGCGCACCT GAAATTCCGG 540ACAACCAGAA AATGGAATGG CTGTACGCAC TGCTACTTGG CGTTCAACTC TGACAATGAG 600ACTGCCCAGA TTTGGATTGA AGGGGTCGTG GAGGGACACA TTATAGGGAC CATTGGCCAC 660TTCCTGCTGG GCTTCCTGGG GCCCTTAGCA ATCATAGGCA CCTGCGCCCA CCTCATCCGG 720GCCAAGCTCT TGCGGGAGGG CTGGGTCCAT GCCAACCGGC CCAAGAGGCT GCTGCTGGTG 780CTGGTGAGCG CTTTCTTTAT CTTCTGGTCC CCGTTTAACG TGGTGCTGTT GGTCCATCTG 840TGGCGACGGG TGATCCTCAA GGAAATCTAC CACCCCCGGA TGCTGCTCAT CCTCCAGGCT 900AGCTTTGCCT TGGGCTGTGT CAACAGCAGC CTCAACCCCT TCCTCTACGT CTTCGTTGGC 960AGAGATTTCC AAGAAAAGTT TTTCCAGTCT TTGACTTCTG CCCTGGCGAG GGCGTTTGGA 1020GAGGAGGAGT TTCTGTCATC CTGTCCCCGT GGCAACGCCC CCCGGGAATG A 1071(271)SEQ ID NO270的資料(i)序列特征(A)長度356個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi) SEQ ID NO270的序列描述Met Asn Gly Val Ser Glu Gly Thr Arg Gly Cys Ser Asp Arg Gln Pro1 5 10 15Gly Val Leu Thr Arg Asp Arg Ser Cys Ser Arg Lys Met Asn Ser Ser20 25 30Gly Cys Leu Ser Glu Glu Val Gly Ser Leu Arg Pro Leu Thr Val Val
35 40 45Ile Leu Ser Ala Ser Ile Val Val Gly Val Leu Gly Asn Gly Leu Val50 55 60Leu Trp Met Thr Val Phe Arg Met Ala Arg Thr Val Ser Thr Val Cys65 70 75 80Phe Phe His Leu Ala Leu Ala Asp Phe Met Leu Ser Leu Ser Leu Pro85 90 95Ile Ala Met Tyr Tyr Ile Val Ser Arg Gln Trp Leu Leu Gly Glu Trp100 105 110Ala Cys Lys Leu Tyr Ile Thr Phe Val Phe Leu Ser Tyr Phe Ala Ser115 120 125Asn Cys Leu Leu Val Phe Ile Ser Val Asp Arg Cys Ile Ser Val Leu130 135 140Tyr Pro Val Trp Ala Leu Asn His Arg Thr Val Gln Arg Ala Ser Trp145 150 155 160Leu Ala Phe Gly Val Trp Leu Leu Ala Ala Ala Leu Cys Ser Ala His165 170 175Leu Lys Phe Arg Thr Thr Arg Lys Trp Asn Gly Cys Thr His Cys Tyr180 185 190Leu Ala Phe Asn Ser Asp Asn Glu Thr Ala Gln Ile Trp Ile Glu Gly195 200 205Val Val Glu Gly His Ile Ile Gly Thr Ile Gly His Phe Leu Leu Gly210 215 220Phe Leu Gly Pro Leu Ala Ile Ile Gly Thr Cys Ala His Leu Ile Arg225 230 235 240Ala Lys Leu Leu Arg Glu Gly Trp Val His Ala Asn Arg Pro Lys Arg245 250 255Leu Leu Leu Val Leu Val Ser Ala Phe Phe Ile Phe Trp Ser Pro Phe260 265 270Ash Val Val Leu Leu Val His Leu Trp Arg Arg Val Met Leu Lys Glu275 280 285Ile Tyr His Pro Arg Met Leu Leu Ile Leu Gln Ala Ser Phe Ala Leu290 295 300Gly Cys Val Asn Ser Ser Leu Asn Pro Phe Leu Tyr Val Phe Val Gly305 310 315 320Arg Asp Phe Gln Glu Lys Phe Phe Gln Ser Leu Thr Ser Ala Leu Ala325 330 335Arg Ala Phe Gly Glu Glu Glu Phe Leu Ser Ser Cys Pro Arg Gly Asn340 345 350Ala Pro Arg Glu355(272)SEQ ID NO271的資料(i)序列特征(A)長度903個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO271的序列描述ATGGACCTGC CCCCGCAGCT CTCCTTCGGC CTCTATGTGG CCGCCTTTGC GCTGGGCTTC 60CCGCTCAACG TCCTGGCCAT CCGAGGCGCG ACGGCCCACG CCCGGCTCCG TCTCACCCCT 120AGCCTGGTCT ACGCCCTGAA CCTGGGCTGC TCCGACCTGC TGCTGACAGT CTCTCTGCCC 180CTGAAGGCGG TGGAGGCGCT AGCCTCCGGG GCCTGGCCTC TGCCGGCCTC GCTGTGCCCC 240GTCTTCGCGG TGGCCCACTT CTTCCCACTC TATGCCGGCG GGGGCTTCCT GGCCGCCCTG 300AGTGCAGGCC GCTACCTGGG AGCAGCCTTC CCCTTGGGCT ACCAAGCCTT CCGGAGGCCG 360TGCTATTCCT GGGGGGTGTG CGCGGCCATC TGGGCCCTCG TCCTGTGTCA CCTGGGTCTG 420GTCTTTGGGT TGGAGGCTCC AGGAGGCTGG CTGGACCACA GCAACACCTC CCTGGGCATC 480AACACACCGG TCAACGGCTC TCCGGTCTGC CTGGAGGCCT GGGACCCGGC CTCTGCCGGC 540CCGGCCCGCT TCAGCCTCTC TCTCCTGCTC TTTTTTCTGC CCTTGGCCAT CACAGCCTTC 600TGCTACGTGG GCTGCCTCCG GGCACTGGCC CGCTCCGGCC TGACGCACAG GCGGAAGCTG 660CGGGCCAAAT GGGTGGCCGG CGGGGCCCTC CTCACGCTGC TGCTCTGCGT AGGACCCTAC 720AACGCCTCCA ACGTGGCCAG CTTCCTGTAC CCCAATCTAG GAGGCTCCTG GCGGAAGCTG 780GGGCTCATCA CGGGTGCCTG GAGTGTGGTG CTTAATCCGC TGGTGACCGG TTACTTGGGA 840AGGGGTCCTG GCCTGAAGAC AGTGTGTGCG GCAAGAACGC AAGGGGGCAA GTCCCAGAAG 900TAA 903(273)SEQ ID NO272的資料(i)序列特征(A)長度300個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO272的序列描述Met Asp Leu Pro Pro Gln Leu Ser Phe Gly Leu Tyr Val Ala Ala Phe1 5 10 15Ala Leu Gly Phe Pro Leu Asn Val Leu Ala Ile Arg Gly Ala Thr Ala20 25 30His Ala Arg Leu Arg Leu Thr Pro Ser Leu Val Tyr Ala Leu Asn Leu35 40 45Gly Cys Ser Asp Leu Leu Leu Thr Val Ser Leu Pro Leu Lys Ala Val50 55 60Glu Ala Leu Ala Ser Gly Ala Trp Pro Leu Pro Ala Ser Leu Cys Pro65 70 75 80Val Phe Ala Val Ala His Phe Phe Pro Leu Tyr Ala Gly Gly Gly Phe85 90 95Leu Ala Ala Leu Ser Ala Gly Arg Tyr Leu Gly Ala Ala Phe Pro Leu100 105 110Gly Tyr Gln Ala Phe Arg Arg Pro Cys Tyr Ser Trp Gly Val Cys Ala115 120 125Ala Ile Trp Ala Leu Val Leu Cys His Leu Gly Leu Val Phe Gly Leu130 135 140Glu Ala Pro Gly Gly Trp Leu Asp His Ser Asn Thr Ser Leu Gly Ile145 150 155 160Asn Thr Pro Val Asn Gly Ser Pro Val Cys Leu Glu Ala Trp Asp Pro165 170 175Ala Ser Ala Gly Pro Ala Arg Phe Ser Leu Ser Leu Leu Leu Phe Phe180 185 190Leu Pro Leu Ala Ile Thr Ala Phe Cys Tyr Val Gly Cys Leu Arg Ala195 200 205Leu Ala Arg Ser Gly Leu Thr His Arg Arg Lys Leu Arg Ala Lys Trp210 215 220Val Ala Gly Gly Ala Leu Leu Thr Leu Leu Leu Cys Val Gly Pro Tyr225 230 235 240Asn Ala Ser Asn Val Ala Ser Phe Leu Tyr Pro Asn Leu Gly Gly Ser245 250 255Trp Arg Lys Leu Gly Leu Ile Thr Gly Ala Trp Ser Val Val Leu Asn260 265 270Pro Leu Val Thr Gly Tyr Leu Gly Arg Gly Pro Gly Leu Lys Thr Val
275 280285Cys Ala Ala Arg Thr Gln Gly Gly Lys Ser Gln Lys290 295 300(274)SEQ ID NO273的資料(i)序列特征(A)長度1041個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO273的序列描述ATGGATACAG GCCCCGACCA GTCCTACTTC TCCGGCAATC ACTGGTTCGT CTTCTCGGTG 60TACCTTCTCA CTTTCCTGGT GGGGCTCCCC CTCAACCTGC TGGCCCTGGT GGTCTTCGTG 120GGCAAGCTGC AGCGCCGCCC GGTGGCCGTG GACGTGCTCC TGCTCAACCT GACCGCCTCG 180GACCTGCTCC TGCTGCTGTT CCTGCCTTTC CGCATGGTGG AGGCAGCCAA TGGCATGCAC 240TGGCCCCTGC CCTTCATCCT CTGCCCACTC TCTGGATTCA TCTTCTTCAC CACCATCTAT 300CTCACCGCCC TCTTCCTGGC AGCTGTGAGC ATTGAACGCT TCCTGAGTGT GGCCCACCCA 360CTGTGGTACA AGACCCGGCC GAGGCTGGGG CAGGCAGGTC TGGTGAGTGT GGCCTGCTGG 420CTGTTGGCCT CTGCTCACTG CAGCGTGGTC TACGTCATAG AATTCTCAGG GGACATCTCC 480CACAGCCAGG GCACCAATGG GACCTGCTAC CTGGAGTTCC GGAAGGACCA GCTAGCCATC 540CTCCTGCCCG TGCGGCTGGA GATGGCTGTG GTCCTCTTTG TGGTCCCGCT GATCATCACC 600AGCTACTGCT ACAGCCGCCT GGTGTGGATC CTCGGCAGAG GGGGCAGCCA CCGCCGGCAG 660AGGAGGGTGA AGGGGCTGTT GGCGGCCACG CTGCTCAACT TCCTTGTCTG CTTTGGGCCC 720TACAACGTGT CCCATGTCGT GGGCTATATC TGCGGTGAAA GCCCGGCATG GAGGATCTAC 780GTGACGCTTC TCAGCACCCT GAACTCCTGT GTCGACCCCT TTGTCTACTA CTTCTCCTCC 840TCCGGGTTCC AAGCCGACTT TCATGAGCTG CTGAGGAGGT TGTGTGGGCT CTGGGGCCAG 900TGGCAGCAGG AGAGCAGCAT GGAGCTGAAG GAGCAGAAGG GAGGGGAGGA GCAGAGAGCG 960GACCGACCAG CTGAAAGAAA GACCAGTGAA CACTCACAGG GCTGTGGAAC TGGTGGCCAG 1020GTGGCCTGTG CTGAAAGCTA G 1041(275)SEQ ID NO274的資料(i)序列特征(A)長度346個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO274的序列描述Met Asp Thr Gly Pro Asp Gln Ser Tyr Phe Ser Gly Asn His Trp Phe1 5 10 15Val Phe Ser Val Tyr Leu Leu Thr Phe Leu Val Gly Leu Pro Leu Asn
20 25 30Leu Leu Ala Leu Val Val Phe Val Gly Lys Leu Gln Arg Arg Pro Val35 40 45Ala Val Asp Val Leu Leu Leu Asn Leu Thr Ala Ser Asp Leu Leu Leu50 55 60Leu Leu Phe Leu Pro Phe Arg Met Val Glu Ala Ala Asn Gly Met His65 70 75 80Trp Pro Leu Pro Phe Ile Leu Cys Pro Leu Ser Gly Phe Ile Phe Phe85 90 95Thr Thr Ile Tyr Leu Thr Ala Leu Phe Leu Ala Ala Val Ser Ile Glu100 105 110Arg Phe Leu Ser Val Ala His Pro Leu Trp Tyr Lys Thr Arg Pro Arg115 120 125Leu Gly Gln Ala Gly Leu Val Ser Val Ala Cys Trp Leu Leu Ala Ser130 135 140Ala His Cys Ser Val Val Tyr Val Ile Glu Phe Ser Gly Asp Ile Ser145 150 155 160His Ser Gln Gly Thr Asn Gly Thr Cys Tyr Leu Glu Phe Arg Lys Asp165 170 175Gln Leu Ala Ile Leu Leu Pro Val Arg Leu Glu Met Ala Val Val Leu180 185 190Phe Val Val Pro Leu Ile Ile Thr Ser Tyr Cys Tyr Ser Arg Leu Val195 200 205Trp Ile Leu Gly Arg Gly Gly Ser His Arg Arg Gln Arg Arg Val Lys210 215 220Gly Leu Leu Ala Ala Thr Leu Leu Asn Phe Leu Val Cys Phe Gly Pro225 230 235 240Tyr Asn Val Ser His Val Val Gly Tyr Ile Cys Gly Glu Ser Pro Ala245 250 255Trp Arg Ile Tyr Val Thr Leu Leu Ser Thr Leu Asn Ser Cys Val Asp260 265 270Pro Phe Val Tyr Tyr Phe Ser Ser Ser Gly Phe Gln Ala Asp Phe His275 280 285Glu Leu Leu Arg Arg Leu Cys Gly Leu Trp Gly Gln Trp Gln Gln Glu
290 295 300Ser Ser Met Glu Leu Lys Glu Gln Lys Gly Gly Glu Glu Gln Arg Ala305 310 315 320Asp Arg Pro Ala Glu Arg Lys Thr Ser Glu His Ser Gln Gly Cys Gly325 330 335Thr Gly Gly Gln Val Ala Cys Ala Glu Ser340 345(276)SEQ ID NO275的資料(i)序列特(A)長度993個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO275的序列描述ATGCTGCCGG ACTGGAAGAG CTCCTTGATC CTCATGGCTT ACATCATCAT CTTCCTCACT 60GGCCTCCCTG CCAACCTCCT GGCCCTGCGG GCCTTTGTGG GGCGGATCCG CCAGCCCCAG 120CCTGCACCTG TGCACATCCT CCTGCTGAGC CTGACGCTGG CCGACCTCCT CCTGCTGCTG 180CTGCTGCCCT TCAAGATCAT CGAGGCTGCG TCGAACTTCC GCTGGTACCT GCCCAAGGTC 240GTCTGCGCCC TCACGAGTTT TGGCTTCTAC AGCAGCATCT ACTGCAGCAC GTGGCTCCTG 300GCGGGCATCA GCATCGAGCG CTACCTGGGA GTGGCTTTCC CCGTGCAGTA CAAGCTCTCC 360CGCCGGCCTC TGTATGGAGT GATTGCAGCT CTGGTGGCCT GGGTTATGTC CTTTGGTCAC 420TGCACCATCG TGATCATCGT TCAATACTTG AACACGACTG AGCAGGTCAG AAGTGGCAAT 480GAAATTACCT GCTACGAGAA CTTCACCGAT AACCAGTTGG ACGTGGTGCT GCCCGTGCGG 540CTGGAGCTGT GCCTGGTGCT CTTCTTCATC CCCATGGCAG TCACCATCTT CTGCTACTGG 600CGTTTTGTGT GGATCATGCT CTCCCAGCCC CTTGTGGGGG CCCAGAGGCG GCGCCGAGCC 660AAGGGGCTGG CTGTGGTGAC GCTGCTCAAT TTCCTGGTGT GCTTCGGACC TTACAACGTG 720TCCCACCTGG TGGGGTATCA CCAGAGAAAA AGCCCCTGGT GGCGGTCAAT AGCCGTGGTG 780TTCAGTTCAC TCAACGCCAG TCTGGACCCC CTGCTCTTCT ATTTCTCTTC TTCAGTGGTG 840CGCAGGGCAT TTGGGAGAGG GCTGCAGGTG CTGCGGAATC AGGGCTCCTC CCTGTTGGGA 900CGCAGAGGCA AAGACACAGC AGAGGGGACA AATGAGGACA GGGGTGTGGG TCAAGGAGAA 960GGGATGCCAA GTTCGGACTT CACTACAGAG TAG 993(277)SEQ ID NO276的資料(i)序列特征(A)長度330個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO276的序列描述Met Leu Pro Asp Trp Lys Ser Ser Leu Ile Leu Met Ala Tyr Ile Ile1 5 10 15Ile Phe Leu Thr Gly Leu Pro Ala Asn Leu Leu Ala Leu Arg Ala Phe20 25 30Val Gly Arg Ile Arg Gln Pro Gln Pro Ala Pro Val His Ile Leu Leu35 40 45Leu Ser Leu Thr Leu Ala Asp Leu Leu Leu Leu Leu Leu Leu Pro Phe50 55 60Lys Ile Ile Glu Ala Ala Ser Asn Phe Arg Trp Tyr Leu Pro Lys Val65 70 75 80Val Cys Ala Leu Thr Ser Phe Gly Phe Tyr Ser Ser Ile Tyr Cys Ser85 90 95Thr Trp Leu Leu Ala Gly Ile Ser Ile Glu Arg Tyr Leu Gly Val Ala100 105 110Phe Pro Val Gln Tyr Lys Leu Ser Arg Arg Pro Leu Tyr Gly Val Ile115 120 125Ala Ala Leu Val Ala Trp Val Met Ser Phe Gly His Cys Thr Ile Val130 135 140Ile Ile Val Gln Tyr Leu Asn Thr Thr Glu Gln Val Arg Ser Gly Asn145 150 155 160Glu Ile Thr Cys Tyr Glu Asn Phe Thr Asp Asn Gln Leu Asp Val Val165 170 175Leu Pro Val Arg Leu Glu Leu Cys Leu Val Leu Phe Phe Ile Pro Met180 185 190Ala Val Thr Ile Phe Cys Tyr Trp Arg Phe Val Trp Ile Met Leu Ser195 200 205Gln Pro Leu Val Gly Ala Gln Arg Arg Arg Arg Ala Lys Gly Leu Ala210 215 220Val Val Thr Leu Leu Asn Phe Leu Val Cys Phe Gly Pro Tyr Asn Val225 230 235 240Ser His Leu Val Gly Tyr His Gln Arg Lys Ser Pro Trp Trp Arg Ser245 250 255Ile Ala Val Val Phe Ser Ser Leu Asn Ala Ser Leu Asp Pro Leu Leu260 265 270Phe Tyr Phe Ser Ser Ser Val Val Arg Arg Ala Phe Gly Arg Gly Leu275 280 285Gln Val Leu Arg Asn Gln Gly Ser Ser Leu Leu Gly Arg Arg Gly Lys290 295 300Asp Thr Ala Glu Gly Thr Asn Glu Asp Arg Gly Val Gly Gln Gly Glu305 310 315 320Gly Met Pro Ser Ser Asp Phe Thr Thr Glu325 330(278)SEQ ID NO277的資料(i)序列特征(A)長度2724個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO277的序列描述ATGGACACCT CCCGGCTCGG TGTGCTCCTG TCCTTGCCTG TGCTGCTGCA GCTGGCGACC 60GGGGGCAGCT CTCCCAGGTC TGGTGTGTTG CTGAGGGGCT GCCCCACACA CTGTCATTGC 120GAGCCCGACG GCAGGATGTT GCTCAGGGTG GACTGCTCCG ACCTGGGGCT CTCGGAGCTG 180CCTTCCAACC TCAGCGTCTT CACCTCCTAC CTAGACCTCA GTATGAACAA CATCAGTCAG 240CTGCTCCCGA ATCCCCTGCC CAGTCTCCGC TTCCTGGAGG AGTTACGTCT TGCGGGAAAC 300GCTCTGACAT ACATTCCCAA GGGAGCATTC ACTGGCCTTT ACAGTCTTAA AGTTCTTATG 360CTGCAGAATA ATCAGCTAAG ACACGTACCC ACAGAAGCTC TGCAGAATTT GCGAAGCCTT 420CAATCCCTGC GTCTGGATGC TAACCACATC AGCTATGTGC CCCCAAGCTG TTTCAGTGGC 480CTGCATTCCC TGAGGCACCT GTGGCTGGAT GACAATGCGT TAACAGAAAT CCCCGTCCAG 540GCTTTTAGAA GTTTATCGGC ATTGCAAGCC ATGACCTTGG CCCTGAACAA AATACACCAC 600ATACCAGACT ATGCCTTTGG AAACCTCTCC AGCTTGGTAG TTCTACATCT CCATAACAAT 660AGAATCCACT CCCTGGGAAA GAAATGCTTT GATGGGCTCC ACAGCCTAGA GACTTTAGAT 720TTAAATTACA ATAACCTTGA TGAATTCCCC ACTGCAATTA GGACACTCTC CAACCTTAAA 780GAACTAGGAT TTCATAGCAA CAATATCAGG TCGATACCTG AGAAAGCATT TGTAGGCAAC 840CCTTCTCTTA TTACAATACA TTTCTATGAC AATCCCATCC AATTTGTTGG GAGATCTGCT 900TTTCAACATT TACCTGAACT AAGAACACTG ACTCTGAATG GTGCCTCACA AATAACTGAA 960TTTCCTGATT TAACTGGAAC TGCAAACCTG GAGAGTCTGA CTTTAACTGG AGCACAGATC 1020TCATCTCTTC CTCAAACCGT CTGCAATCAG TTACCTAATC TCCAAGTGCT AGATCTGTCT 1080TACAACCTAT TAGAAGATTT ACCCAGTTTT TCAGTCTGCC AAAAGCTTCA GAAAATTGAC 1140CTAAGACATA ATGAAATCTA CGAAATTAAA GTTGACACTT TCCAGCAGTT GCTTAGCCTC 1200CGATCGCTGA ATTTGGCTTG GAACAAAATT GCTATTATTC ACCCCAATGC ATTTTCCACT 1260TTGCCATCCC TAATAAAGCT GGACCTATCG TCCAACCTCC TGTCGTCTTT TCCTATAACT 1320GGGTTACATG GTTTAACTCA CTTAAAATTA ACAGGAAATC ATGCCTTACA GAGCTTGATA 1380TCATCTGAAA ACTTTCCAGA ACTCAAGGTT ATAGAAATGC CTTATGCTTA CCAGTGCTGT 1440GCATTTGGAG TGTGTGAGAA TGCCTATAAG ATTTCTAATC AATGGAATAA AGGTGACAAC 1500AGCAGTATGG ACGACCTTCA TAAGAAAGAT GCTGGAATGT TTCAGGCTCA AGATGAACGT 1560GACCTTGAAG ATTTCCTGCT TGACTTTGAG GAAGACCTGA AAGCCCTTCA TTCAGTGCAG 1620TGTTCACCTT CCCCAGGCCC CTTCAAACCC TGTGAACACC TGCTTGATGG CTGGCTGATC 1680AGAATTGGAG TGTGGACCAT AGCAGTTCTG GCACTTACTT GTAATGCTTT GGTGACTTCA 1740ACAGTTTTCA GATCCCCTCT GTACATTTCC CCCATTAAAC TGTTAATTGG GGTCATCGCA 1800GCAGTGAACA TGCTCACGGG AGTCTCCAGT GCCGTGCTGG CTGGTGTGGA TGCGTTCACT 1860TTTGGCAGCT TTGCACGACA TGGTGCCTGG TGGGAGAATG GGGTTGGTTG CCATGTCATT 1920GGTTTTTTGT CCATTTTTGC TTCAGAATCA TCTGTTTTCC TGCTTACTCT GGCAGCCCTG 1980GAGCGTGGGT TCTCTGTGAA ATATTCTGCA AAATTTGAAA CGAAAGCTCC ATTTTCTAGC 2040CTGAAAGTAA TCATTTTGCT CTGTGCCCTG CTGGCCTTGA CCATGGCCGC AGTTCCCCTG 2100CTGGGTGGCA GCAAGTATGG CGCCTCCCCT CTCTGCCTGC CTTTGCCTTT TGGGGAGCCC 2160AGCACCATGG GCTACATGGT CGCTCTCATC TTGCTCAATT CCCTTTGCTT CCTCATGATG 2220ACCATTGCCT ACACCAAGCT CTACTGCAAT TTGGACAAGG GAGACCTGGA GAATATTTGG 2280GACTGCTCTA TGAAAAAACA CATTGCCCTG TTGCTCTTCA CCAACTGCAT CCTAAACTGC 2340CCTGTGGCTT TCTTGTCCTT CTCCTCTTTA ATAAACCTTA CATTTATCAG TCCTGAAGTA 2400ATTAAGTTTA TCCTTCTGGT GGTAGTCCCA CTTCCTGCAT GTCTCAATCC CCTTCTCTAC 2460ATCTTGTTCA ATCCTCACTT TAAGGAGGAT CTGGTGAGCC TGAGAAAGCA AACCTACGTC 2520TGGACAAGAT CAAAACACCC AAGCTTGATG TCAATTAACT CTGATGATGT CGAAAAACAG 2580TCCTGTGACT CAACTCAAGC CTTGGTAACC TTTACCAGCT CCAGCATCAC TTATGACCTG 2640CCTCCCAGTT CCGTGCCATC ACCAGCTTAT CCAGTGACTG AGAGCTGCCA TCTTTCCTCT 2700GTGGCATTTG TCCCATGTCT CTAA 2724(279)SEQ ID NO278的資料(i)序列特征(A)長度907個氨基酸(B)類型氨基酸(C)鏈型(D)拓撲學不相關(ii)分子類型蛋白質(zhì)(xi)SEQ ID NO278的序列描述Met Asp Thr Ser Arg Leu Gly Val Leu Leu Ser Leu Pro Val Leu Leu1 5 10 15Gln Leu Ala Thr Gly Gly Ser Ser Pro Arg Ser Gly Val Leu Leu Arg20 25 30Gly Cys Pro Thr His Cys His Cys Glu Pro Asp Gly Arg Met Leu Leu35 40 45Arg Val Asp Cys Ser Asp Leu Gly Leu Ser Glu Leu Pro Ser Asn Leu50 55 60Ser Val Phe Thr Ser Tyr Leu Asp Leu Ser Met Asn Asn Ile Ser Gln65 70 75 80Leu Leu Pro Asn Pro Leu Pro Ser Leu Arg Phe Leu Glu Glu Leu Arg85 90 95Leu Ala Gly Asn Ala Leu Thr Tyr Ile Pro Lys Gly Ala Phe Thr Gly100 105 110Leu Tyr Ser Leu Lys Val Leu Met Leu Gln Asn Asn Gln Leu Arg His115 120 125Val Pro Thr Glu Ala Leu Gln Asn Leu Arg Ser Leu Gln Ser Leu Arg130 135 140Leu Asp Ala Asn His Ile Ser Tyr Val Pro Pro Ser Cys Phe Ser Gly145 150 155 160Leu His Ser Leu Arg His Leu Trp Leu Asp Asp Asn Ala Leu Thr Glu165 170 175Ile Pro Val Gln Ala Phe Arg Ser Leu Ser Ala Leu Gln Ala Met Thr180 185 190Leu Ala Leu Asn Lys Ile His His Ile Pro Asp Tyr Ala Phe Gly Asn195 200 205Leu Ser Ser Leu Val Val Leu His Leu His Asn Asn Arg Ile His Ser210 215 220Leu Gly Lys Lys Cys Phe Asp Gly Leu His Ser Leu Glu Thr Leu Asp225 230 235 240Leu Asn Tyr Asn Asn Leu Asp Glu Phe Pro Thr Ala Ile Arg Thr Leu245 250 255Ser Asn Leu Lys Glu Leu Gly Phe His Ser Asn Asn Ile Arg Ser Ile260 265 270Pro Glu Lys Ala Phe Val Gly Asn Pro Ser Leu Ile Thr Ile His Phe275 280 285Tyr Asp Asn Pro Ile Gln Phe Val Gly Arg Ser Ala Phe Gln His Leu290 295 300Pro Glu Leu Arg Thr Leu Thr Leu Asn Gly Ala Ser Gln Ile Thr Glu305 310 315 320Phe Pro Asp Leu Thr Gly Thr Ala Asn Leu Glu Ser Leu Thr Leu Thr325330 335Gly Ala Gln Ile Ser Ser Leu Pro Gln Thr Val Cys Asn Gln Leu Pro340 345 350Asn Leu Gln Val Leu Asp Leu Ser Tyr Asn Leu Leu Glu Asp Leu Pro355 360 365Ser Phe Ser Val Cys Gln Lys Leu Gln Lys Ile Asp Leu Arg His Asn370 375 380Glu Ile Tyr Glu Ile Lys Val Asp Thr Phe Gln Gln Leu Leu Ser Leu385 390 395 400Arg Ser Leu Asn Leu Ala Trp Asn Lys Ile Ala Ile Ile His Pro Asn405 410 415Ala Phe Ser Thr Leu Pro Ser Leu Ile Lys Leu Asp Leu Ser Ser Asn420 425 430Leu Leu Ser Ser Phe Pro Ile Thr Gly Leu His Gly Leu Thr His Leu435 440 445Lys Leu Thr Gly Asn His Ala Leu Gln Ser Leu Ile Ser Ser Glu Asn450 455 460Phe Pro Glu Leu Lys Val Ile Glu Met Pro Tyr Ala Tyr Gln Cys Cys465 470 475 480Ala Phe Gly Val Cys Glu Asn Ala Tyr Lys Ile Ser Asn Gln Trp Asn485 490 495Lys Gly Asp Asn Ser Ser Met Asp Asp Leu His Lys Lys Asp Ala Gly500 505 510Met Phe Gln Ala Gln Asp Glu Arg Asp Leu Glu Asp Phe Leu Leu Asp515 520 525Phe Glu Glu Asp Leu Lys Ala Leu His Ser Val Gln Cys Ser Pro Ser530 535 540Pro Gly Pro Phe Lys Pro Cys Glu His Leu Leu Asp Gly Trp Leu Ile545 550 555 560Arg Ile Gly Val Trp Thr Ile Ala Val Leu Ala Leu Thr Cys Asn Ala565 570 575Leu Val Thr Ser Thr Val Phe Arg Ser Pro Leu Tyr Ile Ser Pro Ile580 585 590Lys Leu Leu Ile Gly Val Ile Ala Ala Val Asn Met Leu Thr Gly Val595 600 605Ser Ser Ala Val Leu Ala Gly Val Asp Ala Phe Thr Phe Gly Ser Phe610 615 620Ala Arg His Gly Ala Trp Trp Glu Asn Gly Val Gly Cys His Val Ile625 630 635 640Gly Phe Leu Ser Ile Phe Ala Ser Glu Ser Ser Val Phe Leu Leu Thr645 650 655Leu Ala Ala Leu Glu Arg Gly Phe Ser Val Lys Tyr Ser Ala Lys Phe660 665 670Glu Thr Lys Ala Pro Phe Ser Ser Leu Lys Val Ile Ile Leu Leu Cys675 680 685Ala Leu Leu Ala Leu Thr Met Ala Ala Val Pro Leu Leu Gly Gly Ser690 695 700Lys Tyr Gly Ala Ser Pro Leu Cys Leu Pro Leu Pro Phe Gly Glu Pro705 710 715 720Ser Thr Met Gly Tyr Met Val Ala Leu Ile Leu Leu Asn Ser Leu Cys725 730 735Phe Leu Met Met Thr Ile Ala Tyr Thr Lys Leu Tyr Cys Asn Leu Asp740 745 750Lys Gly Asp Leu Glu Asn Ile Trp Asp Cys Ser Met Lys Lys His Ile755 760 765Ala Leu Leu Leu Phe Thr Asn Cys Ile Leu Asn Cys Pro Val Ala Phe770 775 780Leu Ser Phe Ser Ser Leu Ile Asn Leu Thr Phe Ile Ser Pro Glu Val785 790 795 800Ile Lys Phe Ile Leu Leu Val Val Val Pro Leu Pro Ala Cys Leu Asn805 810 815Pro Leu Leu Tyr Ile Leu Phe Asn Pro His Phe Lys Glu Asp Leu Val820 825 830Ser Leu Arg Lys Gln Thr Tyr Val Trp Thr Arg Ser Lys His Pro Ser835 840 845Leu Met Ser Ile Asn Ser Asp Asp Val Glu Lys Gln Ser Cys Asp Ser850 855 860Thr Gln Ala Leu Val Thr Phe Thr Ser Ser Ser Ile Thr Tyr Asp Leu865 870 875 880Pro Pro Ser Ser Val Pro Ser Pro Ala Tyr Pro Val Thr Glu Ser Cys885 890 895His Leu Ser Ser Val Ala Phe Val Pro Cys Leu900 905(280)SEQ ID NO279的資料(i)序列特征
(A)長度32個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO279的序列描述CATGCCAACC GGCCCGCGAG GCTGCTGCTG GT 32(281)SEQ ID NO280的資料(i)序列特征(A)長度32個堿基對(B)類型核酸(C)鏈型單鏈(D)拓撲學線形(ii)分子類型DNA(基因組的)(xi)SEQ ID NO280的序列描述ACCAGCAGCA GCCTCGCGGG CCGGTTGGCA TG 3權利要求
1.一種內(nèi)源的人孤兒G蛋白偶聯(lián)受體(GPCR)的組成型活化的非內(nèi)源形式,該受體含有下列氨基酸殘基(從C-末端到N-末端走向),它們橫跨非內(nèi)源的GPCR的跨膜-6(TM6)和細胞內(nèi)環(huán)-3(IC3)區(qū)域P1AA15X其中(1)P1是位于非內(nèi)源GPCR的TM6區(qū)域內(nèi)的一個氨基酸殘基,其中,P1選自(i)內(nèi)源的孤兒GPCR脯氨酸殘基和(ii)除脯氨酸之外的非內(nèi)源的氨基酸殘基;(2)AA15是15個氨基酸殘基,它們選自(a)內(nèi)源的孤兒GPCR的15個內(nèi)源氨基酸殘基、(b)15個非內(nèi)源的氨基酸殘基和(c)15個組合的氨基酸殘基,其中含有內(nèi)源孤兒GPCR的至少一個內(nèi)源氨基酸殘基和至少一個非內(nèi)源氨基酸殘基的組合,除非在位于GPCR的TM6區(qū)域內(nèi)的15個內(nèi)源氨基酸殘基都不是脯氨酸;和(3)X是位于所說的非內(nèi)源GPCR的IC3區(qū)域內(nèi)的非內(nèi)源氨基酸殘基。
2.權利要求1的非內(nèi)源人GPCR,其中P1是內(nèi)源脯氨酸殘基。
3.權利要求1的非內(nèi)源人GPCR,其中P1是除脯氨酸殘基之外的非內(nèi)源氨基酸殘基。
4.權利要求1的非內(nèi)源人GPCR,其中AA15是內(nèi)源GPCR的15個內(nèi)源氨基酸殘基。
5.權利要求1的非內(nèi)源人GPCR,其中X是從由賴氨酸、組氨酸、精氨酸和丙氨酸殘基中選擇而來,但是當在所說的內(nèi)源人GPCR的X位置的內(nèi)源氨基酸是賴氨酸時,X是從由組氨酸、精氨酸和丙氨酸中選擇而來。
6.權利要求1的非內(nèi)源人GPCR,其中X是賴氨酸殘基,但是當在所說的內(nèi)源人GPCR的X位置的內(nèi)源氨基酸是賴氨酸時,X是除賴氨酸以外的氨基酸。
7.權利要求4的非內(nèi)源人GPCR,其中X是賴氨酸殘基,但是當在所說的內(nèi)源人GPCR的X位置的內(nèi)源氨基酸是賴氨酸時,X是除賴氨酸以外的氨基酸。
8.權利要求1的非內(nèi)源人GPCR,其中P1是脯氨酸殘基,X是賴氨酸殘基,但是當在所說的內(nèi)源人GPCR的X位置的內(nèi)源氨基酸是賴氨酸時,X是除賴氨酸以外的氨基酸。
9.一種包含權利要求1的非內(nèi)源人GPCR的宿主細胞。
10.權利要求9的材料,其中所說的宿主細胞是來自哺乳動物。
11.權利要求1的非內(nèi)源人GPCR,其為經(jīng)純化和分離后的形式。
12.一種編碼內(nèi)源人孤兒G蛋白偶聯(lián)受體(GPCR)的組成型活化的、非內(nèi)源形式的核酸序列,包括下列核酸序列區(qū)域,它們橫跨孤兒GPCR的跨膜-6(TM6)和細胞內(nèi)環(huán)-3(IC3)區(qū)域3’-P密碼子(AA-密碼子)15X密碼子-3’其中(1)P密碼子是位于非內(nèi)源的GPCR的TM6區(qū)內(nèi)的一個核酸編碼區(qū),其中P密碼子編碼從(i)內(nèi)源GPCR的脯氨酸殘基和(ii)除脯氨酸之外的非內(nèi)源氨基酸殘基中選擇出來的氨基酸;(2)(AA-密碼子)15是編碼15個氨基酸的15個密碼子,這些氨基酸殘基選自(a)內(nèi)源孤兒GPCR的15個內(nèi)源氨基酸殘基,(b)15個非內(nèi)源的氨基酸殘基,和(c)15個氨基酸殘基的組合,該組合包括內(nèi)源孤兒GPCR的至少一個內(nèi)源氨基酸殘基和至少一個非內(nèi)源氨基酸殘基,除非位于GPCR的TM6區(qū)域內(nèi)的15個內(nèi)源氨基酸殘基都不是脯氨酸;和(3)X密碼子是編碼位于所說的非內(nèi)源人GPCR的IC3區(qū)域內(nèi)氨基酸殘基的核酸編碼區(qū),其中X密碼子編碼非內(nèi)源氨基酸。
13.權利要求12的核酸序列,其中P密碼子編碼內(nèi)源脯氨酸殘基。
14.權利要求12的核酸序列,其中P密碼子編碼不是脯氨酸的非內(nèi)源脯氨酸殘基。
15.權利要求12的核酸序列,其中X密碼子編碼非內(nèi)源氨基酸,該氨基酸是從賴氨酸、組氨酸、精氨酸和丙氨酸中選擇而來,但是當在所說的內(nèi)源人GPCR的X位置的內(nèi)源氨基酸是賴氨酸時,X密碼子編碼從組氨酸、精氨酸和丙氨酸中選擇而來的氨基酸。
16.權利要求13的核酸序列,其中X密碼子編碼非內(nèi)源賴氨酸,但是當在所說的內(nèi)源人GPCR的X位置的內(nèi)源氨基酸是賴氨酸時,X密碼子編碼從組氨酸、精氨酸和丙氨酸中選擇而來的氨基酸。
17.權利要求12的核酸序列,其中X密碼子是從由AAA、AAG、GCA、GCG、GCC和GCU組成的一組中選擇而來。
18.權利要求12的核酸序列,其中X密碼子是從由AAA和AAG組成的一組中選擇而來。
19.權利要求12的核酸序列,其中P密碼子是從由CCA、CCC、CCG和CCU組成的一組中選擇而來,而X密碼子是從由AAA和AAG組成的一組中選擇而來。
20.一種含有權利要求12的核酸序列的載體。
21.一種含有權利要求12的核酸序列的質(zhì)粒。
22.一種含有權利要求21的核酸序列的宿主細胞。
23.權利要求12的核酸序列,其為經(jīng)純化和分離后的形式。
24.一種選擇改變在人G蛋白偶聯(lián)受體(“GPCR”)的第三個細胞內(nèi)環(huán)內(nèi)的內(nèi)源氨基酸殘基的方法,其中所說的受體包括一個跨膜6區(qū)和一個細胞內(nèi)環(huán)3區(qū),并且當此內(nèi)源氨基酸被改造為非內(nèi)源氨基酸殘基時,組成型活化所說的人GPCR,該方法包括如下步驟(a)識別在人GPCR的跨膜6區(qū)的內(nèi)源脯氨酸殘基;(b)通過從所說的GPCR的羧基末端區(qū)域指向所說的GPCR的氨基末端區(qū)域的方向上移動,來識別距離從所說的脯氨酸殘基起算為第16位的內(nèi)源氨基酸殘基;(c)把步驟(b)的內(nèi)源殘基改造為非內(nèi)源的氨基酸殘基以創(chuàng)造內(nèi)源人GPCR的非內(nèi)源形式;和(d)確定步驟(c)的非內(nèi)源人GPCR是否是組成型活化的。
25.權利要求24的方法,其中按從羧基末端到氨基末端走向距離跨膜6區(qū)內(nèi)所說的脯氨酸殘基兩個殘基的氨基酸是色氨酸。
26.一種由權利要求24的方法生產(chǎn)的組成型活性的、非內(nèi)源人GPCR。
27.一種由權利要求25的方法生產(chǎn)的組成型活性的、非內(nèi)源人GPCR。
28.一種創(chuàng)造內(nèi)源人G蛋白偶聯(lián)受體(GPCR)的非內(nèi)源的、組成型活化形式的算法規(guī)則,其中所說的內(nèi)源GPCR包括一個跨膜6區(qū)和一個細胞內(nèi)環(huán)3區(qū),該算法規(guī)則包括如下步驟(a)選擇在跨膜-6區(qū)含有脯氨酸殘基的內(nèi)源人GPCR;(b)通過在從羧基末端指向氨基末端方向上從步驟(a)所說的脯氨酸殘基起數(shù)16個氨基酸殘基而識別出內(nèi)源氨基酸殘基;(c)把步驟(b)識別的氨基酸殘基改造為非內(nèi)源的氨基酸殘基以創(chuàng)造內(nèi)源人GPCR的非內(nèi)源形式;和(d)確定步驟(c)的內(nèi)源人GPCR的非內(nèi)源形式是否是組成型活化的。
29.權利要求28的算法規(guī)則,其中在從羧基末端指向氨基末端方向上距離跨膜6區(qū)的所說的脯氨酸殘基兩個殘基的氨基酸殘基是色氨酸。
30.一種由權利要求28的算法規(guī)則產(chǎn)生的組成型活性的、非內(nèi)源人GPCR。
31.一種由權利要求29的算法規(guī)則產(chǎn)生的組成型活性的、非內(nèi)源人GPCR。
32.一種直接識別選自非內(nèi)源的、組成型活化的人G蛋白偶聯(lián)受體的反激活劑、激活劑和部分激活劑的化合物的方法,其中所說的受體含有一個跨膜-6區(qū)和細胞內(nèi)環(huán)-3區(qū),該方法包括如下步驟(a)選擇內(nèi)源人GPCR;(b)識別在步驟(a)的GPCR的跨膜-6區(qū)內(nèi)的脯氨酸殘基;(c)在從羧基末端指向氨基末端方向上識別從步驟(b)的脯氨酸殘基起算為第16位的內(nèi)源氨基酸殘基;(d)把步驟(c)的內(nèi)源氨基酸改造為非內(nèi)源的氨基酸;(e)證實步驟(d)的非內(nèi)源人GPCR是組成型活化的;(f)用步驟(e)的非內(nèi)源的、組成型活化的GPCR接觸候選化合物;和(g)通過測量所說的被接觸的受體的化合物效應,確定所說的化合物是否是所說受體的反激活劑、激活劑或部分激活劑。
33.權利要求32的方法,其中步驟(d)的非內(nèi)源氨基酸是賴氨酸。
34.一種經(jīng)權利要求32的方法直接識別的化合物。
35.權利要求32的方法,其中被直接識別的化合物是反激活劑。
36.權利要求32的方法,其中被直接識別的化合物是激活劑。
37.權利要求32的方法,其中被直接識別的化合物是部分激活劑。
38.一種含有權利要求35所述的反激活劑的組合物。
39.一種含有權利要求36所述的反激活劑的組合物。
40.一種含有權利要求37的部分激活劑的組合物。
41.一種直接識別針對非內(nèi)源的、組成型活化的人G蛋白偶聯(lián)受體(“GPCR”)的反激活劑的方法,其中所說的GPCR含有一個跨膜-6區(qū)和細胞內(nèi)環(huán)-3區(qū),該方法包括如下步驟(a)選擇內(nèi)源人GPCR;(b)識別在步驟(a)的GPCR的跨膜-6區(qū)內(nèi)的脯氨酸殘基;(c)識別在從羧基末端指向氨基末端方向上從步驟(b)的脯氨酸殘基起算為第16位的內(nèi)源氨基酸殘基;(d)把步驟(c)的內(nèi)源氨基酸改造為非內(nèi)源的賴氨酸殘基;(e)證實步驟(d)的非內(nèi)源人GPCR是組成型活化的;(f)用步驟(e)的非內(nèi)源的、組成型活化的GPCR接觸候選化合物;和(g)通過測量所說的被接觸的受體的化合物效應,確定所說的化合物是否是所說受體的反激活劑。
42.一種經(jīng)權利要求37的方法直接識別的反激活劑。
43.一種含有權利要求38所述的反激活劑的組合物。
全文摘要
在此公開的是內(nèi)源人G蛋白偶聯(lián)受體(GPCR)的組成型活化的非內(nèi)源形式,該受體含有(a)下列氨基酸區(qū)域(從C-末端到N-末端走向)和/或(b)橫跨GPCR的跨膜-6(TM6)和細胞內(nèi)環(huán)-3(IC3)區(qū)域的下列核酸序列區(qū)域(3’到5’走向),分別是(a)P
文檔編號C07D231/12GK1398298SQ99812091
公開日2003年2月19日 申請日期1999年10月12日 優(yōu)先權日1998年10月13日
發(fā)明者多米尼克·P·比漢, 德里克·T·查默斯, 廖王蓁 申請人:阿瑞那制藥公司