專利名稱:來自ulkenia的PUFA-PKS基因的制作方法
技術領域:
本發(fā)明描述了對于多酮化合物合酶(polyketide synthase)(PKS)特異性的基因編碼序列。由它們合成的PKS的特征是具有產生PUFAs(多不飽和脂肪酸)的酶學能力。本發(fā)明另外包括相應DNA序列的鑒定以及所述核苷酸序列對于生產重組和/或轉基因生物的用途。
術語PUFAs(多不飽和脂肪酸)表示具有鏈長度>C12和至少兩個雙鍵的多重不飽和長鏈脂肪酸。有兩個PUFA的主要家族,其根據相對于烷基末端,在ω-3和在ω-6脂肪酸中的第一個雙鍵的位置而區(qū)別。它們是細胞膜的重要組分,在那里它們以脂質,特別是磷脂的形式存在。PUFAs還作為在人和在動物中的重要分子,例如,前列腺素,白三烯和環(huán)前列腺的初級階段而起作用(A.P.Simopoulos,essential fattyacids in health and chronic disease,Am.J.Clin.Nutr.1999(70),pp.560-569)。ω-3脂肪酸族的重要代表是DHA(二十二碳六烯酸)和EPA(二十碳五烯酸),其可以在魚油和在海洋微生物中發(fā)現。ω-6脂肪酸的重要代表是ARA(花生四烯酸),其出現在,例如,絲狀真菌中,但是也可以從動物組織如肝和腎中分離。DHA和ARA在人母乳中彼此相接出現。
PUFAs對于人來說在適當發(fā)育方面,特別是對于發(fā)育腦,組織形成及其修復是必需的。因而,DHA是人細胞膜的重要組分,特別是神經的細胞膜。它在腦功能的成熟中發(fā)揮重要作用并且對于視力的發(fā)育是必需的。ω-3 PUFAs如DHA和EPA被用作營養(yǎng)添加劑,因為具有DHA充分供應的平衡營養(yǎng)對于某些疾病的預防有利(A.P.Simopoulos,Essential fatty acids in health and chronic disease,AmericanJournal of Clinical Nutrition 1999(70),pp.560-569)。例如,患有非胰島素依賴型糖尿病的成人呈現與后來出現的心臟問題相關的DHA平衡的缺陷或者至少是失衡的DHA平衡。同樣地,神經元疾病如,例如,阿爾茨海默病或精神分裂癥伴隨著低的DHA水平。
有大量的DHA商業(yè)提取物的來源,例如,來自海洋冷水魚的油,蛋黃部分或海洋微生物。適于提取n-3 PUFA的微生物發(fā)現于,例如,弧菌屬(Vibrio)的細菌中(例如,海產弧菌(Vibrio marinus))或腰鞭毛蟲(Dinophyta)中,其中特別是Crypthecodinium屬,如C.cohnii或在Stramenopiles(或Labyrinthulomycota)中,如Pinguiophyceae如,例如,Glossomastix,Phaeomonas,Pinguiochrysis,Pinguiococcus和Polypodochrysis。其它生產PUFA的優(yōu)選微生物特別屬于Thraustochytriales目,(Thraustchytriidea)具有Japonochytrium屬,Schizochytrium屬,Thraustochytrium屬,Althornia屬,Labyrinthuloides屬,Aplanochytrium屬和Ulkenia屬。
提取自商業(yè)上已知的PUFA來源如植物或動物的油的特征經常是非常不均勻的組成。以這種方式提取的油必須進行昂貴的純化處理以便能夠富集一種或幾種PUFAs。另外,來自這些來源的PUFA的供應也會發(fā)生不可控制的波動。因而,疾病和天氣影響能夠減少動物也能夠減少植物的產量。從魚中提取PUFA出現季節(jié)波動并且甚至能夠由于過度捕撈或氣候變化(例如,厄爾尼諾現象)而暫時性地停止。動物油,特別是魚油,可以通過食物鏈從環(huán)境中積聚有害物質。已知動物受有機氯化物,例如,多氯化聯苯高度脅迫,特別是在商業(yè)性魚場中,其抵消了魚類消費的健康方面(Hites等,2004,Global assessmentof organic contaminants in farmed salmon,Science 303,pp.226-229)。魚產品質量的所得損失導致消費者對于魚和魚油作為ω-3 PUFA來源的接受度下降。另外,從魚濃縮DHA因為高度技術需要而相對昂貴。另一方面,DHA存在于少數海洋微生物,占細胞總脂肪組分的大約50%,并且它們能夠在大的發(fā)酵罐中進行相對經濟地培養(yǎng)。微生物的另一個優(yōu)點是提取自它們的油的組成限于少數幾種組分。
對于長鏈PUFA如二十二碳六烯酸(DHA;22:6,n-3)和二十碳五烯酸(EPA;20:5,n-3)的生物合成已知多種生物催化途徑。在真核生物中生產長鏈PUFA的常規(guī)生物合成途徑起始于亞油酸(LA;18:2,n-6)和α亞油酸的δ-6去飽和作用。它導致由亞油酸合成γ亞油酸(GLA;18:3,n-6)以及由α亞油酸合成十八碳四烯酸(OTA;18:4,n-3)。對于n-6以及n-3脂肪酸來說,此去飽和作用后接延伸步驟以及δ-5去飽和作用,致成花生四烯酸(ARA;20:4,n-6)和二十碳五烯酸(EPA;20:5,n-3)。起始自二十碳五烯酸(EPA;20:5,n-3)的二十二碳六烯酸(DHA;22:6,n-3)的合成隨后能夠通過兩種不同的生物合成途徑發(fā)生。在所謂的線性生物合成途徑中,發(fā)生二十碳五烯酸(EPA;20:5,n-3)延伸另外兩個碳單位,隨后發(fā)生δ-4去飽和作用以形成二十二碳六烯酸(DHA;22:6,n-3)。這種生物合成途徑的存在能夠通過生物如破囊壺菌屬(Thraustochytrium)和裸藻屬(Euglena)中δ-4去飽和酶的存在而確證(Qiu,等,Identification of a delta 4 fatty acid desaturase fromThraustochytrium sp.involved in the biosynthesis of docosahexaenoic acidby heterologous expression in Saccharomyces cerevisiae and Brassicajuncea.,J.Biol.Chem.276(2001),pp.31561-31,566和Meyer等,Biosynthesis of docosahexaenoic acid in Euglena gracilisBiochemicaland molecular evidence for the involvement of a delta 4 fatty acyl groupdesaturase.Biochemistry 42(2003),pp.9779-9788)。起始自二十碳五烯酸(EPA;20:5,n-3)的二十二碳六烯酸(DHA;22:6,n-3)合成的第二條途徑,所謂的Sprecher途徑,獨立于δ-4去飽和作用。它由兩個連續(xù)延伸步驟,每步延伸2個碳單位至二十四碳五烯酸(24:5,n-3)以及隨后δ-6去飽和作用至二十四碳六烯酸(24:6,n-3)組成。隨后通過過氧化物酶體β氧化作用縮短兩個碳單位而接著發(fā)生二十二碳六烯酸的形成(H.Sprecher,Metabolism of highly unsaturated n-3 and n-6 fatty acids.Biochimica et Biophysica Acta 1486(2000),pp.219-231)。這一第二生物合成途徑是在哺乳動物中占優(yōu)勢的DHA合成途徑(Leonard等,Identification and expression of mammalian long-chain PUFA elongationenzymes.Lipids 37(2002),pp.733-740)。對于C20 PUFA形成的備選生物合成途徑存在于少數缺δ-6變性酶活性的生物中。這些生物包括,例如,原生生物Acanthamoeba sp.和Euglena gracilis。在備選的C20PUFA合成中的第一步在于C18脂肪酸,亞油酸(LA;18:2,n-6)和α亞油酸(ALA;18:3,n-3)延伸兩個碳單位。隨后通過δ8去飽和作用和接下來的δ5去飽和作用將得到的脂肪酸二十碳二烯酸(20:2,n-6)和二十碳三烯酸(20:3,n-3)轉化成花生四烯酸(ARA;20:4,n-6)和/或二十碳五烯酸(EPA;20:5,n-3)(Sayanova和Napier,Eicosapentaenoic acidBiosynthetic routes and the potential for synthesis in transgenic plants.Phytochemistry 65(2004),pp.147-158;Wallis和Browse;The delta-8desaturase of Euglena gracilisAn alternate pathway for synthesis of20-carbon polyunsaturated fatty acids.Arch.Biochem.Biophys.362(1999),pp.307-316)。
高等植物不具有由初級階段合成C20 PUFA的能力。它們通過各種去飽和酶起始自硬脂酸(18:0),形成油酸(C18:1;δ-9去飽和酶),亞油酸(18:2,n-6,δ12去飽和酶)和α亞油酸(18:3,n-3;δ15去飽和酶)。
不過,某些海洋微生物采取完全不同的生物合成途徑來產生EPA和DHA。這些產生PUFA的微生物包括γ蛋白細菌的海洋代表以及少數幾種cytophaga flavobacterium bacteroides族和到目前為止的真核性原生生物,Schizochytrium sp.ATCC 20888(Metz等,2001,Productionof polyunsaturated fatty acids by polyketide synthases in both prokaryotesand eukaryotes.Science 293290-293)。它們通過所謂的多酮化合物合酶(PKS)來合成長鏈PUFA。這些PKSs代表催化由酮化合物(ketide)單位組成的次級代謝產物合成的大酶(G.W.Wallis,J.L.Watts和J.Browse,Polyunsaturated fatty acid synthesiswhat will they think of next?Trendsin Biochemical Sciences 27(9)(2000)pp.467-473)。多酮化合物的合成包含許多與脂肪酸合成類似的酶反應(Hopwood & Sherman Annu.Rev.Genet.24(1990)pp.37-66;Katz & Donadio Annu.Rev.of Microbiol.47(1993)pp.875-912)。
已知不同PUFA-PKSs(PUFA-合成的PKSs)的基因序列。由此,從海洋細菌Shewanella sp.分離出38kb基因組片段含有生產EPA的信息。隨后對這一片段的測序導致鑒定了8個開放閱讀框(ORFs)(H.Takeyama等,Microbiology 143(1997)pp.2725-2731)。來自Shewanella的這些開放閱讀框,其中五個與多酮化合物合酶基因密切相關。同樣,美國專利號5,798,259描述了來自Shewanella putrefaciens SCRC-2874的EPA基因簇。PUFA-PKS基因也發(fā)現于海洋原核生物Photobacteriumprofundum株SS9中(Allen和Bartlett,Microbiology 2002,148 pp.1903-1913)和Moritella marina株MP-1,早期的Vibrio marinus(Tanaka等,Biotechnol.Letters 1999,21,pp.939-945)。類似的產生PUFA的PKS樣ORFs也能夠在真核性原生生物Schizochytrium中鑒定(Metz等,Science 293(2001)pp.290-293,US專利No.6,556,583及WO02/083870A2)。在Schizochytrium中確定了三種ORFs,其與來自Shewanella的EPA基因簇呈現部分同一性。在少數原核生物和真核生物Schizochytrium中存在這些保守性PKS基因給出了暗示,PUFA-PKS基因可能在原核生物和真核生物之間進行了水平轉移。
即使是使用正常情況下不產生PUFAs的微生物中分離的基因簇對PUFAs進行轉基因生產也已經能夠得以顯示了。因而,存在于來自Shewanella sp.SCRC-2738的簇中的上述五種ORFs(開放閱讀框)足以在非IPA生產者大腸桿菌(E.coli)和Synechoccus sp.中生產可測量量的EPA(Yazawa,Lipids 1996,31,pp.297-300和Takayama等,Microbiology 1997,143,pp.2725-2731)。
通常,對于大規(guī)模生產PUFAs的新的PUFA生產者總是存在需要。首先這種生產是否發(fā)生在,例如,原核生物,原生生物或在植物中并不重要。目標始終是盡可能經濟地和以盡可能保護環(huán)境的方式大量生產高質量的PUFAs。本發(fā)明追求這一目標,因為它介紹了來自特別有效的PUFA生產者Ulkenia sp.的合適的PUFA-PKS基因。
考慮到技術狀態(tài),所以本發(fā)明的任務是從生產DHA的微生物Ulkenia sp.中鑒定和分離另外的PUFA-PKS基因,其極適于生產PUFAs。此外,應當獲得關于這些基因的位置和排列以及它們的調控元件的知識。由此獲得的知識,特別是由此獲得的核酸物質,應當使得PUFA-PKS基因在同系生物以及在轉基因生物中的加強表達成為可能。
通過本發(fā)明的權利要求書中所定義的主題解決了這些任務以及其它未曾被明確地說明但可以從本文件初始討論的聯系中輕易得到或總結的其它任務。
1.PUFA-PKS,其特征是它們a.包括在SEQ ID No.6(ORF 1),7(ORF 2),8和/或80(ORF 3)中所示氨基酸序列的至少其中一種,以及具有與它們有至少70%,優(yōu)選80%,特別優(yōu)選至少90%和更加特別優(yōu)選至少99%和最優(yōu)選100%序列同源性的同源序列,其具有PUFA-PKS的至少一個結構域的生物學活性,或b.包括在SEQ ID No.32,34,45,58,59,60,61,72,74和/或77中所示氨基酸序列的至少其中一種,以及具有與它們有至少70%,優(yōu)選80%,特別優(yōu)選至少90%和更加特別優(yōu)選至少99%和最優(yōu)選100%序列同源性的同源序列,其具有PUFA-PKS的至少一個結構域的生物學活性。
2.具有10個或更多ACP結構域的根據權利要求1的分離的PUFA-PKS。
另外,本發(fā)明在優(yōu)選的方面涉及這樣一種PUFA-PKS,其包含與序列SEQ ID No.6(ORF 1),7(ORF 2)和/或8和/或80(ORF 3)的至少500個直接連續(xù)氨基酸具有至少70%,優(yōu)選至少80%,特別優(yōu)選至少90%和更加特別優(yōu)選至少99%同一性的至少一種氨基酸序列。
在另一個優(yōu)選的方面,本發(fā)明涉及分離的DNA分子,其編碼根據任一項在前權利要求的PUFA-PKS。
后者優(yōu)選特征為它編碼與序列SEQ ID No.6(ORF 1),7(ORF 2)和/或8和/或80(ORF 3)的至少500個直接連續(xù)氨基酸具有至少70%同一性的氨基酸序列。
另外,本發(fā)明涉及這樣的分離DNA分子,其與來自序列SEQ ID No.3,4,5和/或9的至少500個連續(xù)核苷酸具有至少70%,優(yōu)選至少80%,特別優(yōu)選至少90%和更加特別優(yōu)選至少95%的同一性。
在另一個優(yōu)選的方面,本發(fā)明涉及一種重組DNA分子,其包含與控制轉錄的至少一種DNA序列功能性連接的先前所述DNA分子的其中之一,優(yōu)選選自SEQ ID No.3,4和5和/或9或其至少500個核苷酸的部分以及它們的功能性變體。
在又一個優(yōu)選的方面,本發(fā)明涉及包含前述重組DNA分子的重組宿主細胞。
在又一個優(yōu)選的視點下,本發(fā)明涉及內源性表達具有至少10個ACP結構域的根據本發(fā)明的PUFA-PKS的重組宿主細胞。
另外,在又一個優(yōu)選的方面,本發(fā)明涉及一種生產含有PUFA的油的方法,包括培養(yǎng)這種重組宿主細胞,以及涉及以此方式生產的油。
另外,在又一個優(yōu)選的方面,本發(fā)明涉及一種生產含有PUFA,優(yōu)選DHA的生物質量的方法,包括培養(yǎng)這種重組宿主細胞,以及涉及以此方式生產的生物質量。
所以,在又一個優(yōu)選的方面,本發(fā)明還涉及根據權利要求15的重組生物質量,其包含根據權利要求8的核酸和/或根據權利要求1的氨基酸序列或與它同源的至少500個連續(xù)氨基酸的部分。
本發(fā)明在又一個優(yōu)選的方面還涉及SEQ ID No.32,33,34,45,58,59,60,61,72,74和/或77中所示、來自包含SEQ ID No.6,7,8和/或80的PUFA-PKS的個別酶結構域的用途,用于生產人工多酮化合物,例如,多酮化合物抗生素和/或新的,變化的脂肪酸。
根據本發(fā)明,有關核酸的同一性指示在待比較鏈的特定位置上的相同堿基對。不過,缺口是有可能的。以%計算同一性值的可能性由程序blastn和fasta代表。
就氨基酸而言,概念同源性也包含,例如氨基酸序列的保守性交換,其絲毫不影響蛋白質的功能和/或結構。甚至是這些同源性值也通過本領域熟練技術人員已知的程序,例如,blastp,Matrix PAM30,GapPenalties9,Extension1進行計算(Altschul等,NAR 25,3389-3402)。
來自Ulkenia sp.的PUFA-PKS基因的序列信息可由SEQ ID No.3-5和/或9中所定義的核酸序列和氨基酸序列獲得。SEQ ID No.1和2代表目前分離的兩種粘粒上的完整基因組DNA序列(見實施例2和3)。后者對其部分包含PUFA合成所必需的三種相關開放閱讀框ORFs1-3的信息以及它們的側翼調控序列。另外,作為其結果提出了能夠源自基因組序列的蛋白質序列。
本發(fā)明另外包括用根據本發(fā)明的核酸對宿主生物進行同源和異源轉化用來生產高純PUFAs的方法。分離的開放閱讀框優(yōu)選導致在同系生物和轉基因生物中生產PUFA,特別是DHA,EPA和DPA。
由此生產的PUFAs優(yōu)選作為生物質量的組分或作為油而存在。
在本發(fā)明之前,只有真核生物,原生生物Schizochytrium的PUFA-PKS基因是已知的(美國專利號6,566,583,WO02/083870)。隨后測定的序列數據部分源自cDNA和源自染色體DNA。在本發(fā)明中首次從染色體DNA完全描述了對于PUFA合成必需的真核性原生生物的所有PUFA-PKS基因。這不僅導致確定了以往未知的來自Ulkenia sp.的PUFA-PKS編碼基因信息,還另外提供了關于側翼調控元件如轉錄啟動子和終止子的數據。此外,染色體序列信息使得深入了解個別PUFA-PKS基因的位置和排列成為可能。
這里完全令人吃驚的是簇同樣地不再存在,因為以往知道它是來自原核性PUFA-PKS代表如Shewanella,Photobacterium或Moritella。鑒定的粘粒(Seq ID No.1)一開始顯示個別ORFs的線性排列在Ulkenia中被打亂并且還顯示個別ORFs的閱讀方向是反向的(
圖1)。這可能是大段基因轉座的結果。作為轉座的結果,個別ORFs還清楚地呈現彼此的更大間隔。因而,兩個ORFs 1和2具有大約13kb的間隔。第三個ORF直到在另一個粘粒上才能夠在此情況下得以鑒定(SeqID No.2)并且在兩種粘粒之間(Seq ID No.1和2)沒能發(fā)現部分同一性(圖1)。這意味著來自Ulkenia sp.的ORF在空間上不再位于兩種ORFs1和2附近。這作出結論,即PUFA基因簇,已知來自上述原核代表,不再存在于真核生物Ulkenia sp.中。已經部分測定了原生生物Schizochytrium的個別PUFA-PKS基因在基因組上的位置和排列(WO02/083870)并且還顯示了兩種ORFs A和B的相反方向。不過,它們彼此僅僅分離4224個堿基對。在專利申請WO 02/083870中將這一序列片段討論為具有雙向啟動子元件的基因間隔區(qū)。至少對于Ulkenia在同源性ORFs 1和2之間的雙向啟動子元件似乎是不可能的,這是因為對于Ulkenia測定的12.95kb的間隔區(qū)。沒有其它明顯ORFs存在于來自Ulkenia的ORFs 1和ORF2之間的12.95kb區(qū)域之內。表明區(qū)域中發(fā)生了大的重組和/或轉座事件。轉座酶樣事件也能夠基于少數重復序列重復而發(fā)生。
更加令人特別吃驚的是與EPA生產者Shewanella(6xACP)和Photobacterium(5xACP)的PUFA-PKS以及DHA生產者Moritella(5xACP)和Schizochytrium(9xACP)的PUFA-PKS相比,來自Ulkeniasp.的PUFA-PKS具有最大數目的酰基載體蛋白的重復,有10個ACP結構域(圖3)。這意味著分離自Ulkenia sp.的PUFA-PKS相對于來自親緣性原生生物Schizochytrium的PUFA-PKS不僅具有偏移性氨基酸序列,而且在結構上也是獨特的。另一種特性是這樣的事實,即來自Ulkenia sp.的第三個ORF相對于來自Schizochytrium的ORF C短了38個氨基酸并且另包含了丙氨酸富集的結構域,該結構域并不以此方式存在于Schizochytrium中(圖6)中。令人感興趣的是,這種序列類似存在于來自ORF 1的個別ACT結構域之間的區(qū)域并且可能代表連接區(qū)。所述相似性在于序列長度以及丙氨酸連續(xù)僅被個別脯氨酸和纈氨酸打亂的事實。相對于Schizochytrium ORF C缺失的ORF 3中氨基酸的最大部分是刪除的結果,有30個氨基酸長,位于脫水酶/異構酶結構域之間(圖6)。作為結果,這些結構域位于相應的蛋白質上,彼此相距短的間隔,這能夠對于酶學活性具有影響。對于ORF 3來說,即使其它的5’位置上的ATG密碼子也可作為起始密碼子,從而在理論上甚至是最大為1848個氨基酸長的ORF也能夠存在(Seq ID No.9和80)。在此情況下甚至同時出現ORF 3的變體也是可能的。
特別地,來自Ulkenia sp.的ORF 1(Seq ID No.3和6)在一方面包含所謂的β酮?;铣擅附Y構域(Seq ID No.14和32),其特征是靶標(motive)(DXAC)(Seq ID No.12和30)。Ulkenia ORF 1中酶學結構域的活性中心的靶標能夠以優(yōu)選的形式擴展到17個氨基酸的范圍(GMNCVVDAACASSLIAV)Seq ID No.11和29)。完整的β酮酰基合成酶結構域可以分成N末端(Seq ID No.10和28)和分成C末端(Seq ID No.13和31)部分。β酮酰基合成酶結構域的生物學功能是催化脂肪酸和/或PKS合成的縮合反應。進行延伸的?;鶊F通過硫酯鍵結合到酶學結構域的活性中心的半胱氨酸基團并且以幾個步驟轉移到酰基載體蛋白上的丙二?;鶊F的碳原子2上,釋放CO2。β酮酰基合成酶結構域后接丙二酰CoA-ACP轉移酶結構域(Seq ID No.15和33)。此結構域催化丙二酰CoA轉移到酰基載體蛋白(ACP)上的4’-phosphopantetheine基團。丙二酰CoA-ACP轉移酶結構域也將甲基或乙基丙二酸酯轉移到ACP上,期間它們將分枝導入其它的線性碳鏈上。隨后將連接區(qū)域后接富含丙氨酸序列的部分(Seq ID No.16和34),該部分包含10個重復的酰基載體蛋白結構域(ACP結構域)(17-26和35-44)。這些ACP結構域對于它們的部分彼此通過連接區(qū)域相互分離,所述連接區(qū)域主要由丙氨酸和脯氨酸組成。每個ACP結構域的特征是4’-phosphopantetheine分子(LGXDS(L/I))的結合靶標。所述4’-phosphopantetheine分子在這里結合到靶標內的保守絲氨酸上。ACP結構域通過4’-phosphopantetheine基團作為載體起作用來生長脂肪酸和/或多酮化合物鏈。與酮還原酶具有部分同一性的序列(Seq ID No.27和45)隨后接上。這些結構域的生物學功能在于3-酮?;?ACP化合物的NADPH依賴型還原作用。它代表脂肪酸生物合成中的第一次還原反應。這種反應在多酮化合物合成中也經常發(fā)生(還參見圖3)。
來自Ulkenia sp.的ORF 2(Seq ID No.4和7)也以β酮酰基合成酶結構域(Seq ID No.50和58)起始,其特征是靶標(DXAC)(Seq IDNo.48和56)。Ulkenia ORF 2中酶學結構域的活性中心的這種靶標能夠以優(yōu)選的形式擴展到17個氨基酸的范圍(PLHYSVDAACATALYVL)Seq ID No.47和55)。完整的β酮?;铣擅附Y構域可以分成N末端(Seq ID No.46和54)和C末端(Seq ID No.49和57)部分。此結構域的生物學活性對應于ORF1中所述的β酮?;铣擅附Y構域。Kethosynthases在延伸循環(huán)中發(fā)揮關鍵作用并且顯示了比脂肪酸合成的其它酶更高的底物特異性。這再次后接與β酮?;铣擅附Y構域具有較小部分同一性的序列片段。另外,這一結構域缺少用于活性中心的靶標DXAC。它具有來自II型PKS類似系統(tǒng)的所謂鏈長因子(CLF)的特性(Seq ID No.51和59)。CLF氨基酸序列與酮合成酶具有部分同一性,但是沒有具有相應的半胱氨酸基團的特征性活性中心。PKS系統(tǒng)中的CLFs的部分目前正以爭論方式進行討論。最近的結果指出CLF結構的部分在于丙二酰ACP的脫羧作用。產生的乙?;S后可以結合到β酮?;铣擅附Y構域的活性中心上并且因而代表了起始縮合反應的所謂引動分子(priming molecule)。還發(fā)現CLF同源性序列作為分子PKS系統(tǒng)中的負載結構域。具有CLF序列特性的結構域存在于所有先前已知的PUFA-PKS系統(tǒng)。這后接酰基轉移酶結構域(Seq ID No.52和60)。這種結構域催化許多酰基轉移如從?;D移到輔酶A或轉移到ACP結構域。來自ORF 2的終止結構域顯示與氧化還原酶的部分同一性(Seq ID No.53和61)并且很可能代表了一種烯?;€原酶結構域。烯酰基還原酶結構域的生物學活性存在于脂肪酸合成的第二次還原反應中。它催化脂肪酸?;鵄CP的反式雙鍵的還原(也參見圖2)。
來自Ulkenia sp.的ORF 3(Seq ID No.5和8)由兩種脫水酶/異構酶結構域(Seq ID No.66,68,72和74)組成。兩種結構域都包含“活性位點”組氨酸,直接相鄰半胱氨酸(Seq ID No.67和73以及Seq ID No.69和75)。這些結構域的生物學功能是反式雙鍵插入到脂肪酸或多酮化合物分子中,伴隨著H2O的分解和雙鍵隨后轉化成順式異構形式。第二種脫水酶/異構酶結構域并入丙氨酸富集區(qū)(Seq ID No.70和76),所述丙氨酸富集區(qū)沒有已知的功能但是可能代表連接區(qū)。這后接烯?;€原酶結構域(Seq ID No.71和77),其與來自Ulkenia的已經存在于ORF 2中的烯?;€原酶結構域具有高度部分同一性。它的生物學功能對應于上面已經介紹過的烯?;€原酶結構域(也參見圖2)。
優(yōu)選在來自Ulkenia sp.的ORF 1起始ATG密碼子前面給出2000bp(Sequence ID No.62)作為啟動子序列。它們特別優(yōu)選1500bp,更加特別優(yōu)選1000bp在起始密碼子之前。
優(yōu)選可以在終止密碼子TAA之后給出2000bp(Sequence ID No.63)作為ORF 1的終止序列。特別優(yōu)選1500bp,更加特別優(yōu)選1000bp在終止密碼子之后。具有堿基序列AATAAA的ORF 1的mRNA合成的潛在終止信號存在于終止密碼子TAA之后的412bp。
優(yōu)選在來自Ulkenia sp.的ORF 2起始ATG密碼子前面給出2000bp(Sequence ID No.64)作為啟動子序列。它們特別優(yōu)選1500bp,更加特別優(yōu)選1000bp在起始密碼子之前。
優(yōu)選可以在終止密碼子TAA之后給出2000bp(Sequence ID No.65)作為ORF 2的終止序列。具有堿基序列AATAAA的ORF 2的mRNA合成的潛在終止信號存在于終止密碼子TAA之后的1650bp。
優(yōu)選在來自Ulkenia sp.的ORF 3起始ATG密碼子前面給出2000bp(Sequence ID No.78)作為啟動子序列。它們特別優(yōu)選1500bp,更加特別優(yōu)選1000bp在起始密碼子之前。
優(yōu)選可以在終止密碼子TAA之后給出2000bp(Sequence ID No.79)作為ORF 3的終止序列。具有堿基序列AATAAA的ORF 3的mRNA合成的潛在終止信號存在于終止密碼子TAA之后的4229bp。
PUFA,例如,DHA可以在Ulkenia sp.中進行同源生產,此外還可以在宿主,例如,大腸桿菌中利用本發(fā)明測定的序列信息進行異源生產。根據本發(fā)明的核酸序列可以用來提高PUFA的產量,其中它們被用來,例如,提高生產PUFA的生物中PUFA-PKS基因的數目。自然地,甚至是個別核酸片段,例如,編碼ACP結構域的序列片段也可在同源或異源生產生物中進行擴增。特別地,ACP結構域呈現自己提高生產,因為輔因子4-phosphapantheteine的結合位點對于PUFA合成是必需的。自然地,即使是不同調控元件,例如,啟動子,終止子和增強子元件的使用也能夠導致經遺傳修飾的PUFA生產者內產量的提高。在個別序列片段中的遺傳修飾能夠導致獲得產物結構的變化并且因而導致不同PUFAs的生產。另外,PUFA合成酶與多酮化合物合酶的相似性使得混合系統(tǒng)的構建成為可能。這種所謂的組合性生物合成允許新的人工生物活性物質的生產。例如,通過PKS-和PUFA-PKS單位的混合系統(tǒng)在轉基因微生物中生產的新型多酮化合物抗生素是有可能的。
適于這里給出的PUFA基因的異源表達的宿主除了大腸桿菌之外為,例如,酵母如釀酒酵母(Saccharomyces cerevisiae)和畢赤酵母(Pichia Pastoris)或者絲狀真菌,例如,構巢曲霉(Aspergillus nidulans)和Acremonium chrysogenum。通過將根據本發(fā)明的基因導入,例如,大豆,油菜,向日葵,亞麻或其它的,優(yōu)選富含油的植物中來生成生產PUFA的植物。為了PUFA基因的有效異源表達,甚至也可以使用其它的附屬基因,例如,4-phosphopantheteine轉移酶。另外,可以使用宿主特異性啟動子/操縱系統(tǒng)進行加強的或可誘導的基因表達。
可以使用多種原核表達系統(tǒng)進行PUFA的異源生產??梢詷嫿ǔ讼鄳腜UFA基因之外還包含啟動子,核糖體結合位點和轉錄終止子的表達載體。將大腸桿菌色氨酸生物合成的啟動子/操縱子區(qū)和λ噬菌體的啟動子引證作為大腸桿菌中這些調控元件的例子。同樣地,可以將可選擇的標記,例如,對氨芐青霉素、四環(huán)素或氯霉素的抗性用于合適的載體上。對于大腸桿菌的轉化非常合適的載體為pBR322,pCQV2和pUC質粒以及它們的衍生物。這些質??砂《疽约凹毦?梢允褂妹糠N源自大腸桿菌K12的菌株,例如,JM101,JM109,RR1,HB101,DH1或AG1作為大腸桿菌宿主菌株。自然地,所有其它慣用的原核表達系統(tǒng)也可以用于異源PUFA生產(還參見Sambrook等)。還可以使用生油(oil-building)細菌作為宿主系統(tǒng)。
可以將哺乳動物、植物和昆蟲細胞以及真菌,例如,酵母用作真核表達系統(tǒng)。對于酵母系統(tǒng)來說,可以使用來自于糖酵解酶基因的轉錄起始元件。這包括乙醇脫氫酶,甘油醛-3-磷酸脫氫酶,phosphoglukoisomerase,磷酸甘油酯激酶等的調控元件。不過,即使是來自基因如來自酸性磷酸酶,乳糖酶,金屬硫蛋白或葡糖淀粉酶基因的調控元件也可以使用。這里還使用允許加強的或可誘導的表達的啟動子。可由半乳糖誘導的啟動子(GAL1,GAL7和GAL10)也是令人特別感興趣的(Lue等,1987 Mol.Cell.Biol.7,p.3446 ff.和Johnston1987 Mircobiol.Rev.51,p.458 ff.)。3’終止序列還優(yōu)選源自酵母。由于緊鄰起始密碼子(ATG)的核苷酸序列影響酵母中基因的表達,還優(yōu)選來自酵母的有效翻譯起始序列。在使用酵母質粒的情況下,它們包含來自酵母的復制起點并且包含選擇標記。這種選擇標記優(yōu)選是營養(yǎng)缺陷型標記,例如,LEU,TRP或HIS。這種酵母質粒是所謂的YRps(酵母復制性質粒),YCps(酵母著絲點質粒)和YEps(酵母游離質粒)。沒有復制起點的質粒是Yips(酵母整合質粒),其用于整合轉化的DNA至基因組中。特別感興趣的是質粒pYES2和pYX424以及pPICZ質粒。
如果將絲狀真菌,例如,構巢曲霉用作異源PUFA生產者,也可以使用來自對應生物的啟動子??梢詫⒂糜诩訌姳磉_的gpdA啟動子和用于可誘導表達的alcA啟動子用作實例。優(yōu)選使用酵母質粒如pHELP(D.J.Balance和G.Turner(1985)Development of ahigh-frequency transforming vector for Aspergillus nidulans.Gene 36,321-331)和可選擇標記如ura,bio或paba用于轉化絲狀真菌。甚至優(yōu)選來自絲狀真菌的3’調控元件。
通過桿狀病毒表達系統(tǒng)可以在昆蟲細胞中生產PUFA。這些表達系統(tǒng)可由,例如Clonetech或Invitrogen商購。
可以將載體,例如,來自土壤桿菌的Ti質?;蛲暾《救绮嘶踊ㄈ~病毒(CaMV),雙粒病毒,番茄金黃花葉病毒或煙草花葉病毒(TMV)用于植物的轉化。優(yōu)選的啟動子為,例如,CaMV的35S啟動子。對于植物轉化的其它可能性為磷酸鈣法,聚乙二醇法,微注射,電穿孔或原生質體的脂染。還優(yōu)選通過用DNA帶電微粒轟擊(基因槍)進行的轉化。植物中備選的PUFA生產源自葉綠體的轉化。例如,N末端引導肽使得蛋白質在葉綠體中的轉運成為可能。優(yōu)選的引導肽源自核酮糖雙磷酸酯羧化酶的小亞基但是也可以使用其它chloroplastidary蛋白的引導肽。葉綠體基因組的穩(wěn)定轉化提供了另一種可能性。對此尤其可以考慮生物導彈法還可以考慮其它方法(Blowers等Plant Cell 1989 1 pp.123-132,Kline等.Nature 1987 327 pp.70-73和Schrier等Embo J.4 pp.25-32)。
對于哺乳動物細胞還可以使用可以商購的表達系統(tǒng)。其中,可以使用病毒性或非病毒性轉化和表達系統(tǒng),例如,慢病毒或腺病毒系統(tǒng)或Invitrogen的T-Rex系統(tǒng)等作為例子。同樣,來自Invitrogen的Flp-In系統(tǒng),可以用于哺乳動物細胞中DNA的目的性整合。
下面利用幾個實施例介紹構成了根據本發(fā)明方法基礎的核酸和氨基酸。不過,所述序列和本發(fā)明并不限于這些實施例。
附圖簡述圖1描述了來自Ulkenia sp.的PUFA-PKS基因在基因組上的位置。另外,顯示了由這些基因編碼的PUFA-PKS的個別結構域。KS酮合成酶,MAT丙二酰-CoA:ACP?;D移酶,ACP酰基載體蛋白,KR酮還原酶,CLF鏈長因子,AT?;D移酶,ER烯?;€原酶和DH脫水酶/異構酶。
圖2顯示來自Ulkenia sp.的ORF2和ORF3與來自Moritellamarina(GenBank編號AB025342.1),Photobacterium profundum SS9(GenBank編號AF409100),Shewanella sp.SCRC-2783(GenBank編號U73935.1)和Schizochytrium(GenBank編號AF378327,AF378328,AF378329)的相應同源性ORFs的比較。在進化過程中個別ORFs之中和之間的基因轉座也在結構域結構旁邊指出。
圖3顯示來自Ulkenia sp.的ORF1與來自Moritella marina(GenBank編號AB025342.1),Photobacterium profundum SS9(GenBank編號AF409100),Shewanella sp.SCRC-2783(GenBank編號U73935.1)和Schizochytrium(GenBank編號AF378327,AF378328,AF378329)的相應同源性ORFs的比較。強調了ACP結構域和氨基酸連續(xù)LGIDSIKRVEIL重復的數目。
圖4包含了來自Ulkenia sp.的ORF1與來自Schizochytrium的ORF A的序列比較。兩種序列的部分同一性的程度為大約81.5%。
圖5包含了來自Ulkenia sp.的ORF 2與來自Schizochytrium的ORF B的序列比較。兩種序列的部分同一性的程度為大約75.9%。
圖6包含了來自Ulkenia sp.的ORF 3與來自Schizochytrium的ORF C的序列比較。兩種序列的部分同一性的程度為大約80.0%。
圖7描述了由FASTAX進行的,實施例1中所述PCR產物與數據庫序列(Swiss-PROT全文庫)的序列比較。
圖8顯示了用于生產來自實施例2的粘粒庫的Cosmid SuperCosI(Stragagene)的載體圖(card)。
圖9描述了由BLASTX進行的,實施例3中所述PCR產物與數據庫序列(Swiss-PROT全文庫)的序列比較。
實施例實施例1從分離自Ulkenia sp.SAM2179的DNA擴增PUFA-PKS特異性序列1.1包含編碼PUFA-PKS的基因的基因組DNA的分離在250ml帶有阻流板的Erlenmeyer燒瓶中用Ulkenia sp.SAM2179接種50ml DH1培養(yǎng)基(50g/l葡萄糖;12.5g/l酵母提取物;16.65g/l Tropic Marin;pH6.0)并于28℃和150rpm培養(yǎng)48h。隨后用滅菌自來水洗滌細胞,離心下去并將細胞沉淀物冷凍于-85℃中。為了進一步的檢查(workup),隨后將細胞沉淀物轉移入研缽中并以研棒在液氮下粉碎成精細粉末。隨后,將大約1/10研成粉末的細胞材料與2ml裂解緩沖液(50mM tris/Cl pH7.2;50mM EDTA;3%(v/v)SDA;0.01%(v/v)2-巰基乙醇)混合并于68℃溫育1h。隨后加入2ml苯酚/氯仿/異戊醇(25∶24∶1),攪動并于100000rpm離心20min。在除去上層水相后,將后者轉移入兩個新的反應容器中,每個600μl,并且分別再次與600μl苯酚/氯仿/異戊醇(25∶24∶1)混合,攪動并于13000rpm離心15min。隨后將特定上層相每個400μl轉移入新的反應容器中并在每種情況下加入1ml乙醇(100%)后倒轉兩到三次。隨后,將沉淀的DNA纏繞在玻璃棒上,用70%乙醇洗滌,干燥并溶于50μl蒸餾水中。將以此方式提取的DNA與2μl RNase A混合并保存于4℃待用。
1.2利用靶標特異性寡核苷酸進行PCR反應將PCR引物MOF1和MOR1用作靶標特異性寡核苷酸。
MOF15’-CTC GGC ATT GAC TCC ATC-3’(Seq ID No.81)MOR15’-GAG AAT CTC GAC ACG CTT-3’(Seq ID No.82)。將在上面1.1段中所述的來自Ulkenia sp.SAM2179的基因組DNA稀釋1∶100。隨后將2μl的這種稀釋液轉移入50μl體積的PCR反應混合物中(1x緩沖液(Sigma);dNTPs(每種200μM);MOF1(20pmol),MOR1(20pmol)和2.5U Taq-DNA聚合酶(Sigma))。在下列條件下實施PCR起始變性94℃ 3min,隨后為30個循環(huán),每個循環(huán)于94℃ 1min,55℃ 1min,72℃ 1min,和最后8min 72℃。隨后通過凝膠電泳分析PCR產物并通過T/A克隆(Invitrogen)將具有合適大小的片段插入載體pCR2.1 TOPO中。在轉化大腸桿菌TOP 10F’之后,分離質粒DNA(Qiaprep Spin,QUAGEN)并進行測序。
將獲得的序列數據與官方EMBL核苷酸序列數據庫(http://www.ebi.ac.uk/embl/)相比較并進行評估。用FASTAX獲得的序列比較對于來自Ulkenia sp.SAM 2179的PCR主要產物與來自Schizochytrium sp.ATCC 20888的PUFA-PKS(ORF A;ORF開放閱讀框)的酰基載體蛋白產生部分同一性,其在氨基酸水平上為大約90%(圖7)。令人吃驚的是,為了確定在Ulkenia sp.SAM 2179中的這種PUFA-PKS,僅須實施單次PCR實驗。這說明所用寡核苷酸的特別高的效力。
實施例2由來自Ulkenia sp.SAM 2179的基因組DNA生產基因組文庫在500μl體積中以2.5U Sau3AI于37℃ 2min將來自Ulkenia sp.SAM 2179的50μg基因組DNA部分裂解并且接下著立即用相同體積的苯酚/氯仿進行沉淀,隨后用乙醇沉淀并溶解于蒸餾水中。隨后根據生產商的說明書用SAP(蝦堿性磷酸酶;Roche)將Sau3AI裂解的基因組DNA去磷酸化。隨后通過將該反應加熱20分鐘至65℃來進行酶的滅活。將粘粒Supercos I(Stratagene,圖8)用作載體。將10μgSupercos I用XbaI于37℃完全裂解幾小時。隨后將酶于65℃加熱滅活20min并且根據生產商的說明書用SAP(Roche)將剪切的粘粒去磷酸化。在這里也通過將該反應于65℃加熱20分鐘進行酶的滅活。隨后用BamHI于37℃將XbaI裂解的和去磷酸化的Supercos I粘粒完全裂解幾小時。隨后將剪切的粘粒DNA用苯酚/氯仿進行沉淀,用乙醇沉淀并接下來溶解于蒸餾水中。為了進行連接,將1μg用XbaI和BamHI裂解的粘粒DNA,和3.5μl Sau3AI裂解的基因組DNA組合于20μl的體積中并用T4連接酶(Biolabs)根據生產商的說明書連接幾小時。隨后根據生產商的說明書利用Gigapack III XL Packaging Extract(Stratagene)將大約1/7的連接物包裝在噬菌體中。隨后將后者用于轉染大腸桿菌XL1-Blue MR。隨后以PCR篩選的形式由QIAGEN公司(Hilden,Germany)由基因文庫中進行PUFA-PKS特異性粘粒的分離,所述PCR篩選利用Ulkenia-PKS-特異性寡核苷酸PSF25’-ATT ACT CCT CTCTGC ATC CGT-3’(Seq ID No.83)和PSR25’-GCC GAA GACAGC ATC AAA CTC-3’(Seq ID No.84)。隨后對由此確定的粘??寺19F09的粘粒DNA進行分離和測序(Seq ID No.1)。
實施例3來自Ulkenia sp.的ORF3的鑒定為了鑒定來自Ulkenia sp.SAM 2179的ORF,寡核苷酸源自不同PUFA-PKS的高度保守的序列片段。令人感興趣的是,對于PCR擴增似乎合適的非常高的部分同一性出現在個別物種之間編碼脫水酶/異構酶的序列片段區(qū)域。
3.1包含編碼PUFA-PKS的基因的基因組DNA的分離參見實施例1.13.2利用PUFA-PKS-特異性寡核苷酸進行的PCR反應將下列PCR引物用作PUFA-PKS-特異性寡核苷酸CFOR15’-GTC GAG AGT GGC CAG TGC GAT-3’(Seq No.85)CREV35’-AAA GTG GCA GGG AAA GTA CCA-3’(Seq IDNo.86).
將在上述3.1段所述的來自Ulkenia sp.2179的基因組DNA稀釋到1∶10的比例。隨后將2μl這種稀釋液轉移入50μl體積的PCR反應混合物中(1x緩沖液(Sigma);dNTPs(每種200μM);CFOR1(20pmol),CREV3(20pmol)和2.5U Taq-DNA聚合酶(Sigma)。在下列條件下進行PCR94℃初始變性3min,隨后30個循環(huán),每個循環(huán)于94℃1min,60℃ 1min,72℃ 1min,和最后8min 72℃。隨后通過凝膠電泳分析PCR產物并通過T/A克隆(Invitrogen)將合適大小的片段插入載體pCR2.1 TOPO中。在轉化大腸桿菌E.coli TOP10F’之后,分離質粒DNA(Qiaprep Spin,QUAGEN)并進行部分測序。
將獲得的序列數據與官方EMBL核苷酸序列數據庫(http:∥www.ebi.ac.uk/embl/)相比較并進行評估。用FASTAX獲得的序列比較對于來自Ulkenia sp.SAM 2179的PCR主要產物與來自Schizochytrium sp.ATCC 20888的PUFA-PKS合成酶的ORF C產生部分同一性,其在氨基酸水平上為大約80%(圖9)。令人吃驚的是,為了確定在Ulkenia sp.SAM 2179中的這種PUFA-PKS,僅須實施單次PCR實驗。這說明所用寡核苷酸的特別高的效力。隨后以PCR篩選的形式通過QIAGEN公司(Hilden,Germany)由實施例2中所述基因文庫中分離PUFA-PKS特異性粘粒,所述PCR篩選利用已經用于PCR的寡核苷酸CFOR15’-GTC GAG AGT GGC CAG TGC GAT-3’(Seq ID No.85)和CREV35’-AAA GTG GCA GGG AAA GTA CCA-3’(Seq IDNo.86)。隨后對由此確定的粘??寺?58G09的粘粒DNA進行分離和測序(Seq ID No.2)。
序列表<110>努特諾瓦營養(yǎng)產品及食品成分有限公司(Nutrinova Nutrition Specialties and Food Ingredients GmbH)<120>來自ulkenia的PUFA-PKS基因(PUFA-PKS Gene aus Ulkenia)<130>SCT064799-47<160>86<170>PatentIn version 3.1<210>1<211>43372<212>DNA<213>Ulkenia sp.
<400>1ggatccacag cgttcattta ctcaagatca cactcgtgtg cagtccttga accttgggaa60agctcatgtc tctaggtatt gctgtcatgg tttgaaattt tgtcctcaaa agaatcgctt120gtaatttttc acttggtggg gtgcacaatg gtctctcaga accatctgct ctaaggagtc180ctactgacac ctacctacca cccttccttc atacccatgc ctactaacca acctattgat240aactctaacc agggttctat gataggcaaa tcagccaatc tcccgtggaa attagtcttt300tcaatcgttg gccagcaagc accatcgcaa cgacagcgct gcatcagcag gaactcgagt360acgcttcacc gtcatcgtca tcggtatcac cactattcat gaaatcagaa cctagtcacc420cagttacttt ttacgaggca gttgattctg tggagagatg ctcctgatca atggatatgt480ctattttatc tacaggtcac acataatcaa tcattcgggg tcatgatttt ccgccatggc540gatagtccaa aaaaactcag gaggcaaaat cattgttcaa tttacaacta cccacggagt600aaattaatgt aagagctcca atttacaggc aggtatatca tcacggtgtg ctgcagtagg660ttctgggtta tcatcctcaa tcattcataa acataacatt cattcataaa cataacattc720attcattcat aaacataaca ttcattcatt cattcactca ttcactcatt cattcattca780ctcattaatc cgcttaattt aactttaaat tgattgattg attgattgat ggcagaacca840cctattagca attggttact ccttgtattg aaaggcctga ataagtaagc aagcaagcca900ttggtaaacc ttcctcgccg cgactcgagc gacctcgaga gcggtctgag tgagtctctc960acgcaggccc cccgcctcct gagccgtctg tctcgctcaa ctgaagctcc gacaagccaa1020gctcacagct gcaagcttgc aagcaagctc gcttctgtct actcgtcctg catcgaatca1080acaaccttct cttacgccat gacggacgcc tcttccgaga tgcgcaagcg taagcgctac1140gcataccgca tcctcactga tgagtcatcc tcctcccatg caccctctgc tgaggatggt1200tccgtgcagg actctcgtat gctccgccat gccggcagca tctgggatgc cgaagagcgc1260
cgccgcgctg gcaaaatgtc ctcttccgca actgcagcca tgtccagtgt acctcctgga1320gaggaactct ggcttgtgtc tatccctgcg gacttcgacg cccatgacct caatggcctt1380cgcctgtctg ggaagaagcc cctcgcggac caagaaatcc aaattggcgc tacccacacg1440ctcactgctg acctgctctc gggctcttct caggtgcggt gcctgcgccc tactagctcc1500tatgtcaacg gcctgaggct tacaccgcct gccgcgcgtg ttttccacgt cgtagagcgt1560gatgccgctg atgatgaggc cagtgaagcg ggaggcagtg cccaagagga ggaggagcgc1620ctgcgcaagg ctgaagaggt cgtcaagaga cttttgccga agccgcgtga gcaaattgaa1680tttaggactt tttctatggc cgacaaagag gaactgctga agcgcatgca aaaggcaaag1740gcgcgtggag agaagaagag gggcagaaac gcgattaagg aagaagcaga agacgaggag1800gacaaggagg aagagaagtt ggtggccaag acagcaaaga aggacaagaa gaagggcaag1860aaggaaaagg agaaaaggcg caagtctgtg gcctgagctg gaaacccctt taaagtgaat1920aaaggctgtc ttgacatgtt caagaacgct tattcgatac atgaagacgt gctctggggt1980tatttcgatg aagcctgatc taaatactag tctgcttcag aatcatgcac agtgttcaaa2040ttgattctta actacagcct acgctgaagt tcagcttcaa attttggtct attttgaagt2100tcttcaccga aagtcatttc tagagtcccg ccccaaagtc tgatctacac tctctactcc2160attaccgcta atatccttta caactcttat ctttttcgac ttcttcaagc gctaaggagc2220ggaccactaa actgatgcaa gcttgcatca actctacgac cttttttatg tcaacacaag2280ttctggcctt acgctgaact cgtctctgat acacaatatg caacgaacac cgccaagacg2340gtcgctcatg cacatacgca cacatatata caaccaaaca tacaaataaa cacataagca2400ttggtcaagc cagctacagg accaatattc catcttttgc tgcttttctg caatttgggc2460cgctttttta tgtttggctg tatatatttt tcttggcatg caacctaaca agacacatga2520gcagaaaaaa taaatacggt caaagtcttg tctctgatgc tcatgtcttt cttctaatct2580taccagcgag aagacctttc taaagaataa tatcacatat actcaattgt ccaaattgct2640ttcaataagc attctttact ggatagctct cgccaaactg tcattcttag gaacactgct2700aatacgtggc tgaaagcact cccaacatgc acttttattc ctatgcattt tcttcttgga2760gctcaatttg acaaaatgcc ggtcgataag ctcgcggtct tgactttgat gcttacttcc2820ttgtttaact cgaaaacctt ctcatggctc attggaaaat catcaaatgg attatctatc2880atcttcactt aacccaattt ttgtttctct aaaacagccc caactatttt ttaaagaaat2940ttgtgtgctc tatcttctgt ttgcaactca aactaacaag ccacatcaac aaacatttat3000ttttttcaaa cttgataact ttagaccaac tttgcatcct cgatgctcgg gactccatct3060
taccccttgt caggtatgaa gcatctgatg aagcttgcag tattattacc ttttccagaa3120cactactgct accttcaaag atttgttcat ttcttttctt tgggggaaac aatgaatgct3180gattacccga agcgtaatat ggttgttgca tatattcaaa tattttaaac cttctaagta3240tttatatgat aggtatatgt tatttttaaa gacctttaat gcagttattt catatcaata3300accaagctct cgcagttttg cgctgtactg gcagtggtgg aggacccgtt gatctttata3360aaataggatc actggaggaa ggtgagacca ggaaactaag actatataag tttgtgggtt3420tctgtcattg tcactgacaa ggatcaaagt tatcctaatg cagagcatcc aacctttgtc3480tcagggaccc acccaatcca ctcttcaagt tttcactttc aatttcaggc caatttaaga3540caggaataca actcaaacta aatcaggatt cttctttttt aactcccagt catgcgatct3600ttaaaattga tcacattgcc ggcataataa ccatgggttt cgcaacttcc tccctggttt3660ctttgccaaa taaaacttcc acacactcga gagcaaactc cattgccgtg ccaggccctc3720tagacgtcac aattttggcc tcgtgctcaa ccaccacgcg atcctctgaa catcctcctt3780gagcgctctc aagatctttc gcaaatgccg gatggcacgt ggctctgcgc cctttcacaa3840tacccaaagg tgctagcacc accgctggag cggcgcaaat tgcggcaacc caggctccgc3900gcgagttctg tgccaacaat agcgagcgga gaggctcgct cgcggcgaga tttgaagcgc3960cgggcatccc gccaggaacg attacgaggt cgaaagatgg agatgtagaa ctggcatcca4020ggaggtcatc caaacgaaca tcagcctcaa tacgcacgcc tcgagaacag gtgagggttt4080ttccgctatc gcaaacagcg gcaacaatca cggaggcacg ggctctgcgc aggacatcaa4140tcggaatcac gctttccatc tcttcactgc catccgccat gacaaccagc acggacggtg4200gggaggaagg ggaagaagac atcgtcgaat tatgggaaac gtcgagactg gagcaagcgg4260gggcgattgt ttaagcgagc acaaagtgac gaggaattga gttacaatgt gaatctatag4320ataaataggt acctgtgcct tgcgacgaca gaaagatatt ttctcataat aggcctatct4380aaaaccaata attttgaaca ttttcatcat tgacgaaaag ctcctgcctt ccaaattgga4440agtgactatc cttaatatag tgcaataacg cattggacca aacagaatcc tcctggaggt4500gaccaccatg ttaggacctt gaacttcgca attgattggt ttcgaccttt tctccctcct4560tttataaaat aagcggctca aattaattag cctatcacgg tttctctagt ttttgggggt4620ttcgctatta tttggttatt atgaacaaat gtacagcttc ttacttacca gcctcctcgt4680tcagcatggt gaatgcatga aataaggaat caacttcatg actcatgctc tgcgtacaac4740attagattat ttttgcatgt ggtgttgaaa gtaagtcttc aagtcttttt cgtcaggata4800aaaactttct ttcatttgaa gttgtatgca agtcgcacca agatgtgatg actattttgc4860ttttcattaa ctttcctttg cagcaaaaaa gctctgtgcc tatgaaagcg ttagaactta4920
cttatataac ctccaaatgg tagtgactat tccacctaaa ttacatatca taatgattta4980agtctttgtt aaaaagtgga tgtttggtaa gaaactggaa taactaaggg accactaagc5040tccagacact acaagtgaag caaatcttca atttaaatta tcaaagtact tcaaccaaaa5100ttttagcgtc tcaacaagta cccttcgtgt gctatcccgg aggcaatcac atgtgcacaa5160gtaacgatgt tgaacgtacc tatggctctg gtttattttg gcagccatga gcaacgcaac5220actgaccgta tctttctcta cgctacaatg tcctccgcca agcaaaaaga gaatatccca5280gctcatttgc aaagccgaga ttttattcct gccagtggtg tcaactggtc atttacggag5340aggattgcac ttcaaagcca tgcaatgaat gtggtattat ccacgacaat cttggaaaat5400ccaagctttt aaaatgcccc aaaaccatgc aaacacgtag ccgatcgtga tatccacgcc5460ctccagctgc gccacctatc caaggacatg gtttaagaat tgtcgtttgg tcatatgtta5520gttttcaacc cgcaattggg ccttagtcca ccttgttacc ataggaaatg caagctttgc5580aaattttgta ggctaatctc taagtgtagc ttttgtcatt gtaaagacac aattcattga5640catgaggttg aaagctgttc tcatatgtaa caatccgcaa cattgactac gtcacatgtt5700cgtgcataga gggaacactt atcttgcata gtatgccctc acaactctcc tcccccgtac5760agcaatcgca cgcaccatca tttattcaaa tgagacaata cttgctatcg tcccgattgc5820tctttagttg gacatagaac taaatgcgcg tcgcgatgcg accggaaagg tttaccagca5880gactgttctg caatcgttcc gtaccctatt tcacaacatt agtcgatcga tcagaacaaa5940tcaagataga acctgcagga ggggtcgcgc aaagtttagg cacccaggca cagccgctct6000gtaagtggat tttcattcaa ttgtggtcct gtgcattcat tgtttgctcg tgtagcaaat6060agaaccacaa ggggttttgc agaaagaaaa caaggatcat ggggcgaaac cgaggccaga6120cggcgggacc actcgaccgc cagtcgaggt tcatgaccaa ggttctgcgg caccgcgcgg6180cagacatggg tcttgaaatg cgttcagatg ggtttgtgcg cgtagaagac cttctgaaac6240ttcagcaact taaagacatt ggccttgagg atgtcaaagc tattgttgct gctgataaca6300aacagcgatt tggccttcag caggaagagg accagacctg gtggattcgt gccaaccaag6360gtcactctat ggctagtgtc gagacagaag atcttcttga ggaggttgac ctcgatggga6420tttctctctg tttgcacggc acctatttgc ggttctggcc attgatagta cgcgatggtt6480taaagcgtat gcaacgtaac catatccact ttgcaacagg ccttcccggg gacgatggtg6540tccttagtgg atttcgcaac tctgctgagg tgcttattta tcttgatacc gtgcaggcga6600aaaaagctgg actcaaaatg tatcgctctg caaaccaggt gctcctaagt ccaggtcttg6660gcgacagtgg agtaatccct gtcaccttgt ttgctaaggc tgtcgagcgc cgctctggaa6720agctactttg gccaatagag gaaggtaaag agtcgcaacc ccctacagcg cctacttcag6780
accaccaacc tcgacaagga caactagcaa gtaagcgaaa agctggtggc cacaacaaga6840aactatcgca catgcttagc cgtgtcctgc ggcactctgc agttgatgaa ggaatcacca6900ttcgtgaaga tggcttcgtg cgccttgaag atctccaaac caaactcaag cgtttcgaaa6960atgtaactct tgatgacgtt caagctgtgg tgcgtgacaa tgacaaacaa cgcttcacac7020tacgccagga gtcagacggg tcctggatta ttcgcgcaaa ccaaggtcat tccatggctg7080ttgtcaaaga atcttttctc ttgcgggaac ttgaccctac cacaattgat gtgtgtcttc7140atggtactta caaagaagct tgggcaaaga ttcgaaaaac tggtctctcg cgcatgaacc7200gaaaccatat tcactttgct cgtggattgc cctccgactc caatggtgtt atcagtggca7260tgcggaaatc atgcgaagta catctctata ttgatgcctc tgcagcaggc aaagatggga7320ttaaattctt tgaatctgac aacggtgtta tcttaagtcc tggtaatggt gatggcatta7380tccctcctaa atactttaag tctgtcacag atcgccaagg cgcttcctta gaaaacctaa7440aatgacaaat tatgtagatc ttagttgttg aggacttcat gtcctttttg ttgtttgatt7500ccttgtatag cttatacacc ctggttatgt acattgtcat tcttgttaga ggcaattctt7560catctttgat tgatattcta tagaacttcc tcatgggtgt acctatacac aattatttat7620tataccgtgt gatattgtga ggttctaaag ttagcatcgc ctctgacacc tatgatggat7680gcagagtgac gccaatcctt cctctatatt gtgcgtgcct gctcgagaat caaatgatgt7740taaaagtcgt cttcattcat tatataacag agcataatgg aataataaaa ggaggcagga7800gacaagggta cttctgttgt gtaaaattcc attactatgt tcgtgtatag tagtattcct7860tgcctttagg atagtaggga agatattctc tgtgactttc acctacttca ctcttatgca7920agctcttatg caatcacaga tggatgtaga ttccgcttct tcattctcac tacgagaaca7980gcgcaactac aaatcttaag gactgtcaac tggcctgaaa tagtgaccaa ttatatattc8040caaaataaat ttatttgtat aaaattgtaa agatgcagca tgatagctta ggtacacata8100aacaacggtt aagtgtatag ggatacgcaa acgcaagcga gaacatgcaa gcgagaccat8160cgcctttcac cataatgtta taaatgtcta ttcttctgcc aagagcacga tacactcaac8220gttggtctaa gcactaaaga cagcatgtat ttatgtaagg acaacaacaa gcacctatac8280ctcaaaactt agtaataggc ttactaaaca ttctaacact atgatcttca tgtgaaaata8340ctcagcagca tggatgttga agctccacaa atggaataca gaaaacacaa tctagcaaga8400cgatgaaaat tgttcttagg tttcaggatc agaataacca aaatgcgcac cacacctgtt8460tctgatgctg tagctgtcat gttatggtaa aaacgtgcac agggcaccac tagcctgtta8520ttgtgtcgat tttgatacag tttatcacac gagagcttac tgactatgtt gtagaatgta8580aataccctat tcaaataacc ttgtggacac actcatccaa catactctac tcaactctta8640
ctaaaacaac caaaagattc cgctgaacta gaccaaaata atttgagtga tatgctgcaa8700ttcgtttgaa cacaatacat gtattgatgg ctgagatatg acttgccaaa gattgttcgt8760tgcaattaaa gtttactctc tgagtgcata tactcaatac aatgcagctt tatcgtggaa8820atccgggcta agcatgccat taggacccta tagcaggctc tgggcacgat ctttatatct8880tagcgatagt ttgtgcagca aaataatgga taaatcaaac ttcaacgagt cttaattcat8940agtttcgaat ccctacgagg ctatatatat aaagaaggtg tgagtcgaca gcacagttat9000gtaggaaaag ttataattat gtggaaaata accttagttg tcgaatcgtg gtgaataaaa9060gcttcattta agcgttttca gagatgccgg agcccatacc aaatattaat ttgctcaaag9120tcatcaattt cttatttgat agaatctaaa acagctttat attatatgaa gagcatatat9180attttaagct agtttagact tcaaccaagg ggatccaatt ttcgctcgtc actctgcgtc9240aaggtcgttt gcaaaaacat caaatctggt gcaagctcaa atgactaggg tcaataagga9300ctcctactaa ttatagttgt cactattatt tccactagga accgataaaa cagatgtaat9360taactctctt ggcgcttacc ttgtatagca agagtaaaga gtaaatgatg cggcaaaaac9420tatctctgtt acttatatgt tatagagtgc attggctgcg ccatgccata tgatagtagg9480taaactttgg aagttgaaag gggcgagaaa gggatcacag gtgatctata tataaaatgc9540aaatgaaaat tttaaagttt ggaaagttta tatgcgacac ataaaattat aatttgcata9600tgtggattaa gtgaatggaa tgagtctagc tataactact acctatccct atcataatca9660tgggaacaga tcaggagcaa attgggctta caggcgctca gtgggcacgt agatgtcatc9720aatctcggca gcaacctgct tggcgttagc cttcagcggg gcattacgga cagcttcgag9780gcggcgcaag aagcaggcac cacggaggat ctgcaagttg atttgcacaa catcggggta9840ctcgttggca acggcggggt caaggtaggt acccttgatg aagtcgttga aagatccaat9900cgctgggcca caccaaacct ggtagtccat ggcacggtcc gggatgccag cgtttgccca9960gaagctcgcc aaaccaaggt accagcggaa gcacaaggac atcttaagct tggggtcacg10020ctccgcgcgc tcaatcttct ccgggttctg caacctgttg atgtagaagt ccttggtctc10080ttcccaaact tctgacagag acttcttgaa aatgcgcttc tccacacgtt ccagctctcc10140aggagccatg gactcaaagg agtcatactt gacgaagagc tcatagagct tgttggcacg10200cgaggggaac atagttccct tcttgagcac ctggagcttg acaccttcct caaacatgtc10260agctgctggg gccatgcaga tgtcggagta ggtggcttgt gagagctgct tgcgaacggt10320gtcacaggtt ccagcttgct tactcatctg gtttacggta ccagtgacga tgaaggccgc10380gcccatgttg aaggtggcaa tggcggcctg agggcatcca atgccaccac cagcaccaac10440gcgaacgcga aggtgggcag ggtagccgca ctccttgtgc agacgatcac ggaggttgac10500
aatgagaggg aggatgacgt ggatggggcg gttatcggtg tggccaccgg agtccgcctc10560aacggcaatg tcgtctgcca caggcactgt gcgtgcgaga gcagcctgct cttgggtgat10620ctcgccggac ttcagcagct tctcgaggag attctcgggc gcgggacgga taaacattgc10680ggcaagctct gtgcgagaaa ccttaccgat gacgcggttc ttaataaccg tggagccatc10740agcagcgcga gagagacctg cagcacggta gcgcacgagc tgcggggtca aggtcataaa10800ggcggaggct tcaacgacag tgacgccctt ctcgaggaag aggtcgacgt tacccttctc10860gaggttgctg tcgaagggag agtggatgag gttgacagcg taagggccct tgggcagttc10920agcctggata gcttcgagag ccttgcgtac ggtggcgata ggaagaccac cagcaccgag10980agaaccaagg atgccgcgct ttccggcagc gataaccatc tcagcggatg caatgccctt11040tgccatggcg ccggtgtaca tgggggcgga tacaccatat gtctccatga aggcacggct11100gccaagatcc ttgatatcgc acttgggcac aacaatagat gcttcacttg ggcttgcttc11160aacgagatca ccgttggcgt tgacaccaag catcaaagtg ctgttgagct ccaaaagttt11220ggcacggaga gcctcagagg aagccacaac agctccggag acgctgcggg ccggagcagc11280aggggcaagg gcaggagcag cggaaggttt gttatccttg ttgagaatag ggtcgcgtgt11340ggcgtcttgc tcctgaatgt cgagacgctc catgaacttg ggggcaatag gctgcatctt11400gcgagcctgg ataagagcct cgatcttggg gtccgcagga ggaagcttag ctagcacctg11460gggcggcacg agctgctttt tggggtcata gcgaccattg accacaatct tacgcaagaa11520cttgttctta gtaggcttct tgccagccac catatcgttg taactctgcg tagcctcctc11580aacagtctcg gggtggtaca gaggggagac cttcacgcca ggcacgcggt gggcttggag11640agaggcaacc agcttgacca tggttgtcca agcattctcg ttctggcggt ccatggatcc11700ggtgacaaaa ggcttgctat ttccaagggt ggcgcgaatt gcggcgctac ggtgggcgtt11760gggaccagtc tcaacaaaga cgtcaaagtt cttgtcgcta acggtcttgg cgatcttagg11820aaagtctgcc tgaacagtgt acagctgtgc tgcgtattca ccaaagctgg gtgcgtactc11880gtcgctggct ccagtggact tgttaacaag cttcttctgg ttgacgctcg tgtacaggtc11940aaggccggca acctcgggaa tctcgaggac gctatggatc tcagcgatct gcttgccgta12000cggctcgacc acggggcagt ggccacacat accaaggtcc acgggcaaag cagggaggtt12060gctgctcagg cgagcaatgg cagccttgca atcttcaggc ttgccactga tgagagcact12120gttggcatcg ttgacaatgg tcaagtgcac gtacttattg ttggggccga tggccgcttc12180aacggcctcg cgggttccac gtaccacgta tccttgccag aactcgctga caggggtatc12240ttggggaata ttccaggcct tgcggagggc gtcaaactca acagcgaggg ccttacgcca12300
gacctccgag ttgcggagtt tagttgtcag ctcctcagag acaaggccgt tcttctcaga12360aaaggcaaaa accatggaaa tctctccaag gctcagtccg aaagcagcct tgggctggat12420gccaagcacg tcgcgagcga tgtgggtgaa gcacatggac atgagaatac cgagtcggaa12480catctccacc tggttgcggt tgaactcatc ttcctgcgcc ttaagctcct ccttcgtcga12540ggcgcgcggg atcaaccatc tgtcgccttg atcccaaagc ttgttggtct tggcgtttac12600aaactcgtga agttcgggcc agatgcggtg aatgtcaagg ccgataccat agtaagggct12660tcggccttcg ccgtacataa acgcaacgcg atcgcttgac agtggcttgg gtgcaaagtg12720gctgcccgag ggtgatgtcc agtcgcggcc catcttaaga ctccgcggga tgcccttgga12780ggcgagttca agctccttct ggagcttact aggagaggtc accaggcaca gagcgaaggc12840cggcaacggg gtcttggtct cctgggcaat gctctcgccg agcaactcca taaaagcaag12900acgtacatta gcgctaggct gggcgaggcg ctcgcggagc ttgtcaacac gctgcgtgat12960agcgtcatgg gagtctccgc ggattacgag gagtttgacg gcatcgtcat cgagcgaaat13020gcggctcttg gtctcgtggt ggccctccac atcagagagc agcaccgtgt agcatgaacg13080ggtctcggaa acacctgaga cagctgcgtg gcggcgagct ccagggttct tcaaccaggc13140ccgcgaggac tggcacgcgt acagagactt gccccactgt gtctcaggtg caggctcctc13200ccaggaggcg ccgtttgagg gcaagtagcg gttgtacaga cagagagccg tcttgatgag13260actggcagct cctgaggcgt agccggtgtc accgacagtg gacttgacgc tgctgacagc13320gacgttgtgg ggctccacag cttcgttgct agagcgctgg ctgagaatgg cctcaatgcc13380gcggatttcc tcctcagcag tgagttcctt aggcagaacg gaggggttct tgaggtggcg13440ggcagagtca gcggagagct cgagcatctc aacgtccttg gggttgacgc gagcctgggc13500gagagcctcc tccatgcagg ctgccggcat gttgccgggc acgatagcgt ccatgcaggc13560gtaaatgcgt tcgtccttgg tgcagtcgct ctcgcgcttg aggacgaggg caccacatcc13620ctcaccaaca aagtagccgt cagcgccgga gtcgaagctg gcccgcgggc tctcctgctc13680cgagaccttg aaacgacgcg acttcacgta gagattctca gcgctggcgc aaagatccac13740accggcgatc actacggcct cgacctcgcc agtctcgagc aagtacttgc ccaactctgc13800gcaacggtag acggagttgt tgccctctgt gatggtgaaa gaaggaccct cgaaacccca13860ttgtgaagac acgcgggtgg ccacgaggtt gccgatgtag gatgtgtacg aggtagcggt13920accgcaatcg ttgatgtagg acatcatatc attgagggct gaagcggctt cgggacgagc13980acgctccttg agggcaacgc gggcgcggtg acggtagagc tcaaggtcag tgccaaggcc14040gacgaagaca gcgaccttac ctcccttctt gaggccagag ttgagaatgg cacggtcgat14100ggttgtgaca gcaagtagct gcatggggcg caacatgtcg tctggcgtca tgggcgtgcg14160
caggcggcta aagtccacct cgacgtcctc aatgtagcat ccgtggggca cctccttgac14220accgcacagg tccaaaaagt ccttgtcttt accaaggaaa cgccagcgct tctcaggcaa14280tggcacagca ccatgttggc cattgtagat ggcacgctca aaggcgtcca ggcccttgag14340ggagccgaag gtggcatcca taccggtaat agcaatgcgc atgttgccct ccccgccaca14400acgtgagctg agggaactga tgctatcgtg ggtggcacag gcagccttgg agcggtcaaa14460ctcctcaaag actgcgtggg cgttggtgcc accaaagccg aaagcggaga gaccagcgcg14520cttgggctcg ccctcagtgt cgggccatgg gatgggctca gagaccacaa gcgggtccat14580ttgggaagat ccatcgacac caggagtggg cgggatcaca ccatgcttca tggcaaggag14640taccttgcac atgcctgcga aaccagctgc aacgagtgtg tggccaaagt tacccttgga14700gcttccaaag cgaggcacct tgccctcgaa gcaagccttg acggcatcaa tctcaacgcg14760gtctccctgg ggagtacccg ttgcgtggca ctcgacgtac tggatcttgt gcgggtgcac14820gttgacgcgc ttgtaggtat caatgaggca ggacttctcg ctgggcaagt gcggcttgag14880gggaagacca cagccagcat tgctgatggt agcaccgagc agagtaccgt aaatgtggtc14940tccatcgcga atagcgtcgt caaggcgctt gagaaccata atggcaccac cttcaccagg15000ggtgagaccc tgactgtcct tgtgaagcgg gtacgagatg ccgtctcccg atacaggcat15060ggcctggaaa gtggagaatc cggagagaat gaaaaagggc tccgggaagc aagttgcacc15120agcgagcatg acatcagcag caccggaaac gaggtggtcc tgggcgaggc gaaggacgta15180aagggcggtg gcacaggcag catcgacaga gtagtgaaga ggaccgaggt tgagctcttc15240tgctacgaag gatgccgggt ccataaagat gcggcggtca ccagcctcgg ggttctgcga15300ctgctcacgc tcggaccact tggaggcatc cttgaagacg cgagcgccga gtttcttttc15360gacgtggttt tggtacacat tgaggagttc gccctggagg ttgtccatgg gaaaggacag15420gcatccgctc acaataccgc accttgtaga gtcggagacc gatgtctcgg agagagcctt15480cttggagagc ttaaggagaa gctcgtgttc gttatcgacg gagtcatcga cgcagccgta15540gttctcgttg caaaaggtat ctgcaaattt gctacgctct gctttgaagt gctcggctcg15600cttgttggat ccgaggcgtt tatcgctaat cttagtccat gcagcctcac cgcccatgac15660tactttccag aactcttcct tgtctttgca gcccgcgtat tgcacggcca tgcccaccac15720ggcaatgcgc ttctcgtcgt gcatttcgtg agcagcgctc acattcttgc gagaggccat15780ctttttgctt tcttgttgct gcttactgta aacaaaaaaa agagcttgcg tgtcacctga15840ccggcacttt tagatcgatc aaaaagcggt cgtgtagatg gtttgctttg gaggagatgt15900ataaatgatg tgattgacta ccttgagcaa gtgattacag ggatgccaga gcaatcaaat15960aatcaatcag ttaatcaacg ccgtaataaa ggctatcaat caatcaatca atcaatcagc16020
caactagcta gccgaagctg cgatggactg gcgtttggac agcgcgaagc tgtaggaact16080ggcgccgcac gagctgcgag gctgccaagc tagaggctgt ctgcctttgt ctcactcctt16140ttccgaggaa ggagagagag agagagagag agagagagag tggggggatg aaagtttgga16200tgcacgatgc gtgctttgtg gtttgtttcc ttgtttcttt ctttgcttgt tttttctctc16260tttttctttg ttattttgtc tctcttgaag caaatagaaa gaacctcgaa ctagacgctc16320caaagggtct tcaagaggtc tcgaaggcta ggctggcgaa agcgcgcacg ctggtcaagc16380aagcaagcaa agcaagcagg caagcaagca agcaagcaag caagcaaagc aagcaagggg16440tggattccac gaatgcgaga agtcaaaact ctgcttcaaa cagagaacaa atgggcaaac16500gaatgaggat aaatgagcaa ctaagtgaag tttacatttt caaaactcaa caaaacgatt16560acccaatcaa ctatgagacg cgcagacgtc tgcggcagca tctcttttat gattttcaaa16620aacaaaaaca aaaaccaaaa caaaataatt tgcaacaaat taatgaaaag cgaaacaaca16680aacagaaaca ttgtttaaac taaaaagtca tttttattga aaatctgttc ttttcatctg16740tacgtatgta tgtttgtatg tacacacttt gcttcatcgg tttattcgag tgctcttcat16800tcttgaaatt gccttagttc ttgctgttat aactgtcaaa caaacctcgc gaccttgaca16860agcagctcca cctcaccttc gggcctgctc gtttgccttt ctcgcttttt tcgcgatctt16920ctgccatcct tgcctactct gtccttatct catcaggctg ctgcggcctc ttgacctagc16980agttcaagta taattaattt gaaaataaac aaaaaaacac tgccacttat tatgcagatg17040gcactctctc agtgttgcaa aagtagagtg aaattctggt ttacaaaaaa tatttattta17100ataaacaaat aaaataaata taaattcatg ttatgttaga tcattttatt ttgttttctg17160agggcgcgat aaacgcttac ttgagaacca agaaaagcaa gaaaagcaaa ggtgcgaaag17220aagcaaacac attgatttcc ctagttccca ccacttcttt ctttctttgt ttgtatattt17280gtttgtttct ttctttcctg ctttgttttg tttgttttgt ttgttttgtt tgtttgtctg17340tttgtctgtt tatctgtttg ttagtttgtt agttactaga ctgctaattg atttgaaaac17400caagccaaac ccacgcaatg aatacgcaga aagcacagct aaaaagaaga agaagaggag17460gaattccgaa tcaggcgaga aagtctcgaa agcagtgcac caaaatcctc atttggaatc17520aaagccctcc ttcccagcga ctacggaggc ccacgacgac gacgacgccg acgacgccgc17580ccgcccgccc atcctcctct ctctccgcct gctcctcgtc ttctccctcc ctccctccct17640ccctcgcgca cgccgctccg aatggaatga catgactgac gcaagcgcgc aatggccgcc17700gtgcgatggc tcgaagcagc atcgcatcgc attgcattgg cattattcat tgattcattc17760attgattcat tcattaattt attcatttta attcattcat tcattcaatc attcatttat17820tcattcattc attaatttat tcattttaat tcattcatta atttattcat taatttattc17880
atttttattc attcatactc ccgagcgcta cccggcgcta ggtgggtgct aggcgtggat17940ggagcggacc tctctgccag cagaaagagg aatgaatcta tctggatact gcgcgcagct18000tcttgcttgc tttgcttcaa cttgcttgca aacagccagg aggccgaacg gcttcgaccg18060ctcagcgtgt tcgccagcaa agaaccacct ccgccctcgc agtcgccgga tggatgaacg18120agcgaatgcg aatcctcctc cgatcttgaa cctcgaacct tcaatcaact tgccttaatt18180ttactttcat gactctcact attttaaata tacatgtatg tatgtatgta tgtatgtatg18240tatgtatgaa tgcacctcat actgataggg acctgcgggg gactgatacc acctgtctga18300atcaatttgc gagaccgcga gactgagtgg caggtagtag ctagctaagt agctgcctaa18360gagtctatcg gcatgcatga atcaaaaact atcatgtcaa tgttcctttg aggcttcgaa18420gtccgtcatt tgtcacgaaa ggttttgggt gaacgatcca ctgtttcgag agagatggtg18480tgaatgtata ggtgatagtt gccgagctgg cgagccgtcc caagcggtgc cggcactcac18540ccggctgaag cttcttacat gctctccgtt cataatcgtc caaattgatc ctgattcatg18600attcatgatt catgattcat gatgacacga gttggagttg gacgataagt cagcgctcgc18660tcaaccaaac tacctctgct cgcctagctg ctgttaggta gtgctactga ggcaggaccc18720aacttgaagc tacctactgc ctaggtattc ctacgctgtt tcgctgattt gcaatctctt18780cgttaccaag agataaaatt aacgagttat gacattgcgt atgcagacta cataataaag18840attgtgtcat ttatttataa gtggaaaggt gtaagatcaa gaactaagca ctaggtagca18900attaggcgtt atttgttagc gcgtggaaga aaatgcctct ggacagatag ctattaatag18960ctattaatag ccggtgttgt atttacaacc ttctgaaaga atttctccat agaggaaagt19020aaagaaacat cttattctgt gaaaagagat aaacaacttt ctagaaaatg gatgacagag19080caaagaaggt cgatcgtctt caaccgcaga tctgggaatg ctaaggttgg cgccaggctt19140acattatgcg tcatgctgac caaagggcgt aaagtgccga tgggcatccg atatatgcgc19200gttcaaggtg aggaattcaa gatcatcaag tttgtttgaa tttcgaggtt gaaaacacag19260agttttgaca atcgatcaat caatcaatca atcaatcaat caatttaaaa ccaatttaaa19320accaaatgaa tgagtgaatg agtgaatgac tgaatcaatt taaactaaat gaatgaatga19380atttaaaacc aaatgaatga gtccttagcg atttcaagtt ctgcagtgaa atctacaaat19440ctacgacgaa agtagtgaga tcgtatcaac gtgtatagac agacaatgat gctgcggata19500cctaagtgct tgcgtggagg gactacgatg cagatcccga gttttaggtc ctagttcctc19560cgttctctgg taaaaaagaa agcctctcct tcttgacgcc attcagcgac gtggaacaag19620cgagacagag gcacaagttt tggagtcatt gagtcgggtc tgctctgctt tgaggatgaa19680ccaacgacct tcggagtctt gcagatagat ggtccattct tcaaacgaca cagagatcgt19740
cgtctcgcgt aagttggcag tgggtctaga gctagctaaa aacatctgac agagagcaca19800tacagagcta aagaggagtg tactcggcaa aatagcgtgg acggatgaca tcatcaatcg19860ctcagctttt tcgtttctta ccaaaaaatt gacaaaccag agaaataaat agattgactc19920aacaaattaa attaaaacaa taaattaaaa aagatctctt aaagaagttt tctgaaagaa19980accaaaaaca ataaactctg cgacaagaac ttgaggccag aagggatgaa gaaggtacgt20040atctagatgg tgactgggga cacaaagaag caaggtctga attctcagaa gccagctgca20100gccagccagc tactaggagt gtctgccagc tccgtcgtca tgccacgagt gtccctgcca20160acgcttcaag cgtacttgca acttttattt gattactaca ttactacatt ataacttcat20220ctatagcttt aaaaaggaaa taaaggaaat aataaaataa atcaaataat ggtaaaaagt20280tataaataat caacgactaa aaaggaattt tattcgaagg tcctcggcag gaaataagtg20340gaatcaaaga gaaggcggga acggtaggga ccatacatga tagtcccaaa ctgaggaact20400acgaattgcg gggctaagca aattcatagg atcccagtta gggacagacc ctcgaggtcc20460gagttggtat cctgggccaa agcttgcgca agggtgctct agagctacaa ctcaatacca20520gtagttgcat ggccatctct gatagctttc ttcatgaata tggggtgagc ttagagacaa20580gcagtagaca ctctgtgacc tacgagctat atttgctgtc gcagagcatc tcctcaaaat20640aattcatcga agaaagacgg attgaaagtt ttgccttatt tgaacaaagt taatatttta20700actctcggta gttaaaccat gatagctcat ttatagcgta ggctgacaca gaagcgtagg20760ggcttagacg tcatgatgat tcgtgatgaa ataaatcaag gattctcgaa cgttgacacg20820cgcaatggag cgtgccaatg tcaaaagggt attgctgtat catcaacgta ggtaggtagt20880caaacgggct acagctctgt cctattcact cactaagaca aaatgttttc tctcaaacgg20940ccagctcgaa agtaatattg ggagcaagaa tgaaaatcat tctccagtac acttgcagtg21000agatcaagtt tcaagaccat caaacgatac gatacaggag gtactatctt tgctgaagtc21060agtagcagca gcattacgag cctggtagat ataaattgat aaaaagacaa gaggtatatc21120atatttcaga gtagagtaca tactgagctg gaaacataaa actagtgcac gcaatcgacg21180gttcaacttt tctcaagacg cttccagtcg tttcttaatt agctcagatg gtagcaaaag21240tgatatgcgc atcagacttt cgtaaacgta aaactcggca tctgtagatg ttgagtcatt21300gttttcttca ataatttact tctcgcagca gtgcacttgg aaaggtttgt caagtttgac21360ccagctaatg aaacacaaca tcatcaggcg gggctcgaaa agtagatctg aaagtctata21420aagaatgaaa gttactctca acacagaaag caatttgtgc aaacataaga gagaatggcg21480tctatgctgc aagagaaaat tcgacggtcg catcatagtc gtctacactg ctgtgcatgg21540gcaatttata atatcatgtc tgatcacggt ttctgagaac atttaaacga aataagtcaa21600
aacgaatgcg ctctgtcgcg attatagttt tgttctgaca gtaactccta accaaagggc21660caaataagga cgagagaata aaatagattg ctctctcact tcggacccag gaatcccgaa21720tttatataat ttcaatgtac tcacgtaaca ctgacaagct atgcggcgtc aataactcat21780ccacgttggg agaatctcga aacaacgcaa cgagttattt tatcctgatt aataatctag21840cttgaaccgt ttgttgtaac tagaacccaa gctgcaaaga gctacaacca aggtttgatt21900tcgttccaag ctaacatgaa actctcaaac ttcgtcgatt tttttaatgt ttgtcaaaaa21960cctagtacag cggtcctagg taccgatttg agaagcaggc aacccgctta taaataaaag22020aaaaagagtc tttattattt tataaataga aaaaacttta attgggacaa tattctttat22080gtgttctctg tcttcttcct tcatgtatga cgtaatgatc atgctccttt catctccttc22140cttccaaaaa gttcattttt cctactaggt ctttttcaaa attaaaaata taattaagta22200agaaagaaag aaggaaagaa agaaaaacct gggtactaat cagtgtgata tgaggtgaat22260ggtggttttg ttttacttct cggaagtgtc gagtcctata aggagcacta tacctatcct22320agacgctttt ggtaccaagc cctgcgcggc aggcatacgt cagcaagcta cgatagcagt22380acacgctact cagaaaggcc tagtgaggta ggcgagcagg aagtagtgct cttgcgtcat22440gcttatgatg gcatcagcca cgcgagaacc tcattcgaat agtccttttg caattcattc22500acgcatgcat gcattgatgc ctgctacaga gtagctagtg agagagtatg atacttagtt22560agtgctactt atgcgttgtc acctatgcaa tagcattgga tagaaggaat cagattcacc22620gctgactctc gctgagagta agggccatac gcagtgctcc tgagttgttt cattaaacgg22680acttcaagct gagttctggc taggcacctg gtagctgggg ctagagggta cctacctacc22740tacctactga tagctaactt tcaaatgagg aaagattgga gattgaatag aaagaaagtg22800atacatactg tcagccgtat cgaaactccg aagtggcacg cggatggcgt cagcaaactg22860ccgtagcaag tgaataacgc acatctcaat tgggacgtcc atgaaaacaa aaaacaaaaa22920agcaaaaaaa agttgcaatc gatcatgaat cgtgctgatt catgggttgc ttgcttagtt22980gttatgctgg agggtgtcga gacttggatc tggtgagcag tgcgctctcc actcaagttg23040gaccctttgg tatcagggga gtgcgagtgg gcacactacc atagtatcct aaattacctc23100tacgttttga ttgcctttga tcacagcaga taattttcaa tttaaataaa aatcataaaa23160agaagaagaa gaagaagaaa gaaagtgaag gtggcgtttc tgatgtcatc attttcgcag23220tgcttcccag cgaagattta ctgtgaacta ctacgcatgt gagtatggca agcactgggt23280aagtaggtac ctaccactac catgttgtaa aacaaaacaa ggaatatgtt agctagaaca23340gagcgaatcc ggtgtgagtg ggagtcatca tcagatattg aaagttgtcc tctcaattaa23400tataaatatt tctaactaaa gcaattaaac atatatttat taatttaatt ataaattaaa23460
taaatatgct gggtgggtcc gagtcattct gactatcatc tatgatgttt aataataaaa23520tattgaaagc agtcaaggtt atttggaatt atgggatgat cgtgatctgt gtatcattct23580gcatcattgt ggatgctggc ctacgaaact acgacggcat tgcaattgcc acctggcggt23640gcgatcgcgt gcactcctgc aattgcgagt gtcttccgcc ggcttcaagt tgaggtgctg23700cgacagtgcg ggcccagagc tcctaacatt tcgtggatga ccgactgact cagacagagg23760tctctcaagc ttagaaagtg cgctgcaaaa aagggcgcta gctagataag atacgagtga23820gtgagtgagt gagtgagtga gtgagtgagg ttctagctag tgctcctccc aaatcttgga23880gtgccgatgc tcgagaatac atacatactt caagacacga agaacttgaa cccgaagacg23940aatgccgtct tcgacgtcat ctttgccgtc gtcatggccc actgcagcaa cgatccagtg24000cgtgcgagca gcagggccag cccacgatca cgcagctcgt cgggctggac ttggctcaat24060gaatgaatga atcaatcaat gaaagaatga ctcaatgaat caatgaatca gcaagttgcc24120accaaagccc atcgcaacga cgggtcctgc ctgcgtgcgc cattcttagg atccagagca24180agcaagatct tcttcaccta tcgctcagca agcgagaacg caacctccct ctgcatcatg24240atgcaggata agtaagataa atccatcttg gacctcgagc tcaaatcgac gcttgctgca24300tctatctatc tttgtatcta tctatgtatc tatctttgta tctatgtgtc tatctatctc24360tctgcgtgcc tcgtcgtgtt tttgaaaagg agtttcgatc gtggcccaat cggaagagaa24420ggctctctct ccctctctct ctctctctct ctctctgcat cgcacagacc aatgagcctt24480gcggcaacac agcttcaact tcattgcagg atccaatcca tccaaggcat cgcttgggct24540ctcagtgaat gaattcgacc aaagctcgtt ggcaggcaga caaggcctgg acaacataaa24600gcaagggggc acgaaggcaa gatggcaagg aggcagagca ggcaccagcg actgcgatgc24660tggcgagaga agatcaaggc aaagcagagg ctgcaagcaa gctctgcagt agccacctcc24720tcagcagatt cgtcaagatc gggcaaactt cgtctgtggc tgccacgcca gagcagagca24780tgcctgcttc atgatccatg ctcaagaaag aaagacagac aagacagaca agacagatag24840atggatgaca gcgaacttac atttgcagac ttcgaaggtg cctgacgggt attggtgcca24900ctaagacgag aaggagcact tgcttccaga tcgctcacgc cgctcacatc accatgctac24960gtcttcaata cgcctggtcc ggttcgcaag agccgcgcgc cggcgattgg gcgaaaggcg25020gaggagtcga ggtacgcgtt atcagcagaa tgtaggaaca ccgcgacgcg gccgacgacg25080ctggtgagga ggaagaaaga cctggcgcct gtacgtacgt acctacgttc tagcagtagc25140ttgaagtgga ctgtgggtcc cctccatctt cttcaagacc ttcaagttgc ttgctgacgg25200catcgctgtt tgtttgtggc tgttaggtag gtaggtagct agctagctat agctgtgtcc25260tagctgcaca gggagcactc agcctctttc ctagtttctt tggttctgtg cttgtttttc25320
tagcgagtcg tgcaaataac ctgcggcggc cacgagaagt ccgcgttgag gcgatcttgc25380gccagtgcgg cagttgccat cactcgtgca gacagagttg agttgcttct caatcgttac25440caatcgctcc aagcaggcct agacatagat tttccttctc tggaccatct actaaaatga25500tcaagttaga taggtagata gatagataga tagatagcta gggagatact aggcaccttc25560tatgccggca cgtctcgaac aaagcgaaga aagagctgtg ggcaagagca ctcattttga25620tcgtagatga tcgtagacgc gctgtagagg agagctctta gtggcggcta ctgtgatgga25680ctatgagagg ggacttcgca agacctgtct cggtcgcacg tagctgtggg aagcgagaac25740ccgcagagga ctgattctga ttagtgcgga taacttggtc gaggaagagc ggggacccgc25800agggaacccg catagcagcg acgttggcac ccgacgacgc tagggcaaag acgcagcatg25860cgtgcgaggt gcctataagc tgcgcaattc agagaattaa gacagcagcg ctgggaagga25920aggaggagat ttgaaggctc ggcgggagct gtcgagatgg aggcaggcag gcaagcaagc25980aagcaagcga aagaggcggc cagggctcgc gtcgaagccg ctgatggacg agagaatcgc26040acgaagaaga atacggagtg tttgttttca aagccaaaga aagccaaagc caaagccaat26100tcgttcgttc gtgagttaac ttattattta atttaattga catcttcatt tactactgtt26160gttatctatt atttatttat ttatttattt atttatttat ttatttattt atttatttat26220ttatttattt atttatttat ttatttattt atttatttat tgtttatatt tttttaaatt26280aaaaaaattc aaaattcaaa attcaaaatt cacgaataaa ttgcacttga aggagatgaa26340gcaaagcttt gtttcttcta aaaagagtat aaataataca aagtgatgac ggaaagaagc26400atcattctga tggtaagcac ttcggcaaga tgcacgcact agcacttgtc gccttgcttg26460cgatccgcgg aggtaatagt ggaggcgaaa gaaggagttc attcctgtta tttcgcgctg26520gggttacagc agtgccaaga tttcgaatat ttgaattttt gaatttttga atttttggat26580cttcgttccc cttcttcctg aactgttcaa acgactcgga ggttgtcgat cggatcactc26640aatctctcaa tctctcactc actcactcac tcactttttc tcagctgcct gatccttcgc26700aatgctcgcg aagcgcgagg gatatgcgtg ggcgagcacg caccatcttc tctccacgcg26760taaagaagag cagagccaga ggcaggtagg tatctccacc catctcaggc tgtgacttct26820ttgtttcttt ctttctttgc ttgttttctg ttctctctct gtgctctgtc cacacgagaa26880agagaaagag agagagaaag aaccacgggt ttatagagcg cactcgtcct tcctgcttca26940gcagaaagca ctgcgtagga gaactacggg ggaggaggaa gcacgcacgg aggaggcgtg27000gaaggaagga ggagacagag agagagagac actgagggac agagggggag aggcagaggg27060agaggcatct gatgtttgcg agaaaccaat aagttttgaa agtgatttga tttagctgat27120tgactgatct atggcctgaa agaaagcttt taaagcggag ggagatagat gacgagggca27180
gctgcgatgg cgtacggcgc atccgtctct ctctgtgtct ctctctcttt ctctctcgtc27240agggcgtgga gacctcggaa gctgcacgcg gcgcggtgag gaggcagggc agcagaggga27300gaggagagat cccagagtcg aagagcattg attgattgca gatgatcttg ggcaacgcgc27360gtcagcttga gcgaggaatg ctttggactt caggttcttc gcttctgtgt ttcattcttt27420ctcgaagaaa gaaagaatga aagaaagaga gaaagaaaga aagaaagaaa gaaagaaaga27480aagaaagaaa gaatgaatga atgaaagaaa gagagaaaga aagaacgaat gaaagaaaga27540gagaaagaat caaagagaaa gcgcattcgc agttcttctt cgtgaaagaa aaggaaaaga27600gaggcgatgg taggctctga tctcatcatt tctggtttct ctgttgtacc tgtactctgt27660gcttgtggcc ttgcgaaggc tgaagacgcc atgcagacaa ccacgcctcc gcagagactt27720tgcgggaaag cagagggctt ctcgccactc tcgaagaaac gagctcgcca gttttcgggg27780ttgttctcag aattgcgagt gttggcttta tatgggatga tggtatggca cttcgtcatc27840gttactctcg ctcgcttgct tacgaagatt ttcaaaaggg cgaaagaagt gctcagcttt27900taaaataaag tcacaccaaa gactaggccg catagcagaa agctaaagta aacccaatct27960gtctgaagag agtgtcgtgg ttagatactt acgcaagagt ttaaaagctg taaatagtac28020aggaacaaaa acaaataaat atatatatat tcttttttat tagtaaaaca tgaaaccaaa28080aaactccttt aaaataaaat aaaataaaat aaaataaaat aaaataaaat aaatttacta28140ctatatatac atatatatat acaataaata aaaacaactt tttcagacca gaaaaagact28200gagaaaaaag gaaactaatg actctcgagc accgagagcg atataagagt ggattatatt28260tgctaggccc accacgagtg agtcccctag gaggaagcgc cctctgagac aggagcagag28320gcgtcgctgg tgctccaaaa agcgacggcg aatggaaagc aaaacccttt cgagggaggc28380ttgtggccgt gactattcaa atctccagca tctcagctcc agcacagcag aagctacctc28440gcttctcagc tctagctatc acatcgatcg cagcatctag ctcgtagaca gctagcgccg28500caccttcccc caaatcaact tgggcaactt aactcttttt tcaccagaac tcctcttttc28560ctttaatctt cgaaaagaag acgaataaaa gagataatcc tctgccgcag cacattctaa28620aagaaaagcg gcatactggc gtaggcaaga ctttcaagct cttcctcgcc tccaccccgt28680atttccctgt tcatctttgt gaaacgagga aacaagaaat tttataggac aagatggctc28740aacgtgagaa ccgtctcgag gccaacatgg atacccgcat cgctgtgatc ggcatgtccg28800ccatcctccc ctgcggtacc accgttcgtg agtcttggga ggctatccgc gatggtatcg28860actgcctcag tgatctcccc gaggaccgcg tcgatgtgac cgcctacttc gacccggtca28920agaccaccaa ggataagatc tactgcaaac gtggtggatt catccctgag tacgacttcg28980acgcccgtga gttcggcctc aacatgtttc agatggagga ctccgacgca aaccaaaccg29040
tcaccctcct caaggtcaag gaggccctcg aggacgctgg catcgaagcc ctcagcaagg29100aaaagaagaa cattggatgt gttctcggta tcggtggtgg ccagaagtcc agccacgagt29160tctactcccg cttaaactat gttgtcgttg agaaggtcct tcgcaagatg ggcatgcctg29220aggaggatgt tcaagctgct gttgagaagt acaaggccaa cttccctgag tggcgccttg29280actccttccc cggtttcctc ggcaacgtta ctgccggtcg ctgtaccaac accttcaacc29340tcgatggtat gaactgtgtc gtcgatgctg cctgtgctag ttctctcatc gccgttaagg29400ttgccattga tgagcttctc cacggagact gtgacatgat gatcactggt gctacctgca29460cggataactc catcggtatg tacatggcct tctccaagac cccggtgttc tctaccgacc29520ctagcgtccg cgcatacgat gagaagacca agggtatgct tattggcgaa ggctctgcca29580tgcttgtgct taaacgttac gccgacgctg ttcgtgatgg tgacgagatt cacgctgtca29640ttcgcggctg cgcctcttcc tctgacggta aggcctccgg tatttacacc ccgaccatct29700ctggtcaaga ggaggctctt cgccgtgcct acatgcgcgc taacgtcgat cccgccaccg29760tcactcttgt tgagggccac ggtaccggta cccccgttgg tgaccgtatt gagctcaccg29820ctctccgtaa cctcttcgac agtgcctacg gcaacgagaa ggagaaggtc gctgttggca29880gcattaagtc caacatcggt cacctcaagg ctgtcgccgg tcttgccggt atgatcaagg29940tcatcatggc cctcaagcat aagactcttc cggccaccat caacgttgat gagcccccta30000agctttacga caacactccc atcaccgact catcgctgta cattaacacg atgaaccgtc30060cgtggttccc tgctccgggt gtgccccgtc gcgctggtat ctccagtttc ggttttggtg30120gtgccaacta ccacgccgtt cttgaggaag ccgagcccga gcaccagaag gcttaccgtc30180tcaacaaacg cccccagccg gtgcttctga tggcatcttc aacccaggct cttgcttccc30240tctgtgaagc ccagcttaag gaattcgaga aggctatcga ggagaacaag accgtcaaga30300acactgctta catcaagtgc gtcgacttct gtgagaagtt caagttccct ggatctatcc30360cgagctctaa cgctcgcctc ggttttcttg tcaaggaggc cgatgatgcc accgagaccc30420tccgtgccat cgttgcccag ttccaaaagt cagctggcaa ggattcttgg caccttcccc30480gccagggtgt gagctttcgt gctcagggca tcaacaccac tggtggtgtc gctgccctct30540tctctggcca gggtgctcag tacacccaca tgttcagcga ggtcgccatg aactggcctc30600agttccgtga gagcatctct gacatggatc gtgcccaggc taaggttgct ggcgctgaca30660aggactacga gcgtgtctcc caagtcctct acccgcgtaa gccttataac tctgagcccg30720agcaggacca caagaagatc tccctgacct catactctca gccctctacc ctcgcctgcg30780ctcttggtgc ctacgagatc ttcaagcagg ctggtttcaa gcccgacttc gctgccggtc30840actctctcgg tgagtttgcg gccctctacg ctgctgactg cgtcaaccgt gacgacctct30900
ttgagctcgt gtgccgtcgt gcccgcatca tgggtggcaa ggatgcacct gctaccccca30960agggatgcat ggctgctgtc attggaccca atgccgagaa gatccagatt cgcactgctg31020atgtctggct cggcaactgc aactcccctt cgcagactgt catcaccggc tctgttgagg31080gtatcaagaa ggagtccgag cttctccaga gtgagggctt ccgtgttgtc cccctcgcct31140gcgagagtgc cttccactca ccgcagatgc aaaacgcctc ctctgccttc aaggatgttc31200tctccaaggt tgccttccgt cagcctagcg cccagaccaa gctcttcagc aacgtgtctg31260gcgagaccta ctccaacaat gcccaggacc tccttaagga gcacatgacc agcagtgtta31320agttcatctc tcaggttcgc aacatgcact ctgctggtgc tcgcatcttt gtcgagtttg31380gccccaagca ggtgctctct aagcttgttt ccgagaccct caaggacgat ccttccatta31440tcactatctc tgtcaaccct tcctctggca aggatgccga tattcagctt cgcgaggctg31500ctgtgcagct cgttgttgct ggagtcaacc ttcagggctt cgacaagtgg gacgcacctg31560acgccacccg ccttcagccg attaagaaga agaagactac tcttcgtctc tcggctgcca31620cttacgtgtc tgacaagacc aagaaggctc gcgaggctgc catgaacgac ggccgcatgc31680tcagctgtgt cagcaaggtc atcgcccccc ctgacgccaa gcccattgtg gacaccaagg31740ctcaggagga ggttgctcgt ctccagaagc agcttcagga tgcccaggcc cagatccaga31800aggccaaggc cgatgctgct gaggctgaca agaagcttgc cgctgctaag gatgaggcca31860agcgtgccgc cgcttctgca cctgtgcaga agcaggttga caccaccatt gttgataagc31920accgtgctat cctcaagtct atgcttgctg agcttgactg ctactccact cctggtgctg31980tgtccagctc tttccaggca cctgttgctg ctacccctgc tccggtcgct gcgcctgttg32040cagctgctcc tgctccggct gtcaacaatg ctctccttgc caaggctgag tctgttgtca32100tggaggttct tgccgccaag actggttacg agactgacat gatcgagccc gacatggagc32160tcgagactga gctcggcatt gactctatca agcgtgtcga gattctctct gaggtccagg32220cccagctcaa cgtcgaggcc aaggatgttg atgctcttag ccgcacccgc accgtcggtg32280aggttgtcaa cgccatgaag gctgagatcg ctggcagctc tggtgctgcc gctgctgccc32340cggccccggt tgctgctgct cccgctgccc ctgcccctgc tgtcaacagc gctcttcttg32400ccaaggctga gactgttgtc atggaggttc ttgccgccaa gactggttac gagactgaca32460tgattgagcc cgacatggag ctcgagactg agctcggcat tgactccatc aagcgtgtcg32520agattctctc tgaggttcag gcccagctca acgttgaggc caaggatgtt gatgctctta32580gccgcacccg caccgttggt gaggttgtca acgccatgaa ggctgagatc gctggcagct32640ctggtgctgc cgctgctgcc ccggcccctg ttgctgctgc tccggcgccc gtcgctgccg32700ctgcccctgc tgtcagcagc gctctccttg agaaggctga gtctgttgtc atggaggttc32760
ttgccgccaa gactggttac gagactgaca tgattgaggc cgacatggag ctcgagactg32820agctcggcat tgactccatc aagcgtgtcg agattctctc tgaggtccag gcccagctca32880acgtcgaggc caaggatgtc gatgctctta gccgcacccg caccgttggt gaggttgtca32940acgccatgaa ggctgagatc gctggcagct ctggtgctgc tgccccggcc ccggtcgctg33000cggcccctgc tccggtcgct gccgctgccc ctgctgtcaa cagcgctctt cttgagaagg33060ctgagactgt tgtcatggag gttcttgccg ccaagactgg ttacgagact gacatgatcg33120agcccgacat ggagctcgag actgagctcg gcattgactc tatcaagcgt gtcgagattc33180tctctgaggt ccaggcccag ctcaacgttg aggccaagga tgttgatgct cttagccgca33240cccgcaccgt tggtgaggtt gtcaacgcca tgaaggctga gatcgctggc agctctggtg33300ctgccgctgc tgccccggcc ccggttgctg ctgctcccgc tcccgtcgct gcccctgctg33360tcagcagcgc tctccttgag aaggctgagt ctgtcgtcat ggaggttctt gccgccaaga33420ctggttacga gactgacatg attgaggccg acatggagct cgagactgag ctcggcattg33480actccatcaa gcgtgtcgag attctctctg aggtccaggc ccagctcaac gttgaggcca33540aggatgtcga tgctcttagc cgcacccgca ccgttggtga ggttgtcaac gccatgaagg33600ctgagatcgc tggcagctct ggtgctgccg ctgctgcccc ggcccctgtt gctgcctctc33660ccgctcccgt cgctgccgct gcccctgctg tcagcagcgc tctccttgag aaggccgaat33720ctgttgtcat ggaggttctc gccgccaaga ctggttacga gactgacatg attgaggctg33780acatggagct cgagactgag ctcggcattg actctatcaa gcgtgtcgag attctctctg33840aggtccaggc tatgcttaac gttgaggcca aggatgttga tgctcttagc cgcacccgca33900ccgttggtga ggttgtcaac gccatgaagg ctgagatcgc tggcagctct ggtgccgccg33960ctgctgcccc ggccccggtt gctgctgctc cggcgcccgt cactgccgct gcccctgctg34020tcagcagcgc tctccttgag aaggccgaat ctgttgtcat ggaggttctc gccgccaaga34080ctggttacga gactgacatg attgaggccg acatggagct cgagactgag cttggcattg34140actccatcaa gcgtgtcgag attctctctg aggtccaggc tatgcttaac gtcgaggcca34200aggatgttga tgctcttagc cgcacccgca ccgttggtga ggttgtcaac gccatgaagg34260ctgagattgc tagcagctct ggtgctgctg cccctgctcc ggctgctgcc gttgcaccgg34320cccctgctgc tgcccctgct gtcagcagcg ctctccttga gaaggccgaa tctgttgtca34380tggaggttct cgccgccaag actggttacg agactgacat gattgaggcc gacatggagc34440tcgagactga gctcggcatt gactctatca agcgtgtcga gattctctct gaggtccagg34500ctatgcttaa cgttgaggcc aaggatgttg atgctcttag ccgcacccgc accgttggtg34560aggttgtcaa cgccatgaag gctgagattg ctagcagctc tggtgctgct gcccctgctc34620
ctgctgctgc cgctgcaccg gcccctgctg ctgcccctgc tgtcagcagc gctcttcttg34680agaaggctga gtctgttgtc atggaggttc tcgccgccaa gactggttac gagactgaca34740tgattgaggc cgacatggag ctcgagactg agcttggcat tgactccatc aagcgtgtcg34800agattctctc tgaggtccag gctatgctta acgttgaggc caaggatgtt gatgctctta34860gccgcacccg caccgttggt gaggttgtca acgccatgaa ggctgagatt gctagcagct34920ctggtgctgc tgcccctgct cctgctgctg ccgctgcacc ggcccctgct gctgcccctg34980ctgtcagcag cgctcttctt gagaaggctg agtctgttgt catggaggtt ctcgccgcca35040agactggtta cgagactgac atgattgagg ccgacatgga gctcgagact gagcttggca35100ttgactccat caagcgtgtc gagattctct ctgaggtcca ggctatgctt aacgttgagg35160ccaaggatgt tgatgctctt agccgcaccc gcaccgttgg tgaggttgtc aacgccatga35220aggctgagat cgctggcagc tctggtgctg ctactgcctc tgcccctgct gctgcagctg35280ccgcccctgc tatcaagatc tccactgttc acggtgctga ctgcgatgac ctctctgtga35340tgtctgctga gcttgtcgac attcgtcgcg ctgatgagct ccttcttgag cgccctgaga35400accgcccggt ccttattgtc gatgatggta ccgagctcac ctctgctctg gttcgtgttc35460ttggtgctgg tgctgtagtt cttacctttg acggtcttca gttggctcag cgtgctggtg35520ctgctgttcg ccatgtccag gtgaaggacc tctccgctga gagtgccgag aaggctatca35580aggaggctga gcaacgcttc ggccagcttg gaggcttcat ctctcagcag gctgagcgct35640ttgcccctgc tgacattctt ggtttcaccc tcatgtgcgc taagtttgcc aaggcttccc35700tctgcacccc tgtgcagggt ggccgtgcct tcttcattgg tgtggcccgt cttgacggtc35760gccttggttt cacctcccag ggatctactg actccctcac acgtgcccag cgtggtgcta35820tcttcggcct ctgcaagacc attggccttg agtggtctgc taacgaagtg ttcgcccgcg35880gtattgatat tgctcgtgag gtccaccctg aagatgctgc cgtcgccatc actcgcgaaa35940tgtcctgcgc tgacaaccgt atccgcgagg tcggcattgg cctcaaccag aagcgctgca36000ccatccgtgc tgtggacctc aagccgggtg cccccaagat ccagatcagc caggatgacg36060ttctccttgt gtctggtggt gctcgtggta ttactcctct ctgcatccgt gagatcaccc36120gtcaggtccg cggtggtaag tacattctcc tcggtcgctc caaggtccct gctggtgagc36180ctgcttggtg caacggtgtt tctgatgacg atcttggcaa ggctgctatg caggagctga36240agcgtgcttt ctccgccggt gagggcccca agcccacccc gatgacccac aagaagctcg36300ttggcactat tgctggtgcc cgtgaggttc gttcctcaat tgctaacatt gaggctctcg36360gtggcaaggc aatctactcc tcttgtgatg tgaactctgc tgctgatgtc gccaaggctg36420ttcgcgaggc tgaggctcag cttggcgccc gtgtaactgg tgtcgtccac gcttctggtg36480
tccttcgtga ccgcctcatt gagcagaagc gccccgatga gtttgatgct gtcttcggca36540ccaaggtgac tggtctcgag aacctctttg gtgccattga catggccaac cttaagcacc36600tcgtcctctt cagctctctt gctggtttcc acggcaacat tggtcagtct gactacgcca36660tggctaacga ggccctcaac aagatgggtc ttgagctctc tgaccgtgtg tccgtgaagt36720ctatttgctt cggcccctgg gatggtggca tggttacccc ccagctcaag aagcagttcc36780agtctatggg tgttcagatc atcccccgtg agggtggtgc cgatactgtg gctcgcattg36840tcctcggctc ctcccctgct gagatccttg ttggcaactg gaccactccc accaagaagg36900ttggcagtga gcccgttgtg atccaccgca agatcagcgc tgcatccaac ccttttctta36960aggaccacgt catccagggt cgctgtgtgc tccccatgac cattgctgtg ggctgccttg37020ctgagacctg cctgggtcag ttccctggat actccctctg ggctattgag gatgctcaac37080tcttcaaggg tgtcaccgtt gacggtgatg tcaactgtga gatcactctc aagccttccc37140agggtactgc cggccgcgtt atgattcagg ccaccctgaa gaccttcgct agcggcaagc37200ttgttccggc ttaccgtgcc gtgatcgttc tctccactca gggaaagccc cctgctgcta37260ctacttccca gaccccctct ctccaggctg atcctgctgc ccgtggcaac ccttacgacg37320gcaagaccct cttccacggc cctgccttcc agggtcttaa ggagatcatc tcttgcaaca37380agtctcagct tgtcgccgag tgcaccttca ttccgtcttc cgagagcgct ggtgagttcg37440cttctgacta cgagtcccac aaccctttcg tcaacgacat tgctttccag gccatgctcg37500tctggattcg ccgcaccctc ggccaggctg ccctccccaa ctctatccag cgcattgtgc37560agcaccgtgc tcttccccag gacaagccct tctacttgac cctcaagagc aacagcgcga37620gtggccactc tcagcacaag acctccgttc agtttcacaa cgagcagggt gacctcttcg37680tggacatcca ggcttccgtc acctcttctg actcccttgc cttctaaagt tgtgaggctg37740tcttgtcttg tcagtcgcga aagtgtaagc aagaactttg tcatacaaag aagcaaccaa37800cttccgaacc aacacacctt gtaggattac aaccacaact ttctataaat agtgcgcaag37860aataaccagt aagctatcct tcgtgtacct gttacaacaa cgacattttt acttgatctt37920cctacttgtg atgggtagtc ccggcttgta ctgacagtga tgccacagca gagtagatca37980ctgtgaataa gtaaataagc ctacttatta tattcccaaa gtactcgctg ggatattatt38040agtatcacga aaagtgatat gttttataac tcgcttgtct tgccaagatc taaccttttt38100tttttaaatg gccaaaaagt cgccagaaca catcttacaa taaacaaaaa tttagattat38160atcgtatgta taatgtataa tatattatat tattatatac atacgatata atctaaagcc38220attccagact tattcggtga tgaaaaatgc tttcccagct ttatacaaac tattcaaaaa38280gttgcatgac ccattttcag atatatttaa tagtataaga ttatgtccat ttgttttcaa38340
agttattcaa gagtttacat cttgaagttt catcccttta ctactacact gtttttcgtt38400tgggtttttt ctctaacggc gaaagaaaca agtcaccaag cttaactagt aggcatcttt38460gtggtgacga aattaaagtt gaatatataa attatagtta gtcattatgg aatctcagtt38520tgaacgaagc taagctattt ataaaaatca ctgcatggag ataatacttg aattttgatg38580atagtgttta tgaagaagtt taatcttgct ttttattaat gttattctct aatatagaaa38640tatttcaata aaaaaatcat atgaagggat aataaataca gagaatgatc gttatcattt38700gatatgtcga acgctaatct atcatcttat ctaggaaaca aaggtggaaa taaaggaaag38760ccctacacga gttaattcct caaacgaact actttggatt atcaaatcca actgctgaca38820ctggatacat gcatgtattt agtgggtgtt actgtacttc cttatttcct ttaattcaat38880tgtcttgatt tttacttcgg agattctact tgaaaatcat ctcccttcac ttccggttat38940acagaaagac ccttcaattc gaatgctggc caggtacaat aactatcagc gattcccctc39000cactagacat gaccgactgt aagcacctca acccgatttc aagcaacaca tgatgactag39060ctgtttccgc aaaacaacaa ataagagagg tagtggaaaa cacccagttc gctcgagctc39120ccctagtaga ttcgacattc actttctatt tgattgctaa ttgtgggtcc ggctatttaa39180ggaaagaact gatgaaagtc cacctcacgc aatcaaatcg cggtctagtt ggaagctaca39240atggccgacg tatgcgcgcc tctatctttt aggattgtag aacagggcgg caatctgcta39300acataaattt aataccttgc tcaagctgct ttccatactt ttcaatccat ttgtgataat39360cttgcaatgg accaatctcc aaatctgtag aagcaataac aaggacatcg cagggtcccg39420gttcgtttgc atgctcgtct tctggtgcca caacaatgct gcctgttatt atctcatgag39480agtctttata ctgcggatcc gtggctatag cgtgaataaa cgttgtgcgc aagcctatat39540cctcgcgatg gagatactgg cctgctacag tttgcgttcg tctgcctacg acaacgcatg39600gaacattctt tggtgtgcga gtgggccgta gcgttcgacc ctgggcaagg aagccatgca39660gacgtgattc cgagaggcca tctcgcgtgt aagacttatc ccaattttct ggatcctcta39720atttccagct agccataagc tcagtcaaca gaccaagcgt tcttgatctt ctttctaggt39780caaatacatc ttgatggaag cctgcagtaa tttctttgta agatttggaa acgacgttct39840tgaaatgaac acaaactgat attgcattca tgggtgcagg tgacagttgc aaatgaactg39900aaatgtctgg agaaaagttg aggaagcgtg gtttataaag cggccaagct gtcctcgcat39960gcgcaagacc tagtatatta ctaatgactc tgcgaccaca atcctccatg cgttcaaact40020tgctatgcgg aattccacga atgatgttac cttgaggatt tggggctctc caaaggagct40080gttgcagttg ctgtacgtat tcgcggtgtt cgcggacctg atctcgaagt cgggcatttt40140cctctgagca aggccctaca ggtggaaatc tgcacagcat attgtatgtt ctctctagat40200
gtactgcccg ttgccgcaaa tgagctacat ccatctccag tttatttact gtgtcttcga40260gcgcaaacct ttcacagcgc ctgcgtttgc gttcatttct cgaaatctct cgccgccgct40320gcctgattcg ttctgcgcga tcaactcggt catcccctgt gtagcttggt gatgacgtgg40380atccatcttg tgaggcgtca aagccagaca ctgcctttac ttctaaatct cgccattcat40440ctgcaaaatc cctatatcct tccccataag tgtaatcgtc actacctatc aattctgtag40500atgccgcatc tacagtccta attatttgag gatttccttg cattgtaaag caaagatact40560cggaggctgg atttgtcaca aaaggtacga cagccctatt gatcaaattg aaggaagggg40620attgctttta ccagtacacg atgttactgt tgttgctatt gttgttgttc ccaatttctt40680cagacgtagc gtgccgcttc tgacattgcc aatagctgct tgtctttggt cttctttggg40740gaatgggcca gtaaaagaaa ccctaggcag ttcgattatc tactaatcta aagaacctgt40800ggcccctttc ccctcaaccc acgcccttcg ttgctctctt cggtcggtga agcgtttaga40860tgcgaggttt cctccactac gtgcttcttc aatgctaaac gcccaagtca actgaggaca40920ctgaaagcct gcacggagca gaagacccac acagacggtc gcaggatcaa ccctacctac40980gcctcgttgc cacgatggtc gctgccgatc ctcgatctct cgtcgattat tggtctcctg41040ttgcgctctt ggccacgcgg ccactcagac tctgcttctg tggcttctca ctgacgtgat41100gtagaaagaa atagaaagca cagagccact ttaaaaggaa aaggggaaag cagagaggaa41160agggaaaaag aagacctcag attgactcag agattgactc aatcgacgag agaatggaag41220ggaatggacg ccacggagac agaggcgcag cgagacggag cgagacggag gtaggcagag41280gcagaggcag aggtggaggc gaggggccgg gttgtcggca ctggcagagg gagagagaga41340gaaggagagg cggaccagtt tgaaaactct cgccagcttc gatagccgta ctcggtatgt41400atgtatgtat gtatgtatgt atgtatgtat gtatgtatgt atgtatgtat gcactcttct41460acttgtttcc aatgtgctgt tctatgcttt acagtgtttt ccgcgctcgc tacttgctac41520tttcatcagt ctgtctgcct gaggcggcgg tgatgcagaa tgcacctagg tacctatttg41580tcgccaactt tggatttgcg tggcggcagg attcctcttc tcctgcactt tgtttcgact41640cgccttagaa gggttgttgg aagacgccta aacgggtatt gcccggagat aggtgctgct41700ggtagctcat gtagatagtt cgttaggtag ttacactgga acagacagac gctctgtgtt41760tcgtggtgtt gcaggtcatg gactcagagg ggctgcgtga gttttgtgtt cgagagcaga41820gtgttgatat tcttttatgg gcaggacaca ttgcaacttg aagtaccgtg gttgtaacta41880caggacctcc atctgaagcg cggcatcacg tgaaaaagaa atgaaatgaa gagggaaagg41940acacccaaag gttcataatg tttggtttgc aaaggttatt cgaaagacac cttcttcgtg42000gtagatggtg attctgtcga aactgccgag attttgctga gagtgaacca aagcagggtt42060
ttgagataga agaatcaatc gtgcatggac aacctattcg taggattgtt atagctgttg42120tttgttatag gtcaaacttt atagcttcaa cccctcgctg gcaagtacga agggaaagtg42180taaatataca ttcttggttt aacgcataat ctcaagagct tccatgctga aaagttagat42240agtatattct tctgatttta catatttaaa ccaagtaaac aagttccacc aagggactta42300cttggcaact taaccatggt catcataatt tgcgcatcac ttagatcact acgttaacat42360tcgttcttga tctcttcgag cgcctaaata agcaaactgg cagcgaatta ggtcaccata42420tttttccaag gaggaaaaac tgtattgtgc tacccgttgt ggtgtaaaac ttgtaattct42480tcgcatctct aattcctatc gttaaacttg tcatcttact ttctggaagg aagcttggta42540tctcagaaaa tcgaactttg caataatacg aaagcacaag taagggttta tggcagcata42600acattgtctt aagaaattga atttaaaagc agaccgaatg caccgcagaa tacattgtaa42660attggtgcca aatattatga gtagcaatca tcaatctaac gcacgatttt ttgaagaagt42720acaatacaaa tttccccgtc gtagagaatc aaatggtttt acacatctat ttcaacactt42780ttcttggatt gtgatttcat atcaagacaa ggcttaaatg atcttggctt tctctgcaag42840agcggttctc caaatttcct ctcctgtttc tggattcatg tcaaaacata gtttaacaat42900agaaagaagg tgaccaggta ggtacgcaat aatagtttcc gcaatgaatt ggggcttgta42960gcgtgcagag aaatgcatga gatatagggc ctggcagttg tccaatgcac ctcgttttgc43020aaacctcgcg agctcttcaa tgtggatgtg gccacgctct ctagcaaagg agatatcgcc43080atcaaaaaat gtaagctcca tgcaaagtgt ggcagcctga agaaatagag cctcagggat43140atccagggcg tctataattg tgtcacctgt atatgcaaat tcaatcgttt ctctgtacac43200gaaatcctca ggcataggag gtgacttttg aaagcgcttt cgtttctctt ctggactgag43260gttggcaagc tctgacctaa gctcttttcg tttcgtcttc accgcatagc ctacagaggg43320aactctatgc atcgtcttgc acaccacaac acttgcatct cctcctagat cc43372<210>2<211>39976<212>DNA<213>Ulkenia sp.
<220>
<221>misc_feature<222>(32086)..(32086)<223>n steht für irgendeine Base<220>
<221>misc_feature<222>(32086)..(32086)<223>n steht für irgendeine Base<220>
<221>misc_feature
<222>(32084)..(32084)<223>n steht für irgendeine Base<400>2tcaagaattc gcggccgcaa ttaaccctca ctaaagggat ctgatgaact tggagcaaga60ataagaaatc catccattca agtcagcaca cccgatggca tcatcaatct tcgtcaactc120tttgtgcagg cagattggtg cttcgggcaa tcaatcggtt gacggattga ttgatcaatc180gctttgcttg cttgcttgct tgcttgcttg caattgatcg gcaaaagagg ccatccatcg240tagagcgtgc aatcttcaat gctctagcta gaggcgccat caggtagtta gttagctagc300tcgttagtta gttgctcttc ctgaaactaa caatgtatga catcagcatc atcgttcttt360cttctttatc catccaggat ccttcttttc aattcgtttg ttttgttttg tcttgttttg420tctttttctt tcaatgcaag catctcttaa ttcaacaaac caaacgaacc aagagatgaa480actcaaaaaa cgttttaaaa taaacaaaca attaaaatca aatagaaaat gaaattgaaa540gcacttttgt tttcgcctct ctagagagct agctatagct acctactatt cgttctcgct600cttcgtcgtc gggactgctg catcctgtca ttatcgggcc ctaagagtgc cctagtctta660gaaattgatg gcgataagat ggcggtcttt cttatccttc ttctcgttgc tgctgctgtg720ctctttgcct ctcggatcct tttgtttaca gctggccagt cagtcagaca gtcagttaat780cgattaacag gcaagcaagc aagcaagcaa gcacgcaagt cagccagctg gatagacagt840tagatagatc gtggcgtcgt cgttggcttc gtcgctgttt tggtgcttga ggattcgaag900tgcacgaggt tccttctacc tacagctctt cctttcactc ttcacctatt attatgcgct960gcaagttctt ttcgaaaggc tttttcttct ttcattctct ttcttttggc ctttgcgtta1020cagagcggag acgcctagtt ttatagatct aaataaacaa gagggaggac aacagaggcg1080gaaaacaagc aagttcaaga cggcaagaaa gcagcgcctt tgtttctttg tttcttttgt1140ttcttttcaa aagagccctt cctcggaaag ctttctttct ctcttgagcc aacttgaatt1200cgaatctgat cttcaaagcg agttagttcc tcaggcgcca ggcacctctc tccctccctc1260cctccctcta tcgcaggcag gccagcgtga cacctgtgac agcaggcagc tcaggcgtgc1320atgcaacgaa ggcgttgact catgcattgg cgctcactca ctcactcact cactcactca1380ctcgcgtacg tacgcacgca cactcacgca ctcacgcact caatcactca atcactcact1440cactcactca ctcactcacg ccagcattct cgaggagagg ccatgcgtag gtgaggtacg1500aaggaaagga gtccatagtt tggaggcgat gatggcgaat tgcagagcat aacagtgcag1560agggagaaac ttacatccat tcatacgtag ggaggcgcat acttacgtaa ctaagtgcaa1620tcggtggatc aagaaagaag gaatgaaaga atgaatgaag gaatgaatga aagaaagaaa1680gaaagaataa atgaataaat gaatgaatga atgaatgaat gaatgaatag ataaatgaat1740
gaaagaaaga gccccgctta tttggtatcg atctcattgc aaatgttcct gaaagttgct1800tatttgcctc acaactatga gtaggtagtg atgataataa tagtaattgc tattgctatt1860acttgaattt gaatttgaat ttgaattcag gtagacaata aaataagatt agcaaaacat1920tttgagagga agcagaggat atgcagtgca aaaggaggtc ccgagtttcg atcttctttg1980cacctgctac gtatctagtg cacgtagagc aagaaagaat gaaagaaaga acgaaagaaa2040gaaagagaga gagagagaga gagagagaga gaaagcgaag atgatagcgg agagaactct2100tcttcgcagt cactctgttt ctcagtcagt cccgcaacca ataacaactc gaactcgcag2160cagtgttctt cggagtgcca gcgctcgctc gcactgcgtc ggcacagcag cagcagcagc2220aggccccgcg ctcgctgcac tcagcccggg caggagcaac agctgctgag cagctgaggc2280cagctggctg gcggctcgcc tcgcctcgcc tcgcgtcgcg tcgcgagaga aagcgatcga2340ccaactgtca atcgattatt cgagtccttc gagcgcttta tagggcactg attgatcact2400cattgattca ttgactcatt tattctttgc gtggtcagcc aaacggcgtt agcattgggc2460aaagcgggtc tttgctttgc tctaaaatag atttgctcgc gagagtacgt acttgcagga2520gtaggtaggc tctgcctagt acctgggcat ttgaatattt gaacttcgaa cttcgttgag2580tatctgaata tttgaatatc tgaatatttg aatttcgaaa gtttgaatat ttgaatattt2640gaattttgga atattggaat agctgggttt ggagataaga cttactaagc taagcgccga2700cgtaagagcg gcgagtaaat ccacacacaa gagagaggca gagagagagg gagggagaca2760actcgcgcag gcaagctgag cccactggac gcacggggcg cgtcccccct gacgggcgct2820ctggtggtgg cgtgtttggg agggttttgc atgcttgtga taggggctct ggcgcgggct2880ctgtacggtg cttggagatg cacgggcagg gcgagagagg ggacgggttc ccgggaggcg2940ctgcttggag gtgctgagag ggagggagaa ggcgtgcttt gcgatgcgcg gggcgaccta3000ggcgctgctg cgcggtgcag cagcagggac ctcggacgtg agtcgaagcc gtctgcagag3060gagatggtag aagggccgcg gattggtagc agagaagagg aaatagaaga agaagaagaa3120atagaagaag aagaaataga agaagaagaa atagaagaag aagaggagga cgggcaggcg3180ggaaagatgg agaaaggact cgcggcggga aaacaagaga atgtgaactt gggcttgaac3240tttggtttga atttgaatgt ggagaacgag gggttgaatt tgagtttgaa tttgaaagaa3300aacttacgga aagaaagttt agttgaaagt gagaaagaaa aaaatgagaa agaaaaagag3360aaagaaaaag agaaagaaaa agagaaagaa aaagagaaag aaaaagagaa agaaaaagag3420aaagaaaaag agaaagaaaa agagaaagaa aaagagaaag aaaaagagaa agaaaaagag3480aaagaaaaag aagaagaaaa agaagaagaa aaagagaaag aaaaagagaa agaaaaagag3540aaagaaaaag aagaaggaga tttaaaaagt tgtttagttg aaaaaggaga aggaggaaga3600
agcagcgaca gcggcagaag aagaagtagt tgttgtaaga ggggaacgga ggcagtagca3660gtggagcagg cggaggcgac agcaaacctc gaactcgacc ccgtcgagcc gcagcaagaa3720caagagcccg accaggtgga cgaggacgag gtccgcttgt tgtcaggaac aacagaagtt3780gcaggactag ccgagagtgc taccactgca attcttagat ccacagacgc aagagcagaa3840aacttacaac tgctcgccac aacacaagaa ccaccttcag atacaaccag gttcgagaac3900tccacaagtc tagaagcagc aacagctcta gcagataatc aaacaggtcc agaaaaagct3960acgactagaa gagaaattat cgagtcgcaa cttgcaacca tggccactcg cgtgaagacc4020aacaagaaac catgctggga gatgaccaag gaggagctca ccagcggcaa gaacgtcgtt4080ttcgactatg acgagctcct tgagttcgcc gagggtgaca tcagcaaggt cttcggcccc4140gaattcagcc agatcgacca gtacaagcgt cgcgttcgtc tccccgcccg cgagtacctc4200ctcgtcaccc gcgtcaccct catggacgcc gaggtcaaca actaccgcgt cggtgcccgc4260atggtcactg agtacgacct ccccgtcaac ggtgagctct ctgagggtgg tgactgcccc4320tgggccgtgc tcgtcgagag tggtcagtgt gatctcatgc tcatctccta catgggtatt4380gacttccaga acaagagcga ccgcgtctac cgtctgctca acaccaccct caccttctac4440ggtgttgccc aggagggcga gaccctggag tacgacatcc gcgtgaccgg cttcgccaag4500cgtctcgacg gtgacatctc catgttcttc ttcgagtacg actgctacgt caacggccgt4560ctcctcatcg agatgcgcga cggctgtgcc ggtttcttca ccaacgagga gctcgccgcc4620ggcaagggtg tcgtctttac ccgcgctgat ctcctcgccc gcgagaagac caagaagcag4680gacatcaccc cgtacgccat tgccccgcgt cttaacaaga ccgttctcaa cgagactgag4740atgcagtccc tcgtggacaa gaactggacc aaggttttcg gccccgagaa cggcatggac4800cagatcaact acaaactctg cgcccgtaag atgctcatga ttgaccgcgt caccaagatt4860gactacaccg gtggccccta cggccttggt cttctcgttg gtgagaagat cctcgagcgc4920gaccactggt actttccgtg ccacttcgtc ggagaccagg tcatggctgg atccctcgtg4980tctgacggct gcagccagct cctcaagatg tacatgctct ggctcggcct ccaccttaag5040accggtccct tcgacttccg ccccgtcaac ggccacccca acaaggtccg ctgccgtggc5100cagatctccc cgcacaaggg taagctcgta tacgtcatgg agatcaagga gatgggctac5160gacgaggctg gtgacccgta cgccatcgcc gatgtcaaca ttctcgacat tgacttcgag5220aagggccaga ctttcgacct tgccaacctc cacgagtacg gcaagggcga cctcaacaag5280aagatcgtcg tcgacttcaa gggtattgcc ctcaagctcc agaagcgctc tggccctgcc5340gttgtcgctc ccgagaagcc cctcgctctc aacaaggacc tttgcgcccc ggctgttgag5400gccatccctg agcacatcct caagggcgat gctcttgccc ctaaccagat gacctggcac5460
ccgatgtcca agatcgctgg caaccccacg ccctcgttct ctccctcggc ctaccctccc5520cgtcccatca ccttcacccc gttccccggc aacaagaacg acaacaacca cgtgcccggc5580gagatgccgc tctcgtggta caacatggct gagttcatgg ccggcaaggt cagcctctgc5640ctcggccctg agttcgccaa gttcgatgac tccaacacca gccgcagccc tgcatgggac5700cttgctcttg tgactcgtgt ggtctccgtt tctgacatgg agtgggtcca gtggaagaac5760gtggactgca acccgtccaa gggaaccatg gttggcgagt tcgactgccc catcgacgcc5820tggttcttcc agggatcttg taacgacggc cacatgccgt actccatcct catggagatc5880gccctccaga cctctggtgt cctcacctct gtgctcaagg ccccgctcac catggagaag5940aaggacattc tcttccgcaa ccttgacgcc aacgccgaga tggttcgctc tgatattgac6000ctccgcggca agaccatcca caacctcacc aagtgtaccg gctacagcat gctcggagac6060atgggtgtcc accgcttcag cttcgagctc tctgttgatg gtgtagtctt ctacaagggt6120accacctcct tcggctggtt cgtccctgag gtcttcatct cccagactgg tctcgacaac6180ggtcgccgca cccagccctg gcacattgag tccaaggtgc cttccgccca ggtcctcacc6240tacgacgtta cccccaacgg tgccggtcgc acccagctct acgccaacgc ccccaagggc6300gctcagctca ctcgccgctg gaaccagtgc cagtaccttg acaccatcga ccttgtggtc6360gccggtggct ccgccggtct tggctacggt catggccgca agcaggtgaa ccccaaggac6420tggttcttct cgtgccactt ctggttcgac tccgtcatgc ccggctcgct cggtgtggag6480tctatgttcc agctcgtcga gtccatcgct gtcaagcagg acctcgccgg caagtacggc6540atcaccaacc cgaccttcgc tcatgctccg ggcaagatct cctggaagta ccgtggtcag6600ctcaccccca cctccaagtt catggactcc gaggcccaca ttgtctccat cgaggcccac6660gacggcgtcg tcgacatcgt tgccaatggt aacctctggg ctgatggcct ccgcgtctac6720aacgtcagca acatccgtgt gcgcattgtt gctggcgccg cccctgctgc tgctgctgct6780gctgctgctg ttgctgctcc ggctgccgcc cctgctccgg ttgctgcatc tggccctgcc6840cagaccatca ccctcaagca gctcaaggct gagcttcttg acgttgagaa gcctctctac6900atctcctcca gcaacggcca ggtcaagaag cacgccgatg tggctggtgg ccaggccacc6960attgtgcagg cttgcagcct cagtgacctc ggtgatgaag gcttcatgaa gacctacggt7020gttgtggctc ctctctacac cggtgccatg gccaagggta ttgcctctgc tgaccttgtg7080attgccactg gtaagcgcaa gatcctcggt tccttcggtg ctggcggtct ccccatgcac7140attgtccgtg ccgctgttga gaagatccag gctgagctcc cgaacggccc cttcgccgtc7200aacctcatcc actccccctt cgatagcaac cttgagaagg gcaacgttga cctcttcctc7260gagaagggcg ttactgtcgt cgaggcctcc gccttcatga ccttgacccc gcaagtcgtc7320
cgctaccgtg ctgctggtct ttcccgtaac gctgatggct ccattaacat caagaaccgc7380atcatcggta aggtctcccg taccgagctc gctgagatgt tcatccgccc tgccccgcag7440aacctcctcg acaagctcat ccagtctggt gagattacca aggagcaggc tgagcttgcc7500aagctcgtcc ccgtcgccga cgacatcgcc gtcgaggccg actctggtgg ccacaccgac7560aaccgcccca tccacgtcat cctccccctt atcatcaacc tccgcaaccg cctccacaag7620gagtgcggct accccgctca cctccgcgtg cgcgttggag ctggtggtgg tgttggatgc7680ccccaggccg ctgccgctgc tctcgctatg ggtgctgcct tccttgttac cggcactgtc7740aaccaggtcg ccaagcagtc cggcacctgc gacaatgtcc gcaagcagct ctgcatggcc7800acctactctg acgtctgcat ggctcccgct gctgacatgt tcgaggaggg cgtcaagctc7860caggtcctca agaagggaac catgttcccg tccagggcta acaagctcta cgagctcttc7920tgcaagtacg actccttcga gtccatgcct gccacagagc tcgagcgtgt tgagaagcgc7980atcttccagt gccctcttgc tgatgtctgg gctgagacct ccgacttcta catcaaccgc8040ctccacaacc cggagaagat cacccgtgcc gagcgtgacc ccaagctcaa gatgtctctc8100tgcttccgct ggtaccttgg tcttgcctct cgctgggcca acaccggtga ggctggacgc8160gtcatggact accaggtctg gtgtggccct gccattggag ccttcaacga cttcatcaag8220ggctcctacc ttgacccggc cgtctctggt gagtacccgg acgtcgtgca gatcaacttg8280cagatccttc gcggtgcctg ctacctccgc cgtctcaatg tcatccgcaa cgacccgcgt8340gtcagcattg aggtcgagga tgctgagttc gtctacgagc ccaccaacgc cctctaagcg8400agttatatct gtctagaaaa cttggcatgg ctagcaattt atgtctagct attccataca8460cacggtaatg ccagtagcct gttagttata gctcttttgg ttgttgtctc acaatacact8520gacatcagca gaacaaaatg aaaggggcct tggctaccat gaaatcaata cttcaaaagg8580tctcttggtt tctttactcg catgtcgcta tttacttaca ttcctcgagt acataacata8640tcatacatca aagaaattaa aaagaaaaca aacattcaaa tatgcattac tttccctact8700gtactagtaa gtacgtttct ggtattaagt tgttttttct caaaagaaca atgtgcttac8760ttgtaaaatc cacagctgct tacttgtaag cctcaactag ttagtgatgt gattatcata8820aaatgttcga cactgtacct cctttccagc tatcttccta cacctcctct gacgcaggtt8880gacggaggag gcgtgggggt tgattgaagt gcaacacaac gttttgttta agatattcct8940tgccttggcc gactccaaat ggatagcaca gaagcctaat gataatttga attaatttta9000tttcgagctt atttaatgct cttatcagag tccgtaggta tctcttttcc tactaattgt9060tgaaaaagga tgttttggac atagcaggtc atcatactat ttggttccat caaattcata9120tccatttctt tcgttcaagt gcttcccttc ctacttatta tatatattat atatccataa9180
atgtaaaaga gacgattacg aatactttgc atacatgtat agcgaaacag agatggtagc9240aaaagttcac cttcactaat ctaagaatct ctccacgtgg gtaaaaactt cagcagtaag9300attgtaaatg atgtccaaga acaaaacgtc atgctagtcc aggggttact gagctaacga9360ttaataatgt ttcgtagtct tcctaattgc accatcaaaa cttgtctgca caagttttaa9420agtattggag cctttactga agaatcagag gacatagatg gggcacgttc gccttgaaaa9480aaatagtctt ctttacctgc atggtgttac aaacaaaaac gagttgaaaa tagctgtgca9540aggaggcaaa catgattgga aaagaaaaac gaggggaccc ttatacagga gggcgccaca9600tagtagaatg agtagattgt tagagtaggg tacgctttat gtgattgatt gaatgggcga9660gtgaaagttg ctgtcaaggt tctaaacaaa aggatgtttg agtttgtgag tattgtttgc9720ggcaaaaaga ttcagtagag agaaatgcac aaaaagataa tacgtgtgta gggcgattat9780ggaggcatgc atttggggga aatcatcgca tgcgcatgag tttctccatc tgccgaatct9840ttgcaaaggc attttcaagc tccatttgca tagcgtaggc ttgctgctca aactgagcgc9900gctgatgcgc cagattttct tcatgtcttt tgttcaaact acgctcaaga ccctcaagag9960ccgcaacctt gagcttgcgt tccttttgct gaatctccat aactcttcgt ttcacctgga10020gctcaatttc tgcagcatcc gtggtctttg cagcggcctg tgcgtcttgt gcggcctgtg10080cgttgtttgc gagctccttt cgcagctcct ccatctccgc gttctttttc tcctccatcc10140atttggcacc gagtttggca gcttgatcga tgcggccctt gagaacttct tcgttctcct10200caagttctgc gatacgcgcg tgtaagccga ggatctcctc cgagacagcc tcgccattga10260tcattatttc acttcccgag tcttgaatga caacatcagc cttggtgcca ggttcaccgg10320tatctcgctc gcaaccctgc tggcgcatag acagcataag gcgcgcatta tcctcacgca10380gatcatccac ctgttctgat aaaagtttga ctgcctgctc aagattacgg gggttcactt10440cgtgaaaaat ttcttgaagg tctcgaagct cagaaagctt ggcagagcaa gtgtgcatcg10500ctctgcactt tttaagacgt gcaagtgcat catcaagttt ggcattattt accttcatgg10560aggcttcagc tacttcggct tcttcgatta caattttctg cagctctaca acatcatggc10620caattaactt gcgatgcagc tcggcaatca ccccatgcat cttttcggta tggcctggac10680gcgcctcatc ctgcgttctt cggatctcct cctctagttc tcgatttaga cgaagggctg10740gtccaagggg cgggtaatta gcctgagtca agccaagctc tgttgctagt ccaaggcagt10800cggaaagtcg cagccggtcc ctatcagaaa cagccttttg caagtctacg ctcaaacgca10860cttcttgagc cttgcgcacc atcttcggtt ctgcctgtcg cagaagtttc gagtcgtagc10920cagcttgcca cgctagcacg atggcacgcg caagtgacct cagttgaccg ctgttcatgg10980cagacttgag caacattttg atttgcacaa atacctcatc tgattcatca tcttcagctt11040
cctcaagctc tgcaggtgtc ttgcgctctc cagagacttg aagagcaggg ttcaaaccgc11100cctccaggac ctcgctcgca agcgcctcct ctgtctcagc tttgcgcaat agcgcagcag11160cattctccgc cattgtgttt gtcactcacg agattaatat cgttgccaga gtatacggta11220atgcgagtta aggattcaca gaatctctca aattaatctt ttcacctaat gatatccaca11280aaacgttgca atcgctcagc ccaacgacaa gcgtgcttct tgttttaaga ctgcaactgc11340tcctttttct attagtcaat atggaccgtc ctccaaacgt ccagaaaata gcacagaatt11400taccagcagc cgctgcagac aagaagtgca agagagcagg caagcaagtg agggtttgag11460caaataggcc aacctctcca cgcagaattc tagggtcgca accggaactc acagtcctta11520gaaaccgtgc gaagccctgg gctcaacttc aatttgtcca cgggaccttc agcaagcacc11580aagctcagca gcgtgaaggc aggcgctgac cacagtttga gctcagaggg cttggtgtgc11640ctcgcgattg atattgaagt caattgcgca ggacggcagc aacggaccag gtggtgaaga11700aggtaatctc cagcggagtg atgatggagc tcgaccgact actccggaat cgaccagggg11760aggtgcgggc gcccttcaca agcgggcgag aggcagggga gagaaggctc gactccacgt11820cttgaagcgt gtacgtgtgc gcgctcacgc gtgcgacacg ccggcaaggg cgccttagtg11880gcctgctgct gctgctggtc gccacgctgc gagcccaaga gatttgaatt gaactcgaag11940aaaataacta tcatttatca attccaatca atcaatgcat tatgaagcac ctctgaagtg12000aactattctc ctctccaata tacaacaaaa aacacacaca gtgggtttta ccctataacc12060tattgttccg cgagcgatca actactctat agagcgaatg accagttttt ctttctttct12120ttctttcttt ctttctttct ttctttcttt ctttctttct ttctttcttt ctttctgttt12180tcctatctaa taaccccttt aatcgaggaa acctttcgat ttaaaaggaa agctctgtct12240gtatatatct gttacagata ctgctatcat gccatgcaga aagaaacaca aaagaaaaac12300aaaagaaaga gagaaagaga gaaagaaaga gagaaagaaa gaaagaaaga aagaaagaag12360agcttttctc aatcggtttc ctcatcgacc gctcacatat ctacgattgt ggcaaagaaa12420gaaagaaaga aagaaggaaa gcctcagcag agtccgcacg aaagccttca ttgagccacc12480atgtcgtggt ccgctgcagt cagtgccgcc tctctgtgaa ttgagtgagt gagtgagtga12540gtgagttggt tggttagtta gttagtgcct cttcagctca aagcctttca cggtcgctct12600tcgagcgttt gctttttcat aaacaaataa acaaaccatc gaacgaacca tcgaacgaac12660gaacaatggt accccagaat agacggaatt aattgctaag taaaccagta acagtaagtt12720agtgtttctg acctgagccg ttttctttat ttattcctct cagctctgtg aagagaattt12780gggatgaaaa gaaacgtttt tatttattta aaagtttagt aacaagaaaa acatggtccc12840tcttcttcct tcatgtaaaa ataagtaagt aaaaaaaaga aaagaaaaaa aaaaaagctt12900
ttaaagtagt aaagcgaggt agagataaaa gttctttctc agggctccta gtaggcactt12960aggaggtacg tctaagaccg cctcgtggga agaaaagaga aaacaagaag agaaaagaga13020gagagaaaca gcgctgaccc gagaggctca tgcgcagagc ccaaatctgc ccaactttgg13080caaaatgcag cgccgcctct gcggcggaga cggtcatgtg aatccgcaga gctgcacgca13140cgcgtcacag gctacagctg gatatttttt atacgagccc gcgcgagacc gcggcggaga13200aacggggtcc cgcgcgaagg gcctctgaaa agcaggcagc gaaccaggcc tgcaccagcg13260ccgacctccg cgagacttcc ttcgatctca ggaaggacct tctgaagagt ggctcaaagc13320agcgcaggcg gaggcagcgg cggagggcac gcccagcgag ggcatcggct cgaggctcca13380gggctgccag gtcgcgaggc atgcacggcc tcggttcgtg atcttggccc tgccgggtgt13440gccgggatcc aatatggtgc gcaccgtttt tgaagctgtc gctcttttct cgcgtcgcac13500attacgatgc gcagaactga gtgagtggac aaacgaagag ggcgatcgat ggcttggaat13560gcgaactccg tccatcgaca tcgacatcga tcaacccatc gacccatcca ctccgtgcac13620aagctgcact ccgtgcacaa gctggagacg agcgaccgaa gaggtgacga ttcgctctcg13680ctcgggatgc ttggatgatt ggatgattgg gtgcacgagc tgccacttgt tgttcttgtg13740ttgttcttgt tgctgttctt cttcttcttg gcggtcgttg agcgaatgcg ctgtttgtcg13800agaaccatga aatgagcgtc ttgaatatgg gtggcctcgg gaatccgcag aacgatggta13860tcgcattcgc atccctggtt gcaagaaggc ttgcgatgag gtaagcacat gccgactcgc13920cgatcgacca gcgcgggcct ctgtgccgaa ggagcgacag cttggacgca ggggaatggg13980gcctcgaagt tcttgtggtc actcaggaca gaaactcttg ttttaatttt tctagttgct14040tagctcaagt tagttagcca gttggctagt ttgcttttaa ttaaaaatga agaaaactaa14100aattgagttc tcaagtctga aagaacaagc aaacaaaagc gaaggatgtg ctgtgcatgc14160acgagcttcg gctcaggcag aggaagattg ccagctcgca tgaccttgga tcttccatac14220tgcgtaatgc tgagcgtcag agaaagatgc gggccaggtg ccggaagata taccttcatg14280gactttccgc agaggtgaag atcagcgatg atcatgtgga agtgacacga cgcacctcga14340gcatcccagg aattgcagtg tttgcccagg caggcagtga gtgcctggtc aattatggaa14400tagtcaatct agtaatatga gtgagtggaa ggcagaaaat aatttccatt ccttcattcc14460atgactagct gcatcaacat catgatgttg cttcagctcg tcagcagggt gaacaacgtg14520cgggctagaa gaattagaaa agaacaatga gtgtctatga atgcatgaga atcgagtgta14580atgcaataca gaaacgtgag aaattgcagg attgattaga aagtattagt agggcaagaa14640cagagagatt agagaagtga aaagggatga cggtgaaacc agtgtagtcg tagtaaagag14700tggcttgcaa ataggtgcac cgcatccatc aattggtcaa cgagcaaatt agtgcagcca14760
gcgtactagc tatttactgc gacgatgtaa cgaagtcctc caaggacgcg tacacggtgg14820ccggcaagtc ttcattggcc ttgagcttgt ccaagataat gcggggaaac tggattgcct14880ggtcaatcac agccttacgg gcctgctcat cgttgacaag tgtagggtcg cgctcgaggt14940ggtcggtgaa gcgagagtgc aaatcaagag ctacagcaat gatgcgctca gccttgagct15000catcacggtc ccactcaact tcctgttgca taagcgaacg gatgcgcttt acaaggcagt15060ctcggacctt gggcaggtca ttaataccat cgtaataaat actcatgacc ttccacatgt15120tgggctcact cttacactta gaacgcattt tgtcaaagag ttcctccaat tgtgcgcgca15180aactagacac aagttcaggg ctcatctcct tacgaggggc ctccggcgtg tgcgggtctt15240cactagaggt ctgagaatcc gacttacgga cagagaccaa ggcatctaca actacaagga15300gactctgtaa gtcaaccatc tcagcaatgg cctcacgagt gccagcgcgc acatccacaa15360ggttgctcat accatcaatg gccatacccc actgatgagt gttgacagcc aacataatgt15420agttctccca aatacgccag ttggaacgtg actgacgaac ggcctcaata acagccttca15480aggcagcagg gtagtcgttg agctgaatca aaatcgaact caagttagcc caagcatcac15540cgctatccgg atcctggcga gtcacatgcg caaaggctgt acgggccaag gtccactgtt15600caaggcgcat agcgcatgag ccaaggcgga accacgactc cgggtacaac gggttgatct15660taagggcatc ctgaagatgg tcaatactct cctgcaaatc accgcggtca aatgccatga15720gggccaattc acgcttagca cgcgcatggc gcttgccaga aaactcccac gccttgctga15780accagtcctc atcctgaagc aaagagccca aaacgcacat aaggtgtgca gtgggctcaa15840ccgcaaggcg ctcacggatc aacttctcag cacgagcgcg cttgtccatg atcacaaggc15900agtccacagc ctcttcccaa agacgaacct cctcaaagat ctgcaacgca ctaccagcag15960cgcccacttc atagtacaat tgcgcgaggc cgcgcttgag ctcccaaact gcgggccagg16020atagcgcgtg cagaaaggcg agacgctcgg tcactggggc agcgttgtcc acgtcacgct16080ggcgaggctg tgttggtgtg agccggtcag tctgctggtc aacgaggact tgcatctgta16140aaatggcacg ctccttggtc ttgttgcgct caaactcaag ctgagactta attagaagtg16200cagtggagta taccatccag ttttcaggac tctggaggac acgctccacg taagcaagca16260tctcctctgc agtgagagcc tccatagcgt agctgttctt cacatccata cacaagccga16320gcacgataca ttggtccaaa aggctcaacg ttccgcggcg catgcgagcc tcctcatcac16380tggtctcctt tgcgtactgg atctcctcgt gaagtggagt ctcagcgtca acttcttcga16440ggcgcacctt ggggataccg agaacagaag tggatcctac aatctcgcca ccgttctctt16500cctcagttgc atcattatca tcctctgcca caatctcgtt gggtacctcg gtctcgatgg16560ccttgactgc aggggaatta gaattcttag gagcttcagt gtcttgctcg cgattttctg16620
ggtctttggt ggtagctgac gatgcaagaa gaataagctg ggtcttctct tgtttctgaa16680acttggtacg ctttcccatt acaccagtca tctgaatatt cagctgtgct gtttcctttg16740cctttgcaaa agcacgctta gctccatctg cagccttgaa cttgtgccgt gctacaccgc16800attcaaccca cacgagggac tccagaagtt tatcatctgg gtacatgatc tgcactgctc16860ggaccgtgcg agcaaatcct ctctctgcct cctgctcgag gctaggagcc gctgcctcct16920tggcttcaag ggtctcctgg tgaacaacag cagatcgtgc tgcccaccag ctaggtgtca16980gcaaatgccg aagagctccg cggactgtgt tggagatctg ttcatgaggg ttagcgctca17040tgccgccgtt ggtctttccg ggaagatcgt cctcgccatc ttcttcatct tcatcagcgc17100caatgacagc cccgttctcg tccacaaggc gagcctcagc gagcatatca gttgggtcca17160cagcaccagg ggttactgtc ggattagcga cgacgcgaag aatgacgcga gcaacaagca17220agaagtgaag gtacttggca tcacggtaaa cctcctcacc gttcgcgcca agcataagag17280ctgttcttgc gtggagctcc ggatacgcgt ccaactgtga ctcgtggaag ctggcatcct17340tgccagacac agcggcagca gccttagcat cggaggcgag agcggagaga gcagtctcca17400catacctgag acccggctca gaggcagact tggagctagt atctacagaa ttcttggggg17460cattggggtt gtcgtagacg ccgcgaacaa aagggagggg gtagaacttg tcaataccat17520ggctagagac aggaggacct gtccagttgg cctggacgaa gatgtgaaga caagcaacac17580ctgcgaacat gcaggccata gctcgcagag cacgttggtt tacgtggtcc tcaggactcg17640agacactcgg gtccttgaag gagctgcgac ctgtgcaggg gccaatacca gtctcgatgt17700gctcaacaac acgctcatga agaaaacgac cttcacggtt tttaacgcga cttacagagt17760atcccttctt aagatcttct gcggcaaaga ggccttgcgc cgcaggagag gcgaggacct17820cgaagaaatc gccttgggca agagcgcacg ccatacgaag gacctcaagt tggagctctt17880tcacttcagg ctcgtcacgg agctcctcgc ggtcatctgc agcagccgat gcagccacaa17940gaacatcttc gagaccctcg gcattgctct cgagagcgag acgctcgaca agtcgcagcg18000agtaaagagg tttcgcaact ccagagccct ttttggagtt gaagacacca gcgccgttaa18060gatcaccatc gtcagcctcg tcgatctcat catcaggacc gtcagggagc tcaaagtcct18120gtggaaggcc taggaactcg ttcaggtctg cgtcagagct cgaatccgac gcgtagtccg18180ccatcctggc ctacaggacc gccgaaacag gttgcggcag ccgcccaaag tctaagctgc18240aagagtcaac cctcaatcgc gagcttgcgg cacaacgtcg ccgcaggatc tcgcgccaag18300acgtctccaa atgcaagtct ggtgctcaag tcatcctggc cacccgcgcc tttgcccctt18360aagctaggtc acctacctta aaccagagtt gccccgcggt gtcatattgt aaacatttta18420taacaatata cgtcatatta aaaacctaga tgtggggaca atgttataaa taagtaacaa18480
atatagacta catcgagaag aaagaattct tcggcactcc gtgtgagttt gggcgaaact18540gcaatcacga agccatgcaa agtcttcgta tatctgagtg gagcctcgct ggagagaaga18600ccccatgtga atgggtgtag aacgacgaat ctacgcagcg ttgtctccgt tgagacgctc18660tgtccagata tgaggtccct cactattctc gtatttgatc atgccaagca tctccagttc18720caacaatgga gttttctatt gaaagaacat agacatgttt ggaacggttc ctttcagagg18780ggaaaaacta atcaaaaatc aattgaggaa tgcagggggg ttatttgctg cagttttagc18840aataaaataa aaatcctttg ttgatgtgat ttcattcgtt cctttgacat tcaatcattg18900aattgctctt caccggagct tttcaaggtg cccaactgcg atctccgctg cggctgctcg18960cggccgggct ctgagctcta tctccgtgtg ggaggcggga agccagcagg tgcggcgacc19020ctctccaaat agaggccgcg gcgaccttga ggcactcgcg tggcgggcgg attggcgatt19080ctgtgttcaa ccgagatatt tcatacatat tatttgctaa ttattagcaa atagaaataa19140atatacagac tttgcaagct cagtagagaa agtgaagatc caaaatgtcg gcctcttcct19200cgcaatctac ttcggagcag cgcaagtcac gcgtggcgta cttttacaaa cctgagattg19260gcagctacta ctatgggtaa gttagtatgg gaaaattggc gacagaaaaa tataataaaa19320aaagcaactg tatcgccacc gtttattcac ggtagttaga aggtatttgc ttcctgcgca19380cactcgatct gcaggatgta catgtcttga gtggcattgt ccaacgatcg ttctgtttgg19440cggaacattg cttttaaaca aaaacgagat agtgaatata ttctacccaa ctaccaccat19500ccggtttaag gagacaaata aatctgtctt tcgacccagg ataaggaggc ttgcatggga19560atcttttata atctagtctt tatgtcaaat tttcgcaggt tccagcctac catctctcat19620gctatttgtg attgcacaag atgatatgaa agtaaagaaa caaggcaaag gatataagat19680gcataaggat gtgcagaaaa ctaactagaa acattcatgt gatgaaacct tcctcttgaa19740aactcacctc ggtttgtttt ggatcttggt ttgtctttgc tcactttttt tcattattta19800cagcccgtcc catccgatga agcctcaccg cctgaaactg actcacaacc tgcttcttac19860atacggactc ttccgacaca tggaagttct gcgcccgcac gacgcgactg cggaagacat19920ggagcgtttc cactcgcacg aatatgttga ctttctaaag cgcatttctc ccgacaccga19980gcaagagttc gagaagcaaa tgacccgttt caacgttggt ccctattctg attgccctat20040ttttgacggc ttatacaatt ttatgtctag ctgctccggc gcatcgttgg atgccgcaat20100taagatcaac cacggacagg ccgatgtttg tgtcaactgg tctggtggtc ttcaccacgc20160aaagaagggt gaagcttctg gtttttgcta catcaacgat attgttctct gtattgttga20220gctcctcaag tatcaccctc gtgtactcta tgtggatatc gacattcacc atggtgacgg20280agttgaggaa gcgttttaca caaccaatcg tgtgatgacc tgctcttttc acaagtatgg20340
tgacttcttt cccggtagtg gtgcctacac agataccggc gctcgcgctg gtaagaacta20400cgccgtaaac tttccgctca aggatggtct tgacgatgcc agctttgaga gcatcttcaa20460gcctgttctt gatggcatca tgaagcactt tcagcccggt gctgtggtga tgtgctgtgg20520tgctgattcc atctctggtg atcgccttgg gtgctggaac atgtcattgc gaggccatgg20580ctacgctgta cagtacgtga aatcctttgg cgtacctgtt gtgcttcttg gtggtggagg20640ttacaccccg cgtaacgtgg ctcgctgctg ggcttacgaa accggcattg cactcggcaa20700gcatgaggat atgcagaatg atattccatg gaacaactac cacaactact ttggccctaa20760ccatcttctt cacattactc ctgacccgca gatgaagaac gccaattcac gcacctacat20820ggacaagtac accaacatta ttctcgagaa cctttcgaag cttgaagcgg tgcccagtgt20880acagttccaa gatcgcccta acgactttgc aaacccagat gagcgtgctc gtattgctct20940tgacaacgct gaccctgatg aaaaggatta cattcaacgt cctcagcacg aggccgaata21000ttacgaagac gagaaacacc aagactcgga ccgtcccaat ccggctgatg gtggtgccga21060ctcaaaggta aagtctgaaa aatcctcagg cgatggagct gcggacgaag cggagaccgg21120atccagaaag ccttacaaaa agggcactga atgcggtggt ctacttgaaa ttgacgaggc21180tgtcatggaa gtggactcca atgaagcgcc caaggagact gctcctgctt cagattctgc21240tatcaagact gaggatgctc ctgctgctga gtctgctgcc tccccctcgg atgccaaggc21300ctaaacatga agactttgtt ttaatgcaat agacgtgctc ttttgctgct cgagtagcgg21360caaccctagt gccatgtcct ccttttttct tactcacttc tctctctacc tttgaaagag21420accaagtgga accaagcagc catttctgtg ttccacattg caatagatta tcttttaaca21480attctcatac atacatattt tcttcatttt tcttttctat gtatttttaa aataaaatat21540aacaacaaag tagtagtttg tatgaatttc ggccatgcag gtgacaaaag gtgaaagtaa21600tgagcgtcat tttggatcac attaccagcg aatccactca acgactcttc tcttctcgag21660ctttagaagc tgactgtgag ataatagaac agagcacggt ccatcaatca aaatacataa21720ttagctcgca atagcttcgc ctcacagtga tcgtttcacc tcatgatacc cttgttgggc21780gctcgctctt aggctctccc ttgttgttat atgatgcaac gatcatctaa gtgctgtccg21840cagtcatcaa gacatcctat tctgtagcaa gcaagcaagc aagcaagcta gctagtttag21900ctggctagct agtttagctg gctgagttcg cagtgaataa acaattaaca cctcaagtct21960tgaaggagca ggaaacttgg ctcctatgat atgccatcct ggaaggccat gttttggggg22020gtatgagaga caggtctttc cttttctact ctggttcggt ggatgacgag acaacaacca22080gacgtcccgc ctagtacctg ggtggtcgat ctgtcctccg ttcactccga gtgcagggct22140tgtgggacga ctcgctctgt tgaattgagg tccttcacgc gagcctatct gggcatcgat22200
cgacctcatc catcaacaca cacacatatg ttcaatccgc gccaccctcg ctgactccca22260gactgcccag cgaaactttg aaaacttccc catctcgaaa cagcactccc aaaagacgca22320cacaagcaac gcttgagcct aggcaggctc tccgctggac gcacaaacca cctcgcagcc22380atccactctc tgactcccca agcatgcatg gccttctccc tcgatttggc gcttcgcgtt22440gctgtcttcg aagtcctcaa acacgaactt ttcactaatc atcctcgacc tcagcaggat22500gccccccctc ctaagctctg tttgctatgt atttattaga ggaaggacgg caagctgggg22560gtctgcggaa cgcattttgg gggtttgaaa attttcgaat tttcaaactc cccgaaacgg22620ccatggtttc ttccgagaag cggtagttag gtggggaaat gagagcacgg cggagttggc22680gagaagcata aatctgggcg ggcaagcaaa ccccaaacta tcctgcaatc aacaaaacac22740acgcactccg caatcaactt gcaccgtaag tctttggaat tgattatggt atctgcttcg22800ccgtcttcaa ctttaacttt gcgcctcgca acgagacttt gttttgtaat gtgcctttag22860atttgacgaa acatctttaa gcgagatagt acagcagcgc gttggtacca agagagatag22920atcctgggac cttttgaaat aaataaactg tgtgatgaac ggtcgactaa ctgggcttgt22980aattgatata ttgatgatac tcttggtcca catgggagtg agcacagtcc acaaacaact23040tgctaaccca cacaaaaacc tcccaaactt gcagacccgt tctgcattct tgtaaacaca23100taatcacaca gcacacataa tcacaatgac ctacggcaca gcacacaact acgtgcagga23160gcagattgag ttggacgaat gcttcaacaa ctttggcgaa gaagtgagca gctctgttga23220gcctcggtgg cagcgcaagg ccttggccgc tcgcactccc aagtctagcc gcaagcgtag23280ccgcaccggc aagaccccga gcaagggcaa gtctacgccc cagcacgacc gattcatccc23340caaccgtggc gccatggacc tcgctaacgc tcacttcaac ctcatgaagg agaacagcag23400ctccgcctct aaccagtgcg agtcccctac tcgtgctgaa ttcaacaagg ctttggcgtc23460cagcatgggt gcgggtgagt cccgtgtttt ggccttcaag aagaaggctc cggcaccgcc23520tgagggatat gaaaactccc tcaaggtttt gtacacgcag aacaaggaga agatggcgcg23580cactcagaag cccgttcgtc acattccttc ggcaccggag cgtatcctcg acgcacccga23640cctcttggac gactactacc tcaaccttgt cgactggggc gcctccaaca tgctcgccgt23700ggcccttggc cagacggtgt acttgtggaa cgccgagacc ggcggcattg aggagctctg23760ccagtgtgat gccgaggatg actacatcac ctcggttaag tttgttcagg agggcggtgg23820ctacttggct gtgggcacga acttcagcga gaccaagctc tttgatgtgg agacctgcaa23880gcttctccgc aacatggacg gtcacagctc tcgcgtgtcc tcgctctcgt ggaaccagca23940catcctttcc agtggcagcc gcgactcgac tattgtgcac cacgacgttc gcgtggccag24000ccacaaggtc ggtgttcttg agggtcacgt gcaggaggtc tgtgggcttt catggtcccc24060
ggatggccag accttggcct ccggaggcaa cgacaacctg ctgtgcctct gggacgctcg24120ttactctggc gacggtcgct cccagcagac cgtgcagacc ccgcgtctta agatcgctga24180ccacctcgct gctgtgaagg ctcttgcctg gtgcccgcac cagcgcaatg tccttgccag24240cggaggtggt actgccgatc gcacgattaa gatctggaac gctgccaatg gcgcctgcct24300caacagcgtc gacactggat cccaggtgtg ctccctcctc tggaacccac acgagaagga24360gcttctgtct tctcacggct tcagtgagaa ccagctcagt ctctggaagt tcccttccat24420ggctcgtgtc aaggatcttc gcagccactc cgctcgcgtt ctccacttgg cgatgtctcc24480ggacggaacc actgtctgct ccgctgctgc tgacgagacc cttcgattct ggaaggtctt24540cgaggcagct aacccggtca agcgcaacaa gcgcgccgct ggagctgcca ctgcctctca24600cggtggcctc gcccgcatga gcatccggta agtttccccc cttcccttgt ccggttaatt24660cactttcgac tactgtctta cacagaagca aagcatggtt atgcaagcaa acttgctggc24720atgctctctt ttgtctcttc agtagcgaga ggccgtggtc aaggggctca tgcgggagct24780ccaatgtaat ctaccaccac ccggcctctc atgtatacat atatatatat ctatttatat24840gctgatcatg atgcaaaaaa atcccacgcc gtcatactaa agcgcgtcag tgtttacaat24900actgttggcg tatagttcgg tagtgaaaat taaaatcctt cagggtttgt acctatagct24960tttggtgatg aatgtgatct actactactg acgtgacaga agcaacaatt cttgtgaatc25020tgacttcttt tttgtgtatt ctatttcgca tgactgcctg attgtatgat atgggtctga25080tttggtcgac tgtactctat tttgcatgcc atgtaacttt ttgttcgatt atactatgaa25140tctgtggcaa cttttgctga gaagaaggga tggcagacag tttgattttc ttgatcaatg25200tgtttcgctg tcccgctgtg ttgaaagaat gcagtaaatg acccgagtat cggactggag25260tgcgtatgtt tcacgctgcc ttatgaatcc ccaggggttc gcagcagcac tttccctcgt25320ctgtctctgt gtttgctgtt tgttcgctcg taaatgtgtt ttgcctgtat catatgcatg25380taggatagaa agttattacg cagtgtgtat tatagattta tggaagatca ggtggactcg25440tatatgctga ctggtgggta tgcttcacgg gatactcgca ttaagttcaa attcgaggca25500atggttgctg ctgaagtcgc tgacgaagga gagctcattg ttcttgtcgc caatttgtaa25560gtaggtggca cctgattcct ctttcctctg ggaagagatg cagcgctctt gggatcagtt25620tctctctcaa tcacgcttgc cgagcagttt ttagtagcaa gcaataggtc tttaatgact25680tctagaacta gatgagcagg tatttgcatc atgcaaggct ggcatgtttg gtggctttgc25740aatttctctg tcttgaactt agctggatag atagcgagag agtgaagttg gtacaaacat25800aaccgacagc atgtagccgc tgccttcgct cgcagctcta gcgctcgcct gcagagacgg25860aagagtgtat aattgcccag tgtcaacttt tgggtggtgg gtctgactca caatcaatgg25920
taccgttcag gtatctttcg gtagattatg acactggcca cttttctgaa gtgatttgag25980atttggtatc gatgatgaag agtgagagaa ttttgaaaga aatacctcat taacttccaa26040tagtcagtat cttgatgaaa aacgctgacc tgaaagctgc gcgtgttttg ttgacacggt26100ccttttattt tgttttttga tgatctattg gtacttatac ctgcgatttt tcttttgcaa26160gctaaggcac attcgacttt gtctagaagg aaagtgatca tcacgcttcg gcacacatct26220gttttcctca gttaagtttt cttcttggtt caggtatggt attacatgca ggaagaaagg26280ggatgcgggg acagccgtat agatgccacc aactttaaca tggtttgtgt tttggggaaa26340caaggaaaga gagcatacgc tatgagctac ttaaactagt gacacaagaa gcaacttatc26400ataccggaga tcacaatgga gtgattaggt tctatcagat agtagaagca gagtatgcga26460cctgcggtgg ctacgtacat gggtgaaaat aatagaacac ctcgcgtagc gtcgaaaacc26520gcctcgtaga ctctgtgtca ggtatgaacc acccactttt tttgtcctct ttatctccac26580actatttcct tcatggagac aaactcattc tcgaaagaca aacaatcaaa tcaatccatt26640accctcatgt tctcatgatg ggtatgttat acatatatgt ctcagacata tgtttatcct26700ttttaaaaca catacttaat aggcacttag cactgttact gctatagaaa actcatccat26760tcaagaggag ggagagaaca gagttggcaa aatcttggaa gggcaaagtt tatagcaagt26820aagtagtagc acagagagag tattatgtat gtgttcatct agcaaaatct aaatagaaga26880gccgatcgac tcagtcagtt gtaattagga ctagtcgtta atcatgacat ggctcataaa26940caactagtca gtttcttgat ttacttggca ctcaggaaca aagtatgttg ccatccctgg27000gcaatagatt tgatcccgtg cgttgagata aagcttgcca aggtcgggtc atgtaactgc27060agaggcactg ggcgtagatt ccagtcccag acataaggaa cagcaagatc ctcaccaacc27120acgcaaatgc cctcagttcc aattgtaact tcaagctgag gagtcttgtg ctcggcggaa27180agctcgaaag gggtaaaaac aggtacaggg tcaaggactg tgcgagctgt ggccttgtat27240ttgttggtgg acttccaaaa tccctcctcc atgaatggtt caatctgctt ggtcacagcc27300tcggagcttg aagtttcctt gtcggacatg agaccccact ggtaaagctt gcagccgtgg27360ccctgagaat ctttaactaa agcgacataa ctctgcgggc ctgcccaaat gtcaagcacc27420gggcccgcct cagggccgaa ctcgacctct cttggtgaaa cctggtcctc gtagttgctt27480ttggcctcgt ccaactggcg agattgcatc ttgccccaca caaagacacg accatccttg27540agcaaggctg cgctgctgtt catgccagca gcaaccttga tggccggtcc aggtagatct27600ctgacctctt gcatgacgaa gaagtcgtca ataccgcgca gaccgattcc gagttgaccg27660cgctgtccct tgccccacgc gaagactttg ccgctcactt tcgtagccac aactccgtgt27720ctgaatccca acgcaacgct ggccacggca tcatcatctt caggaagacc aattgtagtc27780
cttgggtccc agaagtacga gtctgtggtt cccgtcgcgc actgtccata gacattctcg27840ccaaatacaa aaagcgtgtc cgtttccttc gtaatgaaag ctgtcacacc ggcaccacac27900acaacttccc gaataggttc tgtcgagtaa ccctcaaact ttgtctcaag accccttttc27960cgtgagtcct gctcaatatc gtcctcacca aggtccttgt acaccttgta gctcaatacc28020tcctttggct caatcgcatc cacagatgtc tccacaccca tcatccgcat gacatactgc28080atcaccatgc ttgatttcgc atagcgaccc agacgcacag taagccgggt atcgtgggtg28140cgaccaaaga gataaacgcg accttgggcg tcaagaacgg cgctgtggcc aaagcccgct28200gcaagtttta caggctgggc ctgcttggtg tcgaggtcgc cgtggatctg tgtagggctg28260tcagcgttat cgagactacc tgtaccgagg gcaccgttga taccaatgcc tcgagcccat28320acgccgcgaa gggcggttcc ggcagaagag ctaagcatcc gcttggcacc tgttagggag28380cccagcgccg tcatggtggt ggtctgtatg tcaatgtatc tgtagaaagg cagccagcta28440actaaccagc tgtactgtga accacagaag aggcttttgc aaaagatgct cgagagcaaa28500atggatgatc ggtggagatg cggagaagcg cacagcacga tccgagtccg aacttgattg28560aactcaagtt cggagtttgc aatttttcta caactaggta taccttcgta gtatcacgta28620gtaggtggta gtactagtag tcctttgaat tgcggcaggg aatttacgac agcaactctg28680gtaaattaat ttaggacgcc tcttttgtac taaagtcctt ctctttagaa cggaaagaac28740atatgatatt gagacatcat gaggacatgg gaaagggttg tgcatctttg gaactgtatt28800gcccagtatg gctggacttc accttggact tattcataga atgaccacag ctattcctgg28860ggtagatgga ggtctgacaa tgctcgagct aaccctgccc atccatgatc aagacgcacc28920caagcactat ggccgcaagt ttcagttcat ggagagcaga gctgctcaaa tttagcttct28980gcggtcgatt ggtcttggca caaccgctct taagagtcat ctacgacagg ctaccatcca29040ctcaagataa aaatggactc acagatagat agatagatag atagatagat agatagatgg29100caggcgacca atcgcagcgc actctcgctc tcaagatatg cccgcccatc gaaacacggc29160cttctcatgc ggcctgtttc gtctcaagct cgagcaggcg tcggcccatg ctccagcgca29220acgggcccgc aactttcagt ttcgagcttg gtcttgcttt tgagtttgct tttgcttttg29280agtttgagtt tgagtttgag tttgagttca aaattcaaat tcttcaaatt caaattcttc29340gaattcaaac tcaaattgga gaatccatct tttcaaaaac tcaattcacg ctctcgaaga29400agttcaaact ccgcagtcgc atccagctga ggcacgcact ccccatcgca tcgccggcgc29460tctctcctcg ctcctgccgc gtctaagcgt gctcgcgtct ctgtcctgct gctgcttgct29520tgccagtatc tccacttctc gcgagcagaa ggaggacgag cagaagaaga aggaaggatc29580aagaatcatc aagaaggaac actctctttg tttctgtggt tcgtcattag tttgttgtag29640
cttgaaggag aaggagaaga cggagaagat ggagaagaag ggaatgaaca gcagtggcgt29700ttatctgtct ctagctagct aggtacctta cctaccaggt agagttagga ggagaggata29760gccgagacta aggaagcaag ccgtagtttt attttactat gtctgttgtt ctttctctcg29820actaccttct ctcgctaccc ccgtgggaag gaggtctctt gtgtcgagtc tgatccacgt29880ggacgcctcg aggatcttcc ctcgcacccc gggcccggtc gctgccggtg caaaacctcc29940tcagtggcct tgctcgcgct gtgtgctttc gttcctgcgt ctggaacgtc agatagcaga30000taaagagata taagatagtt agttgacgga agcagtcaaa gcaaacctcg aacggattga30060agcgaagcga ggacgctctc gcctctttgc tgactgctcc gcctattgct gctctggccc30120tcactctgag atattactat gtctgaacct gccgcagccg caccgccggc cgagcccaaa30180tcgtcgtggg cggatgaagt cgataatgac acggagggag acgctgtggc cgctctgagc30240gaacatgcgg ctaagttgga cctcgacgtc cacggagctc cagacctgca cagcggtgct30300cttgtagtac gcgaggccgg gtgccccgtg gacgagccca agacgcaggc agtgacaagt30360ttctcagccc ttgcgattga tgacgacctc aagaagtcta tcgcgaacgt caagggctgg30420agcactatgt ctaagatcca gcaaattgga cttccgcttg tgatcagcga ccctccacga30480aaccttatcg ggcaggctca agccggcacg ggtaagaccg gtacctttgt catctctatg30540cttgcaagga tctctgcaga taagaagccc agcacgcctc aggccattat cttggctgta30600actcaggagc tgtgcacgca gattgcacag gaggtcaacg cactgggatc cgacaagggc30660attaaagcac gcagagttat gtctgctagg tccaaaaatg gacccctcgc ggaagggagc30720gcggcggcgc cgtgggcact tagtgaaggt gaagactttg atgagcaggt cgttgtggga30780acacctggaa tggtcaagaa ctacctcaaa aatgccatgg gacgcaagaa gcgcaagccc30840atgatcgatc cgtctgagtg ccgcgttctt attcttgatg aagctgacaa gatggtgcag30900cagccacctc acggatttgg acaggacgtt caggagattc gcgacattat tctcaagaag30960cgcaaggaca agccgtgcca aattttgctc ttttcggcca ccttcaccga aaatgtacga31020cagattgccc gccagttcgt tggtggacat gacatggacg agtccaagta ccacgagatc31080acgctgcgca aggaggatgt cactctcgac aaagtcgtca acttcgttgt ctatattgga31140gacgagaatg agcgcaacga agaggaaatc tataagaaga agtttgaggc cattaatgag31200atctgggaga acctctctca gctcagcgag gggcagtccg ttatcttttg caatcgtaaa31260gatcgtgtac aacgcctcgc ggattatctt cgcgggctaa acttcccggt cggtcagatc31320catggtgaca tggataaggc cgagcgtgac attgtgctca gtgagttcaa gcgcggtgag31380cgcaaggctc tcgtttctac tgatgtcacc tcgcgcggta ttgacaaccc caatgtgact31440ttagttatta atgtcgacct tcctgttaac cgcgagcagg aagctgaccc ggagaatttt31500
gtgcacagga taggccgctc gggacgttgg actaagaagg gtgcttctgt ttctcttgtg31560gctcgcagcc ctgccttccg tgaccttggc ctcatgaagg acattgagcg tgcactcttc31620gctaatgcag aggtaaaccg tccgcttatc cccgtcgatg atctctccaa ccttgagagc31680aagatcattt ctgctcttga agcatacaac taagtgccta cctaccttaa tcagccctta31740tcacttgcat tgcgagcccg ggtttccgca gcgcttgccc tgtgttgcta gagactgggc31800aagctggctc gcctgtctct ttctcgcatt caacaatgca ttcaccgttt ctcctagctg31860cacccgccct ctctcttgcg cccacgacaa gaaaaataca gttcatatca gcatcccccc31920caaaacaacc ataacaatta cgtaaatgaa ggccgtttat tctaccgtgc atcatgagca31980ctgcaccttt tctctcctcc atcgcgcctt ataccgataa acaaaaaata gataacacct32040ttttgtagag caaccaccac cattgtttcc cttccctccc tccncnctcc ctcccaaaat32100aacttgcttt gtttgtacgg cgttccttct atctactttt tctttaatct tcaatcatgt32160ctgacggttc ctttacttat tatgcgttgt tttattcggt cacaaggagg tacagccttg32220atggtcctgc gatagatgcc gtactttatt gtcatatgtt tataactttt aaaaaattaa32280ttttttagta cttatattca aaattcaaaa ttcaaaatat aaaattcaaa attcaaaaat32340tcaaaaattc gaaattcgaa attcgaaatt caatttagat tgtaatctga ttatctttga32400atccgtcacc ttctttttat tattttttaa aataatttat ttttaatgtt tttagttaag32460ctaattttgt aaaaacaatt atattgttat aataacctta tcacctgaat aataagatag32520aaaacgaaga tgcatcctta cctcagcata agaccaaaca gactaaaacg aaacatcttg32580gattgcattt tgtctcgact atatcccatc tcaagagagc aataaaagtt attactgagc32640cttttcaagt cagaaatgtg tagtcgtgtt caaatttgaa ctttagtttt cgctaaataa32700catataagat ctgaattttg caacgactgt gacacaacac tttggttctc aagagaacac32760aagttcttgg ttggccagtg cttgttattc cgtatagtat tttgggataa tggacaagga32820tccaaaccaa gcacaattga gaagcataat tgcaacacca aacctgaaaa gtaactattt32880tgaagacatt accttgtggt gcagtttgat cgatacgaga gcaacgaacg gagcattgag32940gttaagcgag gggagtcaaa gaaagttatg ggacaggcac tcaactccac gatgaatgcc33000atgcatgtat ccaaggctgg ctgctcctct gggtggatgg gtgtcggggc acatgattat33060gtagaggaca aagatgtccc ttctcttgag ccttctgagc atagccaggc accttttcgt33120tgttcttgcg tacaatctcg ggttgtaggc cccaaaagtc acgttgaaaa ggtaatgggc33180tcacgatgtt gtcaaagccc tcgatgtagc gcgggcaaag gcacgcttgc agaactcgac33240gaggtcatgg acaacaaagt ccgaatttct ctagacgttg gcgaagacgt cgatgtcggc33300catgaagtcg gcaaagaaga taagacgagg ggcaaggcga ctcttcatga tggaggtgtt33360
agagacaaac tcagaatcgc tgatggtgtt attggctcta attatgttga tctcaaggcc33420aaagaaagtc atcaagcacc aggctcggtt atccaatcgc agcggcactc tcgagccgag33480aagtaccgcc gagcagacgc tgttcgcgaa gctcttcgcg atttgtcttc tttctcgcca33540agtacttcaa tgaatacttt tcctgactct tcgagccgaa caacacctgc atgctcccct33600gaatcagaaa ctagccttga tgaggagaag gagaatatag ggctggtaaa taacgttcta33660cttgaggaag aacacgttag tcgcccacga tcaatgacgt ttgatgcttc actttcgatg33720acggagctgg aaacccaaaa cgaagtggag cacgctgtgt tgacttcgtc tgtcatgtat33780gcagccgaga aaactctaag ttttattaag gagaattccg gagaattggg caaacatatc33840ggaaccgaag gcggaagtaa tatcaaagac attgttgaag aacatgcaaa tcaaaaatcg33900caagaaagtg ataatgaaat gtttatgagg ttgcttgaag atctgcctac tcaggcccaa33960caagtagttt ccgaaagttt gggaacacct actaccaaac atcattactt ttccagcgcc34020aacacgagca gtggagcatc gcgaagcttg cagtcaggtc gatcaagcac cccaaactgt34080gtcacggtat ctccatgcac agagctgggc tctcctcgtt gcgggcttga ctctgtactt34140ggtaaccaaa ttgatgaaaa acatggtgaa gggcttgacg atcaccatag gatcccgcag34200tttgatctct tacaacatga gcttttacaa gatagcaact ctattacagc acacagagat34260ggtgaaacga cttcgtcccc agttgcctgg gctggagatc ttcaagatga tcttacgcgc34320tctctgttga cagaagttga acatcctttc atctgtcgag aaacaaatat accaccggtc34380cattcaaaag ggaacgaggg tttgagaaca tgcaatggtt cgtcgcatag atctagtctg34440ggagcaattt tgcacgagat tctcgaaacc aagggagact ttcgtaaaaa cggtgaactg34500atcaccgacc tcgacatctt cctaggcgat aaattgccaa aaggcaaaac attttggtcg34560ctcttgacaa gtagcgagct aggtgagctt ggtgaaagag ttgaactcga aataatgagc34620cgccccctcg cgcaccagcc ttaccgagaa tcactctggt gtgttgcatt tcagacaatc34680cagctcactc cctatcgcca aagattggcg ctcagctgtc gcgatagact tttgcctcac34740gagcgggctt taagcgggtt ctccattgct caactaggtc gtgcgtgttt tgtacttcgg34800caaaggctcg tagactgctt ccaccacaac ggcaggataa agttcaaatg ttacaggcga34860acatgcaagt tgctggaagc aaggatgtgg caatgagcct caaaacatag gcttggcaca34920gggtgttgaa gcgcctttct gagacccatg aaactcctag tttgtttgct ttgcatcgct34980ctgtatcaat cgtgccgcat gcaaatgcaa taagctaaca ctcaaatcat ggtacagtct35040tttaatttgg accgagtcta gggcacccga ggcatttcga tgcaaacatc tttctcatca35100aagacttatt taggcgagtt aggcattgga gctcaccttc cctggcaggt cgcctttacg35160tggtaagtta tataagtcaa gaggaaaacc cgagcgacgc tggtctctat aagattgaca35220
gatccctgga ggtgataaag gttgtatcgt acaacttgtt ctacgagaat caaatcttgt35280acgctccaag ccagcagctt gaaattggca gatgagttgt atctgcgtca ggagttatca35340gagagcttac tggactatca aatggtagac atgttgacac tgcgcacctg aaaagctctg35400ccaagcacct ccgctcccca gaaagcctgg tttacatgaa gtgtgatgta gtctgcagtt35460caagatctaa tctcatcaga gagcgcttag tacccattgg tgatctgtca cattttgagg35520ctacgcacag tttggatgac gctcttcgcg ctgtatgcaa cacatccgac gaacgagatg35580aacctacttc caaagactcg tgtgctggtt ggcgcgcggc ctagacctgg tcggggcact35640ggcgcatgct atgagattgc tggacgcgaa aaatgtggcg aagctgtgta cgcagtgaac35700tggggtgcca aatcaatgat tctaagagtg tttgccccaa agtatggctt aaaatgtttc35760aaactaccca agggttcccc gacatgaggc cacatgtggg aagtgtattt gccccccatt35820tgagaagttg ggacagagcg cttcgtcagg gatgatcatg aagcatgttc tatgaacttg35880caccacttgt ttagaacgga agtgtggctg gaatgaaacc tatatgtcag catatctgcg35940ggtaatcccc aactacataa tatttgctgg tatgcttgct ttaagcagca atcaagtttc36000tagcaacagg gtaataacca ggtcaccggt caatcgcaca atggcctttt tagttcggaa36060aatttgacaa cctgtggatg tttggggagt ccatggataa atgtggagct gtttggtgta36120acagaacatt gcaaagggtg acgccttaga tccttttctc atgacaggct tcgatcacaa36180agttgtacac tttcaaggtt gtaggtgcgt attgaacttg gcatttctgg aacaaacaga36240cactatatct cgaatctggg tctgcctgcc cctctagctc aggccctgat agtttgacta36300gagcatcgcc gtctcgtgta ttctctccga atctttctgc acattgagtt agacttctcg36360tcgtgtttgg agcatgtgta aatacatcag cgatattttt ttactcctaa aaatggcaaa36420ttcgcattta cctactgcaa ataatgaatc aaaatgagga aacaatgtgc tatatgaacc36480gtgctctttg gaacacaaat aaaaaataaa taaagtcaaa gatcgtgcca aatccgccca36540acttgagaga aaggcttggc tggtgacctg ccctgttgtg gcatcatcct atcttggctg36600ccgccctcca aagagaaatg tgagcctcgg aagagcgggc taggctggta accaatgaga36660gctatgtaaa tagcaaagga agagagaata aatctttggg aataaacctg tcagcaaggc36720tccaaagctt gctttctggg caaggcttac atgttgcttg atatgatttc acagaagcat36780ttggacacgc caaactctgc tactttgact gtgcctaggt ggtaaaccaa gcaactgcta36840tctttgacgc caccatgcag gtttccatca aaatagagat agaggagaag ttaccatatt36900tgaatccacc aattcttcaa gtgtgtggag acgctcgagt aatgagcata cttgaggaag36960atgctcatgg accttccgtg tgtttttctc ccgaggtatt acacgatatt ttcgtatttg37020caatgttgca gagtcttgat atcgtgtgac agtggaaaca aatgctacag ttgattcctt37080
gatccccttc atcgcaaaga gcttgttatt ctctataata agagctagtt accggcaccg37140tagtcgcttt tgctcagcaa gtggcccttt tccagcatga gataagacct cctaattttg37200gctcgttttc tgattacaaa tgaaggtcct tgccaactac accatggtca cagctttctc37260tgccgagctc agggatgcaa ctgtcggctt agacaccaag tcagcgtcgg ttgcaagtgc37320tgcttctgag agctgactgc tgtagtgtgt gggtttgctc cacctatgag tgggtatgag37380taggtctgct ccacctatga ggaccaccaa gtttgctctc catgtgctac agcgcctgcg37440tctcttgtgc ggtgagacat attttttgag cttggtcttt acgaaatgaa ggcctgcgac37500agacaacgat cgcaacaatt ctgcctcgaa ggcgcttatc cctacgtaga cgtaggtctc37560tgttcccact aaagccactc ctgcgtcaat agaacaaaag caaaagctct tatggctgct37620gtacaaatag agtaaaactt cacctttcta ctcgtaacac tacagttata agtagcaagt37680caatcagagc aagacctttg cgagtaaacc tgcattgctc tatcgcagtc ttccagcatc37740ttcgcgaggc ggtctcgcac aacttcagtc agtctgtaat aacaggagct ttagcaccag37800ccaaagcagt tgcgttgcaa ccagcagaag acttggcatc atgctcattc ccgctgtgga37860cgtggccgtg ggcggtgctg tggcgtcctc tgagaagttt gatctcctca aacgcctgag37920ttggtgcggc ccccttcgca tcatcccttc acttgactct gtctccgcac caagtgtggg37980tgcccctgag gagaaggact tctggaaatc tgctgttcgc aagtggggca aagctttgtg38040ttcgtaccct tgccaagttg gtcccatcgc cgctacaagc gttgaggaag tgacgcaatg38100gctcaacgaa ggcgctgtcc aagtcattgt tgagggttct ttcgacgacc tcgaggacat38160tgcttcgcag cttcctcgtg aacgtcttgt tgccagattt tccgagaagg tccttgaaga38220cgacggtctc ctgagcaaac tttctggcag cgttgggggc gtttcaatta tttctgaggc38280caaaaattct gaagaagtcg tcaaggtcgc agagagggca tggcagcttt tgggaaaacg38340ccttgctatc gcattagagg tccccgagat cgaggccgga ggcgaggcgc agaagattaa38400caaccagctt gttggtaagc tccatggact ccactccaca gactttcctg tgaacgttgt38460gtctgagaac gtttccatgc caacagaagg gtctcttgcg acagatactg actcagaagc38520tgccttttgc gtggcaaggt cttttgtagc gtgccttcgc accgaccgta cagatggtct38580ctttgcgacg gtcgtcaccg atgagaatgg cgtggcactt ggcctcgtgt actccagcga38640acagtctgtg gttgcctcgt tggcgtgtgg ccgcggcgtg tactggtcaa gatccaggca38700gagtctgtgg cgcaagggcg acacaagcgg tgcctttcag gagcttgtgt ctatcgcatt38760tgactgtgat gccgacgcga tgaggttcaa ggtgcgccag cgtggaaacc ctcctgcatt38820ttgccatcaa cagacccgca catgctgggg ttatgacggt ggcatccccc acctctttcg38880cactcttgag tcccgcaagc ttaacgcccc agaaggatca tacacaaaac gtctttttga38940
ggacaaggca ttgctgcgta acaagctcat tgaggaggca caagaggtaa ttgaggctat39000tgaggagaat gacccagagc atgttgcccg cgaggtcgca gacctcgcat acttcctctt39060tgccgcgtgc acgtgcggaa atgcgtcgct cgaggacgtt acacggcagc ttgacatgcg39120ttccctcaag gtcaagcgga ggccaggcaa tgcaaaggca gatcgcatcg ctgctggtga39180ggcagttctc caggctcagc agcagaaaaa gtctgcagag gagcccccag cagctcccaa39240ggaccaggcc taaattgcat gcttattatt acacccaaat cctgcttatt gtgacttgtc39300tgcacccttt tcacattgaa gaagcgtgtt ttcttacccg tcacaccacc actaagtctc39360atcctttctt tcttaccttt ttactagtcc gaacgatata aactttatct ttgcaaggct39420cttgttatac tgcaattgtt atttagtttg ttttctattg ataggcaaac cagacgtaat39480cgtctgagag tgtttgaaga ggataaaaca aagaatcatt aacaggtttt gtgtttctgt39540acacttgaat agttttatgc ctatctactt ctagagcctg ggcggagttg gcatttgtat39600aatctcaaca ttcgataaca aattgcttca aatgaagaac aaaaacagga aatgatttga39660attaaaatct aatatttgta gaaaagaaaa agcgagctga catcattcca tcaaattgac39720caattgactc cttagcacag tagatatttc ctaaacgact tcaactcatt cctcattatc39780ctcgctgttc ctgcttccgt gagtaccctt gctgattcgt acttccaaat cgccgccatc39840ctcccggtca tcatcatctt cgtcatcttc gtcttcatca tcagcccctg acgaggagta39900aatgtcaagg taaggtttgg gattctcgag ctttcgcaat tctccaatac ttattggttg39960gccacagacc ggatcc39976<210>3<211>8994<212>DNA<213>Ulkenia sp.
<400>3atggctcaac gtgagaaccg tctcgaggcc aacatggata cccgcatcgc tgtgatcggc60atgtccgcca tcctcccctg cggtaccacc gttcgtgagt cttgggaggc tatccgcgat120ggtatcgact gcctcagtga tctccccgag gaccgcgtcg atgtgaccgc ctacttcgac180ccggtcaaga ccaccaagga taagatctac tgcaaacgtg gtggattcat ccctgagtac240gacttcgacg cccgtgagtt cggcctcaac atgtttcaga tggaggactc cgacgcaaac300caaaccgtca ccctcctcaa ggtcaaggag gccctcgagg acgctggcat cgaagccctc360agcaaggaaa agaagaacat tggatgtgtt ctcggtatcg gtggtggcca gaagtccagc420cacgagttct actcccgctt aaactatgtt gtcgttgaga aggtccttcg caagatgggc480atgcctgagg aggatgttca agctgctgtt gagaagtaca aggccaactt ccctgagtgg540
cgccttgact ccttccccgg tttcctcggc aacgttactg ccggtcgctg taccaacacc600ttcaacctcg atggtatgaa ctgtgtcgtc gatgctgcct gtgctagttc tctcatcgcc660gttaaggttg ccattgatga gcttctccac ggagactgtg acatgatgat cactggtgct720acctgcacgg ataactccat cggtatgtac atggccttct ccaagacccc ggtgttctct780accgacccta gcgtccgcgc atacgatgag aagaccaagg gtatgcttat tggcgaaggc840tctgccatgc ttgtgcttaa acgttacgcc gacgctgttc gtgatggtga cgagattcac900gctgtcattc gcggctgcgc ctcttcctct gacggtaagg cctccggtat ttacaccccg960accatctctg gtcaagagga ggctcttcgc cgtgcctaca tgcgcgctaa cgtcgatccc1020gccaccgtca ctcttgttga gggccacggt accggtaccc ccgttggtga ccgtattgag1080ctcaccgctc tccgtaacct cttcgacagt gcctacggca acgagaagga gaaggtcgct1140gttggcagca ttaagtccaa catcggtcac ctcaaggctg tcgccggtct tgccggtatg1200atcaaggtca tcatggccct caagcataag actcttccgg ccaccatcaa cgttgatgag1260ccccctaagc tttacgacaa cactcccatc accgactcat cgctgtacat taacacgatg1320aaccgtccgt ggttccctgc tccgggtgtg ccccgtcgcg ctggtatctc cagtttcggt1380tttggtggtg ccaactacca cgccgttctt gaggaagccg agcccgagca ccagaaggct1440taccgtctca acaaacgccc ccagccggtg cttctgatgg catcttcaac ccaggctctt1500gcttccctct gtgaagccca gcttaaggaa ttcgagaagg ctatcgagga gaacaagacc1560gtcaagaaca ctgcttacat caagtgcgtc gacttctgtg agaagttcaa gttccctgga1620tctatcccga gctctaacgc tcgcctcggt tttcttgtca aggaggccga tgatgccacc1680gagaccctcc gtgccatcgt tgcccagttc caaaagtcag ctggcaagga ttcttggcac1740cttccccgcc agggtgtgag ctttcgtgct cagggcatca acaccactgg tggtgtcgct1800gccctcttct ctggccaggg tgctcagtac acccacatgt tcagcgaggt cgccatgaac1860tggcctcagt tccgtgagag catctctgac atggatcgtg cccaggctaa ggttgctggc1920gctgacaagg actacgagcg tgtctcccaa gtcctctacc cgcgtaagcc ttataactct1980gagcccgagc aggaccacaa gaagatctcc ctgacctcat actctcagcc ctctaccctc2040gcctgcgctc ttggtgccta cgagatcttc aagcaggctg gtttcaagcc cgacttcgct2100gccggtcact ctctcggtga gtttgcggcc ctctacgctg ctgactgcgt caaccgtgac2160gacctctttg agctcgtgtg ccgtcgtgcc cgcatcatgg gtggcaagga tgcacctgct2220acccccaagg gatgcatggc tgctgtcatt ggacccaatg ccgagaagat ccagattcgc2280actgctgatg tctggctcgg caactgcaac tccccttcgc agactgtcat caccggctct2340gttgagggta tcaagaagga gtccgagctt ctccagagtg agggcttccg tgttgtcccc2400
ctcgcctgcg agagtgcctt ccactcaccg cagatgcaaa acgcctcctc tgccttcaag2460gatgttctct ccaaggttgc cttccgtcag cctagcgccc agaccaagct cttcagcaac2520gtgtctggcg agacctactc caacaatgcc caggacctcc ttaaggagca catgaccagc2580agtgttaagt tcatctctca ggttcgcaac atgcactctg ctggtgctcg catctttgtc2640gagtttggcc ccaagcaggt gctctctaag cttgtttccg agaccctcaa ggacgatcct2700tccattatca ctatctctgt caacccttcc tctggcaagg atgccgatat tcagcttcgc2760gaggctgctg tgcagctcgt tgttgctgga gtcaaccttc agggcttcga caagtgggac2820gcacctgacg ccacccgcct tcagccgatt aagaagaaga agactactct tcgtctctcg2880gctgccactt acgtgtctga caagaccaag aaggctcgcg aggctgccat gaacgacggc2940cgcatgctca gctgtgtcag caaggtcatc gccccccctg acgccaagcc cattgtggac3000accaaggctc aggaggaggt tgctcgtctc cagaagcagc ttcaggatgc ccaggcccag3060atccagaagg ccaaggccga tgctgctgag gctgacaaga agcttgccgc tgctaaggat3120gaggccaagc gtgccgccgc ttctgcacct gtgcagaagc aggttgacac caccattgtt3180gataagcacc gtgctatcct caagtctatg cttgctgagc ttgactgcta ctccactcct3240ggtgctgtgt ccagctcttt ccaggcacct gttgctgcta cccctgctcc ggtcgctgcg3300cctgttgcag ctgctcctgc tccggctgtc aacaatgctc tccttgccaa ggctgagtct3360gttgtcatgg aggttcttgc cgccaagact ggttacgaga ctgacatgat cgagcccgac3420atggagctcg agactgagct cggcattgac tctatcaagc gtgtcgagat tctctctgag3480gtccaggccc agctcaacgt cgaggccaag gatgttgatg ctcttagccg cacccgcacc3540gtcggtgagg ttgtcaacgc catgaaggct gagatcgctg gcagctctgg tgctgccgct3600gctgccccgg ccccggttgc tgctgctccc gctgcccctg cccctgctgt caacagcgct3660cttcttgcca aggctgagac tgttgtcatg gaggttcttg ccgccaagac tggttacgag3720actgacatga ttgagcccga catggagctc gagactgagc tcggcattga ctccatcaag3780cgtgtcgaga ttctctctga ggttcaggcc cagctcaacg ttgaggccaa ggatgttgat3840gctcttagcc gcacccgcac cgttggtgag gttgtcaacg ccatgaaggc tgagatcgct3900ggcagctctg gtgctgccgc tgctgccccg gcccctgttg ctgctgctcc ggcgcccgtc3960gctgccgctg cccctgctgt cagcagcgct ctccttgaga aggctgagtc tgttgtcatg4020gaggttcttg ccgccaagac tggttacgag actgacatga ttgaggccga catggagctc4080gagactgagc tcggcattga ctccatcaag cgtgtcgaga ttctctctga ggtccaggcc4140cagctcaacg tcgaggccaa ggatgtcgat gctcttagcc gcacccgcac cgttggtgag4200gttgtcaacg ccatgaaggc tgagatcgct ggcagctctg gtgctgctgc cccggccccg4260
gtcgctgcgg cccctgctcc ggtcgctgcc gctgcccctg ctgtcaacag cgctcttctt4320gagaaggctg agactgttgt catggaggtt cttgccgcca agactggtta cgagactgac4380atgatcgagc ccgacatgga gctcgagact gagctcggca ttgactctat caagcgtgtc4440gagattctct ctgaggtcca ggcccagctc aacgttgagg ccaaggatgt tgatgctctt4500agccgcaccc gcaccgttgg tgaggttgtc aacgccatga aggctgagat cgctggcagc4560tctggtgctg ccgctgctgc cccggccccg gttgctgctg ctcccgctcc cgtcgctgcc4620cctgctgtca gcagcgctct ccttgagaag gctgagtctg tcgtcatgga ggttcttgcc4680gccaagactg gttacgagac tgacatgatt gaggccgaca tggagctcga gactgagctc4740ggcattgact ccatcaagcg tgtcgagatt ctctctgagg tccaggccca gctcaacgtt4800gaggccaagg atgtcgatgc tcttagccgc acccgcaccg ttggtgaggt tgtcaacgcc4860atgaaggctg agatcgctgg cagctctggt gctgccgctg ctgccccggc ccctgttgct4920gcctctcccg ctcccgtcgc tgccgctgcc cctgctgtca gcagcgctct ccttgagaag4980gccgaatctg ttgtcatgga ggttctcgcc gccaagactg gttacgagac tgacatgatt5040gaggctgaca tggagctcga gactgagctc ggcattgact ctatcaagcg tgtcgagatt5100ctctctgagg tccaggctat gcttaacgtt gaggccaagg atgttgatgc tcttagccgc5160acccgcaccg ttggtgaggt tgtcaacgcc atgaaggctg agatcgctgg cagctctggt5220gccgccgctg ctgccccggc cccggttgct gctgctccgg cgcccgtcac tgccgctgcc5280cctgctgtca gcagcgctct ccttgagaag gccgaatctg ttgtcatgga ggttctcgcc5340gccaagactg gttacgagac tgacatgatt gaggccgaca tggagctcga gactgagctt5400ggcattgact ccatcaagcg tgtcgagatt ctctctgagg tccaggctat gcttaacgtc5460gaggccaagg atgttgatgc tcttagccgc acccgcaccg ttggtgaggt tgtcaacgcc5520atgaaggctg agattgctag cagctctggt gctgctgccc ctgctccggc tgctgccgtt5580gcaccggccc ctgctgctgc ccctgctgtc agcagcgctc tccttgagaa ggccgaatct5640gttgtcatgg aggttctcgc cgccaagact ggttacgaga ctgacatgat tgaggccgac5700atggagctcg agactgagct cggcattgac tctatcaagc gtgtcgagat tctctctgag5760gtccaggcta tgcttaacgt tgaggccaag gatgttgatg ctcttagccg cacccgcacc5820gttggtgagg ttgtcaacgc catgaaggct gagattgcta gcagctctgg tgctgctgcc5880cctgctcctg ctgctgccgc tgcaccggcc cctgctgctg cccctgctgt cagcagcgct5940cttcttgaga aggctgagtc tgttgtcatg gaggttctcg ccgccaagac tggttacgag6000actgacatga ttgaggccga catggagctc gagactgagc ttggcattga ctccatcaag6060cgtgtcgaga ttctctctga ggtccaggct atgcttaacg ttgaggccaa ggatgttgat6120
gctcttagcc gcacccgcac cgttggtgag gttgtcaacg ccatgaaggc tgagattgct6180agcagctctg gtgctgctgc ccctgctcct gctgctgccg ctgcaccggc ccctgctgct6240gcccctgctg tcagcagcgc tcttcttgag aaggctgagt ctgttgtcat ggaggttctc6300gccgccaaga ctggttacga gactgacatg attgaggccg acatggagct cgagactgag6360cttggcattg actccatcaa gcgtgtcgag attctctctg aggtccaggc tatgcttaac6420gttgaggcca aggatgttga tgctcttagc cgcacccgca ccgttggtga ggttgtcaac6480gccatgaagg ctgagatcgc tggcagctct ggtgctgcta ctgcctctgc ccctgctgct6540gcagctgccg cccctgctat caagatctcc actgttcacg gtgctgactg cgatgacctc6600tctgtgatgt ctgctgagct tgtcgacatt cgtcgcgctg atgagctcct tcttgagcgc6660cctgagaacc gcccggtcct tattgtcgat gatggtaccg agctcacctc tgctctggtt6720cgtgttcttg gtgctggtgc tgtagttctt acctttgacg gtcttcagtt ggctcagcgt6780gctggtgctg ctgttcgcca tgtccaggtg aaggacctct ccgctgagag tgccgagaag6840gctatcaagg aggctgagca acgcttcggc cagcttggag gcttcatctc tcagcaggct6900gagcgctttg cccctgctga cattcttggt ttcaccctca tgtgcgctaa gtttgccaag6960gcttccctct gcacccctgt gcagggtggc cgtgccttct tcattggtgt ggcccgtctt7020gacggtcgcc ttggtttcac ctcccaggga tctactgact ccctcacacg tgcccagcgt7080ggtgctatct tcggcctctg caagaccatt ggccttgagt ggtctgctaa cgaagtgttc7140gcccgcggta ttgatattgc tcgtgaggtc caccctgaag atgctgccgt cgccatcact7200cgcgaaatgt cctgcgctga caaccgtatc cgcgaggtcg gcattggcct caaccagaag7260cgctgcacca tccgtgctgt ggacctcaag ccgggtgccc ccaagatcca gatcagccag7320gatgacgttc tccttgtgtc tggtggtgct cgtggtatta ctcctctctg catccgtgag7380atcacccgtc aggtccgcgg tggtaagtac attctcctcg gtcgctccaa ggtccctgct7440ggtgagcctg cttggtgcaa cggtgtttct gatgacgatc ttggcaaggc tgctatgcag7500gagctgaagc gtgctttctc cgccggtgag ggccccaagc ccaccccgat gacccacaag7560aagctcgttg gcactattgc tggtgcccgt gaggttcgtt cctcaattgc taacattgag7620gctctcggtg gcaaggcaat ctactcctct tgtgatgtga actctgctgc tgatgtcgcc7680aaggctgttc gcgaggctga ggctcagctt ggcgcccgtg taactggtgt cgtccacgct7740tctggtgtcc ttcgtgaccg cctcattgag cagaagcgcc ccgatgagtt tgatgctgtc7800ttcggcacca aggtgactgg tctcgagaac ctctttggtg ccattgacat ggccaacctt7860aagcacctcg tcctcttcag ctctcttgct ggtttccacg gcaacattgg tcagtctgac7920tacgccatgg ctaacgaggc cctcaacaag atgggtcttg agctctctga ccgtgtgtcc7980
gtgaagtcta tttgcttcgg cccctgggat ggtggcatgg ttacccccca gctcaagaag8040cagttccagt ctatgggtgt tcagatcatc ccccgtgagg gtggtgccga tactgtggct8100cgcattgtcc tcggctcctc ccctgctgag atccttgttg gcaactggac cactcccacc8160aagaaggttg gcagtgagcc cgttgtgatc caccgcaaga tcagcgctgc atccaaccct8220tttcttaagg accacgtcat ccagggtcgc tgtgtgctcc ccatgaccat tgctgtgggc8280tgccttgctg agacctgcct gggtcagttc cctggatact ccctctgggc tattgaggat8340gctcaactct tcaagggtgt caccgttgac ggtgatgtca actgtgagat cactctcaag8400ccttcccagg gtactgccgg ccgcgttatg attcaggcca ccctgaagac cttcgctagc8460ggcaagcttg ttccggctta ccgtgccgtg atcgttctct ccactcaggg aaagccccct8520gctgctacta cttcccagac cccctctctc caggctgatc ctgctgcccg tggcaaccct8580tacgacggca agaccctctt ccacggccct gccttccagg gtcttaagga gatcatctct8640tgcaacaagt ctcagcttgt cgccgagtgc accttcattc cgtcttccga gagcgctggt8700gagttcgctt ctgactacga gtcccacaac cctttcgtca acgacattgc tttccaggcc8760atgctcgtct ggattcgccg caccctcggc caggctgccc tccccaactc tatccagcgc8820attgtgcagc accgtgctct tccccaggac aagcccttct acttgaccct caagagcaac8880agcgcgagtg gccactctca gcacaagacc tccgttcagt ttcacaacga gcagggtgac8940ctcttcgtgg acatccaggc ttccgtcacc tcttctgact cccttgcctt ctaa 8994<210>4<211>6093<212>DNA<213>Ulkenia sp.
<400>4atggcctctc gcaagaatgt gagcgctgct cacgaaatgc acgacgagaa gcgcattgcc60gtggtgggca tggccgtgca atacgcgggc tgcaaagaca aggaagagtt ctggaaagta120gtcatgggcg gtgaggctgc atggactaag attagcgata aacgcctcgg atccaacaag180cgagccgagc acttcaaagc agagcgtagc aaatttgcag ataccttttg caacgagaac240tacggctgcg tcgatgactc cgtcgataac gaacacgagc ttctccttaa gctctccaag300aaggctctct ccgagacatc ggtctccgac tctacaaggt gcggtattgt gagcggatgc360ctgtcctttc ccatggacaa cctccagggc gaactcctca atgtgtacca aaaccacgtc420gaaaagaaac tcggcgctcg cgtcttcaag gatgcctcca agtggtccga gcgtgagcag480tcgcagaacc ccgaggctgg tgaccgccgc atctttatgg acccggcatc cttcgtagca540gaagagctca acctcggtcc tcttcactac tctgtcgatg ctgcctgtgc caccgccctt600
tacgtccttc gcctcgccca ggaccacctc gtttccggtg ctgctgatgt catgctcgct660ggtgcaactt gcttcccgga gccctttttc attctctccg gattctccac tttccaggcc720atgcctgtat cgggagacgg catctcgtac ccgcttcaca aggacagtca gggtctcacc780cctggtgaag gtggtgccat tatggttctc aagcgccttg acgacgctat tcgcgatgga840gaccacattt acggtactct gctcggtgct accatcagca atgctggctg tggtcttccc900ctcaagccgc acttgcccag cgagaagtcc tgcctcattg atacctacaa gcgcgtcaac960gtgcacccgc acaagatcca gtacgtcgag tgccacgcaa cgggtactcc ccagggagac1020cgcgttgaga ttgatgccgt caaggcttgc ttcgagggca aggtgcctcg ctttggaagc1080tccaagggta actttggcca cacactcgtt gcagctggtt tcgcaggcat gtgcaaggta1140ctccttgcca tgaagcatgg tgtgatcccg cccactcctg gtgtcgatgg atcttcccaa1200atggacccgc ttgtggtctc tgagcccatc ccatggcccg acactgaggg cgagcccaag1260cgcgctggtc tctccgcttt cggctttggt ggcaccaacg cccacgcagt ctttgaggag1320tttgaccgct ccaaggctgc ctgtgccacc cacgatagca tcagttccct cagctcacgt1380tgtggcgggg agggcaacat gcgcattgct attaccggta tggatgccac cttcggctcc1440ctcaagggcc tggacgcctt tgagcgtgcc atctacaatg gccaacatgg tgctgtgcca1500ttgcctgaga agcgctggcg tttccttggt aaagacaagg actttttgga cctgtgcggt1560gtcaaggagg tgccccacgg atgctacatt gaggacgtcg aggtggactt tagccgcctg1620cgcacgccca tgacgccaga cgacatgttg cgccccatgc agctacttgc tgtcacaacc1680atcgaccgtg ccattctcaa ctctggcctc aagaagggag gtaaggtcgc tgtcttcgtc1740ggccttggca ctgaccttga gctctaccgt caccgcgccc gcgttgccct caaggagcgt1800gctcgtcccg aagccgcttc agccctcaat gatatgatgt cctacatcaa cgattgcggt1860accgctacct cgtacacatc ctacatcggc aacctcgtgg ccacccgcgt gtcttcacaa1920tggggtttcg agggtccttc tttcaccatc acagagggca acaactccgt ctaccgttgc1980gcagagttgg gcaagtactt gctcgagact ggcgaggtcg aggccgtagt gatcgccggt2040gtggatcttt gcgccagcgc tgagaatctc tacgtgaagt cgcgtcgttt caaggtctcg2100gagcaggaga gcccgcgggc cagcttcgac tccggcgctg acggctactt tgttggtgag2160ggatgtggtg ccctcgtcct caagcgcgag agcgactgca ccaaggacga acgcatttac2220gcctgcatgg acgctatcgt gcccggcaac atgccggcag cctgcatgga ggaggctctc2280gcccaggctc gcgtcaaccc caaggacgtt gagatgctcg agctctccgc tgactctgcc2340cgccacctca agaacccctc cgttctgcct aaggaactca ctgctgagga ggaaatccgc2400ggcattgagg ccattctcag ccagcgctct agcaacgaag ctgtggagcc ccacaacgtc2460
gctgtcagca gcgtcaagtc cactgtcggt gacaccggct acgcctcagg agctgccagt2520ctcatcaaga cggctctctg tctgtacaac cgctacttgc cctcaaacgg cgcctcctgg2580gaggagcctg cacctgagac acagtggggc aagtctctgt acgcgtgcca gtcctcgcgg2640gcctggttga agaaccctgg agctcgccgc cacgcagctg tctcaggtgt ttccgagacc2700cgttcatgct acacggtgct gctctctgat gtggagggcc accacgagac caagagccgc2760atttcgctcg atgacgatgc cgtcaaactc ctcgtaatcc gcggagactc ccatgacgct2820atcacgcagc gtgttgacaa gctccgcgag cgcctcgccc agcctagcgc taatgtacgt2880cttgctttta tggagttgct cggcgagagc attgcccagg agaccaagac cccgttgccg2940gccttcgctc tgtgcctggt gacctctcct agtaagctcc agaaggagct tgaactcgcc3000tccaagggca tcccgcggag tcttaagatg ggccgcgact ggacatcacc ctcgggcagc3060cactttgcac ccaagccact gtcaagcgat cgcgttgcgt ttatgtacgg cgaaggccga3120agcccttact atggtatcgg ccttgacatt caccgcatct ggcccgaact tcacgagttt3180gtaaacgcca agaccaacaa gctttgggat caaggcgaca gatggttgat cccgcgcgcc3240tcgacgaagg aggagcttaa ggcgcaggaa gatgagttca accgcaacca ggtggagatg3300ttccgactcg gtattctcat gtccatgtgc ttcacccaca tcgctcgcga cgtgcttggc3360atccagccca aggctgcttt cggactgagc cttggagaga tttccatggt ttttgccttt3420tctgagaaga acggccttgt ctctgaggag ctgacaacta aactccgcaa ctcggaggtc3480tggcgtaagg ccctcgctgt tgagtttgac gccctccgca aggcctggaa tattccccaa3540gatacccctg tcagcgagtt ctggcaagga tacgtggtac gtggaacccg cgaggccgtt3600gaagcggcca tcggccccaa caataagtac gtgcacttga ccattgtcaa cgatgccaac3660agtgctctca tcagtggcaa gcctgaagat tgcaaggctg ccattgctcg cctgagcagc3720aacctccctg ctttgcccgt ggaccttggt atgtgtggcc actgccccgt ggtcgagccg3780tacggcaagc agatcgctga gatccatagc gtcctcgaga ttcccgaggt tgccggcctt3840gacctgtaca cgagcgtcaa ccagaagaag cttgttaaca agtccactgg agccagcgac3900gagtacgcac ccagctttgg tgaatacgca gcacagctgt acactgttca ggcagacttt3960cctaagatcg ccaagaccgt tagcgacaag aactttgacg tctttgttga gactggtccc4020aacgcccacc gtagcgccgc aattcgcgcc acccttggaa atagcaagcc ttttgtcacc4080ggatccatgg accgccagaa cgagaatgct tggacaacca tggtcaagct ggttgcctct4140ctccaagccc accgcgtgcc tggcgtgaag gtctcccctc tgtaccaccc cgagactgtt4200gaggaggcta cgcagagtta caacgatatg gtggctggca agaagcctac taagaacaag4260ttcttgcgta agattgtggt caatggtcgc tatgacccca aaaagcagct cgtgccgccc4320
caggtgctag ctaagcttcc tcctgcggac cccaagatcg aggctcttat ccaggctcgc4380aagatgcagc ctattgcccc caagttcatg gagcgtctcg acattcagga gcaagacgcc4440acacgcgacc ctattctcaa caaggataac aaaccttccg ctgctcctgc ccttgcccct4500gctgctccgg cccgcagcgt ctccggagct gttgtggctt cctctgaggc tctccgtgcc4560aaacttttgg agctcaacag cactttgatg cttggtgtca acgccaacgg tgatctcgtt4620gaagcaagcc caagtgaagc atctattgtt gtgcccaagt gcgatatcaa ggatcttggc4680agccgtgcct tcatggagac atatggtgta tccgccccca tgtacaccgg cgccatggca4740aagggcattg catccgctga gatggttatc gctgccggaa agcgcggcat ccttggttct4800ctcggtgctg gtggtcttcc tatcgccacc gtacgcaagg ctctcgaagc tatccaggct4860gaactgccca agggccctta cgctgtcaac ctcatccact ctcccttcga cagcaacctc4920gagaagggta acgtcgacct cttcctcgag aagggcgtca ctgtcgttga agcctccgcc4980tttatgacct tgaccccgca gctcgtgcgc taccgtgctg caggtctctc tcgcgctgct5040gatggctcca cggttattaa gaaccgcgtc atcggtaagg tttctcgcac agagcttgcc5100gcaatgttta tccgtcccgc gcccgagaat ctcctcgaga agctgctgaa gtccggcgag5160atcacccaag agcaggctgc tctcgcacgc acagtgcctg tggcagacga cattgccgtt5220gaggcggact ccggtggcca caccgataac cgccccatcc acgtcatcct ccctctcatt5280gtcaacctcc gtgatcgtct gcacaaggag tgcggctacc ctgcccacct tcgcgttcgc5340gttggtgctg gtggtggcat tggatgccct caggccgcca ttgccacctt caacatgggc5400gcggccttca tcgtcactgg taccgtaaac cagatgagta agcaagctgg aacctgtgac5460accgttcgca agcagctctc acaagccacc tactccgaca tctgcatggc cccagcagct5520gacatgtttg aggaaggtgt caagctccag gtgctcaaga agggaactat gttcccctcg5580cgtgccaaca agctctatga gctcttcgtc aagtatgact cctttgagtc catggctcct5640ggagagctgg aacgtgtgga gaagcgcatt ttcaagaagt ctctgtcaga agtttgggaa5700gagaccaagg acttctacat caacaggttg cagaacccgg agaagattga gcgcgcggag5760cgtgacccca agcttaagat gtccttgtgc ttccgctggt accttggttt ggcgagcttc5820tgggcaaacg ctggcatccc ggaccgtgcc atggactacc aggtttggtg tggcccagcg5880attggatctt tcaacgactt catcaagggt acctaccttg accccgccgt tgccaacgag5940taccccgatg ttgtgcaaat caacttgcag atcctccgtg gtgcctgctt cttgcgccgc6000ctcgaagctg tccgtaatgc cccgctgaag gctaacgcca agcaggttgc tgccgagatt6060gatgacatct acgtgcccac tgagcgcctg taa 6093
<210>5<211>4398<212>DNA<213>Ulkenia sp.
<400>5atggccactc gcgtgaagac caacaagaaa ccatgctggg agatgaccaa ggaggagctc60accagcggca agaacgtcgt tttcgactat gacgagctcc ttgagttcgc cgagggtgac120atcagcaagg tcttcggccc cgaattcagc cagatcgacc agtacaagcg tcgcgttcgt180ctccccgccc gcgagtacct cctcgtcacc cgcgtcaccc tcatggacgc cgaggtcaac240aactaccgcg tcggtgcccg catggtcact gagtacgacc tccccgtcaa cggtgagctc300tctgagggtg gtgactgccc ctgggccgtg ctcgtcgaga gtggtcagtg tgatctcatg360ctcatctcct acatgggtat tgacttccag aacaagagcg accgcgtcta ccgtctgctc420aacaccaccc tcaccttcta cggtgttgcc caggagggcg agaccctgga gtacgacatc480cgcgtgaccg gcttcgccaa gcgtctcgac ggtgacatct ccatgttctt cttcgagtac540gactgctacg tcaacggccg tctcctcatc gagatgcgcg acggctgtgc cggtttcttc600accaacgagg agctcgccgc cggcaagggt gtcgtcttta cccgcgctga tctcctcgcc660cgcgagaaga ccaagaagca ggacatcacc ccgtacgcca ttgccccgcg tcttaacaag720accgttctca acgagactga gatgcagtcc ctcgtggaca agaactggac caaggttttc780ggccccgaga acggcatgga ccagatcaac tacaaactct gcgcccgtaa gatgctcatg840attgaccgcg tcaccaagat tgactacacc ggtggcccct acggccttgg tcttctcgtt900ggtgagaaga tcctcgagcg cgaccactgg tactttccgt gccacttcgt cggagaccag960gtcatggctg gatccctcgt gtctgacggc tgcagccagc tcctcaagat gtacatgctc1020tggctcggcc tccaccttaa gaccggtccc ttcgacttcc gccccgtcaa cggccacccc1080aacaaggtcc gctgccgtgg ccagatctcc ccgcacaagg gtaagctcgt atacgtcatg1140gagatcaagg agatgggcta cgacgaggct ggtgacccgt acgccatcgc cgatgtcaac1200attctcgaca ttgacttcga gaagggccag actttcgacc ttgccaacct ccacgagtac1260ggcaagggcg acctcaacaa gaagatcgtc gtcgacttca agggtattgc cctcaagctc1320cagaagcgct ctggccctgc cgttgtcgct cccgagaagc ccctcgctct caacaaggac1380ctttgcgccc cggctgttga ggccatccct gagcacatcc tcaagggcga tgctcttgcc1440cctaaccaga tgacctggca cccgatgtcc aagatcgctg gcaaccccac gccctcgttc1500tctccctcgg cctaccctcc ccgtcccatc accttcaccc cgttccccgg caacaagaac1560gacaacaacc acgtgcccgg cgagatgccg ctctcgtggt acaacatggc tgagttcatg1620gccggcaagg tcagcctctg cctcggccct gagttcgcca agttcgatga ctccaacacc1680
agccgcagcc ctgcatggga ccttgctctt gtgactcgtg tggtctccgt ttctgacatg1740gagtgggtcc agtggaagaa cgtggactgc aacccgtcca agggaaccat ggttggcgag1800ttcgactgcc ccatcgacgc ctggttcttc cagggatctt gtaacgacgg ccacatgccg1860tactccatcc tcatggagat cgccctccag acctctggtg tcctcacctc tgtgctcaag1920gccccgctca ccatggagaa gaaggacatt ctcttccgca accttgacgc caacgccgag1980atggttcgct ctgatattga cctccgcggc aagaccatcc acaacctcac caagtgtacc2040ggctacagca tgctcggaga catgggtgtc caccgcttca gcttcgagct ctctgttgat2100ggtgtagtct tctacaaggg taccacctcc ttcggctggt tcgtccctga ggtcttcatc2160tcccagactg gtctcgacaa cggtcgccgc acccagccct ggcacattga gtccaaggtg2220ccttccgccc aggtcctcac ctacgacgtt acccccaacg gtgccggtcg cacccagctc2280tacgccaacg cccccaaggg cgctcagctc actcgccgct ggaaccagtg ccagtacctt2340gacaccatcg accttgtggt cgccggtggc tccgccggtc ttggctacgg tcatggccgc2400aagcaggtga accccaagga ctggttcttc tcgtgccact tctggttcga ctccgtcatg2460cccggctcgc tcggtgtgga gtctatgttc cagctcgtcg agtccatcgc tgtcaagcag2520gacctcgccg gcaagtacgg catcaccaac ccgaccttcg ctcatgctcc gggcaagatc2580tcctggaagt accgtggtca gctcaccccc acctccaagt tcatggactc cgaggcccac2640attgtctcca tcgaggccca cgacggcgtc gtcgacatcg ttgccaatgg taacctctgg2700gctgatggcc tccgcgtcta caacgtcagc aacatccgtg tgcgcattgt tgctggcgcc2760gcccctgctg ctgctgctgc tgctgctgct gttgctgctc cggctgccgc ccctgctccg2820gttgctgcat ctggccctgc ccagaccatc accctcaagc agctcaaggc tgagcttctt2880gacgttgaga agcctctcta catctcctcc agcaacggcc aggtcaagaa gcacgccgat2940gtggctggtg gccaggccac cattgtgcag gcttgcagcc tcagtgacct cggtgatgaa3000ggcttcatga agacctacgg tgttgtggct cctctctaca ccggtgccat ggccaagggt3060attgcctctg ctgaccttgt gattgccact ggtaagcgca agatcctcgg ttccttcggt3120gctggcggtc tccccatgca cattgtccgt gccgctgttg agaagatcca ggctgagctc3180ccgaacggcc ccttcgccgt caacctcatc cactccccct tcgatagcaa ccttgagaag3240ggcaacgttg acctcttcct cgagaagggc gttactgtcg tcgaggcctc cgccttcatg3300accttgaccc cgcaagtcgt ccgctaccgt gctgctggtc tttcccgtaa cgctgatggc3360tccattaaca tcaagaaccg catcatcggt aaggtctccc gtaccgagct cgctgagatg3420ttcatccgcc ctgccccgca gaacctcctc gacaagctca tccagtctgg tgagattacc3480aaggagcagg ctgagcttgc caagctcgtc cccgtcgccg acgacatcgc cgtcgaggcc3540
gactctggtg gccacaccga caaccgcccc atccacgtca tcctccccct tatcatcaac3600ctccgcaacc gcctccacaa ggagtgcggc taccccgctc acctccgcgt gcgcgttgga3660gctggtggtg gtgttggatg cccccaggcc gctgccgctg ctctcgctat gggtgctgcc3720ttccttgtta ccggcactgt caaccaggtc gccaagcagt ccggcacctg cgacaatgtc3780cgcaagcagc tctgcatggc cacctactct gacgtctgca tggctcccgc tgctgacatg3840ttcgaggagg gcgtcaagct ccaggtcctc aagaagggaa ccatgttccc gtccagggct3900aacaagctct acgagctctt ctgcaagtac gactccttcg agtccatgcc tgccacagag3960ctcgagcgtg ttgagaagcg catcttccag tgccctcttg ctgatgtctg ggctgagacc4020tccgacttct acatcaaccg cctccacaac ccggagaaga tcacccgtgc cgagcgtgac4080cccaagctca agatgtctct ctgcttccgc tggtaccttg gtcttgcctc tcgctgggcc4140aacaccggtg aggctggacg cgtcatggac taccaggtct ggtgtggccc tgccattgga4200gccttcaacg acttcatcaa gggctcctac cttgacccgg ccgtctctgg tgagtacccg4260gacgtcgtgc agatcaactt gcagatcctt cgcggtgcct gctacctccg ccgtctcaat4320gtcatccgca acgacccgcg tgtcagcatt gaggtcgagg atgctgagtt cgtctacgag4380cccaccaacg ccctctaa 4398<210>6<211>2997<212>PRT<213>Ulkenia sp.
<400>6Met Ala Gln Arg Glu Asn Arg Leu Glu Ala Asn Met Asp Thr Arg Ile1 5 1015Ala Val Ile Gly Met Ser Ala Ile Leu Pro Cys Gly Thr Thr Val Arg20 25 30Glu Ser Trp Glu Ala Ile Arg Asp Gly Ile Asp Cys Leu Ser Asp Leu35 40 45Pro Glu Asp Arg Val Asp Val Thr Ala Tyr Phe Asp Pro Val Lys Thr5055 60Thr Lys Asp Lys Ile Tyr Cys Lys Arg Gly Gly Phe Ile Pro Glu Tyr65 70 75 80Asp Phe Asp Ala Arg Glu Phe Gly Leu Asn Met Phe Gln Met Glu Asp85 90 95Ser Asp Ala Asn Gln Thr Val Thr Leu Leu Lys Val Lys Glu Ala Leu100 105 110Glu Asp Ala Gly Ile Glu Ala Leu Ser Lys Glu Lys Lys Asn Ile Gly115 120 125
Cys Val Leu Gly Ile Gly Gly Gly Gln Lys Ser Ser His Glu Phe Tyr130 135 140Ser Arg Leu Asn Tyr Val Val Val Glu Lys Val Leu Arg Lys Met Gly145 150 155 160Met Pro Glu Glu Asp Val Gln Ala Ala Val Glu Lys Tyr Lys Ala Asn165 170 175Phe Pro Glu Trp Arg Leu Asp Ser Phe Pro Gly Phe Leu Gly Asn Val180 185 190Thr Ala Gly Arg Cys Thr Asn Thr Phe Asn Leu Asp Gly Met Asn Cys195 200 205Val Val Asp Ala Ala Cys Ala Ser Ser Leu Ile Ala Val Lys Val Ala210 215 220Ile Asp Glu Leu Leu His Gly Asp Cys Asp Met Met Ile Thr Gly Ala225 230 235 240Thr Cys Thr Asp Asn Ser Ile Gly Met Tyr Met Ala Phe Ser Lys Thr245 250 255Pro Val Phe Ser Thr Asp Pro Ser Val Arg Ala Tyr Asp Glu Lys Thr260 265 270Lys Gly Met Leu Ile Gly Glu Gly Ser Ala Met Leu Val Leu Lys Arg275 280 285Tyr Ala Asp Ala Val Arg Asp Gly Asp Glu Ile His Ala Val Ile Arg290 295 300Gly Cys Ala Ser Ser Ser Asp Gly Lys Ala Ser Gly Ile Tyr Thr Pro305 310 315 320Thr Ile Ser Gly Gln Glu Glu Ala Leu Arg Arg Ala Tyr Met Arg Ala325 330 335Asn Val Asp Pro Ala Thr Val Thr Leu Val Glu Gly His Gly Thr Gly340 345 350Thr Pro Val Gly Asp Arg Ile Glu Leu Thr Ala Leu Arg Asn Leu Phe355 360 365Asp Ser Ala Tyr Gly Asn Glu Lys Glu Lys Val Ala Val Gly Ser Ile370 375 380Lys Ser Asn Ile Gly His Leu Lys Ala Val Ala Gly Leu Ala Gly Met385 390 395 400Ile Lys Val Ile Met Ala Leu Lys His Lys Thr Leu Pro Ala Thr Ile405 410 415Asn Val Asp Glu Pro Pro Lys Leu Tyr Asp Asn Thr Pro Ile Thr Asp420 425 430Ser Ser Leu Tyr Ile Asrn Thr Met Asn Arg Pro Trp Phe Pro Ala Pro435440 445Gly Val Pro Arg Arg Ala Gly Ile Ser Ser Phe Gly Phe Gly Gly Ala
450 455 460Asn Tyr His Ala Val Leu Glu Glu Ala Glu Pro Glu His Gln Lys Ala465 470 475 480Tyr Arg Leu Asn Lys Arg Pro Gln Pro Val Leu Leu Met Ala Ser Ser485 490 495Thr Gln Ala Leu Ala Ser Leu Cys Glu Ala Gln Leu Lys Glu Phe Glu500 505 510Lys Ala Ile Glu Glu Asn Lys Thr Val Lys Asn Thr Ala Tyr Ile Lys515 520 525Cys Val Asp Phe Cys Glu Lys Phe Lys Phe Pro Gly Ser Ile Pro Ser530 535 540Ser Asn Ala Arg Leu Gly Phe Leu Val Lys Glu Ala Asp Asp Ala Thr545 550 555 560Glu Thr Leu Arg Ala Ile Val Ala Gln Phe Gln Lys Ser Ala Gly Lys565 570 575Asp Ser Trp His Leu Pro Arg Gln Gly Val Ser Phe Arg Ala Gln Gly580 585 590Ile Asn Thr Thr Gly Gly Val Ala Ala Leu Phe Ser Gly Gln Gly Ala595 600 605Gln Tyr Thr His Met Phe Ser Glu Val Ala Met Asn Trp Pro Gln Phe610 615 620Arg Glu Ser Ile Ser Asp Met Asp Arg Ala Gln Ala Lys Val Ala Gly625 630 635 640Ala Asp Lys Asp Tyr Glu Arg Val Ser Gln Val Leu Tyr Pro Arg Lys645 650 655Pro Tyr Asn Ser Glu Pro Glu Gln Asp His Lys Lys Ile Ser Leu Thr660 665 670Ser Tyr Ser Gln Pro Ser Thr Leu Ala Cys Ala Leu Gly Ala Tyr Glu675 680 685Ile Phe Lys Gln Ala Gly Phe Lys Pro Asp Phe Ala Ala Gly His Ser690 695 700Leu Gly Glu Phe Ala Ala Leu Tyr Ala Ala Asp Cys Val Asn Arg Asp705 710 715 720Asp Leu Phe Glu Leu Val Cys Arg Arg Ala Arg Ile Met Gly Gly Lys725 730 735Asp Ala Pro Ala Thr Pro Lys Gly Cys Met Ala Ala Val Ile Gly Pro740 745 750Asn Ala Glu Lys Ile Gln Ile Arg Thr Ala Asp Val Trp Leu Gly Asn755 760 765Cys Asn Ser Pro Ser Gln Thr Val Ile Thr Gly Ser Val Glu Gly Ile770 775 780
Lys Lys Glu Ser Glu Leu Leu Gln Ser Glu Gly Phe Arg Val Val Pro785 790 795 800Leu Ala Cys Glu Ser Ala Phe His Ser Pro Gln Met Gln Asn Ala Ser805 810 815Ser Ala Phe Lys Asp Val Leu Ser Lys Val Ala Phe Arg Gln Pro Ser820 825 830Ala Gln Thr Lys Leu Phe Ser Asn Val Ser Gly Glu Thr Tyr Ser Asn835 840 845Asn Ala Gln Asp Leu Leu Lys Glu His Met Thr Ser Ser Val Lys Phe850 855 860Ile Ser Gln Val Arg Asn Met His Ser Ala Gly Ala Arg Ile Phe Val865 870 875 880Glu Phe Gly Pro Lys Gln Val Leu Ser Lys Leu Val Ser Glu Thr Leu885 890 895Lys Asp Asp Pro Ser Ile Ile Thr Ile Ser Val Asn Pro Ser Ser Gly900 905 910Lys Asp Ala Asp Ile Gln Leu Arg Glu Ala Ala Val Gln Leu Val Val915 920 925Ala Gly Val Asn Leu Gln Gly Phe Asp Lys Trp Asp Ala Pro Asp Ala930 935 940Thr Arg Leu Gln Pro Ile Lys Lys Lys Lys Thr Thr Leu Arg Leu Ser945 950 955 960Ala Ala Thr Tyr Val Ser Asp Lys Thr Lys Lys Ala Arg Glu Ala Ala965 970 975Met Asn Asp Gly Arg Met Leu Ser Cys Val Ser Lys Val Ile Ala Pro980 985 990Pro Asp Ala Lys Pro Ile Val Asp Thr Lys Ala Gln Glu Glu Val Ala995 1000 1005Arg Leu Gln Lys Gln Leu Gln Asp Ala Gln Ala Gln Ile Gln Lys1010 1015 1020Ala Lys Ala Asp Ala Ala Glu Ala Asp Lys Lys Leu Ala Ala Ala1025 1030 1035Lys Asp Glu Ala Lys Arg Ala Ala Ala Ser Ala Pro Val Gln Lys1040 1045 1050Gln Val Asp Thr Thr Ile Val Asp Lys His Arg Ala Ile Leu Lys1055 1060 1065Ser Met Leu Ala Glu Leu Asp Cys Tyr Ser Thr Pro Gly Ala Val1070 1075 1080Ser Ser Ser Phe Gln Ala Pro Val Ala Ala Thr Pro Ala Pro Val1085 1090 1095Ala Ala Pro Val Ala Ala Ala Pro Ala Pro Ala Val Asn Asn Ala
1100 1105 1110Leu Leu Ala Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala1115 1120 1125Lys Thr Gly Tyr Glu Thr Asp Met Ile Glu Pro Asp Met Glu Leu1130 1135 1140Glu Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu1145 1150 1155Ser Glu Val Gln Ala Gln Leu Asn Val Glu Ala Lys Asp Val Asp1160 1165 1170Ala Leu Ser Arg Thr Arg Thr Val Gly Glu Val Val Asn Ala Met1175 1180 1185Lys Ala Glu Ile Ala Gly Ser Ser Gly Ala Ala Ala Ala Ala Pro1190 1195 1200Ala Pro Val Ala Ala Ala Pro Ala Ala Pro Ala Pro Ala Val Asn1205 1210 1215Ser Ala Leu Leu Ala Lys Ala Glu Thr Val Val Met Glu Val Leu1220 1225 1230Ala Ala Lys Thr Gly Tyr Glu Thr Asp Met Ile Glu Pro Asp Met1235 1240 1245Glu Leu Glu Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu1250 1255 1260Ile Leu Ser Glu Val Gln Ala Gln Leu Asn Val Glu Ala Lys Asp1265 1270 1275Val Asp Ala Leu Ser Arg Thr Arg Thr Val Gly Glu Val Val Asn1280 1285 1290Ala Met Lys Ala Glu Ile Ala Gly Ser Ser Gly Ala Ala Ala Ala1295 1300 1305Ala Pro Ala Pro Val Ala Ala Ala Pro Ala Pro Val Ala Ala Ala1310 1315 1320Ala Pro Ala Val Ser Ser Ala Leu Leu Glu Lys Ala Glu Ser Val1325 1330 1335Val Met Glu Val Leu Ala Ala Lys Thr Gly Tyr Glu Thr Asp Met1340 1345 1350Ile Glu Ala Asp Met Glu Leu Glu Thr Glu Leu Gly Ile Asp Ser1355 1360 1365Ile Lys Arg Val Glu Ile Leu Ser Glu Val Gln Ala Gln Leu Asn1370 1375 1380Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg Thr Arg Thr Val1385 1390 1395Gly Glu Val Val Asn Ala Met Lys Ala Glu Ile Ala Gly Ser Ser1400 1405 1410
Gly Ala Ala Ala Pro Ala Pro Val Ala Ala Ala Pro Ala Pro Val1415 1420 1425Ala Ala Ala Ala Pro Ala Val Asn Ser Ala Leu Leu Glu Lys Ala1430 1435 1440Glu Thr Val Val Met Glu Val Leu Ala Ala Lys Thr Gly Tyr Glu1445 1450 1455Thr Asp Met Ile Glu Pro Asp Met Glu Leu Glu Thr Glu Leu Gly1460 1465 1470Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val Gln Ala1475 1480 1485Gln Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg Thr1490 1495 1500Arg Thr Val Gly Glu Val Val Asn Ala Met Lys Ala Glu Ile Ala1505 1510 1515Gly Ser Ser Gly Ala Ala Ala Ala Ala Pro Ala Pro Val Ala Ala1520 1525 1530Ala Pro Ala Pro Val Ala Ala Pro Ala Val Ser Ser Ala Leu Leu1535 1540 1545Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys Thr1550 1555 1560Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr1565 1570 1575Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu1580 1585 1590Val Gln Ala Gln Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu1595 1600 1605Ser Arg Thr Arg Thr Val Gly Glu Val Val Asn Ala Met Lys Ala1610 1615 1620Glu Ile Ala Gly Ser Ser Gly Ala Ala Ala Ala Ala Pro Ala Pro1625 1630 1635Val Ala Ala Ser Pro Ala Pro Val Ala Ala Ala Ala Pro Ala Val1640 1645 1650Ser Ser Ala Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val1655 1660 1665Leu Ala Ala Lys Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp1670 1675 1680Met Glu Leu Glu Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg Val1685 1690 1695Glu Ile Leu Ser Glu Val Gln Ala Met Leu Asn Val Glu Ala Lys1700 1705 1710Asp Val Asp Ala Leu Ser Arg Thr Arg Thr Val Gly Glu Val Val
1715 1720 1725Asn Ala Met Lys Ala Glu Ile Ala Gly Ser Ser Gly Ala Ala Ala1730 1735 1740Ala Ala Pro Ala Pro Val Ala Ala Ala Pro Ala Pro Val Thr Ala1745 1750 1755Ala Ala Pro Ala Val Ser Ser Ala Leu Leu Glu Lys Ala Glu Ser1760 1765 1770Val Val Met Glu Val Leu Ala Ala Lys Thr Gly Tyr Glu Thr Asp1775 1780 1785Met Ile Glu Ala Asp Met Glu Leu Glu Thr Glu Leu Gly Ile Asp1790 1795 1800Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val Gln Ala Met Leu1805 1810 1815Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg Thr Arg Thr1820 1825 1830Val Gly Glu Val Val Asn Ala Met Lys Ala Glu Ile Ala Ser Ser1835 1840 1845Ser Gly Ala Ala Ala Pro Ala Pro Ala Ala Ala Val Ala Pro Ala1850 1855 1860Pro Ala Ala Ala Pro Ala Val Ser Ser Ala Leu Leu Glu Lys Ala1865 1870 1875Glu Ser Val Val Met Glu Val Leu Ala Ala Lys Thr Gly Tyr Glu1880 1885 1890Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr Glu Leu Gly1895 1900 1905Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val Gln Ala1910 1915 1920Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg Thr1925 1930 1935Arg Thr Val Gly Glu Val Val Asn Ala Met Lys Ala Glu Ile Ala1940 1945 1950Ser Ser Ser Gly Ala Ala Ala Pro Ala Pro Ala Ala Ala Ala Ala1955 1960 1965Pro Ala Pro Ala Ala Ala Pro Ala Val Ser Ser Ala Leu Leu Glu1970 1975 1980Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys Thr Gly1985 1990 1995Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr Glu2000 2005 2010Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val2015 2020 2025
Gln Ala Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser2030 2035 2040Arg Thr Arg Thr Val Gly Glu Val Val Asn Ala Met Lys Ala Glu2045 2050 2055Ile Ala Ser Ser Ser Gly Ala Ala Ala Pro Ala Pro Ala Ala Ala2060 2065 2070Ala Ala Pro Ala Pro Ala Ala Ala Pro Ala Val Ser Ser Ala Leu2075 2080 2085Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys2090 2095 2100Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu2105 2110 2115Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser2120 2125 2130Glu Val Gln Ala Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala2135 2140 2145Leu Ser Arg Thr Arg Thr Val Gly Glu Val Val Asn Ala Met Lys2150 2155 2160Ala Glu Ile Ala Gly Ser Ser Gly Ala Ala Thr Ala Ser Ala Pro2165 2170 2175Ala Ala Ala Ala Ala Ala Pro Ala Ile Lys Ile Ser Thr Val His2180 2185 2190Gly Ala Asp Cys Asp Asp Leu Ser Val Met Ser Ala Glu Leu Val2195 2200 2205Asp Ile Arg Arg Ala Asp Glu Leu Leu Leu Glu Arg Pro Glu Asn2210 2215 2220Arg Pro Val Leu Ile Val Asp Asp Gly Thr Glu Leu Thr Ser Ala2225 2230 2235Leu Val Arg Val Leu Gly Ala Gly Ala Val Val Leu Thr Phe Asp2240 2245 2250Gly Leu Gln Leu Ala Gln Arg Ala Gly Ala Ala Val Arg His Val2255 2260 2265Gln Val Lys Asp Leu Ser Ala Glu Ser Ala Glu Lys Ala Ile Lys2270 2275 2280Glu Ala Glu Gln Arg Phe Gly Gln Leu Gly Gly Phe Ile Ser Gln2285 2290 2295Gln Ala Glu Arg Phe Ala Pro Ala Asp Ile Leu Gly Phe Thr Leu2300 2305 2310Met Cys Ala Lys Phe Ala Lys Ala Ser Leu Cys Thr Pro Val Gln2315 2320 2325Gly Gly Arg Ala Phe Phe Ile Gly Val Ala Arg Leu Asp Gly Arg
2330 2335 2340Leu Gly Phe Thr Ser Gln Gly Ser Thr Asp Ser Leu Thr Arg Ala2345 2350 2355Gln Arg Gly Ala Ile Phe Gly Leu Cys Lys Thr Ile Gly Leu Glu2360 2365 2370Trp Ser Ala Asn Glu Val Phe Ala Arg Gly Ile Asp Ile Ala Arg2375 2380 2385Glu Val His Pro Glu Asp Ala Ala Val Ala Ile Thr Arg Glu Met2390 2395 2400Ser Cys Ala Asp Asn Arg Ile Arg Glu Val Gly Ile Gly Leu Asn2405 2410 2415Gln Lys Arg Cys Thr Ile Arg Ala Val Asp Leu Lys Pro Gly Ala2420 2425 2430Pro Lys Ile Gln Ile Ser Gln Asp Asp Val Leu Leu Val Ser Gly2435 2440 2445Gly Ala Arg Gly Ile Thr Pro Leu Cys Ile Arg Glu Ile Thr Arg2450 2455 2460Gln Val Arg Gly Gly Lys Tyr Ile Leu Leu Gly Arg Ser Lys Val2465 2470 2475Pro Ala Gly Glu Pro Ala Trp Cys Asn Gly Val Ser Asp Asp Asp2480 2485 2490Leu Gly Lys Ala Ala Met Gln Glu Leu Lys Arg Ala Phe Ser Ala2495 2500 2505Gly Glu Gly Pro Lys Pro Thr Pro Met Thr His Lys Lys Leu Val2510 2515 2520Gly Thr Ile Ala Gly Ala Arg Glu Val Arg Ser Ser Ile Ala Asn2525 2530 2535Ile Glu Ala Leu Gly Gly Lys Ala Ile Tyr Ser Ser Cys Asp Val2540 2545 2550Asn Ser Ala Ala Asp Val Ala Lys Ala Val Arg Glu Ala Glu Ala2555 2560 2565Gln Leu Gly Ala Arg Val Thr Gly Val Val His Ala Ser Gly Val2570 2575 2580Leu Arg Asp Arg Leu Ile Glu Gln Lys Arg Pro Asp Glu Phe Asp2585 2590 2595Ala Val Phe Gly Thr Lys Val Thr Gly Leu Glu Asn Leu Phe Gly2600 2605 2610Ala Ile Asp Met Ala Asn Leu Lys His Leu Val Leu Phe Ser Ser2615 2620 2625Leu Ala Gly Phe His Gly Asn Ile Gly Gln Ser Asp Tyr Ala Met2630 2635 2640
Ala Asn Glu Ala Leu Asn Lys Met Gly Leu Glu Leu Ser Asp Arg2645 2650 2655Val Ser Val Lys Ser Ile Cys Phe Gly Pro Trp Asp Gly Gly Met2660 2665 2670Val Thr Pro Gln Leu Lys Lys Gln Phe Gln Ser Met Gly Val Gln2675 2680 2685Ile Ile Pro Arg Glu Gly Gly Ala Asp Thr Val Ala Arg Ile Val2690 2695 2700Leu Gly Ser Ser Pro Ala Glu Ile Leu Val Gly Asn Trp Thr Thr2705 2710 2715Pro Thr Lys Lys Val Gly Ser Glu Pro Val Val Ile His Arg Lys2720 2725 2730Ile Ser Ala Ala Ser Asn Pro Phe Leu Lys Asp His Val Ile Gln2735 2740 2745Gly Arg Cys Val Leu Pro Met Thr Ile Ala Val Gly Cys Leu Ala2750 2755 2760Glu Thr Cys Leu Gly Gln Phe Pro Gly Tyr Ser Leu Trp Ala Ile2765 2770 2775Glu Asp Ala Gln Leu Phe Lys Gly Val Thr Val Asp Gly Asp Val2780 2785 2790Asn Cys Glu Ile Thr Leu Lys Pro Ser Gln Gly Thr Ala Gly Arg2795 2800 2805Val Met Ile Gln Ala Thr Leu Lys Thr Phe Ala Ser Gly Lys Leu2810 2815 2820Val Pro Ala Tyr Arg Ala Val Ile Val Leu Ser Thr Gln Gly Lys2825 2830 2835Pro Pro Ala Ala Thr Thr Ser Gln Thr Pro Ser Leu Gln Ala Asp2840 2845 2850Pro Ala Ala Arg Gly Asn Pro Tyr Asp Gly Lys Thr Leu Phe His2855 2860 2865Gly Pro Ala Phe Gln Gly Leu Lys Glu Ile Ile Ser Cys Asn Lys2870 2875 2880Ser Gln Leu Val Ala Glu Cys Thr Phe Ile Pro Ser Ser Glu Ser2885 2890 2895Ala Gly Glu Phe Ala Ser Asp Tyr Glu Ser His Asn Pro Phe Val2900 2905 2910Asn Asp Ile Ala Phe Gln Ala Met Leu Val Trp Ile Arg Arg Thr2915 2920 2925Leu Gly Gln Ala Ala Leu Pro Asn Ser Ile Gln Arg Ile Val Gln2930 2935 2940His Arg Ala Leu Pro Gln Asp Lys Pro Phe Tyr Leu Thr Leu Lys
2945 2950 2955Ser Asn Ser Ala Ser Gly His Ser Gln His Lys Thr Ser Val Gln2960 2965 2970Phe His Asn Glu Gln Gly Asp Leu Phe Val Asp Ile Gln Ala Ser2975 2980 2985Val Thr Ser Ser Asp Ser Leu Ala Phe2990 2995<210>7<211>2030<212>PRT<213>Ulkenia sp.
<400>7Met Ala Ser Arg Lys Asn Val Ser Ala Ala His Glu Met His Asp Glu1 510 15Lys Arg Ile Ala Val Val Gly Met Ala Val Gln Tyr Ala Gly Cys Lys20 25 30Asp Lys Glu Glu Phe Trp Lys Val Val Met Gly Gly Glu Ala Ala Trp35 40 45Thr Lys Ile Ser Asp Lys Arg Leu Gly Ser Asn Lys Arg Ala Glu His50 55 60Phe Lys Ala Glu Arg Ser Lys Phe Ala Asp Thr Phe Cys Asn Glu Asn65 70 75 80Tyr Gly Cys Val Asp Asp Ser Val Asp Asn Glu His Glu Leu Leu Leu85 90 95Lys Leu Ser Lys Lys Ala Leu Ser Glu Thr Ser Val Ser Asp Ser Thr100 105 110Arg Cys Gly Ile Val Ser Gly Cys Leu Ser Phe Pro Met Asp Asn Leu115 120 125Gln Gly Glu Leu Leu Asn Val Tyr Gln Asn His Val Glu Lys Lys Leu130 135 140Gly Ala Arg Val Phe Lys Asp Ala Ser Lys Trp Ser Glu Arg Glu Gln145 150 155 160Ser Gln Asn Pro Glu Ala Gly Asp Arg Arg Ile Phe Met Asp Pro Ala165 170 175Ser Phe Val Ala Glu Glu Leu Asn Leu Gly Pro Leu His Tyr Ser Val180 185 190Asp Ala Ala Cys Ala Thr Ala Leu Tyr Val Leu Arg Leu Ala Gln Asp195 200 205His Leu Val Ser Gly Ala Ala Asp Val Met Leu Ala Gly Ala Thr Cys210 215 220Phe Pro Glu Pro Phe Phe Ile Leu Ser Gly Phe Ser Thr Phe Gln Ala
225 230 235 240Met Pro Val Ser Gly Asp Gly Ile Ser Tyr Pro Leu His Lys Asp Ser245 250 255Gln Gly Leu Thr Pro Gly Glu Gly Gly Ala Ile Met Val Leu Lys Arg260 265 270Leu Asp Asp Ala Ile Arg Asp Gly Asp His Ile Tyr Gly Thr Leu Leu275 280 285Gly Ala Thr Ile Ser Asn Ala Gly Cys Gly Leu Pro Leu Lys Pro His290 295 300Leu Pro Ser Glu Lys Ser Cys Leu Ile Asp Thr Tyr Lys Arg Val Asn305 310 315 320Val His Pro His Lys Ile Gln Tyr Val Glu Cys His Ala Thr Gly Thr325 330 335Pro Gln Gly Asp Arg Val Glu Ile Asp Ala Val Lys Ala Cys Phe Glu340 345 350Gly Lys Val Pro Arg Phe Gly Ser Ser Lys Gly Asn Phe Gly His Thr355 360 365Leu Val Ala Ala Gly Phe Ala Gly Met Cys Lys Val Leu Leu Ala Met370 375 380Lys His Gly Val Ile Pro Pro Thr Pro Gly Val Asp Gly Ser Ser Gln385 390 395 400Met Asp Pro Leu Val Val Ser Glu Pro Ile Pro Trp Pro Asp Thr Glu405 410 415Gly Glu Pro Lys Arg Ala Gly Leu Ser Ala Phe Gly Phe Gly Gly Thr420 425 430Asn Ala His Ala Val Phe Glu Glu Phe Asp Arg Ser Lys Ala Ala Cys435 440 445Ala Thr His Asp Ser Ile Ser Ser Leu Ser Ser Arg Cys Gly Gly Glu450 455 460Gly Asn Met Arg Ile Ala Ile Thr Gly Met Asp Ala Thr Phe Gly Ser465 470 475 480Leu Lys Gly Leu Asp Ala Phe Glu Arg Ala Ile Tyr Asn Gly Gln His485 490 495Gly Ala Val Pro Leu Pro Glu Lys Arg Trp Arg Phe Leu Gly Lys Asp500 505 510Lys Asp Phe Leu Asp Leu Cys Gly Val Lys Glu Val Pro His Gly Cys515 520 525Tyr Ile Glu Asp Val Glu Val Asp Phe Ser Arg Leu Arg Thr Pro Met530 535 540Thr Pro Asp Asp Met Leu Arg Pro Met Gln Leu Leu Ala Val Thr Thr545 550 555 560
Ile Asp Arg Ala Ile Leu Asn Ser Gly Leu Lys Lys Gly Gly Lys Val565 570 575Ala Val Phe Val Gly Leu Gly Thr Asp Leu Glu Leu Tyr Arg His Arg580 585 590Ala Arg Val Ala Leu Lys Glu Arg Ala Arg Pro Glu Ala Ala Ser Ala595 600 605Leu Asn Asp Met Met Ser Tyr Ile Asn Asp Cys Gly Thr Ala Thr Ser610 615 620Tyr Thr Ser Tyr Ile Gly Asn Leu Val Ala Thr Arg Val Ser Ser Gln625 630 635 640Trp Gly Phe Glu Gly Pro Ser Phe Thr Ile Thr Glu Gly Asn Asn Ser645 650 655Val Tyr Arg Cys Ala Glu Leu Gly Lys Tyr Leu Leu Glu Thr Gly Glu660 665 670Val Glu Ala Val Val Ile Ala Gly Val Asp Leu Cys Ala Ser Ala Glu675 680 685Asn Leu Tyr Val Lys Ser Arg Arg Phe Lys Val Ser Glu Gln Glu Ser690 695 700Pro Arg Ala Ser Phe Asp Ser Gly Ala Asp Gly Tyr Phe Val Gly Glu705 710 715 720Gly Cys Gly Ala Leu Val Leu Lys Arg Glu Ser Asp Cys Thr Lys Asp725 730 735Glu Arg Ile Tyr Ala Cys Met Asp Ala Ile Val Pro Gly Asn Met Pro740 745 750Ala Ala Cys Met Glu Glu Ala Leu Ala Gln Ala Arg Val Asn Pro Lys755 760 765Asp Val Glu Met Leu Glu Leu Ser Ala Asp Ser Ala Arg His Leu Lys770 775 780Asn Pro Ser Val Leu Pro Lys Glu Leu Thr Ala Glu Glu Glu Ile Arg785 790 795 800Gly Ile Glu Ala Ile Leu Ser Gln Arg Ser Ser Asn Glu Ala Val Glu805 810 815Pro His Asn Val Ala Val Ser Ser Val Lys Ser Thr Val Gly Asp Thr820 825 830Gly Tyr Ala Ser Gly Ala Ala Ser Leu Ile Lys Thr Ala Leu Cys Leu835 840 845Tyr Asn Arg Tyr Leu Pro Ser Asn Gly Ala Ser Trp Glu Glu Pro Ala850 855 860Pro Glu Thr Gln Trp Gly Lys Ser Leu Tyr Ala Cys Gln Ser Ser Arg865 870 875 880Ala Trp Leu Lys Asn Pro Gly Ala Arg Arg His Ala Ala Val Ser Gly
885 890 895Val Ser Glu Thr Arg Ser Cys Tyr Thr Val Leu Leu Ser Asp Val Glu900 905 910Gly His His Glu Thr Lys Ser Arg Ile Ser Leu Asp Asp Asp Ala Val915 920 925Lys Leu Leu Val Ile Arg Gly Asp Ser His Asp Ala Ile Thr Gln Arg930 935 940Val Asp Lys Leu Arg Glu Arg Leu Ala Gln Pro Ser Ala Asn Val Arg945 950 955 960Leu Ala Phe Met Glu Leu Leu Gly Glu Ser Ile Ala Gln Glu Thr Lys965 970 975Thr Pro Leu Pro Ala Phe Ala Leu Cys Leu Val Thr Ser Pro Ser Lys980 985 990Leu Gln Lys Glu Leu Glu Leu Ala Ser Lys Gly Ile Pro Arg Ser Leu995 1000 1005Lys Met Gly Arg Asp Trp Thr Ser Pro Ser Gly Ser His Phe Ala1010 1015 1020Pro Lys Pro Leu Ser Ser Asp Arg Val Ala Phe Met Tyr Gly Glu1025 1030 1035Gly Arg Ser Pro Tyr Tyr Gly Ile Gly Leu Asp Ile His Arg Ile1040 1045 1050Trp Pro Glu Leu His Glu Phe Val Asn Ala Lys Thr Asn Lys Leu1055 1060 1065Trp Asp Gln Gly Asp Arg Trp Leu Ile Pro Arg Ala Ser Thr Lys1070 1075 1080Glu Glu Leu Lys Ala Gln Glu Asp Glu Phe Asn Arg Asn Gln Val1085 1090 1095Glu Met Phe Arg Leu Gly Ile Leu Met Ser Met Cys Phe Thr His1100 1105 1110Ile Ala Arg Asp Val Leu Gly Ile Gln Pro Lys Ala Ala Phe Gly1115 1120 1125Leu Ser Leu Gly Glu Ile Ser Met Val Phe Ala Phe Ser Glu Lys1130 1135 1140Asn Gly Leu Val Ser Glu Glu Leu Thr Thr Lys Leu Arg Asn Ser1145 1150 1155Glu Val Trp Arg Lys Ala Leu Ala Val Glu Phe Asp Ala Leu Arg1160 1165 1170Lys Ala Trp Asn Ile Pro Gln Asp Thr Pro Val Ser Glu Phe Trp1175 1180 1185Gln Gly Tyr Val Val Arg Gly Thr Arg Glu Ala Val Glu Ala Ala1190 1195 1200
Ile Gly Pro Asn Asn Lys Tyr Val His Leu Thr Ile Val Asn Asp1205 1210 1215Ala Asn Ser Ala Leu Ile Ser Gly Lys Pro Glu Asp Cys Lys Ala1220 1225 1230Ala Ile Ala Arg Leu Ser Ser Asn Leu Pro Ala Leu Pro Val Asp1235 1240 1245Leu Gly Met Cys Gly His Cys Pro Val Val Glu Pro Tyr Gly Lys1250 1255 1260Gln Ile Ala Glu Ile His Ser Val Leu Glu Ile Pro Glu Val Ala1265 1270 1275Gly Leu Asp Leu Tyr Thr Ser Val Asn Gln Lys Lys Leu Val Asn1280 1285 1290Lys Ser Thr Gly Ala Ser Asp Glu Tyr Ala Pro Ser Phe Gly Glu1295 1300 1305Tyr Ala Ala Gln Leu Tyr Thr Val Gln Ala Asp Phe Pro Lys Ile1310 1315 1320Ala Lys Thr Val Ser Asp Lys Asn Phe Asp Val Phe Val Glu Thr1325 1330 1335Gly Pro Asn Ala His Arg Ser Ala Ala Ile Arg Ala Thr Leu Gly1340 1345 1350Asn Ser Lys Pro Phe Val Thr Gly Ser Met Asp Arg Gln Asn Glu1355 1360 1365Asn Ala Trp Thr Thr Met Val Lys Leu Val Ala Ser Leu Gln Ala1370 1375 1380His Arg Val Pro Gly Val Lys Val Ser Pro Leu Tyr His Pro Glu1385 1390 1395Thr Val Glu Glu Ala Thr Gln Ser Tyr Asn Asp Met Val Ala Gly1400 1405 1410Lys Lys Pro Thr Lys Asn Lys Phe Leu Arg Lys Ile Val Val Asn1415 1420 1425Gly Arg Tyr Asp Pro Lys Lys Gln Leu Val Pro Pro Gln Val Leu1430 1435 1440Ala Lys Leu Pro Pro Ala Asp Pro Lys Ile Glu Ala Leu Ile Gln1445 1450 1455Ala Arg Lys Met Gln Pro Ile Ala Pro Lys Phe Met Glu Arg Leu1460 1465 1470Asp Ile Gln Glu Gln Asp Ala Thr Arg Asp Pro Ile Leu Asn Lys1475 1480 1485Asp Asn Lys Pro Ser Ala Ala Pro Ala Leu Ala Pro Ala Ala Pro1490 1495 1500Ala Arg Ser Val Ser Gly Ala Val Val Ala Ser Ser Glu Ala Leu
1505 1510 1515Arq Ala Lys Leu Leu Glu Leu Asn Ser Thr Leu Met Leu Gly Val1520 1525 1530Asn Ala Asn Gly Asp Leu Val Glu Ala Ser Pro Ser Glu Ala Ser1535 1540 1545Ile Val Val Pro Lys Cys Asp Ile Lys Asp Leu Gly Ser Arg Ala1550 1555 1560Phe Met Glu Thr Tyr Gly Val Ser Ala Pro Met Tyr Thr Gly Ala1565 1570 1575Met Ala Lys Gly Ile Ala Ser Ala Glu Met Val Ile Ala Ala Gly1580 1585 1590Lys Arg Gly Ile Leu Gly Ser Leu Gly Ala Gly Gly Leu Pro Ile1595 1600 1605Ala Thr Val Arg Lys Ala Leu Glu Ala Ile Gln Ala Glu Leu Pro1610 1615 1620Lys Gly Pro Tyr Ala Val Asn Leu Ile His Ser Pro Phe Asp Ser1625 1630 1635Asn Leu Glu Lys Gly Asn Val Asp Leu Phe Leu Glu Lys Gly Val1640 1645 1650Thr Val Val Glu Ala Ser Ala Phe Met Thr Leu Thr Pro Gln Leu1655 1660 1665Val Arg Tyr Arg Ala Ala Gly Leu Ser Arg Ala Ala Asp Gly Ser1670 1675 1680Thr Val Ile Lys Asn Arg Val Ile Gly Lys Val Ser Arg Thr Glu1685 1690 1695Leu Ala Ala Met Phe Ile Arg Pro Ala Pro Glu Asn Leu Leu Glu1700 1705 1710Lys Leu Leu Lys Ser Gly Glu Ile Thr Gln Glu Gln Ala Ala Leu1715 1720 1725Ala Arg Thr Val Pro Val Ala Asp Asp Ile Ala Val Glu Ala Asp1730 1735 1740Ser Gly Gly His Thr Asp Asn Arg Pro Ile His Val Ile Leu Pro1745 1750 1755Leu Ile Val Asn Leu Arg Asp Arg Leu His Lys Glu Cys Gly Tyr1760 1765 1770Pro Ala His Leu Arg Val Arg Val Gly Ala Gly Gly Gly Ile Gly1775 1780 1785Cys Pro Gln Ala Ala Ile Ala Thr Phe Asn Met Gly Ala Ala Phe1790 1795 1800Ile Val Thr Gly Thr Val Asn Gln Met Ser Lys Gln Ala Gly Thr1805 1810 1815
Cys Asp Thr Val Arg Lys Gln Leu Ser Gln Ala Thr Tyr Ser Asp1820 1825 1830Ile Cys Met Ala Pro Ala Ala Asp Met Phe Glu Glu Gly Val Lys1835 1840 1845Leu Gln Val Leu Lys Lys Gly Thr Met Phe Pro Ser Arg Ala Asn1850 1855 1860Lys Leu Tyr Glu Leu Phe Val Lys Tyr Asp Ser Phe Glu Ser Met1865 1870 1875Ala Pro Gly Glu Leu Glu Arg Val Glu Lys Arg Ile Phe Lys Lys1880 1885 1890Ser Leu Ser Glu Val Trp Glu Glu Thr Lys Asp Phe Tyr Ile Asn1895 1900 1905Arg Leu Gln Asn Pro Glu Lys Ile Glu Arg Ala Glu Arg Asp Pro1910 1915 1920Lys Leu Lys Met Ser Leu Cys Phe Arg Trp Tyr Leu Gly Leu Ala1925 1930 1935Ser Phe Trp Ala Asn Ala Gly Ile Pro Asp Arg Ala Met Asp Tyr1940 1945 1950Gln Val Trp Cys Gly Pro Ala Ile Gly Ser Phe Asn Asp Phe Ile1955 1960 1965Lys Gly Thr Tyr Leu Asp Pro Ala Val Ala Asn Glu Tyr Pro Asp1970 1975 1980Val Val Gln Ile Asn Leu Gln Ile Leu Arg Gly Ala Cys Phe Leu1985 1990 1995Arg Arg Leu Glu Ala Val Arg Asn Ala Pro Leu Lys Ala Asn Ala2000 2005 2010Lys Gln Val Ala Ala Glu Ile Asp Asp Ile Tyr Val Pro Thr Glu2015 2020 2025Arg Leu2030<210>8<211>1465<212>PRT<213>Ulkenia sp.
<400>8Met Ala Thr Arg Val Lys Thr Asn Lys Lys Pro Cys Trp Glu Met Thr1 510 15Lys Glu Glu Leu Thr Ser Gly Lys Asn Val Val Phe Asp Tyr Asp Glu20 25 30Leu Leu Glu Phe Ala Glu Gly Asp Ile Ser Lys Val Phe Gly Pro Glu35 40 45
Phe Ser Gln Ile Asp Gln Tyr Lys Arg Arg Val Arg Leu Pro Ala Arg50 5560Glu Tyr Leu Leu Val Thr Arg Val Thr Leu Met Asp Ala Glu Val Asn65 70 75 80Asn Tyr Arg Val Gly Ala Arg Met Val Thr Glu Tyr Asp Leu Pro Val85 90 95Asn Gly Glu Leu Ser Glu Gly Gly Asp Cys Pro Trp Ala Val Leu Val100 105 110Glu Ser Gly Gln Cys Asp Leu Met Leu Ile Ser Tyr Met Gly Ile Asp115 120 125Phe Gln Asn Lys Ser Asp Arg Val Tyr Arg Leu Leu Asn Thr Thr Leu130 135 140Thr Phe Tyr Gly Val Ala Gln Glu Gly Glu Thr Leu Glu Tyr Asp Ile145 150 155 160Arg Val Thr Gly Phe Ala Lys Arg Leu Asp Gly Asp Ile Ser Met Phe165 170 175Phe Phe Glu Tyr Asp Cys Tyr Val Asn Gly Arg Leu Leu Ile Glu Met180 185 190Arg Asp Gly Cys Ala Gly Phe Phe Thr Asn Glu Glu Leu Ala Ala Gly195 200 205Lys Gly Val Val Phe Thr Arg Ala Asp Leu Leu Ala Arg Glu Lys Thr210 215 220Lys Lys Gln Asp Ile Thr Pro Tyr Ala Ile Ala Pro Arg Leu Asn Lys225 230 235 240Thr Val Leu Asn Glu Thr Glu Met Gln Ser Leu Val Asp Lys Asn Trp245 250 255Thr Lys Val Phe Gly Pro Glu Asn Gly Met Asp Gln Ile Asn Tyr Lys260 265 270Leu Cys Ala Arg Lys Met Leu Met Ile Asp Arg Val Thr Lys Ile Asp275 280 285Tyr Thr Gly Gly Pro Tyr Gly Leu Gly Leu Leu Val Gly Glu Lys Ile290 295 300Leu Glu Arg Asp His Trp Tyr Phe Pro Cys His Phe Val Gly Asp Gln305 310 315 320Val Met Ala Gly Ser Leu Val Ser Asp Gly Cys Ser Gln Leu Leu Lys325 330 335Met Tyr Met Leu Trp Leu Gly Leu His Leu Lys Thr Gly Pro Phe Asp340 345 350Phe Arg Pro Val Asn Gly His Pro Asn Lys Val Arg Cys Arg Gly Gln355 360 365Ile Ser Pro His Lys Gly Lys Leu Val Tyr Val Met Glu Ile Lys Glu
370 375 380Met Gly Tyr Asp Glu Ala Gly Asp Pro Tyr Ala Ile Ala Asp Val Asn385 390 395 400Ile Leu Asp Ile Asp Phe Glu Lys Gly Gln Thr Phe Asp Leu Ala Asn405 410 415Leu His Glu Tyr Gly Lys Gly Asp Leu Asn Lys Lys Ile Val Val Asp420 425 430Phe Lys Gly Ile Ala Leu Lys Leu Gln Lys Arg Ser Gly Pro Ala Val435 440 445Val Ala Pro Glu Lys Pro Leu Ala Leu Asn Lys Asp Leu Cys Ala Pro450 455 460Ala Val Glu Ala Ile Pro Glu His Ile Leu Lys Gly Asp Ala Leu Ala465 470 475 480Pro Asn Gln Met Thr Trp His Pro Met Ser Lys Ile Ala Gly Asn Pro485 490 495Thr Pro Ser Phe Ser Pro Ser Ala Tyr Pro Pro Arg Pro Ile Thr Phe500 505 510Thr Pro Phe Pro Gly Asn Lys Asn Asp Asn Asn His Val Pro Gly Glu515 520 525Met Pro Leu Ser Trp Tyr Asn Met Ala Glu Phe Met Ala Gly Lys Val530 535 540Ser Leu Cys Leu Gly Pro Glu Phe Ala Lys Phe Asp Asp Ser Asn Thr545 550 555 560Ser Arg Ser Pro Ala Trp Asp Leu Ala Leu Val Thr Arg Val Val Ser565 570 575Val Ser Asp Met Glu Trp Val Gln Trp Lys Asn Val Asp Cys Asn Pro580 585 590Ser Lys Gly Thr Met Val Gly Glu Phe Asp Cys Pro Ile Asp Ala Trp595 600 605Phe Phe Gln Gly Ser Cys Asn Asp Gly His Met Pro Tyr Ser Ile Leu610 615 620Met Glu Ile Ala Leu Gln Thr Ser Gly Val Leu Thr Ser Val Leu Lys625 630 635 640Ala Pro Leu Thr Met Glu Lys Lys Asp Ile Leu Phe Arg Asn Leu Asp645 650 655Ala Asn Ala Glu Met Val Arg Ser Asp Ile Asp Leu Arg Gly Lys Thr660 665 670Ile His Asn Leu Thr Lys Cys Thr Gly Tyr Ser Met Leu Gly Asp Met675 680 685Gly Val His Arg Phe Ser Phe Glu Leu Ser Val Asp Gly Val Val Phe690 695 700
Tyr Lys Gly Thr Thr Ser Phe Gly Trp Phe Val Pro Glu Val Phe Ile705 710 715 720Ser Gln Thr Gly Leu Asp Asn Gly Arg Arg Thr Gln Pro Trp His Ile725 730 735Glu Ser Lys Val Pro Ser Ala Gln Val Leu Thr Tyr Asp Val Thr Pro740 745 750Asn Gly Ala Gly Arg Thr Gln Leu Tyr Ala Asn Ala Pro Lys Gly Ala755 760 765Gln Leu Thr Arg Arg Trp Asn Gln Cys Gln Tyr Leu Asp Thr Ile Asp770 775 780Leu Val Val Ala Gly Gly Ser Ala Gly Leu Gly Tyr Gly His Gly Arg785 790 795 800Lys Gln Val Asn Pro Lys Asp Trp Phe Phe Ser Cys His Phe Trp Phe805 810 815Asp Ser Val Met Pro Gly Ser Leu Gly Val Glu Ser Met Phe Gln Leu820 825 830Val Glu Ser Ile Ala Val Lys Gln Asp Leu Ala Gly Lys Tyr Gly Ile835 840 845Thr Asn Pro Thr Phe Ala His Ala Pro Gly Lys Ile Ser Trp Lys Tyr850 855 860Arg Gly Gln Leu Thr Pro Thr Ser Lys Phe Met Asp Ser Glu Ala His865 870 875 880Ile Val Ser Ile Glu Ala His Asp Gly Val Val Asp Ile Val Ala Asn885 890 895Gly Asn Leu Trp Ala Asp Gly Leu Arg Val Tyr Asn Val Ser Asn Ile900 905 910Arg Val Arg Ile Val Ala Gly Ala Ala Pro Ala Ala Ala Ala Ala Ala915 920 925Ala Ala Val Ala Ala Pro Ala Ala Ala Pro Ala Pro Val Ala Ala Ser930 935 940Gly Pro Ala Gln Thr Ile Thr Leu Lys Gln Leu Lys Ala Glu Leu Leu945 950 955 960Asp Val Glu Lys Pro Leu Tyr Ile Ser Ser Ser Asn Gly Gln Val Lys965 970 975Lys His Ala Asp Val Ala Gly Gly Gln Ala Thr Ile Val Gln Ala Cys980 985 990Ser Leu Ser Asp Leu Gly Asp Glu Gly Phe Met Lys Thr Tyr Gly Val995 1000 1005Val Ala Pro Leu Tyr Thr Gly Ala Met Ala Lys Gly Ile Ala Ser1010 1015 1020Ala Asp Leu Val Ile Ala Thr Gly Lys Arg Lys Ile Leu Gly Ser
1025 1030 1035Phe Gly Ala Gly Gly Leu Pro Met His Ile Val Arg Ala Ala Val1040 1045 1050Glu Lys Ile Gln Ala Glu Leu Pro Asn Gly Pro Phe Ala Val Asn1055 1060 1065Leu Ile His Ser Pro Phe Asp Ser Asn Leu Glu Lys Gly Asn Val1070 1075 1080Asp Leu Phe Leu Glu Lys Gly Val Thr Val Val Glu Ala Ser Ala1085 1090 1095Phe Met Thr Leu Thr Pro Gln Val Val Arg Tyr Arg Ala Ala Gly1100 1105 1110Leu Ser Arg Asn Ala Asp Gly Ser Ile Asn Ile Lys Asn Arg Ile1115 1120 1125Ile Gly Lys Val Ser Arg Thr Glu Leu Ala Glu Met Phe Ile Arg1130 1135 1140Pro Ala Pro Gln Asn Leu Leu Asp Lys Leu Ile Gln Ser Gly Glu1145 1150 1155Ile Thr Lys Glu Gln Ala Glu Leu Ala Lys Leu Val Pro Val Ala1160 1165 1170Asp Asp Ile Ala Val Glu Ala Asp Ser Gly Gly His Thr Asp Asn1175 1180 1185Arg Pro Ile His Val Ile Leu Pro Leu Ile Ile Asn Leu Arg Asn1190 1195 1200Arg Leu His Lys Glu Cys Gly Tyr Pro Ala His Leu Arg Val Arg1205 1210 1215Val Gly Ala Gly Gly Gly Val Gly Cys Pro Gln Ala Ala Ala Ala1220 1225 1230Ala Leu Ala Met Gly Ala Ala Phe Leu Val Thr Gly Thr Val Asn1235 1240 1245Gln Val Ala Lys Gln Ser Gly Thr Cys Asp Asn Val Arg Lys Gln1250 1255 1260Leu Cys Met Ala Thr Tyr Ser Asp Val Cys Met Ala Pro Ala Ala1265 1270 1275Asp Met Phe Glu Glu Gly Val Lys Leu Gln Val Leu Lys Lys Gly1280 1285 1290Thr Met Phe Pro Ser Arg Ala Asn Lys Leu Tyr Glu Leu Phe Cys1295 1300 1305Lys Tyr Asp Ser Phe Glu Ser Met Pro Ala Thr Glu Leu Glu Arg1310 1315 1320Val Glu Lys Arg Ile Phe Gln Cys Pro Leu Ala Asp Val Trp Ala1325 1330 1335
Glu Thr Ser Asp Phe Tyr Ile Asn Arg Leu His Asn Pro Glu Lys1340 1345 1350Ile Thr Arg Ala Glu Arg Asp Pro Lys Leu Lys Met Ser Leu Cys1355 1360 1365Phe Arg Trp Tyr Leu Gly Leu Ala Ser Arg Trp Ala Asn Thr Gly1370 1375 1380Glu Ala Gly Arg Val Met Asp Tyr Gln Val Trp Cys Gly Pro Ala1385 1390 1395Ile Gly Ala Phe Asn Asp Phe Ile Lys Gly Ser Tyr Leu Asp Pro1400 1405 1410Ala Val Ser Gly Glu Tyr Pro Asp Val Val Gln Ile Asn Leu Gln1415 1420 1425Ile Leu Arg Gly Ala Cys Tyr Leu Arg Arg Leu Asn Val Ile Arg1430 1435 1440Asn Asp Pro Arg Val Ser Ile Glu Val Glu Asp Ala Glu Phe Val1445 1450 1455Tyr Glu Pro Thr Asn Ala Leu1460 1465<210>9<211>5547<212>DNA<213>Ulkenia sp.
<400>9atgcttgtga taggggctct ggcgcgggct ctgtacggtg cttggagatg cacgggcagg60gcgagagagg ggacgggttc ccgggaggcg ctgcttggag gtgctgagag ggagggagaa120ggcgtgcttt gcgatgcgcg gggcgaccta ggcgctgctg cgcggtgcag cagcagggac180ctcggacgtg agtcgaagcc gtctgcagag gagatggtag aagggccgcg gattggtagc240agagaagagg aaatagaaga agaagaagaa atagaagaag aagaaataga agaagaagaa300atagaagaag aagaggagga cgggcaggcg ggaaagatgg agaaaggact cgcggcggga360aaacaagaga atgtgaactt gggcttgaac tttggtttga atttgaatgt ggagaacgag420gggttgaatt tgagtttgaa tttgaaagaa aacttacgga aagaaagttt agttgaaagt480gagaaagaaa aaaatgagaa agaaaaagag aaagaaaaag agaaagaaaa agagaaagaa540aaagagaaag aaaaagagaa agaaaaagag aaagaaaaag agaaagaaaa agagaaagaa600aaagagaaag aaaaagagaa agaaaaagag aaagaaaaag aagaagaaaa agaagaagaa660aaagagaaag aaaaagagaa agaaaaagag aaagaaaaag aagaaggaga tttaaaaagt720tgtttagttg aaaaaggaga aggaggaaga agcagcgaca gcggcagaag aagaagtagt780tgttgtaaga ggggaacgga ggcagtagca gtggagcagg cggaggcgac agcaaacctc840
gaactcgacc ccgtcgagcc gcagcaagaa caagagcccg accaggtgga cgaggacgag900gtccgcttgt tgtcaggaac aacagaagtt gcaggactag ccgagagtgc taccactgca960attcttagat ccacagacgc aagagcagaa aacttacaac tgctcgccac aacacaagaa1020ccaccttcag atacaaccag gttcgagaac tccacaagtc tagaagcagc aacagctcta1080gcagataatc aaacaggtcc agaaaaagct acgactagaa gagaaattat cgagtcgcaa1140cttgcaacca tggccactcg cgtgaagacc aacaagaaac catgctggga gatgaccaag1200gaggagctca ccagcggcaa gaacgtcgtt ttcgactatg acgagctcct tgagttcgcc1260gagggtgaca tcagcaaggt cttcggcccc gaattcagcc agatcgacca gtacaagcgt1320cgcgttcgtc tccccgcccg cgagtacctc ctcgtcaccc gcgtcaccct catggacgcc1380gaggtcaaca actaccgcgt cggtgcccgc atggtcactg agtacgacct ccccgtcaac1440ggtgagctct ctgagggtgg tgactgcccc tgggccgtgc tcgtcgagag tggtcagtgt1500gatctcatgc tcatctccta catgggtatt gacttccaga acaagagcga ccgcgtctac1560cgtctgctca acaccaccct caccttctac ggtgttgccc aggagggcga gaccctggag1620tacgacatcc gcgtgaccgg cttcgccaag cgtctcgacg gtgacatctc catgttcttc1680ttcgagtacg actgctacgt caacggccgt ctcctcatcg agatgcgcga cggctgtgcc1740ggtttcttca ccaacgagga gctcgccgcc ggcaagggtg tcgtctttac ccgcgctgat1800ctcctcgccc gcgagaagac caagaagcag gacatcaccc cgtacgccat tgccccgcgt1860cttaacaaga ccgttctcaa cgagactgag atgcagtccc tcgtggacaa gaactggacc1920aaggttttcg gccccgagaa cggcatggac cagatcaact acaaactctg cgcccgtaag1980atgctcatga ttgaccgcgt caccaagatt gactacaccg gtggccccta cggccttggt2040cttctcgttg gtgagaagat cctcgagcgc gaccactggt actttccgtg ccacttcgtc2100ggagaccagg tcatggctgg atccctcgtg tctgacggct gcagccagct cctcaagatg2160tacatgctct ggctcggcct ccaccttaag accggtccct tcgacttccg ccccgtcaac2220ggccacccca acaaggtccg ctgccgtggc cagatctccc cgcacaaggg taagctcgta2280tacgtcatgg agatcaagga gatgggctac gacgaggctg gtgacccgta cgccatcgcc2340gatgtcaaca ttctcgacat tgacttcgag aagggccaga ctttcgacct tgccaacctc2400cacgagtacg gcaagggcga cctcaacaag aagatcgtcg tcgacttcaa gggtattgcc2460ctcaagctcc agaagcgctc tggccctgcc gttgtcgctc ccgagaagcc cctcgctctc2520aacaaggacc tttgcgcccc ggctgttgag gccatccctg agcacatcct caagggcgat2580gctcttgccc ctaaccagat gacctggcac ccgatgtcca agatcgctgg caaccccacg2640ccctcgttct ctccctcggc ctaccctccc cgtcccatca ccttcacccc gttccccggc2700
aacaagaacg acaacaacca cgtgcccggc gagatgccgc tctcgtggta caacatggct2760gagttcatgg ccggcaaggt cagcctctgc ctcggccctg agttcgccaa gttcgatgac2820tccaacacca gccgcagccc tgcatgggac cttgctcttg tgactcgtgt ggtctccgtt2880tctgacatgg agtgggtcca gtggaagaac gtggactgca acccgtccaa gggaaccatg2940gttggcgagt tcgactgccc catcgacgcc tggttcttcc agggatcttg taacgacggc3000cacatgccgt actccatcct catggagatc gccctccaga cctctggtgt cctcacctct3060gtgctcaagg ccccgctcac catggagaag aaggacattc tcttccgcaa ccttgacgcc3120aacgccgaga tggttcgctc tgatattgac ctccgcggca agaccatcca caacctcacc3180aagtgtaccg gctacagcat gctcggagac atgggtgtcc accgcttcag cttcgagctc3240tctgttgatg gtgtagtctt ctacaagggt accacctcct tcggctggtt cgtccctgag3300gtcttcatct cccagactgg tctcgacaac ggtcgccgca cccagccctg gcacattgag3360tccaaggtgc cttccgccca ggtcctcacc tacgacgtta cccccaacgg tgccggtcgc3420acccagctct acgccaacgc ccccaagggc gctcagctca ctcgccgctg gaaccagtgc3480cagtaccttg acaccatcga ccttgtggtc gccggtggct ccgccggtct tggctacggt3540catggccgca agcaggtgaa ccccaaggac tggttcttct cgtgccactt ctggttcgac3600tccgtcatgc ccggctcgct cggtgtggag tctatgttcc agctcgtcga gtccatcgct3660gtcaagcagg acctcgccgg caagtacggc atcaccaacc cgaccttcgc tcatgctccg3720ggcaagatct cctggaagta ccgtggtcag ctcaccccca cctccaagtt catggactcc3780gaggcccaca ttgtctccat cgaggcccac gacggcgtcg tcgacatcgt tgccaatggt3840aacctctggg ctgatggcct ccgcgtctac aacgtcagca acatccgtgt gcgcattgtt3900gctggcgccg cccctgctgc tgctgctgct gctgctgctg ttgctgctcc ggctgccgcc3960cctgctccgg ttgctgcatc tggccctgcc cagaccatca ccctcaagca gctcaaggct4020gagcttcttg acgttgagaa gcctctctac atctcctcca gcaacggcca ggtcaagaag4080cacgccgatg tggctggtgg ccaggccacc attgtgcagg cttgcagcct cagtgacctc4140ggtgatgaag gcttcatgaa gacctacggt gttgtggctc ctctctacac cggtgccatg4200gccaagggta ttgcctctgc tgaccttgtg attgccactg gtaagcgcaa gatcctcggt4260tccttcggtg ctggcggtct ccccatgcac attgtccgtg ccgctgttga gaagatccag4320gctgagctcc cgaacggccc cttcgccgtc aacctcatcc actccccctt cgatagcaac4380cttgagaagg gcaacgttga cctcttcctc gagaagggcg ttactgtcgt cgaggcctcc4440gccttcatga ccttgacccc gcaagtcgtc cgctaccgtg ctgctggtct ttcccgtaac4500gctgatggct ccattaacat caagaaccgc atcatcggta aggtctcccg taccgagctc4560
gctgagatgt tcatccgccc tgccccgcag aacctcctcg acaagctcat ccagtctggt4620gagattacca aggagcaggc tgagcttgcc aagctcgtcc ccgtcgccga cgacatcgcc4680gtcgaggccg actctggtgg ccacaccgac aaccgcccca tccacgtcat cctccccctt4740atcatcaacc tccgcaaccg cctccacaag gagtgcggct accccgctca cctccgcgtg4800cgcgttggag ctggtggtgg tgttggatgc ccccaggccg ctgccgctgc tctcgctatg4860ggtgctgcct tccttgttac cggcactgtc aaccaggtcg ccaagcagtc cggcacctgc4920gacaatgtcc gcaagcagct ctgcatggcc acctactctg acgtctgcat ggctcccgct4980gctgacatgt tcgaggaggg cgtcaagctc caggtcctca agaagggaac catgttcccg5040tccagggcta acaagctcta cgagctcttc tgcaagtacg actccttcga gtccatgcct5100gccacagagc tcgagcgtgt tgagaagcgc atcttccagt gccctcttgc tgatgtctgg5160gctgagacct ccgacttcta catcaaccgc ctccacaacc cggagaagat cacccgtgcc5220gagcgtgacc ccaagctcaa gatgtctctc tgcttccgct ggtaccttgg tcttgcctct5280cgctgggcca acaccggtga ggctggacgc gtcatggact accaggtctg gtgtggccct5340gccattggag ccttcaacga cttcatcaag ggctcctacc ttgacccggc cgtctctggt5400gagtacccgg acgtcgtgca gatcaacttg cagatccttc gcggtgcctg ctacctccgc5460cgtctcaatg tcatccgcaa cgacccgcgt gtcagcattg aggtcgagga tgctgagttc5520gtctacgagc ccaccaacgc cctctaa5547<210>10<211>837<212>DNA<213>Ulkania sp.
<400>10acccgcatcg ctgtgatcgg catgtccgcc atcctcccct gcggtaccac cgttcgtgag60tcttgggagg ctatccgcga tggtatcgac tgcctcagtg atctccccga ggaccgcgtc120gatgtgaccg cctacttcga cccggtcaag accaccaagg ataagatcta ctgcaaacgt180ggtggattca tccctgagta cgacttcgac gcccgtgagt tcggcctcaa catgtttcag240atggaggact ccgacgcaaa ccaaaccgtc accctcctca aggtcaagga ggccctcgag300gacgctggca tcgaagccct cagcaaggaa aagaagaaca ttggatgtgt tctcggtatc360ggtggtggcc agaagtccag ccacgagttc tactcccgct taaactatgt tgtcgttgag420aaggtccttc gcaagatggg catgcctgag gaggatgttc aagctgctgt tgagaagtac480aaggccaact tccctgagtg gcgccttgac tccttccccg gtttcctcgg caacgttact540gccggtcgct gtaccaacac cttcaacctc gatggtatga actgtgtcgt cgatgctgcc600
tgtgctagtt ctctcatcgc cgttaaggtt gccattgatg agcttctcca cggagactgt660gacatgatga tcactggtgc tacctgcacg gataactcca tcggtatgta catggccttc720tccaagaccc cggtgttctc taccgaccct agcgtccgcg catacgatga gaagaccaag780ggtatgctta ttggcgaagg ctctgccatg cttgtgctta aacgttacgc cgacgct 837<210>11<211>51<212>DNA<213>Ulkenia sp.
<400>11ggtatgaact gtgtcgtcga tgctgcctgt gctagttctc tcatcgccgtt 51<210>12<211>12<212>DNA<213>Ulkenia sp.
<400>12gatgctgcct gt12<210>13<211>522<212>DNA<213>Ulkenia sp.
<400>13cacgctgtca ttcgcggctg cgcctcttcc tctgacggta aggcctccgg tatttacacc60ccgaccatct ctggtcaaga ggaggctctt cgccgtgcct acatgcgcgc taacgtcgat120cccgccaccg tcactcttgt tgagggccac ggtaccggta cccccgttgg tgaccgtatt180gagctcaccg ctctccgtaa cctcttcgac agtgcctacg gcaacgagaa ggagaaggtc240gctgttggca gcattaagtc caacatcggt cacctcaagg ctgtcgccgg tcttgccggt300atgatcaagg tcatcatggc cctcaagcat aagactcttc cggccaccat caacgttgat360gagcccccta agctttacga caacactccc atcaccgact catcgctgta cattaacacg420atgaaccgtc cgtggttccc tgctccgggt gtgccccgtc gcgctggtat ctccagtttc480ggttttggtg gtgccaacta ccacgccgtt cttgaggaag cc 522<210>14<211>1380<212>DNA<213>Ulkenia sp.
<400>14acccgcatcg ctgtgatcgg catgtccgcc atcctcccct gcggtaccac cgttcgtgag60tcttgggagg ctatccgcga tggtatcgac tgcctcagtg atctccccga ggaccgcgtc120
gatgtgaccg cctacttcga cccggtcaag accaccaagg ataagatcta ctgcaaacgt180ggtggattca tccctgagta cgacttcgac gcccgtgagt tcggcctcaa catgtttcag240atggaggact ccgacgcaaa ccaaaccgtc accctcctca aggtcaagga ggccctcgag300gacgctggca tcgaagccct cagcaaggaa aagaagaaca ttggatgtgt tctcggtatc360ggtggtggcc agaagtccag ccacgagttc tactcccgct taaactatgt tgtcgttgag420aaggtccttc gcaagatggg catgcctgag gaggatgttc aagctgctgt tgagaagtac480aaggccaact tccctgagtg gcgccttgac tccttccccg gtttcctcgg caacgttact540gccggtcgct gtaccaacac cttcaacctc gatggtatga actgtgtcgt cgatgctgcc600tgtgctagtt ctctcatcgc cgttaaggtt gccattgatg agcttctcca cggagactgt660gacatgatga tcactggtgc tacctgcacg gataactcca tcggtatgta catggccttc720tccaagaccc cggtgttctc taccgaccct agcgtccgcg catacgatga gaagaccaag780ggtatgctta ttggcgaagg ctctgccatg cttgtgctta aacgttacgc cgacgctgtt840cgtgatggtg acgagattca cgctgtcatt cgcggctgcg cctcttcctc tgacggtaag900gcctccggta tttacacccc gaccatctct ggtcaagagg aggctcttcg ccgtgcctac960atgcgcgcta acgtcgatcc cgccaccgtc actcttgttg agggccacgg taccggtacc1020cccgttggtg accgtattga gctcaccgct ctccgtaacc tcttcgacag tgcctacggc1080aacgagaagg agaaggtcgc tgttggcagc attaagtcca acatcggtca cctcaaggct1140gtcgccggtc ttgccggtat gatcaaggtc atcatggccc tcaagcataa gactcttccg1200gccaccatca acgttgatga gccccctaag ctttacgaca acactcccat caccgactca1260tcgctgtaca ttaacacgat gaaccgtccg tggttccctg ctccgggtgt gccccgtcgc1320gctggtatct ccagtttcgg ttttggtggt gccaactacc acgccgttct tgaggaagcc1380<210>15<211>996<212>DNA<213>Ulkenia sp.
<400>15ctcttctctg gccagggtgc tcagtacacc cacatgttca gcgaggtcgc catgaactgg60cctcagttcc gtgagagcat ctctgacatg gatcgtgccc aggctaaggt tgctggcgct120gacaaggact acgagcgtgt ctcccaagtc ctctacccgc gtaagcctta taactctgag180cccgagcagg accacaagaa gatctccctg acctcatact ctcagccctc taccctcgcc240tgcgctcttg gtgcctacga gatcttcaag caggctggtt tcaagcccga cttcgctgcc300ggtcactctc tcggtgagtt tgcggccctc tacgctgctg actgcgtcaa ccgtgacgac360
ctctttgagc tcgtgtgccg tcgtgcccgc atcatgggtg gcaaggatgc acctgctacc420cccaagggat gcatggctgc tgtcattgga cccaatgccg agaagatcca gattcgcact480gctgatgtct ggctcggcaa ctgcaactcc ccttcgcaga ctgtcatcac cggctctgtt540gagggtatca agaaggagtc cgagcttctc cagagtgagg gcttccgtgt tgtccccctc600gcctgcgaga gtgccttcca ctcaccgcag atgcaaaacg cctcctctgc cttcaaggat660gttctctcca aggttgcctt ccgtcagcct agcgcccaga ccaagctctt cagcaacgtg720tctggcgaga cctactccaa caatgcccag gacctcctta aggagcacat gaccagcagt780gttaagttca tctctcaggt tcgcaacatg cactctgctg gtgctcgcat ctttgtcgag840tttggcccca agcaggtgct ctctaagctt gtttccgaga ccctcaagga cgatccttcc900attatcacta tctctgtcaa cccttcctct ggcaaggatg ccgatattca gcttcgcgag960gctgctgtgc agctcgttgt tgctggagtc aacctt 996<210>16<211>3510<212>DNA<213>Ulkenia sp.
<400>16gcccaggccc agatccagaa ggccaaggcc gatgctgctg aggctgacaa gaagcttgcc60gctgctaagg atgaggccaa gcgtgccgcc gcttctgcac ctgtgcagaa gcaggttgac120accaccattg ttgataagca ccgtgctatc ctcaagtcta tgcttgctga gcttgactgc180tactccactc ctggtgctgt gtccagctct ttccaggcac ctgttgctgc tacccctgct240ccggtcgctg cgcctgttgc agctgctcct gctccggctg tcaacaatgc tctccttgcc300aaggctgagt ctgttgtcat ggaggttctt gccgccaaga ctggttacga gactgacatg360atcgagcccg acatggagct cgagactgag ctcggcattg actctatcaa gcgtgtcgag420attctctctg aggtccaggc ccagctcaac gtcgaggcca aggatgttga tgctcttagc480cgcacccgca ccgtcggtga ggttgtcaac gccatgaagg ctgagatcgc tggcagctct540ggtgctgccg ctgctgcccc ggccccggtt gctgctgctc ccgctgcccc tgcccctgct600gtcaacagcg ctcttcttgc caaggctgag actgttgtca tggaggttct tgccgccaag660actggttacg agactgacat gattgagccc gacatggagc tcgagactga gctcggcatt720gactccatca agcgtgtcga gattctctct gaggttcagg cccagctcaa cgttgaggcc780aaggatgttg atgctcttag ccgcacccgc accgttggtg aggttgtcaa cgccatgaag840gctgagatcg ctggcagctc tggtgctgcc gctgctgccc cggcccctgt tgctgctgct900ccggcgcccg tcgctgccgc tgcccctgct gtcagcagcg ctctccttga gaaggctgag960
tctgttgtca tggaggttct tgccgccaag actggttacg agactgacat gattgaggcc1020gacatggagc tcgagactga gctcggcatt gactccatca agcgtgtcga gattctctct1080gaggtccagg cccagctcaa cgtcgaggcc aaggatgtcg atgctcttag ccgcacccgc1140accgttggtg aggttgtcaa cgccatgaag gctgagatcg ctggcagctc tggtgctgct1200gccccggccc cggtcgctgc ggcccctgct ccggtcgctg ccgctgcccc tgctgtcaac1260agcgctcttc ttgagaaggc tgagactgtt gtcatggagg ttcttgccgc caagactggt1320tacgagactg acatgatcga gcccgacatg gagctcgaga ctgagctcgg cattgactct1380atcaagcgtg tcgagattct ctctgaggtc caggcccagc tcaacgttga ggccaaggat1440gttgatgctc ttagccgcac ccgcaccgtt ggtgaggttg tcaacgccat gaaggctgag1500atcgctggca gctctggtgc tgccgctgct gccccggccc cggttgctgc tgctcccgct1560cccgtcgctg cccctgctgt cagcagcgct ctccttgaga aggctgagtc tgtcgtcatg1620gaggttcttg ccgccaagac tggttacgag actgacatga ttgaggccga catggagctc1680gagactgagc tcggcattga ctccatcaag cgtgtcgaga ttctctctga ggtccaggcc1740cagctcaacg ttgaggccaa ggatgtcgat gctcttagcc gcacccgcac cgttggtgag1800gttgtcaacg ccatgaaggc tgagatcgct ggcagctctg gtgctgccgc tgctgccccg1860gcccctgttg ctgcctctcc cgctcccgtc gctgccgctg cccctgctgt cagcagcgct1920ctccttgaga aggccgaatc tgttgtcatg gaggttctcg ccgccaagac tggttacgag1980actgacatga ttgaggctga catggagctc gagactgagc tcggcattga ctctatcaag2040cgtgtcgaga ttctctctga ggtccaggct atgcttaacg ttgaggccaa ggatgttgat2100gctcttagcc gcacccgcac cgttggtgag gttgtcaacg ccatgaaggc tgagatcgct2160ggcagctctg gtgccgccgc tgctgccccg gccccggttg ctgctgctcc ggcgcccgtc2220actgccgctg cccctgctgt cagcagcgct ctccttgaga aggccgaatc tgttgtcatg2280gaggttctcg ccgccaagac tggttacgag actgacatga ttgaggccga catggagctc2340gagactgagc ttggcattga ctccatcaag cgtgtcgaga ttctctctga ggtccaggct2400atgcttaacg tcgaggccaa ggatgttgat gctcttagcc gcacccgcac cgttggtgag2460gttgtcaacg ccatgaaggc tgagattgct agcagctctg gtgctgctgc ccctgctccg2520gctgctgccg ttgcaccggc ccctgctgct gcccctgctg tcagcagcgc tctccttgag2580aaggccgaat ctgttgtcat ggaggttctc gccgccaaga ctggttacga gactgacatg2640attgaggccg acatggagct cgagactgag ctcggcattg actctatcaa gcgtgtcgag2700attctctctg aggtccaggc tatgcttaac gttgaggcca aggatgttga tgctcttagc2760cgcacccgca ccgttggtga ggttgtcaac gccatgaagg ctgagattgc tagcagctct2820
ggtgctgctg cccctgctcc tgctgctgcc gctgcaccgg cccctgctgc tgcccctgct2880gtcagcagcg ctcttcttga gaaggctgag tctgttgtca tggaggttct cgccgccaag2940actggttacg agactgacat gattgaggcc gacatggagc tcgagactga gcttggcatt3000gactccatca agcgtgtcga gattctctct gaggtccagg ctatgcttaa cgttgaggcc3060aaggatgttg atgctcttag ccgcacccgc accgttggtg aggttgtcaa cgccatgaag3120gctgagattg ctagcagctc tggtgctgct gcccctgctc ctgctgctgc cgctgcaccg3180gcccctgctg ctgcccctgc tgtcagcagc gctcttcttg agaaggctga gtctgttgtc3240atggaggttc tcgccgccaa gactggttac gagactgaca tgattgaggc cgacatggag3300ctcgagactg agcttggcat tgactccatc aagcgtgtcg agattctctc tgaggtccag3360gctatgctta acgttgaggc caaggatgtt gatgctctta gccgcacccg caccgttggt3420gaggttgtca acgccatgaa ggctgagatc gctggcagct ctggtgctgc tactgcctct3480gcccctgctg ctgcagctgc cgcccctgct 3510<210>17<211>219<212>DNA<213>Ulkenia sp.
<400>17ctccttgcca aggctgagtc tgttgtcatg gaggttcttg ccgccaagac tggttacgag60actgacatga tcgagcccga catggagctc gagactgagc tcggcattga ctctatcaag120cgtgtcgaga ttctctctga ggtccaggcc cagctcaacg tcgaggccaa ggatgttgat180gctcttagcc gcacccgcac cgtcggtgag gttgtcaac 219<210>18<211>219<212>DNA<213>Ulkenia sp.
<400>18cttcttgcca aggctgagac tgttgtcatg gaggttcttg ccgccaagac tggttacgag60actgacatga ttgagcccga catggagctc gagactgagc tcggcattga ctccatcaag120cgtgtcgaga ttctctctga ggttcaggcc cagctcaacg ttgaggccaa ggatgttgat180gctcttagcc gcacccgcac cgttggtgag gttgtcaac 219<210>19<211>219<212>DNA<213>Ulkenia sp.
<400>19
ctccttgaga aggctgagtc tgttgtcatg gaggttcttg ccgccaagac tggttacgag60actgacatga ttgaggccga catggagctc gagactgagc tcggcattga ctccatcaag120cgtgtcgaga ttctctctga ggtccaggcc cagctcaacg tcgaggccaa ggatgtcgat180gctcttagcc gcacccgcac cgttggtgag gttgtcaac 219<210>20<211>219<212>DNA<213>Ulkenia sp.
<400>20cttcttgaga aggctgagac tgttgtcatg gaggttcttg ccgccaagac tggttacgag60actgacatga tcgagcccga catggagctc gagactgagc tcggcattga ctctatcaag120cgtgtcgaga ttctctctga ggtccaggcc cagctcaacg ttgaggccaa ggatgttgat180gctcttagcc gcacccgcac cgttggtgag gttgtcaac 219<210>21<211>219<212>DNA<213>Ulkenia sp.
<400>21ctccttgaga aggctgagtc tgtcgtcatg gaggttcttg ccgccaagac tggttacgag60actgacatga ttgaggccga catggagctc gagactgagc tcggcattga ctccatcaag120cgtgtcgaga ttctctctga ggtccaggcc cagctcaacg ttgaggccaa ggatgtcgat180gctcttagcc gcacccgcac cgttggtgag gttgtcaac 219<210>22<211>219<212>DNA<213>Ulkenia sp.
<400>22ctccttgaga aggccgaatc tgttgtcatg gaggttctcg ccgccaagac tggttacgag60actgacatga ttgaggctga catggagctc gagactgagc tcggcattga ctctatcaag120cgtgtcgaga ttctctctga ggtccaggct atgcttaacg ttgaggccaa ggatgttgat180gctcttagcc gcacccgcac cgttggtgag gttgtcaac 219<210>23<211>219<212>DNA<213>Ulkenia sp.
<400>23ctccttgaga aggccgaatc tgttgtcatg gaggttctcg ccgccaagac tggttacgag 60
actgacatga ttgaggccga catggagctc gagactgagc ttggcattga ctccatcaag120cgtgtcgaga ttctctctga ggtccaggct atgcttaacg tcgaggccaa ggatgttgat180gctcttagcc gcacccgcac cgttggtgag gttgtcaac 219<210>24<211>219<212>DNA<213>Ulkenia sp.
<400>24ctccttgaga aggccgaatc tgttgtcatg gaggttctcg ccgccaagac tggttacgag60actgacatga ttgaggccga catggagctc gagactgagc tcggcattga ctctatcaag120cgtgtcgaga ttctctctga ggtccaggct atgcttaacg ttgaggccaa ggatgttgat180gctcttagcc gcacccgcac cgttggtgag gttgtcaac 219<210>25<211>219<212>DNA<213>Ulkenia sp.
<400>25cttcttgaga aggctgagtc tgttgtcatg gaggttctcg ccgccaagac tggttacgag60actgacatga ttgaggccga catggagctc gagactgagc ttggcattga ctccatcaag120cgtgtcgaga ttctctctga ggtccaggct atgcttaacg ttgaggccaa ggatgttgat180gctcttagcc gcacccgcac cgttggtgag gttgtcaac 219<210>26<211>219<212>DNA<213>Ulkenia sp.
<400>26cttcttgaga aggctgagtc tgttgtcatg gaggttctcg ccgccaagac tggttacgag60actgacatga ttgaggccga catggagctc gagactgagc ttggcattga ctccatcaag120cgtgtcgaga ttctctctga ggtccaggct atgcttaacg ttgaggccaa ggatgttgat180gctcttagcc gcacccgcac cgttggtgag gttgtcaac 219<210>27<211>609<212>DNA<213>Ulkenia sp.
<400>27aagaagctcg ttggcactat tgctggtgcc cgtgaggttc gttcctcaat tgctaacatt60gaggctctcg gtggcaaggc aatctactcc tcttgtgatg tgaactctgc tgctgatgtc120
gccaaggctg ttcgcgaggc tgaggctcag cttggcgccc gtgtaactgg tgtcgtccac180gcttctggtg tccttcgtga ccgcctcatt gagcagaagc gccccgatga gtttgatgct240gtcttcggca ccaaggtgac tggtctcgag aacctctttg gtgccattga catggccaac300cttaagcacc tcgtcctctt cagctctctt gctggtttcc acggcaacat tggtcagtct360gactacgcca tggctaacga ggccctcaac aagatgggtc ttgagctctc tgaccgtgtg420tccgtgaagt ctatttgctt cggcccctgg gatggtggca tggttacccc ccagctcaag480aagcagttcc agtctatggg tgttcagatc atcccccgtg agggtggtgc cgatactgtg540gctcgcattg tcctcggctc ctcccctgct gagatccttg ttggcaactg gaccactccc600accaagaag609<210>28<211>279<212>PRT<213>Ulkenia sp.
<400>28Thr Arg Ile Ala Val Ile Gly Met Ser Ala Ile Leu Pro Cys Gly Thr1 510 15Thr Val Arg Glu Ser Trp Glu Ala Ile Arg Asp Gly Ile Asp Cys Leu20 25 30Ser Asp Leu Pro Glu Asp Arg Val Asp Val Thr Ala Tyr Phe Asp Pro35 40 45Val Lys Thr Thr Lys Asp Lys Ile Tyr Cys Lys Arg Gly Gly Phe Ile50 5560Pro Glu Tyr Asp Phe Asp Ala Arg Glu Phe Gly Leu Asn Met Phe Gln65 70 75 80Met Glu Asp Ser Asp Ala Asn Gln Thr Val Thr Leu Leu Lys Val Lys8590 95Glu Ala Leu Glu Asp Ala Gly Ile Glu Ala Leu Ser Lys Glu Lys Lys100 105 110Asn Ile Gly Cys Val Leu Gly Ile Gly Gly Gly Gln Lys Ser Ser His115 120 125Glu Phe Tyr Ser Arg Leu Asn Tyr Val Val Val Glu Lys Val Leu Arg130 135 140Lys Met Gly Met Pro Glu Glu Asp Val Gln Ala Ala Val Glu Lys Tyr145 150 155 160Lys Ala Asn Phe pro Glu Trp Arg Leu Asp Ser Phe Pro Gly Phe Leu165 170 175Gly Asn Val Thr Ala Gly Arg Cys Thr Asn Thr Phe Asn Leu Asp Gly180 185 190
Met Asn Cys Val Val Asp Ala Ala Cys Ala Ser Ser Leu Ile Ala Val195 200 205Lys Val Ala Ile Asp Glu Leu Leu His Gly Asp Cys Asp Met Met Ile210 215 220Thr Gly Ala Thr Cys Thr Asp Asn Ser Ile Gly Met Tyr Met Ala Phe225 230 235 240Ser Lys Thr Pro Val Phe Ser Thr Asp Pro Ser Val Arg Ala Tyr Asp245 250 255Glu Lys Thr Lys Gly Met Leu Ile Gly Glu Gly Ser Ala Met Leu Val260 265 270Leu Lys Arg Tyr Ala Asp Ala275<210>29<211>17<212>PRT<213>Ulkenia sp.
<400>29Gly Met Asn Cys Val Val Asp Ala Ala Cys Ala Ser Ser Leu Ile Ala1 510 15Val<210>30<211>4<212>PRT<213>Ulkenia sp.
<400>30Asp Ala Ala Cys1<210>31<211>174<212>PRT<213>Ulkenia sp.
<400>31His Ala Val Ile Arg Gly Cys Ala Ser Ser Ser Asp Gly Lys Ala Ser1 510 15Gly Ile Tyr Thr Pro Thr Ile Ser Gly Gln Glu Glu Ala Leu Arg Arg20 25 30Ala Tyr Met Arg Ala Asn Val Asp Pro Ala Thr Val Thr Leu Val Glu35 40 45Gly His Gly Thr Gly Thr Pro Val Gly Asp Arg Ile Glu Leu Thr Ala50 5560Leu Arg Asn Leu Phe Asp Ser Ala Tyr Gly Asn Glu Lys Glu Lys Val65 70 75 80
Ala Val Gly Ser Ile Lys Ser Asrn Ile Gly His Leu Lys Ala Val Ala85 9095Gly Leu Ala Gly Met Ile Lys Val Ile Met Ala Leu Lys His Lys Thr100 105 110Leu Pro Ala Thr Ile Asn Val Asp Glu Pro Pro Lys Leu Tyr Asp Asn115 120 125Thr Pro Ile Thr Asp Ser Ser Leu Tyr Ile Asn Thr Met Asn Arg Pro130 135 140Trp Phe Pro Ala Pro Gly Val Pro Arg Arg Ala Gly Ile Ser Ser Phe145 150 155 160Gly Phe Gly Gly Ala Asn Tyr His Ala Val Leu Glu Glu Ala165 170<210>32<211>460<212>PRT<213>Ulkenia sp.
<400>32Thr Arg Ile Ala Val Ile Gly Met Ser Ala Ile Leu Pro Cys Gly Thr1 510 15Thr Val Arg Glu Ser Trp Glu Ala Ile Arg Asp Gly Ile Asp Cys Leu20 25 30Ser Asp Leu Pro Glu Asp Arg Val Asp Val Thr Ala Tyr Phe Asp pro35 40 45Val Lys Thr Thr Lys Asp Lys Ile Tyr Cys Lys Arg Gly Gly Phe Ile50 5560Pro Glu Tyr Asp Phe Asp Ala Arg Glu Phe Gly Leu Asn Met Phe Gln65 70 75 80Met Glu Asp Ser Asp Ala Asn Gln Thr Val Thr Leu Leu Lys Val Lys8590 95Glu Ala Leu Glu Asp Ala Gly Ile Glu Ala Leu Ser Lys Glu Lys Lys100 105 110Asn Ile Gly Cys Val Leu Gly Ile Gly Gly Gly Gln Lys Ser Ser His115 120 125Glu Phe Tyr Ser Arg Leu Asn Tyr Val Val Val Glu Lys Val Leu Arg130 135 140Lys Met Gly Met Pro Glu Glu Asp Val Gln Ala Ala Val Glu Lys Tyr145 150 155 160Lys Ala Asn Phe Pro Glu Trp Arg Leu Asp Ser Phe Pro Gly Phe Leu165 170 175Gly Asn Val Thr Ala Gly Arg Cys Thr Asn Thr Phe Asn Leu Asp Gly180 185 190
Met Asn Cys Val Val Asp Ala Ala Cys Ala Ser Ser Leu Ile Ala Val195 200 205Lys Val Ala Ile Asp Glu Leu Leu His Gly Asp Cys Asp Met Met Ile210 215 220Thr Gly Ala Thr Cys Thr Asp Asn Ser Ile Gly Met Tyr Met Ala Phe225 230 235 240Ser Lys Thr Pro Val Phe Ser Thr Asp Pro Ser Val Arg Ala Tyr Asp245 250 255Glu Lys Thr Lys Gly Met Leu Ile Gly Glu Gly Ser Ala Met Leu Val260 265 270Leu Lys Arg Tyr Ala Asp Ala Val Arg Asp Gly Asp Glu Ile His Ala275 280 285Val Ile Arg Gly Cys Ala Ser Ser Ser Asp Gly Lys Ala Ser Gly Ile290 295 300Tyr Thr Pro Thr Ile Ser Gly Gln Glu Glu Ala Leu Arg Arg Ala Tyr305 310 315 320Met Arg Ala Asn Val Asp Pro Ala Thr Val Thr Leu Val Glu Gly His325 330 335Gly Thr Gly Thr Pro Val Gly Asp Arg Ile Glu Leu Thr Ala Leu Arg340 345 350Asn Leu Phe Asp Ser Ala Tyr Gly Asn Glu Lys Glu Lys Val Ala Val355 360 365Gly Ser Ile Lys Ser Asn Ile Gly His Leu Lys Ala Val Ala Gly Leu370 375 380Ala Gly Met Ile Lys Val Ile Met Ala Leu Lys His Lys Thr Leu Pro385 390 395 400Ala Thr Ile Asn Val Asp Glu Pro Pro Lys Leu Tyr Asp Asn Thr Pro405 410 415Ile Thr Asp Ser Ser Leu Tyr Ile Asn Thr Met Asn Arg Pro Trp Phe420 425 430Pro Ala Pro Gly Val Pro Arg Arg Ala Gly Ile Ser Ser Phe Gly Phe435 440 445Gly Gly Ala Asn Tyr His Ala Val Leu Glu Glu Ala450 455 460<210>33<211>332<212>PRT<213>Ulkenia sp.
<400>33Leu Phe Ser Gly Gln Gly Ala Gln Tyr Thr His Met Phe Ser Glu Val1 5 10 15
Ala Met Asn Trp Pro Gln Phe Arg Glu Ser Ile Ser Asp Met Asp Arg20 25 30Ala Gln Ala Lys Val Ala Gly Ala Asp Lys Asp Tyr Glu Arg Val Ser35 40 45Gln Val Leu Tyr Pro Arg Lys Pro Tyr Asn Ser Glu Pro Glu Gln Asp50 5560His Lys Lys Ile Ser Leu Thr Ser Tyr Ser Gln Pro Ser Thr Leu Ala65 70 75 80Cys Ala Leu Gly Ala Tyr Glu Ile Phe Lys Gln Ala Gly Phe Lys Pro85 9095Asp Phe Ala Ala Gly His Ser Leu Gly Glu Phe Ala Ala Leu Tyr Ala100 105 110Ala Asp Cys Val Asn Arg Asp Asp Leu Phe Glu Leu Val Cys Arg Arg115 120 125Ala Arg Ile Met Gly Gly Lys Asp Ala Pro Ala Thr Pro Lys Gly Cys130 135 140Met Ala Ala Val Ile Gly Pro Asn Ala Glu Lys Ile Gln Ile Arg Thr145 150 155 160Ala Asp Val Trp Leu Gly Asn Cys Asn Ser Pro Ser Gln Thr Val Ile165 170 175Thr Gly Ser Val Glu Gly Ile Lys Lys Glu Ser Glu Leu Leu Gln Ser180 185 190Glu Gly Phe Arg Val Val Pro Leu Ala Cys Glu Ser Ala Phe His Ser195 200 205Pro Gln Met Gln Asn Ala Ser Ser Ala Phe Lys Asp Val Leu Ser Lys210 215 220Val Ala Phe Arq Gln Pro Ser Ala Gln Thr Lys Leu Phe Ser Asn Val225 230 235 240Ser Gly Glu Thr Tyr Ser Asn Asn Ala Gln Asp Leu Leu Lys Glu His245 250 255Met Thr Ser Ser Val Lys Phe Ile Ser Gln Val Arg Asn Met His Ser260 265 270Ala Gly Ala Arg Ile Phe Val Glu Phe Gly Pro Lys Gln Val Leu Ser275 280 285Lys Leu Val Ser Glu Thr Leu Lys Asp Asp Pro Ser Ile Ile Thr Ile290 295 300Ser Val Asn Pro Ser Ser Gly Lys Asp Ala Asp Ile Gln Leu Arg Glu305 310 315 320Ala Ala Val Gln Leu Val Val Ala Gly Val Asn Leu325 330
<210>34<211>1170<212>PRT<213>Ulkenia sp.
<400>34Ala Gln Ala Gln Ile Gln Lys Ala Lys Ala Asp Ala Ala Glu Ala Asp1 510 15Lys Lys Leu Ala Ala Ala Lys Asp Glu Ala Lys Arg Ala Ala Ala Ser20 25 30Ala Pro Val Gln Lys Gln Val Asp Thr Thr Ile Val Asp Lys His Arg35 40 45Ala Ile Leu Lys Ser Met Leu Ala Glu Leu Asp Cys Tyr Ser Thr Pro50 5560Gly Ala Val Ser Ser Ser Phe Gln Ala Pro Val Ala Ala Thr Pro Ala65 70 75 80Pro Val Ala Ala Pro Val Ala Ala Ala Pro Ala Pro Ala Val Asn Asn85 9095Ala Leu Leu Ala Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala100 105 110Lys Thr Gly Tyr Glu Thr Asp Met Ile Glu Pro Asp Met Glu Leu Glu115 120 125Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu130 135 140Val Gln Ala Gln Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser145 150 155 160Arg Thr Arg Thr Val Gly Glu Val Val Asn Ala Met Lys Ala Glu Ile165 170 175Ala Gly Ser Ser Gly Ala Ala Ala Ala Ala Pro Ala Pro Val Ala Ala180 185 190Ala Pro Ala Ala Pro Ala Pro Ala Val Asn Ser Ala Leu Leu Ala Lys195 200 205Ala Glu Thr Val Val Met Glu Val Leu Ala Ala Lys Thr Gly Tyr Glu210 215 220Thr Asp Met Ile Glu Pro Asp Met Glu Leu Glu Thr Glu Leu Gly Ile225 230 235 240Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val Gln Ala Gln Leu245 250 255Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg Thr Arg Thr Val260 265 270Gly Glu Val Val Asn Ala Met Lys Ala Glu Ile Ala Gly Ser Ser Gly275 280 285Ala Ala Ala Ala Ala Pro Ala Pro Val Ala Ala Ala Pro Ala Pro Val
290 295 300Ala Ala Ala Ala Pro Ala Val Ser Ser Ala Leu Leu Glu Lys Ala Glu305 310 315 320Ser Val Val Met Glu Val Leu Ala Ala Lys Thr Gly Tyr Glu Thr Asp325 330 335Met Ile Glu Ala Asp Met Glu Leu Glu Thr Glu Leu Gly Ile Asp Ser340 345 350Ile Lys Arg Val Glu Ile Leu Ser Glu Val Gln Ala Gln Leu Asn Val355 360 365Glu Ala Lys Asp Val Asp Ala Leu Ser Arg Thr Arg Thr Val Gly Glu370 375 380Val Val Asn Ala Met Lys Ala Glu Ile Ala Gly Ser Ser Gly Ala Ala385 390 395 400Ala Pro Ala Pro Val Ala Ala Ala Pro Ala Pro Val Ala Ala Ala Ala405 410 415Pro Ala Val Asn Ser Ala Leu Leu Glu Lys Ala Glu Thr Val Val Met420 425 430Glu Val Leu Ala Ala Lys Thr Gly Tyr Glu Thr Asp Met Ile Glu Pro435 440 445Asp Met Glu Leu Glu Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg Val450 455 460Glu Ile Leu Ser Glu Val Gln Ala Gln Leu Asn Val Glu Ala Lys Asp465 470 475 480Val Asp Ala Leu Ser Arg Thr Arg Thr Val Gly Glu Val Val Asn Ala485 490 495Met Lys Ala Glu Ile Ala Gly Ser Ser Gly Ala Ala Ala Ala Ala Pro500 505 5l0Ala Pro Val Ala Ala Ala Pro Ala Pro Val Ala Ala Pro Ala Val Ser515 520 525Ser Ala Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala530 535 540Ala Lys Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu545 550 555 560Glu Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser565 570 575Glu Val Gln Ala Gln Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu580 585 590Ser Arg Thr Arg Thr Val Gly Glu Val Val Asn Ala Met Lys Ala Glu595 600 605Ile Ala Gly Ser Ser Gly Ala Ala Ala Ala Ala Pro Ala Pro Val Ala610 615 620
Ala Ser Pro Ala Pro Val Ala Ala Ala Ala Pro Ala Val Ser Ser Ala625 630 635 640Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys645 650 655Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr660 665 670Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val675 680 685Gln Ala Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg690 695 700Thr Arg Thr Val Gly Glu Val Val Asn Ala Met Lys Ala Glu Ile Ala705 710 715 720Gly Ser Ser Gly Ala Ala Ala Ala Ala Pro Ala Pro Val Ala Ala Ala725 730 735Pro Ala Pro Val Thr Ala Ala Ala Pro Ala Val Ser Ser Ala Leu Leu740 745 750Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys Thr Gly755 760 765Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr Glu Leu770 775 780Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val Gln Ala785 790 795 800Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg Thr Arg805 810 815Thr Val Gly Glu Val Val Asn Ala Met Lys Ala Glu Ile Ala Ser Ser820 825 830Ser Gly Ala Ala Ala Pro Ala Pro Ala Ala Ala Val Ala Pro Ala Pro835 840 845Ala Ala Ala Pro Ala Val Ser Ser Ala Leu Leu Glu Lys Ala Glu Ser850 855 860Val Val Met Glu Val Leu Ala Ala Lys Thr Gly Tyr Glu Thr Asp Met865 870 875 880Ile Glu Ala Asp Met Glu Leu Glu Thr Glu Leu Gly Ile Asp Ser Ile885 890 895Lys Arg Val Glu Ile Leu Ser Glu Val Gln Ala Met Leu Asn Val Glu900 905 910Ala Lys Asp Val Asp Ala Leu Ser Arg Thr Arg Thr Val Gly Glu Val915 920 925Val Asn Ala Met Lys Ala Glu Ile Ala Ser Ser Ser Gly Ala Ala Ala930 935 940Pro Ala Pro Ala Ala Ala Ala Ala Pro Ala Pro Ala Ala Ala Pro Ala
945 950 955 960Val Ser Ser Ala Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val965 970 975Leu Ala Ala Lys Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met980 985 990Glu Leu Glu Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile995 1000 1005Leu Ser Glu Val Gln Ala Met Leu Asn Val Glu Ala Lys Asp Val1010 1015 1020Asp Ala Leu Ser Arg Thr Arg Thr Val Gly Glu Val Val Asn Ala1025 1030 1035Met Lys Ala Glu Ile Ala Ser Ser Ser Gly Ala Ala Ala Pro Ala1040 1045 1050Pro Ala Ala Ala Ala Ala Pro Ala Pro Ala Ala Ala Pro Ala Val1055 1060 1065Ser Ser Ala Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val1070 1075 1080Leu Ala Ala Lys Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp1085 1090 1095Met Glu Leu Glu Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg Val1100 1105 1110Glu Ile Leu Ser Glu Val Gln Ala Met Leu Asn Val Glu Ala Lys1115 1120 1125Asp Val Asp Ala Leu Ser Arg Thr Arg Thr Val Gly Glu Val Val1130 1135 1140Asn Ala Met Lys Ala Glu Ile Ala Gly Ser Ser Gly Ala Ala Thr1145 1150 1155Ala Ser Ala Pro Ala Ala Ala Ala Ala Ala Pro Ala1160 1165 1170<210>35<211>73<212>PRT<213>Ulkenia sp.
<400>35Leu Leu Ala Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys1 510 15Thr Gly Tyr Glu Thr Asp Met Ile Glu Pro Asp Met Glu Leu Glu Thr20 25 30Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val35 40 45Gln Ala Gln Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg
5055 60Thr Arg Thr Val Gly Glu Val Val Asn65 70<210>36<211>73<212>PRT<213>Ulkenia sp.
<400>36Leu Leu Ala Lys Ala Glu Thr Val Val Met Glu Val Leu Ala Ala Lys1 510 15Thr Gly Tyr Glu Thr Asp Met Ile Glu Pro Asp Met Glu Leu Glu Thr20 25 30Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val35 40 45Gln Ala Gln Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg5055 60Thr Arg Thr Val Gly Glu Val Val Asn65 70<210>37<211>73<212>PRT<213>Ulkenia sp.
<400>37Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys1 510 15Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr20 25 30Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val35 40 45Gln Ala Gln Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg50 5560Thr Arg Thr Val Gly Glu Val Val Asn65 70<210>38<211>73<212>PRT<213>Ulkenia sp.
<400>38Leu Leu Glu Lys Ala Glu Thr Val Val Met Glu Val Leu Ala Ala Lys1 510 15Thr Gly Tyr Glu Thr Asp Met Ile Glu Pro Asp Met Glu Leu Glu Thr20 25 30
Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val35 40 45Gln Ala Gln Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg50 5560Thr Arg Thr Val Gly Glu Val Val Asn65 70<210>39<211>73<212>PRT<213>Ulkenia sp.
<400>39Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys1 5 10 15Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr20 25 30Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val35 40 45Gln Ala Gln Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg50 5560Thr Arg Thr Val Gly Glu Val Val Asn65 70<210>40<211>73<212>PRT<213>Ulkenia sp.
<400>40Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys1 5 1015Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr20 25 30Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val35 40 45Gln Ala Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg50 5560Thr Arg Thr Val Gly Glu Val Val Asn65 70<210>41<211>73<212>PRT<213>Ulkenia sp.
<400>41Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys
1 510 15Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr20 25 30Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val35 40 45Gln Ala Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg50 5560Thr Arg Thr Val Gly Glu Val Val Asn65 70<210>42<211>73<212>PRT<213>Ulkenia sp.
<400>42Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys1 510 15Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr20 25 30Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val3540 45Gln Ala Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg50 5560Thr Arg Thr Val Gly Glu Val Val Asn65 70<210>43<211>73<212>PRT<213>Ulkenia sp.
<400>43Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys1 5 1015Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr20 25 30Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val35 4045Gln Ala Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg50 5560Thr Arg Thr Val Gly Glu Val Val Asn65 70<210>44<211>73<212>PRT
<213>Ulkenia sp.
<400>44Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys1 510 15Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr20 25 30Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val35 40 45Gln Ala Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg50 5560Thr Arg Thr Val Gly Glu Val Val Asn65 70<210>45<211>203<212>PRT<213>Ulkenia sp.
<400>45Lys Lys Leu Val Gly Thr Ile Ala Gly Ala Arg Glu Val Arg Ser Ser1 510 15Ile Ala Asn Ile Glu Ala Leu Gly Gly Lys Ala Ile Tyr Ser Ser Cys20 25 30Asp Val Asn Ser Ala Ala Asp Val Ala Lys Ala Val Arg Glu Ala Glu35 40 45Ala Gln Leu Gly Ala Arg Val Thr Gly Val Val His Ala Ser Gly Val5055 60Leu Arg Asp Arg Leu Ile Glu Gln Lys Arg Pro Asp Glu Phe Asp Ala65 70 75 80Val Phe Gly Thr Lys Val Thr Gly Leu Glu Asn Leu Phe Gly Ala Ile85 9095Asp Met Ala Asn Leu Lys His Leu Val Leu Phe Ser Ser Leu Ala Gly100 105 110Phe His Gly Asn Ile Gly Gln Ser Asp Tyr Ala Met Ala Asn Glu Ala115 120 125Leu Asn Lys Met Gly Leu Glu Leu Ser Asp Arg Val Ser Val Lys Ser130 135 140Ile Cys Phe Gly Pro Trp Asp Gly Gly Met Val Thr Pro Gln Leu Lys145 150 155 160Lys Gln Phe Gln Ser Met Gly Val Gln Ile Ile Pro Arg Glu Gly Gly165 170 175Ala Asp Thr Val Ala Arg Ile Val Leu Gly Ser Ser Pro Ala Glu Ile180 185190
Leu Val Gly Asn Trp Thr Thr Pro Thr Lys Lys195 200<210>46<211>780<212>DNA<213>Ulkenia sp.
<400>46aagcgcattg ccgtggtggg catggccgtg caatacgcgg gctgcaaaga caaggaagag60ttctggaaag tagtcatggg cggtgaggct gcatggacta agattagcga taaacgcctc120ggatccaaca agcgagccga gcacttcaaa gcagagcgta gcaaatttgc agataccttt180tgcaacgaga actacggctg cgtcgatgac tccgtcgata acgaacacga gcttctcctt240aagctctcca agaaggctct ctccgagaca tcggtctccg actctacaag gtgcggtatt300gtgagcggat gcctgtcctt tcccatggac aacctccagg gcgaactcct caatgtgtac360caaaaccacg tcgaaaagaa actcggcgct cgcgtcttca aggatgcctc caagtggtcc420gagcgtgagc agtcgcagaa ccccgaggct ggtgaccgcc gcatctttat ggacccggca480tccttcgtag cagaagagct caacctcggt cctcttcact actctgtcga tgctgcctgt540gccaccgccc tttacgtcct tcgcctcgcc caggaccacc tcgtttccgg tgctgctgat600gtcatgctcg ctggtgcaac ttgcttcccg gagccctttt tcattctctc cggattctcc660actttccagg ccatgcctgt atcgggagac ggcatctcgt acccgcttca caaggacagt720cagggtctca cccctggtga aggtggtgcc attatggttc tcaagcgcct tgacgacgct780<210>47<211>51<212>DNA<213>Ulkenia sp.
<400>47cctcttcact actctgtcga tgctgcctgt gccaccgccc tttacgtcct t51<210>48<211>12<212>DNA<213>Ulkenia sp.
<400>48gatgctgcct gt12<210>49<211>477<212>DNA<213>Ulkenia sp.
<400>49
tacggtactc tgctcggtgc taccatcagc aatgctggct gtggtcttcc cctcaagccg60cacttgccca gcgagaagtc ctgcctcatt gatacctaca agcgcgtcaa cgtgcacccg120cacaagatcc agtacgtcga gtgccacgca acgggtactc cccagggaga ccgcgttgag180attgatgccg tcaaggcttg cttcgagggc aaggtgcctc gctttggaag ctccaagggt240aactttggcc acacactcgt tgcagctggt ttcgcaggca tgtgcaaggt actccttgcc300atgaagcatg gtgtgatccc gcccactcct ggtgtcgatg gatcttccca aatggacccg360cttgtggtct ctgagcccat cccatggccc gacactgagg gcgagcccaa gcgcgctggt420ctctccgctt tcggctttgg tggcaccaac gcccacgcag tctttgagga gtttgac 477<210>50<211>1278<212>DNA<213>Ulkenia sp.
<400>50aagcgcattg ccgtggtggg catggccgtg caatacgcgg gctgcaaaga caaggaagag60ttctggaaag tagtcatggg cggtgaggct gcatggacta agattagcga taaacgcctc120ggatccaaca agcgagccga gcacttcaaa gcagagcgta gcaaatttgc agataccttt180tgcaacgaga actacggctg cgtcgatgac tccgtcgata acgaacacga gcttctcctt240aagctctcca agaaggctct ctccgagaca tcggtctccg actctacaag gtgcggtatt300gtgagcggat gcctgtcctt tcccatggac aacctccagg gcgaactcct caatgtgtac360caaaaccacg tcgaaaagaa actcggcgct cgcgtcttca aggatgcctc caagtggtcc420gagcgtgagc agtcgcagaa ccccgaggct ggtgaccgcc gcatctttat ggacccggca480tccttcgtag cagaagagct caacctcggt cctcttcact actctgtcga tgctgcctgt540gccaccgccc tttacgtcct tcgcctcgcc caggaccacc tcgtttccgg tgctgctgat600gtcatgctcg ctggtgcaac ttgcttcccg gagccctttt tcattctctc cggattctcc660actttccagg ccatgcctgt atcgggagac ggcatctcgt acccgcttca caaggacagt720cagggtctca cccctggtga aggtggtgcc attatggttc tcaagcgcct tgacgacgct780attcgcgatg gagaccacat ttacggtact ctgctcggtg ctaccatcag caatgctggc840tgtggtcttc ccctcaagcc gcacttgccc agcgagaagt cctgcctcat tgatacctac900aagcgcgtca acgtgcaccc gcacaagatc cagtacgtcg agtgccacgc aacgggtact960ccccagggag accgcgttga gattgatgcc gtcaaggctt gcttcgaggg caaggtgcct1020cgctttggaa gctccaaggg taactttggc cacacactcg ttgcagctgg tttcgcaggc1080atgtgcaagg tactccttgc catgaagcat ggtgtgatcc cgcccactcc tggtgtcgat1140
ggatcttccc aaatggaccc gcttgtggtc tctgagccca tcccatggcc cgacactgag1200ggcgagccca agcgcgctgg tctctccgct ttcggctttg gtggcaccaa cgcccacgca1260gtctttgagg agtttgac 1278<210>51<211>801<212>DNA<213>Ulkenia sp.
<400>51atgcgcattg ctattaccgg tatggatgcc accttcggct ccctcaaggg cctggacgcc60tttgagcgtg ccatctacaa tggccaacat ggtgctgtgc cattgcctga gaagcgctgg120cgtttccttg gtaaagacaa ggactttttg gacctgtgcg gtgtcaagga ggtgccccac180ggatgctaca ttgaggacgt cgaggtggac tttagccgcc tgcgcacgcc catgacgcca240gacgacatgt tgcgccccat gcagctactt gctgtcacaa ccatcgaccg tgccattctc300aactctggcc tcaagaaggg aggtaaggtc gctgtcttcg tcggccttgg cactgacctt360gagctctacc gtcaccgcgc ccgcgttgcc ctcaaggagc gtgctcgtcc cgaagccgct420tcagccctca atgatatgat gtcctacatc aacgattgcg gtaccgctac ctcgtacaca480tcctacatcg gcaacctcgt ggccacccgc gtgtcttcac aatggggttt cgagggtcct540tctttcacca tcacagaggg caacaactcc gtctaccgtt gcgcagagtt gggcaagtac600ttgctcgaga ctggcgaggt cgaggccgta gtgatcgccg gtgtggatct ttgcgccagc660gctgagaatc tctacgtgaa gtcgcgtcgt ttcaaggtct cggagcagga gagcccgcgg720gccagcttcg actccggcgc tgacggctac tttgttggtg agggatgtgg tgccctcgtc780ctcaagcgcg agagcgactg c 801<210>52<211>792<212>DNA<213>Ulkenia sp.
<400>52gctgctttcg gactgagcct tggagagatt tccatggttt ttgccttttc tgagaagaac60ggccttgtct ctgaggagct gacaactaaa ctccgcaact cggaggtctg gcgtaaggcc120ctcgctgttg agtttgacgc cctccgcaag gcctggaata ttccccaaga tacccctgtc180agcgagttct ggcaaggata cgtggtacgt ggaacccgcg aggccgttga agcggccatc240ggccccaaca ataagtacgt gcacttgacc attgtcaacg atgccaacag tgctctcatc300agtggcaagc ctgaagattg caaggctgcc attgctcgcc tgagcagcaa cctccctgct360ttgcccgtgg accttggtat gtgtggccac tgccccgtgg tcgagccgta cggcaagcag420
atcgctgaga tccatagcgt cctcgagatt cccgaggttg ccggccttga cctgtacacg480agcgtcaacc agaagaagct tgttaacaag tccactggag ccagcgacga gtacgcaccc540agctttggtg aatacgcagc acagctgtac actgttcagg cagactttcc taagatcgcc600aagaccgtta gcgacaagaa ctttgacgtc tttgttgaga ctggtcccaa cgcccaccgt660agcgccgcaa ttcgcgccac ccttggaaat agcaagcctt ttgtcaccgg atccatggac720cgccagaacg agaatgcttg gacaaccatg gtcaagctgg ttgcctctct ccaagcccac780cgcgtgcctg gc792<210>53<211>1302<212>DNA<213>Ulkenia sp.
<400>53agccgtgcct tcatggagac atatggtgta tccgccccca tgtacaccgg cgccatggca60aagggcattg catccgctga gatggttatc gctgccggaa agcgcggcat ccttggttct120ctcggtgctg gtggtcttcc tatcgccacc gtacgcaagg ctctcgaagc tatccaggct180gaactgccca agggccctta cgctgtcaac ctcatccact ctcccttcga cagcaacctc240gagaagggta acgtcgacct cttcctcgag aagggcgtca ctgtcgttga agcctccgcc300tttatgacct tgaccccgca gctcgtgcgc taccgtgctg caggtctctc tcgcgctgct360gatggctcca cggttattaa gaaccgcgtc atcggtaagg tttctcgcac agagcttgcc420gcaatgttta tccgtcccgc gcccgagaat ctcctcgaga agctgctgaa gtccggcgag480atcacccaag agcaggctgc tctcgcacgc acagtgcctg tggcagacga cattgccgtt540gaggcggact ccggtggcca caccgataac cgccccatcc acgtcatcct ccctctcatt600gtcaacctcc gtgatcgtct gcacaaggag tgcggctacc ctgcccacct tcgcgttcgc660gttggtgctg gtggtggcat tggatgccct caggccgcca ttgccacctt caacatgggc720gcggccttca tcgtcactgg taccgtaaac cagatgagta agcaagctgg aacctgtgac780accgttcgca agcagctctc acaagccacc tactccgaca tctgcatggc cccagcagct840gacatgtttg aggaaggtgt caagctccag gtgctcaaga agggaactat gttcccctcg900cgtgccaaca agctctatga gctcttcgtc aagtatgact cctttgagtc catggctcct960ggagagctgg aacgtgtgga gaagcgcatt ttcaagaagt ctctgtcaga agtttgggaa1020gagaccaagg acttctacat caacaggttg cagaacccgg agaagattga gcgcgcggag1080cgtgacccca agcttaagat gtccttgtgc ttccgctggt accttggttt ggcgagcttc1140tgggcaaacg ctggcatccc ggaccgtgcc atggactacc aggtttggtg tggcccagcg1200
attggatctt tcaacgactt catcaagggt acctaccttg accccgccgt tgccaacgag1260taccccgatg ttgtgcaaat caacttgcag atcctccgtg gt1302<210>54<211>260<212>PRT<213>Ulkenia sp.
<400>54Lys Arg Ile Ala Val Val Gly Met Ala Val Gln Tyr Ala Gly Cys Lys1 510 15Asp Lys Glu Glu Phe Trp Lys Val Val Met Gly Gly Glu Ala Ala Trp20 25 30Thr Lys Ile Ser Asp Lys Arg Leu Gly Ser Asn Lys Arg Ala Glu His3540 45Phe Lys Ala Glu Arg Ser Lys Phe Ala Asp Thr Phe Cys Asn Glu Asn50 5560Tyr Gly Cys Val Asp Asp Ser Val Asp Asn Glu His Glu Leu Leu Leu65 70 75 80Lys Leu Ser Lys Lys Ala Leu Ser Glu Thr Ser Val Ser Asp Ser Thr85 9095Arg Cys Gly Ile Val Ser Gly Cys Leu Ser Phe Pro Met Asp Asn Leu100 105 110Gln Gly Glu Leu Leu Asn Val Tyr Gln Asn His Val Glu Lys Lys Leu115 120 125Gly Ala Arg Val Phe Lys Asp Ala Ser Lys Trp Ser Glu Arg Glu Gln130 135 140Ser Gln Asn Pro Glu Ala Gly Asp Arg Arg Ile Phe Met Asp Pro Ala145 150 155 160Ser Phe Val Ala Glu Glu Leu Asn Leu Gly Pro Leu His Tyr Ser Val165 170 175Asp Ala Ala Cys Ala Thr Ala Leu Tyr Val Leu Arg Leu Ala Gln Asp180 185 190His Leu Val Ser Gly Ala Ala Asp Val Met Leu Ala Gly Ala Thr Cys195 200 205Phe Pro Glu Pro Phe Phe Ile Leu Ser Gly Phe Ser Thr Phe Gln Ala210 215 220Met Pro Val Ser Gly Asp Gly Ile Ser Tyr Pro Leu His Lys Asp Ser225 230 235 240Gln Gly Leu Thr Pro Gly Glu Gly Gly Ala Ile Met Val Leu Lys Arg245 250 255Leu Asp Asp Ala260
<210>55<211>17<212>PRT<213>Ulkenia sp.
<400>55Pro Leu His Tyr Ser Val Asp Ala Ala Cys Ala Thr Ala Leu Tyr Val1 510 15Leu<210>56<211> 4<212>PRT<213>Ulkenia sp.
<400>56Asp Ala Ala Cys1<210>57<211>159<212>PRT<213>Ulkenia sp.
<400>57Tyr Gly Thr Leu Leu Gly Ala Thr Ile Ser Asn Ala Gly Cys Gly Leu1 5 1015Pro Leu Lys Pro His Leu Pro Ser Glu Lys Ser Cys Leu Ile Asp Thr20 25 30Tyr Lys Arg Val Asn Val His Pro His Lys Ile Gln Tyr Val Glu Cys35 40 45His Ala Thr Gly Thr Pro Gln Gly Asp Arg Val Glu Ile Asp Ala Val5055 60Lys Ala Cys Phe Glu Gly Lys Val Pro Arg Phe Gly Ser Ser Lys Gly65 70 7580Asn Phe Gly His Thr Leu Val Ala Ala Gly Phe Ala Gly Met Cys Lys85 90 95Val Leu Leu Ala Met Lys His Gly Val Ile Pro Pro Thr Pro Gly Val100 105 110Asp Gly Ser Ser Gln Met Asp Pro Leu Val Val Ser Glu Pro Ile Pro115 120 125Trp Pro Asp Thr Glu Gly Glu Pro Lys Arg Ala Gly Leu Ser Ala Phe130 135 140Gly Phe Gly Gly Thr Asn Ala His Ala Val Phe Glu Glu Phe Asp145 150 155<210>58<211>426
<212>PRT<213>Ulkenia sp.
<400>58Lys Arg Ile Ala Val Val Gly Met Ala Val Gln Tyr Ala Gly Cys Lys1 5 1015Asp Lys Glu Glu Phe Trp Lys Val Val Met Gly Gly Glu Ala Ala Trp20 25 30Thr Lys Ile Ser Asp Lys Arg Leu Gly Ser Asn Lys Arg Ala Glu His35 40 45Phe Lys Ala Glu Arg Ser Lys Phe Ala Asp Thr Phe Cys Asn Glu Asn50 55 60Tyr Gly Cys Val Asp Asp Ser Val Asp Asn Glu His Glu Leu Leu Leu65 70 75 80Lys Leu Ser Lys Lys Ala Leu Ser Glu Thr Ser Val Ser Asp Ser Thr85 90 95Arg Cys Gly Ile Val Ser Gly Cys Leu Ser Phe Pro Met Asp Asn Leu100 105 110Gln Gly Glu Leu Leu Asn Val Tyr Gln Asn His Val Glu Lys Lys Leu115 120 125Gly Ala Arg Val Phe Lys Asp Ala Ser Lys Trp Ser Glu Arg Glu Gln130 135 140Ser Gln Asn Pro Glu Ala Gly Asp Arg Arg Ile Phe Met Asp Pro Ala145 150 155 160Ser Phe Val Ala Glu Glu Leu Asn Leu Gly Pro Leu His Tyr Ser Val165 170 175Asp Ala Ala Cys Ala Thr Ala Leu Tyr Val Leu Arg Leu Ala Gln Asp180 185 190His Leu Val Ser Gly Ala Ala Asp Val Met Leu Ala Gly Ala Thr Cys195 200 205Phe Pro Glu Pro Phe Phe Ile Leu Ser Gly Phe Ser Thr Phe Gln Ala210 215 220Met Pro Val Ser Gly Asp Gly Ile Ser Tyr Pro Leu His Lys Asp Ser225 230 235 240Gln Gly Leu Thr Pro Gly Glu Gly Gly Ala Ile Met Val Leu Lys Arg245 250 255Leu Asp Asp Ala Ile Arg Asp Gly Asp His Ile Tyr Gly Thr Leu Leu260 265 270Gly Ala Thr Ile Ser Asn Ala Gly Cys Gly Leu Pro Leu Lys Pro His275 280 285Leu Pro Ser Glu Lys Ser Cys Leu Ile Asp Thr Tyr Lys Arg Val Asn290 295 300
Val His Pro His Lys Ile Gln Tyr Val Glu Cys His Ala Thr Gly Thr305 310 315 320Pro Gln Gly Asp Arg Val Glu Ile Asp Ala Val Lys Ala Cys Phe Glu325 330 335Gly Lys Val Pro Arg Phe Gly Ser Ser Lys Gly Asn Phe Gly His Thr340 345 350Leu Val Ala Ala Gly Phe Ala Gly Met Cys Lys Val Leu Leu Ala Met355 360 365Lys His Gly Val Ile Pro Pro Thr Pro Gly Val Asp Gly Ser Ser Gln370 375 380Met Asp Pro Leu Val Val Ser Glu Pro Ile Pro Trp Pro Asp Thr Glu385 390 395 400Gly Glu Pro Lys Arg Ala Gly Leu Ser Ala Phe Gly Phe Gly Gly Thr405 410 415Asn Ala His Ala Val Phe Glu Glu Phe Asp420 425<210>59<211>267<212>PRT<213>Ulkenia sp.
<400>59Met Arg Ile Ala Ile Thr Gly Met Asp Ala Thr Phe Gly Ser Leu Lys1 5 1015Gly Leu Asp Ala Phe Glu Arg Ala Ile Tyr Asn Gly Gln His Gly Ala20 25 30Val Pro Leu Pro Glu Lys Arg Trp Arg Phe Leu Gly Lys Asp Lys Asp35 40 45Phe Leu Asp Leu Cys Gly Val Lys Glu Val Pro His Gly Cys Tyr Ile50 5560Glu Asp Val Glu Val Asp Phe Ser Arg Leu Arg Thr Pro Met Thr Pro65 70 75 80Asp Asp Met Leu Arg Pro Met Gln Leu Leu Ala Val Thr Thr Ile Asp8590 95Arg Ala Ile Leu Asn Ser Gly Leu Lys Lys Gly Gly Lys Val Ala Val100 105 110Phe Val Gly Leu Gly Thr Asp Leu Glu Leu Tyr Arg His Arg Ala Arg115 120 125Val Ala Leu Lys Glu Arg Ala Arg Pro Glu Ala Ala Ser Ala Leu Asn130 135 140Asp Met Met Ser Tyr Ile Asn Asp Cys Gly Thr Ala Thr Ser Tyr Thr145 150 155160
Ser Tyr Ile Gly Asn Leu Val Ala Thr Arg Val Ser Ser Gln Trp Gly165 170 175Phe Glu Gly Pro Ser Phe Thr Ile Thr Glu Gly Asn Asn Ser Val Tyr180 185 190Arg Cys Ala Glu Leu Gly Lys Tyr Leu Leu Glu Thr Gly Glu Val Glu195 200 205Ala Val Val Ile Ala Gly Val Asp Leu Cys Ala Ser Ala Glu Asn Leu210 215 220Tyr Val Lys Ser Arg Arg Phe Lys Val Ser Glu Gln Glu Ser Pro Arg225 230 235 240Ala Ser Phe Asp Ser Gly Ala Asp Gly Tyr Phe Val Gly Glu Gly Cys245 250 255Gly Ala Leu Val Leu Lys Arg Glu Ser Asp Cys260 265<210>60<211>264<212>PRT<213>Ulkenia sp.
<400>60Ala Ala Phe Gly Leu Ser Leu Gly Glu Ile Ser Met Val Phe Ala Phe1 5 1015Ser Glu Lys Asn Gly Leu Val Ser Glu Glu Leu Thr Thr Lys Leu Arg20 25 30Asn Ser Glu Val Trp Arg Lys Ala Leu Ala Val Glu Phe Asp Ala Leu35 40 45Arg Lys Ala Trp Asn Ile Pro Gln Asp Thr Pro Val Ser Glu Phe Trp50 5560Gln Gly Tyr Val Val Arg Gly Thr Arg Glu Ala Val Glu Ala Ala Ile65 70 75 80Gly Pro Asn Asn Lys Tyr Val His Leu Thr Ile Val Asn Asp Ala Asn85 90 95Ser Ala Leu Ile Ser Gly Lys Pro Glu Asp Cys Lys Ala Ala Ile Ala100 105 110Arq Leu Ser Ser Asn Leu Pro Ala Leu Pro Val Asp Leu Gly Met Cys115 120 125Gly His Cys Pro Val Val Glu Pro Tyr Gly Lys Gln Ile Ala Glu Ile130 135 140His Ser Val Leu Glu Ile Pro Glu Val Ala Gly Leu Asp Leu Tyr Thr145 150 155 160Ser Val Asn Gln Lys Lys Leu Val Asn Lys Ser Thr Gly Ala Ser Asp165 170 175
Glu Tyr Ala Pro Ser Phe Gly Glu Tyr Ala Ala Gln Leu Tyr Thr Val180 185 190Gln Ala Asp Phe Pro Lys Ile Ala Lys Thr Val Ser Asp Lys Asn Phe195 200 205Asp Val Phe Val Glu Thr Gly Pro Asn Ala His Arg Ser Ala Ala Ile210 215 220Arg Ala Thr Leu Gly Asn Ser Lys Pro Phe Val Thr Gly Ser Met Asp225 230 235 240Arg Gln Asn Glu Asn Ala Trp Thr Thr Met Val Lys Leu Val Ala Ser245 250 255Leu Gln Ala His Arg Val Pro Gly260<210>61<211>434<212>PRT<213>Ulkenia sp.
<400>61Ser Arg Ala Phe Met Glu Thr Tyr Gly Val Ser Ala Pro Met Tyr Thr1 5 1015Gly Ala Met Ala Lys Gly Ile Ala Ser Ala Glu Met Val Ile Ala Ala20 25 30Gly Lys Arg Gly Ile Leu Gly Ser Leu Gly Ala Gly Gly Leu Pro Ile35 4045Ala Thr Val Arg Lys Ala Leu Glu Ala Ile Gln Ala Glu Leu Pro Lys50 5560Gly Pro Tyr Ala Val Asn Leu Ile His Ser Pro Phe Asp Ser Asn Leu65 70 75 80Glu Lys Gly Asn Val Asp Leu Phe Leu Glu Lys Gly Val Thr Val Val85 90 95Glu Ala Ser Ala Phe Met Thr Leu Thr Pro Gln Leu Val Arg Tyr Arg100 105 110Ala Ala Gly Leu Ser Arg Ala Ala Asp Gly Ser Thr Val Ile Lys Asn115 120 125Arg Val Ile Gly Lys Val Ser Arg Thr Glu Leu Ala Ala Met Phe Ile130 135 140Arq Pro Ala Pro Glu Asn Leu Leu Glu Lys Leu Leu Lys Ser Gly Glu145 150 155 160Ile Thr Gln Glu Gln Ala Ala Leu Ala Arg Thr Val Pro Val Ala Asp165 170 175Asp Ile Ala Val Glu Ala Asp Ser Gly Gly His Thr Asp Asn Arg Pro180 185 190
Ile His Val Ile Leu Pro Leu Ile Val Asn Leu Arg Asp Arg Leu His195 200 205Lys Glu Cys Gly Tyr Pro Ala His Leu Arg Val Arg Val Gly Ala Gly210 215 220Gly Gly Ile Gly Cys Pro Gln Ala Ala Ile Ala Thr Phe Asn Met Gly225 230 235 240Ala Ala Phe Ile Val Thr Gly Thr Val Asn Gln Met Ser Lys Gln Ala245 250 255Gly Thr Cys Asp Thr Val Arg Lys Gln Leu Ser Gln Ala Thr Tyr Ser260 265 270Asp Ile Cys Met Ala Pro Ala Ala Asp Met Phe Glu Glu Gly Val Lys275 280 285Leu Gln Val Leu Lys Lys Gly Thr Met Phe Pro Ser Arg Ala Asn Lys290 295 300Leu Tyr Glu Leu Phe Val Lys Tyr Asp Ser Phe Glu Ser Met Ala Pro305 310 315 320Gly Glu Leu Glu Arg Val Glu Lys Arg Ile Phe Lys Lys Ser Leu Ser325 330 335Glu Val Trp Glu Glu Thr Lys Asp Phe Tyr Ile Asn Arg Leu Gln Asn340 345 350Pro Glu Lys Ile Glu Arg Ala Glu Arg Asp Pro Lys Leu Lys Met Ser355 360 365Leu Cys Phe Arg Trp Tyr Leu Gly Leu Ala Ser Phe Trp Ala Asn Ala370 375 380Gly Ile Pro Asp Arg Ala Met Asp Tyr Gln Val Trp Cys Gly Pro Ala385 390 395 400Ile Gly Ser Phe Asn Asp Phe Ile Lys Gly Thr Tyr Leu Asp Pro Ala405 410 415Val Ala Asn Glu Tyr Pro Asp Val Val Gln Ile Asn Leu Gln Ile Leu420 425 430Arg Gly<210>62<211>2000<212>DNA<213>Ulkenia sp.
<400>62gagcacgcac catcttctct ccacgcgtaa agaagagcag agccagaggc aggtaggtat60ctccacccat ctcaggctgt gacttctttg tttctttctt tctttgcttg ttttctgttc120tctctctgtg ctctgtccac acgagaaaga gaaagagaga gagaaagaac cacgggttta180tagagcgcac tcgtccttcc tgcttcagca gaaagcactg cgtaggagaa ctacggggga240
ggaggaagca cgcacggagg aggcgtggaa ggaaggagga gacagagaga gagagacact300gagggacaga gggggagagg cagagggaga ggcatctgat gtttgcgaga aaccaataag360ttttgaaagt gatttgattt agctgattga ctgatctatg gcctgaaaga aagcttttaa420agcggaggga gatagatgac gagggcagct gcgatggcgt acggcgcatc cgtctctctc480tgtgtctctc tctctttctc tctcgtcagg gcgtggagac ctcggaagct gcacgcggcg540cggtgaggag gcagggcagc agagggagag gagagatccc agagtcgaag agcattgatt600gattgcagat gatcttgggc aacgcgcgtc agcttgagcg aggaatgctt tggacttcag660gttcttcgct tctgtgtttc attctttctc gaagaaagaa agaatgaaag aaagagagaa720agaaagaaag aaagaaagaa agaaagaaag aaagaaagaa tgaatgaatg aaagaaagag780agaaagaaag aacgaatgaa agaaagagag aaagaatcaa agagaaagcg cattcgcagt840tcttcttcgt gaaagaaaag gaaaagagag gcgatggtag gctctgatct catcatttct900ggtttctctg ttgtacctgt actctgtgct tgtggccttg cgaaggctga agacgccatg960cagacaacca cgcctccgca gagactttgc gggaaagcag agggcttctc gccactctcg1020aagaaacgag ctcgccagtt ttcggggttg ttctcagaat tgcgagtgtt ggctttatat1080gggatgatgg tatggcactt cgtcatcgtt actctcgctc gcttgcttac gaagattttc1140aaaagggcga aagaagtgct cagcttttaa aataaagtca caccaaagac taggccgcat1200agcagaaagc taaagtaaac ccaatctgtc tgaagagagt gtcgtggtta gatacttacg1260caagagttta aaagctgtaa atagtacagg aacaaaaaca aataaatata tatatattct1320tttttattag taaaacatga aaccaaaaaa ctcctttaaa ataaaataaa ataaaataaa1380ataaaataaa ataaaataaa tttactacta tatatacata tatatataca ataaataaaa1440acaacttttt cagaccagaa aaagactgag aaaaaaggaa actaatgact ctcgagcacc1500gagagcgata taagagtgga ttatatttgc taggcccacc acgagtgagt cccctaggag1560gaagcgccct ctgagacagg agcagaggcg tcgctggtgc tccaaaaagc gacggcgaat1620ggaaagcaaa accctttcga gggaggcttg tggccgtgac tattcaaatc tccagcatct1680cagctccagc acagcagaag ctacctcgct tctcagctct agctatcaca tcgatcgcag1740catctagctc gtagacagct agcgccgcac cttcccccaa atcaacttgg gcaacttaac1800tcttttttca ccagaactcc tcttttcctt taatcttcga aaagaagacg aataaaagag1860ataatcctct gccgcagcac attctaaaag aaaagcggca tactggcgta ggcaagactt1920tcaagctctt cctcgcctcc accccgtatt tccctgttca tctttgtgaa acgaggaaac1980aagaaatttt ataggacaag2000
<210>63<211>2000<212>DNA<213>Ulkenia sp.
<400>63agttgtgagg ctgtcttgtc ttgtcagtcg cgaaagtgta agcaagaact ttgtcataca60aagaagcaac caacttccga accaacacac cttgtaggat tacaaccaca actttctata120aatagtgcgc aagaataacc agtaagctat ccttcgtgta cctgttacaa caacgacatt180tttacttgat cttcctactt gtgatgggta gtcccggctt gtactgacag tgatgccaca240gcagagtaga tcactgtgaa taagtaaata agcctactta ttatattccc aaagtactcg300ctgggatatt attagtatca cgaaaagtga tatgttttat aactcgcttg tcttgccaag360atctaacctt ttttttttaa atggccaaaa agtcgccaga acacatctta caataaacaa420aaatttagat tatatcgtat gtataatgta taatatatta tattattata tacatacgat480ataatctaaa gccattccag acttattcgg tgatgaaaaa tgctttccca gctttataca540aactattcaa aaagttgcat gacccatttt cagatatatt taatagtata agattatgtc600catttgtttt caaagttatt caagagttta catcttgaag tttcatccct ttactactac660actgtttttc gtttgggttt tttctctaac ggcgaaagaa acaagtcacc aagcttaact720agtaggcatc tttgtggtga cgaaattaaa gttgaatata taaattatag ttagtcatta780tggaatctca gtttgaacga agctaagcta tttataaaaa tcactgcatg gagataatac840ttgaattttg atgatagtgt ttatgaagaa gtttaatctt gctttttatt aatgttattc900tctaatatag aaatatttca ataaaaaaat catatgaagg gataataaat acagagaatg960atcgttatca tttgatatgt cgaacgctaa tctatcatct tatctaggaa acaaaggtgg1020aaataaagga aagccctaca cgagttaatt cctcaaacga actactttgg attatcaaat1080ccaactgctg acactggata catgcatgta tttagtgggt gttactgtac ttccttattt1140cctttaattc aattgtcttg atttttactt cggagattct acttgaaaat catctccctt1200cacttccggt tatacagaaa gacccttcaa ttcgaatgct ggccaggtac aataactatc1260agcgattccc ctccactaga catgaccgac tgtaagcacc tcaacccgat ttcaagcaac1320acatgatgac tagctgtttc cgcaaaacaa caaataagag aggtagtgga aaacacccag1380ttcgctcgag ctcccctagt agattcgaca ttcactttct atttgattgc taattgtggg1440tccggctatt taaggaaaga actgatgaaa gtccacctca cgcaatcaaa tcgcggtcta1500gttggaagct acaatggccg acgtatgcgc gcctctatct tttaggattg tagaacaggg1560cggcaatctg ctaacataaa tttaatacct tgctcaagct gctttccata cttttcaatc1620catttgtgat aatcttgcaa tggaccaatc tccaaatctg tagaagcaat aacaaggaca1680
tcgcagggtc ccggttcgtt tgcatgctcg tcttctggtg ccacaacaat gctgcctgtt1740attatctcat gagagtcttt atactgcgga tccgtggcta tagcgtgaat aaacgttgtg1800cgcaagccta tatcctcgcg atggagatac tggcctgcta cagtttgcgt tcgtctgcct1860acgacaacgc atggaacatt ctttggtgtg cgagtgggcc gtagcgttcg accctgggca1920aggaagccat gcagacgtga ttccgagagg ccatctcgcg tgtaagactt atcccaattt1980tctggatcct ctaatttcca2000<210>64<211>2000<212>DNA<213>Ulkenia sp.
<400>64aaattaatga atgaatcaat gaatgaatca atgaataatg ccaatgcaat gcgatgcgat60gctgcttcga gccatcgcac ggcggccatt gcgcgcttgc gtcagtcatg tcattccatt120cggagcggcg tgcgcgaggg agggagggag ggagggagaa gacgaggagc aggcggagag180agaggaggat gggcgggcgg gcggcgtcgt cggcgtcgtc gtcgtcgtgg gcctccgtag240tcgctgggaa ggagggcttt gattccaaat gaggattttg gtgcactgct ttcgagactt300tctcgcctga ttcggaattc ctcctcttct tcttcttttt agctgtgctt tctgcgtatt360cattgcgtgg gtttggcttg gttttcaaat caattagcag tctagtaact aacaaactaa420caaacagata aacagacaaa cagacaaaca aacaaaacaa acaaaacaaa caaaacaaag480caggaaagaa agaaacaaac aaatatacaa acaaagaaag aaagaagtgg tgggaactag540ggaaatcaat gtgtttgctt ctttcgcacc tttgcttttc ttgcttttct tggttctcaa600gtaagcgttt atcgcgccct cagaaaacaa aataaaatga tctaacataa catgaattta660tatttatttt atttgtttat taaataaata ttttttgtaa accagaattt cactctactt720ttgcaacact gagagagtgc catctgcata ataagtggca gtgttttttt gtttattttc780aaattaatta tacttgaact gctaggtcaa gaggccgcag cagcctgatg agataaggac840agagtaggca aggatggcag aagatcgcga aaaaagcgag aaaggcaaac gagcaggccc900gaaggtgagg tggagctgct tgtcaaggtc gcgaggtttg tttgacagtt ataacagcaa960gaactaaggc aatttcaaga atgaagagca ctcgaataaa ccgatgaagc aaagtgtgta1020catacaaaca tacatacgta cagatgaaaa gaacagattt tcaataaaaa tgacttttta1080gtttaaacaa tgtttctgtt tgttgtttcg cttttcatta atttgttgca aattattttg1140ttttggtttt tgtttttgtt tttgaaaatc ataaaagaga tgctgccgca gacgtctgcg1200cgtctcatag ttgattgggt aatcgttttg ttgagttttg aaaatgtaaa cttcacttag1260
ttgctcattt atcctcattc gtttgcccat ttgttctctg tttgaagcag agttttgact1320tctcgcattc gtggaatcca ccccttgctt gctttgcttg cttgcttgct tgcttgcttg1380cctgcttgct ttgcttgctt gcttgaccag cgtgcgcgct ttcgccagcc tagccttcga1440gacctcttga agaccctttg gagcgtctag ttcgaggttc tttctatttg cttcaagaga1500gacaaaataa caaagaaaaa gagagaaaaa acaagcaaag aaagaaacaa ggaaacaaac1560cacaaagcac gcatcgtgca tccaaacttt catcccccca ctctctctct ctctctctct1620ctctctctcc ttcctcggaa aaggagtgag acaaaggcag acagcctcta gcttggcagc1680ctcgcagctc gtgcggcgcc agttcctaca gcttcgcgct gtccaaacgc cagtccatcg1740cagcttcggc tagctagttg gctgattgat tgattgattg attgatagcc tttattacgg1800cgttgattaa ctgattgatt atttgattgc tctggcatcc ctgtaatcac ttgctcaagg1860tagtcaatca catcatttat acatctcctc caaagcaaac catctacacg accgcttttt1920gatcgatcta aaagtgccgg tcaggtgaca cgcaagctct tttttttgtt tacagtaagc1980agcaacaaga aagcaaaaag2000<210>65<211>2000<212>DNA<213>Ulkenia sp.
<400>65gcccaatttg ctcctgatct gttcccatga ttatgatagg gataggtagt agttatagct60agactcattc cattcactta atccacatat gcaaattata attttatgtg tcgcatataa120actttccaaa ctttaaaatt ttcatttgca ttttatatat agatcacctg tgatcccttt180ctcgcccctt tcaacttcca aagtttacct actatcatat ggcatggcgc agccaatgca240ctctataaca tataagtaac agagatagtt tttgccgcat catttactct ttactcttgc300tatacaaggt aagcgccaag agagttaatt acatctgttt tatcggttcc tagtggaaat360aatagtgaca actataatta gtaggagtcc ttattgaccc tagtcatttg agcttgcacc420agatttgatg tttttgcaaa cgaccttgac gcagagtgac gagcgaaaat tggatcccct480tggttgaagt ctaaactagc ttaaaatata tatgctcttc atataatata aagctgtttt540agattctatc aaataagaaa ttgatgactt tgagcaaatt aatatttggt atgggctccg600gcatctctga aaacgcttaa atgaagcttt tattcaccac gattcgacaa ctaaggttat660tttccacata attataactt ttcctacata actgtgctgt cgactcacac cttctttata720tatatagcct cgtagggatt cgaaactatg aattaagact cgttgaagtt tgatttatcc780attattttgc tgcacaaact atcgctaaga tataaagatc gtgcccagag cctgctatag840
ggtcctaatg gcatgcttag cccggatttc cacgataaag ctgcattgta ttgagtatat900gcactcagag agtaaacttt aattgcaacg aacaatcttt ggcaagtcat atctcagcca960tcaatacatg tattgtgttc aaacgaattg cagcatatca ctcaaattat tttggtctag1020ttcagcggaa tcttttggtt gttttagtaa gagttgagta gagtatgttg gatgagtgtg1080tccacaaggt tatttgaata gggtatttac attctacaac atagtcagta agctctcgtg1140tgataaactg tatcaaaatc gacacaataa caggctagtg gtgccctgtg cacgttttta1200ccataacatg acagctacag catcagaaac aggtgtggtg cgcattttgg ttattctgat1260cctgaaacct aagaacaatt ttcatcgtct tgctagattg tgttttctgt attccatttg1320tggagcttca acatccatgc tgctgagtat tttcacatga agatcatagt gttagaatgt1380ttagtaagcc tattactaag ttttgaggta taggtgcttg ttgttgtcct tacataaata1440catgctgtct ttagtgctta gaccaacgtt gagtgtatcg tgctcttggc agaagaatag1500acatttataa cattatggtg aaaggcgatg gtctcgcttg catgttctcg cttgcgtttg1560cgtatcccta tacacttaac cgttgtttat gtgtacctaa gctatcatgc tgcatcttta1620caattttata caaataaatt tattttggaa tatataattg gtcactattt caggccagtt1680gacagtcctt aagatttgta gttgcgctgt tctcgtagtg agaatgaaga agcggaatct1740acatccatct gtgattgcat aagagcttgc ataagagtga agtaggtgaa agtcacagag1800aatatcttcc ctactatcct aaaggcaagg aatactacta tacacgaaca tagtaatgga1860attttacaca acagaagtac ccttgtctcc tgcctccttt tattattcca ttatgctctg1920ttatataatg aatgaagacg acttttaaca tcatttgatt ctcgagcagg cacgcacaat1980atagaggaag gattggcgtc2000<210>66<211>1212<212>DNA<213>Ulkenia sp.
<400>66ggcaagaacg tcgttttcga ctatgacgag ctccttgagt tcgccgaggg tgacatcagc60aaggtcttcg gccccgaatt cagccagatc gaccagtaca agcgtcgcgt tcgtctcccc120gcccgcgagt acctcctcgt cacccgcgtc accctcatgg acgccgaggt caacaactac180cgcgtcggtg cccgcatggt cactgagtac gacctccccg tcaacggtga gctctctgag240ggtggtgact gcccctgggc cgtgctcgtc gagagtggtc agtgtgatct catgctcatc300tcctacatgg gtattgactt ccagaacaag agcgaccgcg tctaccgtct gctcaacacc360accctcacct tctacggtgt tgcccaggag ggcgagaccc tggagtacga catccgcgtg420
accggcttcg ccaagcgtct cgacggtgac atctccatgt tcttcttcga gtacgactgc480tacgtcaacg gccgtctcct catcgagatg cgcgacggct gtgccggttt cttcaccaac540gaggagctcg ccgccggcaa gggtgtcgtc tttacccgcg ctgatctcct cgcccgcgag600aagaccaaga agcaggacat caccccgtac gccattgccc cgcgtcttaa caagaccgtt660ctcaacgaga ctgagatgca gtccctcgtg gacaagaact ggaccaaggt tttcggcccc720gagaacggca tggaccagat caactacaaa ctctgcgccc gtaagatgct catgattgac780cgcgtcacca agattgacta caccggtggc ccctacggcc ttggtcttct cgttggtgag840aagatcctcg agcgcgacca ctggtacttt ccgtgccact tcgtcggaga ccaggtcatg900gctggatccc tcgtgtctga cggctgcagc cagctcctca agatgtacat gctctggctc960ggcctccacc ttaagaccgg tcccttcgac ttccgccccg tcaacggcca ccccaacaag1020gtccgctgcc gtggccagat ctccccgcac aagggtaagc tcgtatacgt catggagatc1080aaggagatgg gctacgacga ggctggtgac ccgtacgcca tcgccgatgt caacattctc1140gacattgact tcgagaaggg ccagactttc gaccttgcca acctccacga gtacggcaag1200ggcgacctca ac1212<210>67<211>21<212>DNA<213>Ulkenia sp.
<400>67tggtactttc cgtgccacttc 21<210>68<211>1197<212>DNA<213>Ulkenia sp.
<400>68gtgcccggcg agatgccgct ctcgtggtac aacatggctg agttcatggc cggcaaggtc60agcctctgcc tcggccctga gttcgccaag ttcgatgact ccaacaccag ccgcagccct120gcatgggacc ttgctcttgt gactcgtgtg gtctccgttt ctgacatgga gtgggtccag180tggaagaacg tggactgcaa cccgtccaag ggaaccatgg ttggcgagtt cgactgcccc240atcgacgcct ggttcttcca gggatcttgt aacgacggcc acatgccgta ctccatcctc300atggagatcg ccctccagac ctctggtgtc ctcacctctg tgctcaaggc cccgctcacc360atggagaaga aggacattct cttccgcaac cttgacgcca acgccgagat ggttcgctct420gatattgacc tccgcggcaa gaccatccac aacctcacca agtgtaccgg ctacagcatg480ctcggagaca tgggtgtcca ccgcttcagc ttcgagctct ctgttgatgg tgtagtcttc540
tacaagggta ccacctcctt cggctggttc gtccctgagg tcttcatctc ccagactggt600ctcgacaacg gtcgccgcac ccagccctgg cacattgagt ccaaggtgcc ttccgcccag660gtcctcacct acgacgttac ccccaacggt gccggtcgca cccagctcta cgccaacgcc720cccaagggcg ctcagctcac tcgccgctgg aaccagtgcc agtaccttga caccatcgac780cttgtggtcg ccggtggctc cgccggtctt ggctacggtc atggccgcaa gcaggtgaac840cccaaggact ggttcttctc gtgccacttc tggttcgact ccgtcatgcc cggctcgctc900ggtgtggagt ctatgttcca gctcgtcgag tccatcgctg tcaagcagga cctcgccggc960aagtacggca tcaccaaccc gaccttcgct catgctccgg gcaagatctc ctggaagtac1020cgtggtcagc tcacccccac ctccaagttc atggactccg aggcccacat tgtctccatc1080gaggcccacg acggcgtcgt cgacatcgtt gccaatggta acctctgggc tgatggcctc1140cgcgtctaca acgtcagcaa catccgtgtg cgcattgttg ctggcgccgc ccctgct 1197<210>69<211>21<212>DNA<213>Ulkenia sp.
<400>69tggttcttct cgtgccactt c 21<210>70<211>90<212>DNA<213>Ulkenia sp.
<400>70gctggcgccg cccctgctgc tgctgctgct gctgctgctg ttgctgctcc ggctgccgcc60cctgctccgg ttgctgcatc tggccctgcc 90<210>71<211>1299<212>DNA<213>Ulkenia sp.
<400>71gaaggcttca tgaagaccta cggtgttgtg gctcctctct acaccggtgc catggccaag60ggtattgcct ctgctgacct tgtgattgcc actggtaagc gcaagatcct cggttccttc120ggtgctggcg gtctccccat gcacattgtc cgtgccgctg ttgagaagat ccaggctgag180ctcccgaacg gccccttcgc cgtcaacctc atccactccc ccttcgatag caaccttgag240aagggcaacg ttgacctctt cctcgagaag ggcgttactg tcgtcgaggc ctccgccttc300atgaccttga ccccgcaagt cgtccgctac cgtgctgctg gtctttcccg taacgctgat360
ggctccatta acatcaagaa ccgcatcatc ggtaaggtct cccgtaccga gctcgctgag420atgttcatcc gccctgcccc gcagaacctc ctcgacaagc tcatccagtc tggtgagatt480accaaggagc aggctgagct tgccaagctc gtccccgtcg ccgacgacat cgccgtcgag540gccgactctg gtggccacac cgacaaccgc cccatccacg tcatcctccc ccttatcatc600aacctccgca accgcctcca caaggagtgc ggctaccccg ctcacctccg cgtgcgcgtt660ggagctggtg gtggtgttgg atgcccccag gccgctgccg ctgctctcgc tatgggtgct720gccttccttg ttaccggcac tgtcaaccag gtcgccaagc agtccggcac ctgcgacaat780gtccgcaagc agctctgcat ggccacctac tctgacgtct gcatggctcc cgctgctgac840atgttcgagg agggcgtcaa gctccaggtc ctcaagaagg gaaccatgtt cccgtccagg900gctaacaagc tctacgagct cttctgcaag tacgactcct tcgagtccat gcctgccaca960gagctcgagc gtgttgagaa gcgcatcttc cagtgccctc ttgctgatgt ctgggctgag1020acctccgact tctacatcaa ccgcctccac aacccggaga agatcacccg tgccgagcgt1080gaccccaagc tcaagatgtc tctctgcttc cgctggtacc ttggtcttgc ctctcgctgg1140gccaacaccg gtgaggctgg acgcgtcatg gactaccagg tctggtgtgg ccctgccatt1200ggagccttca acgacttcat caagggctcc taccttgacc cggccgtctc tggtgagtac1260ccggacgtcg tgcagatcaa cttgcagatc cttcgcggt 1299<210>72<211>404<212>PRT<213>Ulkenia sp.
<400>72Gly Lys Asn Val Val Phe Asp Tyr Asp Glu Leu Leu Glu Phe Ala Glu1 510 15Gly Asp Ile Ser Lys Val Phe Gly Pro Glu Phe Ser Gln Ile Asp Gln20 25 30Tyr Lys Arg Arg Val Arg Leu Pro Ala Arg Glu Tyr Leu Leu Val Thr35 40 45Arg Val Thr Leu Met Asp Ala Glu Val Asn Asn Tyr Arg Val Gly Ala50 5560Arg Met Val Thr Glu Tyr Asp Leu Pro Val Asn Gly Glu Leu Ser Glu65 70 75 80Gly Gly Asp Cys Pro Trp Ala Val Leu Val Glu Ser Gly Gln Cys Asp8590 95Leu Met Leu Ile Ser Tyr Met Gly Ile Asp Phe Gln Asn Lys Ser Asp100 105 110Arg Val Tyr Arg Leu Leu Asn Thr Thr Leu Thr Phe Tyr Gly Val Ala
115 120 125Gln Glu Gly Glu Thr Leu Glu Tyr Asp Ile Arg Val Thr Gly Phe Ala130 135 140Lys Arg Leu Asp Gly Asp Ile Ser Met Phe Phe Phe Glu Tyr Asp Cys145 150 155 160Tyr Val Asn Gly Arg Leu Leu Ile Glu Met Arg Asp Gly Cys Ala Gly165 170 175Phe Phe Thr Asn Glu Glu Leu Ala Ala Gly Lys Gly Val Val Phe Thr180 185 190Arg Ala Asp Leu Leu Ala Arg Glu Lys Thr Lys Lys Gln Asp Ile Thr195 200 205Pro Tyr Ala Ile Ala Pro Arg Leu Asn Lys Thr Val Leu Asn Glu Thr210 215 220Glu Met Gln Ser Leu Val Asp Lys Asn Trp Thr Lys Val Phe Gly Pro225 230 235 240Glu Asn Gly Met Asp Gln Ile Asn Tyr Lys Leu Cys Ala Arg Lys Met245 250 255Leu Met Ile Asp Arg Val Thr Lys Ile Asp Tyr Thr Gly Gly Pro Tyr260 265 270Gly Leu Gly Leu Leu Val Gly Glu Lys Ile Leu Glu Arg Asp His Trp275 280 285Tyr Phe Pro Cys His Phe Val Gly Asp Gln Val Met Ala Gly Ser Leu290 295 300Val Ser Asp Gly Cys Ser Gln Leu Leu Lys Met Tyr Met Leu Trp Leu305 310 315 320Gly Leu His Leu Lys Thr Gly Pro Phe Asp Phe Arg Pro Val Asn Gly325 330 335His Pro Asn Lys Val Arg Cys Arg Gly Gln Ile Ser Pro His Lys Gly340 345 350Lys Leu Val Tyr Val Met Glu Ile Lys Glu Met Gly Tyr Asp Glu Ala355 360 365Gly Asp Pro Tyr Ala Ile Ala Asp Val Asn Ile Leu Asp Ile Asp Phe370 375 380Glu Lys Gly Gln Thr Phe Asp Leu Ala Asn Leu His Glu Tyr Gly Lys385 390 395 400Gly Asp Leu Asn<210>73<211>7<212>PRT<213>Ulkenia sp.
<400>73
Trp Tyr Phe Pro Cys His Phe1 5<210>74<211>399<212>PRT<213>Ulkenia sp.
<400>74Val Pro Gly Glu Met Pro Leu Ser Trp Tyr Asn Met Ala Glu Phe Met1 510 15Ala Gly Lys Val Ser Leu Cys Leu Gly Pro Glu Phe Ala Lys Phe Asp20 25 30Asp Ser Asn Thr Ser Arg Ser Pro Ala Trp Asp Leu Ala Leu Val Thr35 40 45Arg Val Val Ser Val Ser Asp Met Glu Trp Val Gln Trp Lys Asn Val50 5560Asp Cys Asn Pro Ser Lys Gly Thr Met Val Gly Glu Phe Asp Cys Pro65 70 75 80Ile Asp Ala Trp Phe Phe Gln Gly Ser Cys Asn Asp Gly His Met Pro85 90 95Tyr Ser Ile Leu Met Glu Ile Ala Leu Gln Thr Ser Gly Val Leu Thr100 105 110Ser Val Leu Lys Ala Pro Leu Thr Met Glu Lys Lys Asp Ile Leu Phe115 120 125Arg Asn Leu Asp Ala Asn Ala Glu Met Val Arg Ser Asp Ile Asp Leu130 135 140Arg Gly Lys Thr Ile His Asn Leu Thr Lys Cys Thr Gly Tyr Ser Met145 150 155 160Leu Gly Asp Met Gly Val His Arg Phe Ser Phe Glu Leu Ser Val Asp165 170 175Gly Val Val Phe Tyr Lys Gly Thr Thr Ser Phe Gly Trp Phe Val Pro180 185 190Glu Val Phe Ile Ser Gln Thr Gly Leu Asp Asn Gly Arg Arg Thr Gln195 200 205Pro Trp His Ile Glu Ser Lys Val Pro Ser Ala Gln Val Leu Thr Tyr210 215 220Asp Val Thr Pro Asn Gly Ala Gly Arg Thr Gln Leu Tyr Ala Asn Ala225 230 235 240Pro Lys Gly Ala Gln Leu Thr Arg Arg Trp Asn Gln Cys Gln Tyr Leu245 250 255Asp Thr Ile Asp Leu Val Val Ala Gly Gly Ser Ala Gly Leu Gly Tyr260 265 270
Gly His Gly Arg Lys Gln Val Asn Pro Lys Asp Trp Phe Phe Ser Cys275 280 285His Phe Trp Phe Asp Ser Val Met Pro Gly Ser Leu Gly Val Glu Ser290 295 300Met Phe Gln Leu Val Glu Ser Ile Ala Val Lys Gln Asp Leu Ala Gly305 310 315 320Lys Tyr Gly Ile Thr Asn Pro Thr Phe Ala His Ala Pro Gly Lys Ile325 330 335Ser Trp Lys Tyr Arg Gly Gln Leu Thr Pro Thr Ser Lys Phe Met Asp340 345 350Ser Glu Ala His Ile Val Ser Ile Glu Ala His Asp Gly Val Val Asp355 360 365Ile Val Ala Asn Gly Asn Leu Trp Ala Asp Gly Leu Arg Val Tyr Asn370 375 380Val Ser Asn Ile Arg Val Arg Ile Val Ala Gly Ala Ala Pro Ala385 390 395<210>75<211>7<212>PRT<213>Ulkenia sp.
<400>75Trp Phe Phe Ser Cys His Phe1 5<210>76<211>30<212>PRT<213>Ulkenia sp.
<400>76Ala Gly Ala Ala Pro Ala Ala Ala Ala Ala Ala Ala Ala Val Ala Ala1 510 15Pro Ala Ala Ala Pro Ala Pro Val Ala Ala Ser Gly Pro Ala20 25 30<210>77<211>433<212>PRT<213>Ulkenia sp.
<400>77Glu Gly Phe Met Lys Thr Tyr Gly Val Val Ala Pro Leu Tyr Thr Gly1 5 1015Ala Met Ala Lys Gly Ile Ala Ser Ala Asp Leu Val Ile Ala Thr Gly20 25 30Lys Arg Lys Ile Leu Gly Ser Phe Gly Ala Gly Gly Leu Pro Met His
3540 45Ile Val Arg Ala Ala Val Glu Lys Ile Gln Ala Glu Leu Pro Asn Gly50 5560Pro Phe Ala Val Asn Leu Ile His Ser Pro Phe Asp Ser Asn Leu Glu65 70 75 80Lys Gly Asn Val Asp Leu Phe Leu Glu Lys Gly Val Thr Val Val Glu85 90 95Ala Ser Ala Phe Met Thr Leu Thr Pro Gln Val Val Arg Tyr Arg Ala100 105 110Ala Gly Leu Ser Arg Asn Ala Asp Gly Ser Ile Asn Ile Lys Asn Arg115 120 125Ile Ile Gly Lys Val Ser Arg Thr Glu Leu Ala Glu Met Phe Ile Arg130 135 140Pro Ala Pro Gln Asn Leu Leu Asp Lys Leu Ile Gln Ser Gly Glu Ile145 150 155 160Thr Lys Glu Gln Ala Glu Leu Ala Lys Leu Val Pro Val Ala Asp Asp165 170 175Ile Ala Val Glu Ala Asp Ser Gly Gly His Thr Asp Asn Arg Pro Ile180 185 190His Val Ile Leu Pro Leu Ile Ile Asn Leu Arg Asn Arg Leu His Lys195 200 205Glu Cys Gly Tyr Pro Ala His Leu Arg Val Arg Val Gly Ala Gly Gly210 215 220Gly Val Gly Cys Pro Gln Ala Ala Ala Ala Ala Leu Ala Met Gly Ala225 230 235 240Ala Phe Leu Val Thr Gly Thr Val Asn Gln Val Ala Lys Gln Ser Gly245 250 255Thr Cys Asp Asn Val Arg Lys Gln Leu Cys Met Ala Thr Tyr Ser Asp260 265 270Val Cys Met Ala Pro Ala Ala Asp Met Phe Glu Glu Gly Val Lys Leu275 280 285Gln Val Leu Lys Lys Gly Thr Met Phe Pro Ser Arg Ala Asn Lys Leu290 295 300Tyr Glu Leu Phe Cys Lys Tyr Asp Ser Phe Glu Ser Met Pro Ala Thr305 310 315 320Glu Leu Glu Arg Val Glu Lys Arg Ile Phe Gln Cys Pro Leu Ala Asp325 330 335Val Trp Ala Glu Thr Ser Asp Phe Tyr Ile Asn Arg Leu His Asn Pro340 345 350Glu Lys Ile Thr Arg Ala Glu Arg Asp Pro Lys Leu Lys Met Ser Leu355 360 365
Cys Phe Arg Trp Tyr Leu Gly Leu Ala Ser Arg Trp Ala Asn Thr Gly370 375 380Glu Ala Gly Arg Val Met Asp Tyr Gln Val Trp Cys Gly Pro Ala Ile385 390 395 400Gly Ala Phe Asn Asp Phe Ile Lys Gly Ser Tyr Leu Asp Pro Ala Val405 410 415Ser Gly Glu Tyr Pro Asp Val Val Gln Ile Asn Leu Gln Ile Leu Arg420 425 430Gly<210>78<211>2000<212>DNA<213>Ulkenia sp.
<400>78gcacgtagag caagaaagaa tgaaagaaag aacgaaagaa agaaagagag agagagagag60agagagagag agaaagcgaa gatgatagcg gagagaactc ttcttcgcag tcactctgtt120tctcagtcag tcccgcaacc aataacaact cgaactcgca gcagtgttct tcggagtgcc180agcgctcgct cgcactgcgt cggcacagca gcagcagcag caggccccgc gctcgctgca240ctcagcccgg gcaggagcaa cagctgctga gcagctgagg ccagctggct ggcggctcgc300ctcgcctcgc ctcgcgtcgc gtcgcgagag aaagcgatcg accaactgtc aatcgattat360tcgagtcctt cgagcgcttt atagggcact gattgatcac tcattgattc attgactcat420ttattctttg cgtggtcagc caaacggcgt tagcattggg caaagcgggt ctttgctttg480ctctaaaata gatttgctcg cgagagtacg tacttgcagg agtaggtagg ctctgcctag540tacctgggca tttgaatatt tgaacttcga acttcgttga gtatctgaat atttgaatat600ctgaatattt gaatttcgaa agtttgaata tttgaatatt tgaattttgg aatattggaa660tagctgggtt tggagataag acttactaag ctaagcgccg acgtaagagc ggcgagtaaa720tccacacaca agagagaggc agagagagag ggagggagac aactcgcgca ggcaagctga780gcccactgga cgcacggggc gcgtcccccc tgacgggcgc tctggtggtg gcgtgtttgg840gagggttttg catgcttgtg ataggggctc tggcgcgggc tctgtacggt gcttggagat900gcacgggcag ggcgagagag gggacgggtt cccgggaggc gctgcttgga ggtgctgaga960gggagggaga aggcgtgctt tgcgatgcgc ggggcgacct aggcgctgct gcgcggtgca1020gcagcaggga cctcggacgt gagtcgaagc cgtctgcaga ggagatggta gaagggccgc1080ggattggtag cagagaagag gaaatagaag aagaagaaga aatagaagaa gaagaaatag1140aagaagaaga aatagaagaa gaagaggagg acgggcaggc gggaaagatg gagaaaggac1200
tcgcggcggg aaaacaagag aatgtgaact tgggcttgaa ctttggtttg aatttgaatg1260tggagaacga ggggttgaat ttgagtttga atttgaaaga aaacttacgg aaagaaagtt1320tagttgaaag tgagaaagaa aaaaatgaga aagaaaaaga gaaagaaaaa gagaaagaaa1380aagagaaaga aaaagagaaa gaaaaagaga aagaaaaaga gaaagaaaaa gagaaagaaa1440aagagaaaga aaaagagaaa gaaaaagaga aagaaaaaga gaaagaaaaa gaagaagaaa1500aagaagaaga aaaagagaaa gaaaaagaga aagaaaaaga gaaagaaaaa gaagaaggag1560atttaaaaag ttgtttagtt gaaaaaggag aaggaggaag aagcagcgac agcggcagaa1620gaagaagtag ttgttgtaag aggggaacgg aggcagtagc agtggagcag gcggaggcga1680cagcaaacct cgaactcgac cccgtcgagc cgcagcaaga acaagagccc gaccaggtgg1740acgaggacga ggtccgcttg ttgtcaggaa caacagaagt tgcaggacta gccgagagtg1800ctaccactgc aattcttaga tccacagacg caagagcaga aaacttacaa ctgctcgcca1860caacacaaga accaccttca gatacaacca ggttcgagaa ctccacaagt ctagaagcag1920caacagctct agcagataat caaacaggtc cagaaaaagc tacgactaga agagaaatta1980tcgagtcgca acttgcaacc2000<210>79<211>4683<212>DNA<213>Ulkenia sp.
<400>79gcgagttata tctgtctaga aaacttggca tggctagcaa tttatgtcta gctattccat60acacacggta atgccagtag cctgttagtt atagctcttt tggttgttgt ctcacaatac120actgacatca gcagaacaaa atgaaagggg ccttggctac catgaaatca atacttcaaa180aggtctcttg gtttctttac tcgcatgtcg ctatttactt acattcctcg agtacataac240atatcataca tcaaagaaat taaaaagaaa acaaacattc aaatatgcat tactttccct300actgtactag taagtacgtt tctggtatta agttgttttt tctcaaaaga acaatgtgct360tacttgtaaa atccacagct gcttacttgt aagcctcaac tagttagtga tgtgattatc420ataaaatgtt cgacactgta cctcctttcc agctatcttc ctacacctcc tctgacgcag480gttgacggag gaggcgtggg ggttgattga agtgcaacac aacgttttgt ttaagatatt540ccttgccttg gccgactcca aatggatagc acagaagcct aatgataatt tgaattaatt600ttatttcgag cttatttaat gctcttatca gagtccgtag gtatctcttt tcctactaat660tgttgaaaaa ggatgttttg gacatagcag gtcatcatac tatttggttc catcaaattc720atatccattt ctttcgttca agtgcttccc ttcctactta ttatatatat tatatatcca780
taaatgtaaa agagacgatt acgaatactt tgcatacatg tatagcgaaa cagagatggt840agcaaaagtt caccttcact aatctaagaa tctctccacg tgggtaaaaa cttcagcagt900aagattgtaa atgatgtcca agaacaaaac gtcatgctag tccaggggtt actgagctaa960cgattaataa tgtttcgtag tcttcctaat tgcaccatca aaacttgtct gcacaagttt1020taaagtattg gagcctttac tgaagaatca gaggacatag atggggcacg ttcgccttga1080aaaaaatagt cttctttacc tgcatggtgt tacaaacaaa aacgagttga aaatagctgt1140gcaaggaggc aaacatgatt ggaaaagaaa aacgagggga cccttataca ggagggcgcc1200acatagtaga atgagtagat tgttagagta gggtacgctt tatgtgattg attgaatggg1260cgagtgaaag ttgctgtcaa ggttctaaac aaaaggatgt ttgagtttgt gagtattgtt1320tgcggcaaaa agattcagta gagagaaatg cacaaaaaga taatacgtgt gtagggcgat1380tatggaggca tgcatttggg ggaaatcatc gcatgcgcat gagtttctcc atctgccgaa1440tctttgcaaa ggcattttca agctccattt gcatagcgta ggcttgctgc tcaaactgag1500cgcgctgatg cgccagattt tcttcatgtc ttttgttcaa actacgctca agaccctcaa1560gagccgcaac cttgagcttg cgttcctttt gctgaatctc cataactctt cgtttcacct1620ggagctcaat ttctgcagca tccgtggtct ttgcagcggc ctgtgcgtct tgtgcggcct1680gtgcgttgtt tgcgagctcc tttcgcagct cctccatctc cgcgttcttt ttctcctcca1740tccatttggc accgagtttg gcagcttgat cgatgcggcc cttgagaact tcttcgttct1800cctcaagttc tgcgatacgc gcgtgtaagc cgaggatctc ctccgagaca gcctcgccat1860tgatcattat ttcacttccc gagtcttgaa tgacaacatc agccttggtg ccaggttcac1920cggtatctcg ctcgcaaccc tgctggcgca tagacagcat aaggcgcgca ttatcctcac1980gcagatcatc cacctgttct gataaaagtt tgactgcctg ctcaagatta cgggggttca2040cttcgtgaaa aatttcttga aggtctcgaa gctcagaaag cttggcagag caagtgtgca2100tcgctctgca ctttttaaga cgtgcaagtg catcatcaag tttggcatta tttaccttca2160tggaggcttc agctacttcg gcttcttcga ttacaatttt ctgcagctct acaacatcat2220ggccaattaa cttgcgatgc agctcggcaa tcaccccatg catcttttcg gtatggcctg2280gacgcgcctc atcctgcgtt cttcggatct cctcctctag ttctcgattt agacgaaggg2340ctggtccaag gggcgggtaa ttagcctgag tcaagccaag ctctgttgct agtccaaggc2400agtcggaaag tcgcagccgg tccctatcag aaacagcctt ttgcaagtct acgctcaaac2460gcacttcttg agccttgcgc accatcttcg gttctgcctg tcgcagaagt ttcgagtcgt2520agccagcttg ccacgctagc acgatggcac gcgcaagtga cctcagttga ccgctgttca2580tggcagactt gagcaacatt ttgatttgca caaatacctc atctgattca tcatcttcag2640
cttcctcaag ctctgcaggt gtcttgcgct ctccagagac ttgaagagca gggttcaaac2700cgccctccag gacctcgctc gcaagcgcct cctctgtctc agctttgcgc aatagcgcag2760cagcattctc cgccattgtg tttgtcactc acgagattaa tatcgttgcc agagtatacg2820gtaatgcgag ttaaggattc acagaatctc tcaaattaat cttttcacct aatgatatcc2880acaaaacgtt gcaatcgctc agcccaacga caagcgtgct tcttgtttta agactgcaac2940tgctcctttt tctattagtc aatatggacc gtcctccaaa cgtccagaaa atagcacaga3000atttaccagc agccgctgca gacaagaagt gcaagagagc aggcaagcaa gtgagggttt3060gagcaaatag gccaacctct ccacgcagaa ttctagggtc gcaaccggaa ctcacagtcc3120ttagaaaccg tgcgaagccc tgggctcaac ttcaatttgt ccacgggacc ttcagcaagc3180accaagctca gcagcgtgaa ggcaggcgct gaccacagtt tgagctcaga gggcttggtg3240tgcctcgcga ttgatattga agtcaattgc gcaggacggc agcaacggac caggtggtga3300agaaggtaat ctccagcgga gtgatgatgg agctcgaccg actactccgg aatcgaccag3360gggaggtgcg ggcgcccttc acaagcgggc gagaggcagg ggagagaagg ctcgactcca3420cgtcttgaag cgtgtacgtg tgcgcgctca cgcgtgcgac acgccggcaa gggcgcctta3480gtggcctgct gctgctgctg gtcgccacgc tgcgagccca agagatttga attgaactcg3540aagaaaataa ctatcattta tcaattccaa tcaatcaatg cattatgaag cacctctgaa3600gtgaactatt ctcctctcca atatacaaca aaaaacacac acagtgggtt ttaccctata3660acctattgtt ccgcgagcga tcaactactc tatagagcga atgaccagtt tttctttctt3720tctttctttc tttctttctt tctttctttc tttctttctt tctttctttc tttctttctg3780ttttcctatc taataacccc tttaatcgag gaaacctttc gatttaaaag gaaagctctg3840tctgtatata tctgttacag atactgctat catgccatgc agaaagaaac acaaaagaaa3900aacaaaagaa agagagaaag agagaaagaa agagagaaag aaagaaagaa agaaagaaag3960aagagctttt ctcaatcggt ttcctcatcg accgctcaca tatctacgat tgtggcaaag4020aaagaaagaa agaaagaagg aaagcctcag cagagtccgc acgaaagcct tcattgagcc4080accatgtcgt ggtccgctgc agtcagtgcc gcctctctgt gaattgagtg agtgagtgag4140tgagtgagtt ggttggttag ttagttagtg cctcttcagc tcaaagcctt tcacggtcgc4200tcttcgagcg tttgcttttt cataaacaaa taaacaaacc atcgaacgaa ccatcgaacg4260aacgaacaat ggtaccccag aatagacgga attaattgct aagtaaacca gtaacagtaa4320gttagtgttt ctgacctgag ccgttttctt tatttattcc tctcagctct gtgaagagaa4380tttgggatga aaagaaacgt ttttatttat ttaaaagttt agtaacaaga aaaacatggt4440ccctcttctt ccttcatgta aaaataagta agtaaaaaaa agaaaagaaa aaaaaaaaag4500
cttttaaagt agtaaagcga ggtagagata aaagttcttt ctcagggctc ctagtaggca4560cttaggaggt acgtctaaga ccgcctcgtg ggaagaaaag agaaaacaag aagagaaaag4620agagagagaa acagcgctga cccgagaggc tcatgcgcag agcccaaatc tgcccaactt4680tgg 4683<210>80<211>1848<212>PRT<213>Ulkenia sp.
<400>80Met Leu Val Ile Gly Ala Leu Ala Arg Ala Leu Tyr Gly Ala Trp Arg1 510 15Cys Thr Gly Arg Ala Arg Glu Gly Thr Gly Ser Arg Glu Ala Leu Leu20 25 30Gly Gly Ala Glu Arg Glu Gly Glu Gly Val Leu Cys Asp Ala Arg Gly35 40 45Asp Leu Gly Ala Ala Ala Arg Cys Ser Ser Arg Asp Leu Gly Arg Glu5055 60Ser Lys Pro Ser Ala Glu Glu Met Val Glu Gly Pro Arg Ile Gly Ser65 70 75 80Arg Glu Glu Glu Ile Glu Glu Glu Glu Glu Ile Glu Glu Glu Glu Ile85 90 95Glu Glu Glu Glu Ile Glu Glu Glu Glu Glu Asp Gly Gln Ala Gly Lys100 105 110Met Glu Lys Gly Leu Ala Ala Gly Lys Gln Glu Asn Val Asn Leu Gly115 120 125Leu Asn Phe Gly Leu Asn Leu Asn Val Glu Asn Glu Gly Leu Asn Leu130 135 140Ser Leu Asn Leu Lys Glu Asn Leu Arg Lys Glu Ser Leu Val Glu Ser145 150 155 160Glu Lys Glu Lys Asn Glu Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu165 170 175Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu180 185 190Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu195 200 205Lys Glu Lys Glu Lys Glu Glu Glu Lys Glu Glu Glu Lys Glu Lys Glu210 215 220Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu Glu Gly Asp Leu Lys Ser225 230 235 240Cys Leu Val Glu Lys Gly Glu Gly Gly Arg Ser Ser Asp Ser Gly Arg
245 250 255Arg Arg Ser Ser Cys Cys Lys Arg Gly Thr Glu Ala Val Ala Val Glu260 265 270Gln Ala Glu Ala Thr Ala Asn Leu Glu Leu Asp Pro Val Glu Pro Gln275 280 285Gln Glu Gln Glu Pro Asp Gln Val Asp Glu Asp Glu Val Arg Leu Leu290 295 300Ser Gly Thr Thr Glu Val Ala Gly Leu Ala Glu Ser Ala Thr Thr Ala305 310 315 320Ile Leu Arg Ser Thr Asp Ala Arg Ala Glu Asn Leu Gln Leu Leu Ala325 330 335Thr Thr Gln Glu Pro Pro Ser Asp Thr Thr Arg Phe Glu Asn Ser Thr340 345 350Ser Leu Glu Ala Ala Thr Ala Leu Ala Asp Asn Gln Thr Gly Pro Glu355 360 365Lys Ala Thr Thr Arg Arg Glu Ile Ile Glu Ser Gln Leu Ala Thr Met370 375 380Ala Thr Arg Val Lys Thr Asn Lys Lys Pro Cys Trp Glu Met Thr Lys385 390 395 400Glu Glu Leu Thr Ser Gly Lys Asn Val Val Phe Asp Tyr Asp Glu Leu405 410 415Leu Glu Phe Ala Glu Gly Asp Ile Ser Lys Val Phe Gly Pro Glu Phe420 425 430Ser Gln Ile Asp Gln Tyr Lys Arg Arg Val Arg Leu Pro Ala Arg Glu435 440 445Tyr Leu Leu Val Thr Arg Val Thr Leu Met Asp Ala Glu Val Asn Asn450 455 460Tyr Arg Val Gly Ala Arg Met Val Thr Glu Tyr Asp Leu Pro Val Asn465 470 475 480Gly Glu Leu Ser Glu Gly Gly Asp Cys Pro Trp Ala Val Leu Val Glu485 490 495Ser Gly Gln Cys Asp Leu Met Leu Ile Ser Tyr Met Gly Ile Asp Phe500 505 510Gln Asn Lys Ser Asp Arg Val Tyr Arg Leu Leu Asn Thr Thr Leu Thr515 520 525Phe Tyr Gly Val Ala Gln Glu Gly Glu Thr Leu Glu Tyr Asp Ile Arg530 535 540Val Thr Gly Phe Ala Lys Arg Leu Asp Gly Asp Ile Ser Met Phe Phe545 550 555 560Phe Glu Tyr Asp Cys Tyr Val Asn Gly Arg Leu Leu Ile Glu Met Arg565 570 575
Asp Gly Cys Ala Gly Phe Phe Thr Asn Glu Glu Leu Ala Ala Gly Lys580 585 590Gly Val Val Phe Thr Arg Ala Asp Leu Leu Ala Arg Glu Lys Thr Lys595 600 605Lys Gln Asp Ile Thr Pro Tyr Ala Ile Ala Pro Arg Leu Asn Lys Thr610 615 620Val Leu Asn Glu Thr Glu Met Gln Ser Leu Val Asp Lys Asn Trp Thr625 630 635 640Lys Val Phe Gly Pro Glu Asn Gly Met Asp Gln Ile Asn Tyr Lys Leu645 650 655Cys Ala Arg Lys Met Leu Met Ile Asp Arg Val Thr Lys Ile Asp Tyr660 665 670Thr Gly Gly Pro Tyr Gly Leu Gly Leu Leu Val Gly Glu Lys Ile Leu675 680 685Glu Arg Asp His Trp Tyr Phe Pro Cys His Phe Val Gly Asp Gln Val690 695 700Met Ala Gly Ser Leu Val Ser Asp Gly Cys Ser Gln Leu Leu Lys Met705 710 715 720Tyr Met Leu Trp Leu Gly Leu His Leu Lys Thr Gly Pro Phe Asp Phe725 730 735Arg Pro Val Asn Gly His Pro Asn Lys Val Arg Cys Arg Gly Gln Ile740 745 750Ser Pro His Lys Gly Lys Leu Val Tyr Val Met Glu Ile Lys Glu Met755 760 765Gly Tyr Asp Glu Ala Gly Asp Pro Tyr Ala Ile Ala Asp Val Asn Ile770 775 780Leu Asp Ile Asp Phe Glu Lys Gly Gln Thr Phe Asp Leu Ala Asn Leu785 790 795 800His Glu Tyr Gly Lys Gly Asp Leu Asn Lys Lys Ile Val Val Asp Phe805 810 815Lys Gly Ile Ala Leu Lys Leu Gln Lys Arg Ser Gly Pro Ala Val Val820 825 830Ala Pro Glu Lys Pro Leu Ala Leu Asn Lys Asp Leu Cys Ala Pro Ala835 840 845Val Glu Ala Ile Pro Glu His Ile Leu Lys Gly Asp Ala Leu Ala Pro850 855 860Asn Gln Met Thr Trp His Pro Met Ser Lys Ile Ala Gly Asn Pro Thr865 870 875 880Pro Ser Phe Ser Pro Ser Ala Tyr Pro Pro Arg Pro Ile Thr Phe Thr885 890 895Pro Phe Pro Gly Asn Lys Asn Asp Asn Asn His Val Pro Gly Glu Met
900 905 910Pro Leu Ser Trp Tyr Asn Met Ala Glu Phe Met Ala Gly Lys Val Ser915 920 925Leu Cys Leu Gly Pro Glu Phe Ala Lys Phe Asp Asp Ser Asn Thr Ser930 935 940Arg Ser Pro Ala Trp Asp Leu Ala Leu Val Thr Arg Val Val Ser Val945 950 955 960Ser Asp Met Glu Trp Val Gln Trp Lys Asn Val Asp Cys Asn Pro Ser965 970 975Lys Gly Thr Met Val Gly Glu Phe Asp Cys Pro Ile Asp Ala Trp Phe980 985 990Phe Gln Gly Ser Cys Asn Asp Gly His Met Pro Tyr Ser Ile Leu Met995 1000 1005Glu Ile Ala Leu Gln Thr Ser Gly Val Leu Thr Ser Val Leu Lys1010 1015 1020Ala Pro Leu Thr Met Glu Lys Lys Asp Ile Leu Phe Arg Asn Leu1025 1030 1035Asp Ala Asn Ala Glu Met Val Arg Ser Asp Ile Asp Leu Arg Gly1040 1045 1050Lys Thr Ile His Asn Leu Thr Lys Cys Thr Gly Tyr Ser Met Leu1055 1060 1065Gly Asp Met Gly Val His Arg Phe Ser Phe Glu Leu Ser Val Asp1070 1075 1080Gly Val Val Phe Tyr Lys Gly Thr Thr Ser Phe Gly Trp Phe Val1085 1090 1095Pro Glu Val Phe Ile Ser Gln Thr Gly Leu Asp Asn Gly Arg Arg1100 1105 1110Thr Gln Pro Trp His Ile Glu Ser Lys Val Pro Ser Ala Gln Val1115 1120 1125Leu Thr Tyr Asp Val Thr Pro Asn Gly Ala Gly Arg Thr Gln Leu1130 1135 1140Tyr Ala Asn Ala Pro Lys Gly Ala Gln Leu Thr Arg Arg Trp Asn1145 1150 1155Gln Cys Gln Tyr Leu Asp Thr Ile Asp Leu Val Val Ala Gly Gly1160 1165 1170Ser Ala Gly Leu Gly Tyr Gly His Gly Arg Lys Gln Val Asn Pro1175 1180 1185Lys Asp Trp Phe Phe Ser Cys His Phe Trp Phe Asp Ser Val Met1190 1195 1200Pro Gly Ser Leu Gly Val Glu Ser Met Phe Gln Leu Val Glu Ser1205 1210 1215
Ile Ala Val Lys Gln Asp Leu Ala Gly Lys Tyr Gly Ile Thr Asn1220 1225 1230Pro Thr Phe Ala His Ala Pro Gly Lys Ile Ser Trp Lys Tyr Arg1235 1240 1245Gly Gln Leu Thr Pro Thr Ser Lys Phe Met Asp Ser Glu Ala His1250 1255 1260Ile Val Ser Ile Glu Ala His Asp Gly Val Val Asp Ile Val Ala1265 1270 1275Asn Gly Asn Leu Trp Ala Asp Gly Leu Arg Val Tyr Asn Val Ser1280 1285 1290Asn Ile Arg Val Arg Ile Val Ala Gly Ala Ala Pro Ala Ala Ala1295 1300 1305Ala Ala Ala Ala Ala Val Ala Ala Pro Ala Ala Ala Pro Ala Pro1310 1315 1320Val Ala Ala Ser Gly Pro Ala Gln Thr Ile Thr Leu Lys Gln Leu1325 1330 1335Lys Ala Glu Leu Leu Asp Val Glu Lys Pro Leu Tyr Ile Ser Ser1340 1345 1350Ser Asn Gly Gln Val Lys Lys His Ala Asp Val Ala Gly Gly Gln1355 1360 1365Ala Thr Ile Val Gln Ala Cys Ser Leu Ser Asp Leu Gly Asp Glu1370 1375 1380Gly Phe Met Lys Thr Tyr Gly Val Val Ala Pro Leu Tyr Thr Gly1385 1390 1395Ala Met Ala Lys Gly Ile Ala Ser Ala Asp Leu Val Ile Ala Thr1400 1405 1410Gly Lys Arg Lys Ile Leu Gly Ser Phe Gly Ala Gly Gly Leu Pro1415 1420 1425Met His Ile Val Arg Ala Ala Val Glu Lys Ile Gln Ala Glu Leu1430 1435 1440Pro Asn Gly Pro Phe Ala Val Asn Leu Ile His Ser Pro Phe Asp1445 1450 1455Ser Asn Leu Glu Lys Gly Asn Val Asp Leu Phe Leu Glu Lys Gly1460 1465 1470Val Thr Val Val Glu Ala Ser Ala Phe Met Thr Leu Thr Pro Gln1475 1480 1485Val Val Arg Tyr Arg Ala Ala Gly Leu Ser Arg Asn Ala Asp Gly149 1495 1500Ser Ile Asn Ile Lys Asn Arg Ile Ile Gly Lys Val Ser Arg Thr1505 1510 1515Glu Leu Ala Glu Met Phe Ile Arg Pro Ala Pro Gln Asn Leu Leu
1520 1525 1530Asp Lys Leu Ile Gln Ser Gly Glu Ile Thr Lys Glu Gln Ala Glu1535 1540 1545Leu Ala Lys Leu Val Pro Val Ala Asp Asp Ile Ala Val Glu Ala1550 1555 1560Asp Ser Gly Gly His Thr Asp Asn Arg Pro Ile His Val Ile Leu1565 1570 1575Pro Leu Ile Ile Asn Leu Arg Asn Arg Leu His Lys Glu Cys Gly1580 1585 1590Tyr Pro Ala His Leu Arg Val Arg Val Gly Ala Gly Gly Gly Val1595 1600 1605Gly Cys Pro Gln Ala Ala Ala Ala Ala Leu Ala Met Gly Ala Ala1610 1615 1620Phe Leu Val Thr Gly Thr Val Asn Gln Val Ala Lys Gln Ser Gly1625 1630 1635Thr Cys Asp Asn Val Arg Lys Gln Leu Cys Met Ala Thr Tyr Ser1640 1645 1650Asp Val Cys Met Ala Pro Ala Ala Asp Met Phe Glu Glu Gly Val1655 1660 1665Lys Leu Gln Val Leu Lys Lys Gly Thr Met Phe Pro Ser Arg Ala1670 1675 1680Asn Lys Leu Tyr Glu Leu Phe Cys Lys Tyr Asp Ser Phe Glu Ser1685 1690 1695Met Pro Ala Thr Glu Leu Glu Arg Val Glu Lys Arg Ile Phe Gln1700 1705 1710Cys Pro Leu Ala Asp Val Trp Ala Glu Thr Ser Asp Phe Tyr Ile1715 1720 1725Asn Arg Leu His Asn Pro Glu Lys Ile Thr Arg Ala Glu Arg Asp1730 1735 1740Pro Lys Leu Lys Met Ser Leu Cys Phe Arg Trp Tyr Leu Gly Leu1745 1750 1755Ala Ser Arg Trp Ala Asn Thr Gly Glu Ala Gly Arg Val Met Asp1760 1765 1770Tyr Gln Val Trp Cys Gly Pro Ala Ile Gly Ala Phe Asn Asp Phe1775 1780 1785Ile Lys Gly Ser Tyr Leu Asp Pro Ala Val Ser Gly Glu Tyr Pro1790 1795 1800Asp Val Val Gln Ile Asn Leu Gln Ile Leu Arg Gly Ala Cys Tyr1805 1810 1815Leu Arg Arg Leu Asn Val Ile Arg Asn Asp Pro Arg Val Ser Ile1820 1825 1830
Glu Val Glu Asp Ala Glu Phe Val Tyr Glu Pro Thr Asn Ala Leu1835 1840 1845<210>81<211>18<212>DNA<213>Künstliche Sequenz<400>81ctcggcattg actccatc 18<210>82<211>18<212>DNA<213>Künstliche Sequenz<400>82GAGAATCTCG ACACGCTT 18<210>83<211>21<212>DNA<213>Künstliche Sequenz<400>83ATTACTCCTC TCTGCATCCG T 21<210>84<211>21<212>DNA<213>Künstliche Sequenz<400>84GCCGAAGACA GCATCAAACT C 21<210>85<211>21<212>DNA<213> Künstliche Sequenz<400>85GTCGAGAGTG GCCAGTGCGA T 21<210>86<211>21<212>DNA<213> Künstliche Sequenz<400>86AAAGTGGCAG GGAAAGTACC A 2權利要求
1.PUFA-PKS,其特征是它們a.包括在SEQ ID No.6(ORF 1),7(ORF 2),8和/或80(ORF 3)中所示氨基酸序列的至少其中一種,以及具有與它們有至少70%,優(yōu)選80%,特別優(yōu)選至少90%和更加特別優(yōu)選至少99%和最優(yōu)選100%序列同源性的同源序列,所述同源序列具有PUFA-PKS的至少一個結構域的生物學活性,或b.包括在SEQ ID No.32,34,45,58,59,60,61,72,74和/或77中所示氨基酸序列的至少其中一種,以及具有與它們有至少70%,優(yōu)選80%,特別優(yōu)選至少90%和更加特別優(yōu)選至少99%和最優(yōu)選100%序列同源性的同源序列,所述同源序列具有PUFA-PKS的至少一個結構域的生物學活性。
2.具有10個或更多ACP結構域的根據權利要求1的分離的PUFA-PKS。
3.根據任何一項在前權利要求,其特征是它包含與序列SEQ IDNo.6(ORF 1),7(ORF 2)和/或8和/或80(ORF 3)的至少500個直接連續(xù)氨基酸具有至少70%,優(yōu)選至少80%,特別優(yōu)選至少90%和更加特別優(yōu)選至少99%序列同源性的至少一種氨基酸序列,并且具有PUFA-PKS的至少一個結構域的生物學活性。
4.一種氨基酸序列,它與SEQ ID No.6(ORF 1),7(ORF 2)和/或8和/或80(ORF 3)的至少500個直接連續(xù)氨基酸具有至少70%,優(yōu)選至少80%,特別優(yōu)選至少90%和更加特別優(yōu)選至少99%的同一性,并且具有PUFA-PKS的至少一個結構域的生物學活性。
5.一種分離的DNA分子,其編碼根據任一項在前權利要求的氨基酸序列和與它完全互補的DNA。
6.根據權利要求5的分離的DNA分子,其特征是它與來自SEQ IDNo.3,4和5和/或9的至少500個直接連續(xù)核苷酸具有至少70%,優(yōu)選至少80%,特別優(yōu)選至少90%和更加特別優(yōu)選至少95%的同一性。
7.根據權利要求5或6的DNA分子,其特征是它編碼與序列SEQID No.6(ORF 1),7(ORF 2)和/或8和/或80(ORF 3)的至少500個直接連續(xù)氨基酸具有至少70%同源性的氨基酸序列。
8.包含根據權利要求5,6和/或7其中之一DNA分子的重組DNA分子,其與至少一種控制轉錄的DNA序列功能性連接,所述DNA序列優(yōu)選選自SEQ ID No.XX-YY(終止子/啟動子),或其來自至少500個核苷酸的部分以及它們的功能性變體。
9.包含根據權利要求8的重組DNA分子的重組宿主細胞。
10.根據權利要求9的重組宿主細胞,其內源性表達具有至少另一種PUFA-PKS結構域活性的根據權利要求1的PUFA-PKS。
11.包含重組DNA構建體的重組宿主細胞,其中控制翻譯的元件選自SEQ ID No.XX-YY(終止子/啟動子),或其來自至少500個核苷酸的部分以及它們的功能性變體。
12.一種生產含有PUFA,優(yōu)選DHA的油的方法,包括培養(yǎng)根據權利要求9或10的宿主細胞。
13.根據權利要求12的方法生產的油。
14.一種生產含有PUFA,優(yōu)選DHA的生物質量的方法,包括培養(yǎng)根據權利要求9或10的宿主細胞。
15.根據權利要求14的方法生產的生物質量。
16.根據權利要求15的重組生物質量,其包含根據權利要求8的核酸和/或根據權利要求1的氨基酸序列或與它們同源的至少50個連續(xù)氨基酸的部分。
17.包含PUFA-PKS的來自SEQ ID No.6,7,8和/或80的個別酶結構域用于生產人工多酮化合物的用途。
全文摘要
本發(fā)明涉及編碼特異于多酮化合物合酶(PKS)的序列的基因。由此合成的PKS特征在于其產生PUFAs(多不飽和脂肪酸)的酶能力。本發(fā)明還涉及鑒定相應的DNA序列,以及所述核苷酸序列用于產生重組和/或轉基因生物的應用。
文檔編號C12P7/64GK101087882SQ200580018878
公開日2007年12月12日 申請日期2005年4月8日 優(yōu)先權日2004年4月8日
發(fā)明者托馬斯·克伊, 馬庫斯·盧伊, 馬西亞斯·魯辛 申請人:努特諾瓦營養(yǎng)產品及食品成分有限公司