F226033

General Info

Members Datasets Scaffolds Average Seq Length
156 134 113 562

Family's Representative Sequence

Representative Sequence iso_pu_bacteria|2643221587|2643942180
Length 610
Sequence LRIGNASGFYGDRFDAVREMLTGGELDVLTGDYLAELTMLILGRDQLKDPSAGYAKTFLRQMEEGLGLAHERRVRIVTNAGGLNPAGLADALRALAAKLGLPTRIAHVEGDLLPVTGGALAANAYLGGAGIAACLAAGADVVVTGRVTDAALVTGPAQWHFGWGPEEYDRLAGAVVAGHVLECGTQATGGNYAFFREHDPRLLRRPGFPVAELHEDGSAVVTKHPGTGGLVDLGTVTAQLLYETAGARYPGPDVTTRLDTVRLTPEGPDRVRIDGVRGEAPPPTLKVGLSRLGGFRNEVVFVLTGLDIEAKAALLQEQIHDALAKNPPAETRWELVRTDRPDAPTEETASALLRLVVRDTDPEKVGRTLTGAAIELALASYPGFHVTAPPGRGAPYGVFETVYVPAAEVPHTAVLPDGTRREIPVPRRTRDLEPVPEPALPEPLRPGSTRCEPLGLVAGARSGDKGGDANVGVWVRTEEEWRWLAHALTVEKVRELLPETAGLVVTRHLLPRLRAVNFTIAGILGEGVASQARFDPQAKALGEWLRSRRLDIPEELLPESPGAGLSPEPPAEPLSPEPPAEPLSPEPPAAPLPPHTPAAPLPPDDPEEIQ

Samples

Sample ID Description Type Environment
1 2547132111 Streptomyces sp. TOR3209 Isolate Rhizosphere
2 2551306166 Nocardia tenerifensis NBRC 101015 Isolate Rhizosphere
3 2554235005 Streptomyces violaceusniger SPC6 Isolate Rhizosphere
4 2582581312 Streptomyces atratus OK008 Isolate Rhizosphere
5 2585427649 Amycolatopsis japonica MG417-CF17, DSM 44213 Isolate Unclassified
6 2643221548 Streptomyces sp. Root55 Isolate Unclassified
7 2643221578 Streptomyces sp. Root63 Isolate Unclassified
8 2643221587 Streptomyces sp. Root66D1 Isolate Unclassified
9 2643221670 Streptomyces sp. Root431 Isolate Unclassified
10 2643221673 Streptomyces sp. Root1295 Isolate Unclassified
11 2643221677 Streptomyces sp. Root1304 Isolate Unclassified
12 2643221682 Streptomyces sp. Root1319 Isolate Unclassified
13 2738541264 Mycobacterium sp. OK889 Isolate Unclassified
14 2738541356 Mycobacterium sp. OK887 Isolate Unclassified
15 2784132148 Streptomyces sp. E5N91 SAI-083 Isolate Unclassified
16 2795385472 Herbihabitans rhizosphaerae DSM 101727 Isolate Rhizosphere
17 2808606448 Streptomyces sp. 193411 Isolate Unclassified
18 2808606522 Amycolatopsis sp. BJA-103 Isolate Unclassified
19 2816332119 Kribbella amoyensis DSM 24683 Isolate Rhizosphere
20 2818991463 Streptomyces argenteolus 3259 Isolate Rhizosphere
21 2818991472 Kitasatospora viridis DSM 44826 Isolate Rhizosphere
22 2862507626 Streptomyces sp. NWU339 Isolate Unclassified
23 2863404153 Streptomyces scabiei SAI-025 (Annotation) (version 2) Isolate Unclassified
24 2866552031 Saccharopolyspora rhizosphaerae H219 Isolate Unclassified
25 2866612099 Amycolatopsis suaedae 8-3EHSu Isolate Unclassified
26 2867369537 Streptomyces sp. Z26 Isolate Unclassified
27 2867475112 Streptomyces sp. TM32 Isolate Unclassified
28 2875391855 Streptomyces cavourensis 1AS2a Isolate Rhizosphere
29 2899359706 Amycolatopsis anabasis EGI 650086 Isolate Unclassified
30 2899370129 Amycolatopsis alkalitolerans SYSUP0005 Isolate Stem Tuber
31 2902799365 Mycolicibacterium sp. P1-5 Isolate Unclassified
32 2915768154 Amycolatopsis pittospori PIP199 Isolate Unclassified
33 2918501144 Streptomyces sp. PvR006 Isolate Rhizosphere
34 2946045630 Streptomyces sp. W4I9-2 Isolate Rhizosphere
35 2966598605 Kitasatospora papulosa SLBN-177 Isolate Rhizosphere
36 2974315732 Rhodococcus sp. SORGH_AS 301 Isolate Unclassified
37 2984523437 Rhodococcus sp. SORGH_AS303 Isolate Aerial Root
38 2997451912 Streptomyces piniterrae jys28 Isolate Rhizosphere
39 3006493962 Streptomyces grisecoloratus TRM S81-3 Isolate Rhizosphere
40 3300001989 Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C5 Metagenome Rhizosphere
41 3300003578 Arabidopsis root microbial communities from the University of North Carolina, USA - metaT NBMF1_36_input_d2 (Metagenome Metatranscriptome, Counting Only) Metatranscriptome Unclassified
42 3300003792 Arabidopsis root microbial communities from North Carolina, USA - plate scrape MF_Cvi_mMS_r2 Metagenome Endosphere
43 3300005347 Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-3 metaG Metagenome Rhizosphere
44 3300005367 Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3 metaG Metagenome Rhizosphere
45 3300005436 Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaG Metagenome Rhizosphere
46 3300005455 Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3 metaG Metagenome Rhizosphere
47 3300005981 Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S5T2R1 Metagenome Rhizosphere
48 3300006038 Populus endosphere microbial communities from Tennessee, USA - Endosphere MetaG P. deltoides DD176-5 Metagenome Endosphere
49 3300006048 Populus endosphere microbial communities from Tennessee, USA - Endosphere MetaG P. deltoides DD176-3 Metagenome Endosphere
50 3300006186 Populus endosphere microbial communities from Tennessee, USA - Endosphere MetaG P. TD hybrid TD303-4 Metagenome Endosphere
51 3300006844 Populus rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD2 Metagenome Rhizosphere
52 3300006846 Populus rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD4 Metagenome Rhizosphere
53 3300006847 Populus rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD5 Metagenome Rhizosphere
54 3300006880 Populus rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD3 Metagenome Rhizosphere
55 3300009147 Populus rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2) (version 2) Metagenome Rhizosphere
56 3300009177 Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-4 metaG Metagenome Rhizosphere
57 3300021384 Root-associated microbial communities from Barbacenia macrantha in rupestrian grasslands, the National Park of Serra do Cipo, Brazil - RX_R9 Metagenome Unclassified
58 3300025302 Arabidopsis root microbial communities from the University of North Carolina, USA - plate scrape MF_Cvi_mMS (SPAdes) (version 2) Metagenome Endosphere
59 3300025303 Arabidopsis root microbial communities from North Carolina, USA - plate scrape MF_Cvi_mMS_r2 (SPAdes) (version 2) Metagenome Endosphere
60 3300025903 Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3 metaG (SPAdes) (version 2) Metagenome Rhizosphere
61 3300025904 Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C2 (SPAdes) (version 2) Metagenome Rhizosphere
62 3300026067 Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3 metaG (SPAdes) (version 2) Metagenome Rhizosphere
63 3300026116 Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C7-2 (SPAdes) (version 2) Metagenome Rhizosphere
64 3300028794 Populus trichocarpa ectomycorrhiza microbial communities from riparian zone in the Pacific Northwest, United States - 17_EM Metagenome Unclassified
65 3300030522 Populus trichocarpa ectomycorrhiza microbial communities from riparian zone in the Pacific Northwest, United States - 14_EM Metagenome Unclassified
66 3300031251 Rhizosphere microbial communities from Carex aquatilis grown in University of Washington, Seatle, WA, United States - 4-16-21 metaG Metagenome Rhizosphere
67 3300031616 Populus trichocarpa ectomycorrhiza microbial communities from riparian zone in the Pacific Northwest, United States - 9_EM Metagenome Unclassified
68 3300031649 Populus trichocarpa ectomycorrhiza microbial communities from riparian zone in the Pacific Northwest, United States - 16_EM Metagenome Unclassified
69 3300031838 Populus trichocarpa ectomycorrhiza microbial communities from riparian zone in the Pacific Northwest, United States - 25_EM Metagenome Unclassified
70 3300031901 Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - 322HYB-C-2 Metagenome Rhizosphere
71 3300033179 Populus trichocarpa ectomycorrhiza microbial communities from riparian zone in the Pacific Northwest, United States - 7_EM Metagenome Unclassified
72 3300036712 Rhizosphere microbial communities from salt marsh grasses in Alabama, United States - S_170502SBrCrA Metagenome Rhizosphere
73 3300041410 Rhizosphere microbial communities from Sorghum plant, Scottsbluff, Nebraska, USA - SB0116DE14Z082817_5596 Metagenome Rhizosphere
74 3300041413 Rhizosphere microbial communities from Sorghum plant, Central City, Nebraska, USA - SB0710WE14Z080117_6839 Metagenome Rhizosphere
75 3300041512 White clover root microbial community from Lincoln, Canterbury, New Zealand - WCR18_11 MetaG Metagenome Unclassified
76 3300042005 Rhizosphere microbial communities from Sorghum plant, Central City, Nebraska, USA - CC0512LE14Z062817_5216 Metagenome Rhizosphere
77 3300044656 Rhizosphere microbial communities from millet plant in semiarid region near Thies, Senegal - COA1R Metagenome Rhizosphere
78 3300044658 Rhizosphere microbial communities from millet plant in semiarid region near Thies, Senegal - COC1R Metagenome Rhizosphere
79 3300044684 Rhizosphere microbial communities from millet plant in semiarid region near Thies, Senegal - COC4R Metagenome Rhizosphere
80 3300044693 Rhizosphere microbial communities from millet plant in semiarid region near Thies, Senegal - COC2R Metagenome Rhizosphere
81 3300044694 Rhizosphere microbial communities from millet plant in semiarid region near Thies, Senegal - CSC3R Metagenome Rhizosphere
82 3300044735 Rhizosphere microbial communities from millet plant in semiarid region near Thies, Senegal - CSA1R Metagenome Rhizosphere
83 3300044765 Rhizosphere microbial communities from millet plant in semiarid region near Thies, Senegal - COA2R Metagenome Rhizosphere
84 3300045049 Rhizosphere microbial communities from millet plant in semiarid region near Thies, Senegal - COC3R Metagenome Rhizosphere
85 3300045836 Rhizosphere microbial communities from millet plant in semiarid region near Thies, Senegal - CSC4R Metagenome Rhizosphere
86 3300045976 Rhizosphere microbial communities from millet plant in semiarid region near Thies, Senegal - CSA4R Metagenome Rhizosphere
87 3300046453 Rhizosphere soil microbial communities from poplar common garden site in Corvallis, Oregon, USA - BESC-388-Co3_8_57 rhizosphere Metagenome Rhizosphere
88 3300046455 Rhizosphere soil microbial communities from poplar common garden site in Clatskanie, Oregon, USA - BESC-307-CL1_26_33 rhizosphere Metagenome Rhizosphere
89 3300046459 Rhizosphere soil microbial communities from poplar common garden site in Clatskanie, Oregon, USA - BESC-448-CL3_88_32 rhizosphere Metagenome Rhizosphere
90 3300046461 Rhizosphere soil microbial communities from poplar common garden site in Clatskanie, Oregon, USA - BESC-845-CL3_80_19 rhizosphere Metagenome Rhizosphere
91 3300046476 Rhizosphere soil microbial communities from poplar common garden site in Clatskanie, Oregon, USA - BESC-905-CL1_32_20 rhizosphere Metagenome Rhizosphere
92 3300046531 Rhizosphere soil microbial communities from poplar common garden site in Clatskanie, Oregon, USA - GW-11047-CL2_41_30 rhizosphere Metagenome Rhizosphere
93 3300046557 Rhizosphere soil microbial communities from poplar common garden site in Clatskanie, Oregon, USA - BESC-388-CL1_25_34 rhizosphere Metagenome Rhizosphere
94 3300046663 Rhizosphere soil microbial communities from poplar common garden site in Clatskanie, Oregon, USA - BESC-833-CL3_77_6 rhizosphere Metagenome Rhizosphere
95 3300047319 Rhizosphere soil microbial communities from poplar common garden site in Clatskanie, Oregon, USA - GW-9591-CL1_34_16 rhizosphere Metagenome Rhizosphere
96 3300047321 Rhizosphere soil microbial communities from poplar common garden site in Clatskanie, Oregon, USA - GW-9591-CL3_98_5 rhizosphere Metagenome Rhizosphere
97 3300048089 Rhizosphere soil microbial communities from poplar common garden site in Clatskanie, Oregon, USA - BESC-351-CL3_84_27 rhizosphere Metagenome Rhizosphere
98 3300048903 Rhizoplane soil microbial communities from switchgrass plant in W.K. Kellogg Biological Station, Michigan, USA - CIR rhizoplane_7v unlabeled Metagenome Rhizoplane
99 3300048904 Rhizoplane soil microbial communities from switchgrass plant in W.K. Kellogg Biological Station, Michigan, USA - CIR rhizoplane_7w unlabeled Metagenome Rhizoplane
100 3300048905 Rhizoplane soil microbial communities from switchgrass plant in W.K. Kellogg Biological Station, Michigan, USA - CIR rhizoplane_1c N15 Metagenome Rhizoplane
101 3300048906 Rhizoplane soil microbial communities from switchgrass plant in W.K. Kellogg Biological Station, Michigan, USA - CIR rhizoplane_1d N15 Metagenome Rhizoplane
102 3300048909 Rhizoplane soil microbial communities from switchgrass plant in W.K. Kellogg Biological Station, Michigan, USA - CIR rhizoplane_3c N15 Metagenome Rhizoplane
103 3300048910 Rhizoplane soil microbial communities from switchgrass plant in W.K. Kellogg Biological Station, Michigan, USA - CIR rhizoplane_3d N15 Metagenome Rhizoplane
104 3300048912 Rhizoplane soil microbial communities from switchgrass plant in W.K. Kellogg Biological Station, Michigan, USA - KLW rhizoplane_8w unlabeled Metagenome Rhizoplane
105 3300048913 Rhizoplane soil microbial communities from switchgrass plant in W.K. Kellogg Biological Station, Michigan, USA - KLW rhizoplane_4c N15 Metagenome Rhizoplane
106 3300048914 Rhizoplane soil microbial communities from switchgrass plant in W.K. Kellogg Biological Station, Michigan, USA - KLW rhizoplane_4d N15 Metagenome Rhizoplane
107 3300048917 Rhizoplane soil microbial communities from switchgrass plant in W.K. Kellogg Biological Station, Michigan, USA - KLW rhizoplane_6c N15 Metagenome Rhizoplane
108 3300048919 Root microbial communities from switchgrass plant in W.K. Kellogg Biological Station, Michigan, USA - CIR root_7x unlabeled Metagenome Unclassified
109 3300048920 Root microbial communities from switchgrass plant in W.K. Kellogg Biological Station, Michigan, USA - CIR root_1e N15 Metagenome Unclassified
110 3300048921 Root microbial communities from switchgrass plant in W.K. Kellogg Biological Station, Michigan, USA - CIR root_1f N15 Metagenome Unclassified
111 3300048922 Root microbial communities from switchgrass plant in W.K. Kellogg Biological Station, Michigan, USA - CIR root_2e N15 Metagenome Unclassified
112 3300048923 Root microbial communities from switchgrass plant in W.K. Kellogg Biological Station, Michigan, USA - CIR root_2f N15 Metagenome Unclassified
113 3300048924 Root microbial communities from switchgrass plant in W.K. Kellogg Biological Station, Michigan, USA - CIR root_3e N15 Metagenome Unclassified
114 3300048925 Root microbial communities from switchgrass plant in W.K. Kellogg Biological Station, Michigan, USA - KLW root_8x unlabeled Metagenome Unclassified
115 3300048926 Root microbial communities from switchgrass plant in W.K. Kellogg Biological Station, Michigan, USA - KLW root_8y unlabeled Metagenome Unclassified
116 3300048927 Root microbial communities from switchgrass plant in W.K. Kellogg Biological Station, Michigan, USA - KLW root_4e N15 Metagenome Unclassified
117 3300048928 Root microbial communities from switchgrass plant in W.K. Kellogg Biological Station, Michigan, USA - KLW root_5e N15 Metagenome Unclassified
118 3300048929 Root microbial communities from switchgrass plant in W.K. Kellogg Biological Station, Michigan, USA - KLW root_6e N15 Metagenome Unclassified
119 3300049569 Sugarcane rhizosphere microbial communities from a greenhouse in University of Florida, Reddick, FL, USA - WT_TR_GHRAS_02 Metagenome Rhizosphere
120 3300049571 Sugarcane rhizosphere microbial communities from a greenhouse in University of Florida, Reddick, FL, USA - L1_TR_GHRAS_01 Metagenome Rhizosphere
121 3300049572 Sugarcane rhizosphere microbial communities from a greenhouse in University of Florida, Reddick, FL, USA - L1_TR_GHRAS_03 Metagenome Rhizosphere
122 3300049573 Sugarcane rhizosphere microbial communities from a greenhouse in University of Florida, Reddick, FL, USA - L2_TR_GHRAS_01 Metagenome Rhizosphere
123 3300049574 Sugarcane rhizosphere microbial communities from a greenhouse in University of Florida, Reddick, FL, USA - L2_TR_GHRAS_02 Metagenome Rhizosphere
124 3300049575 Sugarcane rhizosphere microbial communities from a greenhouse in University of Florida, Reddick, FL, USA - L2_TR_GHRAS_03 Metagenome Rhizosphere
125 3300049579 Sugarcane rhizosphere microbial communities from a greenhouse in University of Florida, Reddick, FL, USA - L4_TR_GHRAS_01 Metagenome Rhizosphere
126 3300049586 Sugarcane rhizosphere microbial communities from experimental field in University of Florida, Reddick, FL, USA - L1_T2_FRAS_01 Metagenome Rhizosphere
127 3300049823 Sugarcane rhizosphere microbial communities from a greenhouse in University of Florida, Reddick, FL, USA - L4_TR_GHRAS_02 Metagenome Rhizosphere
128 3300050507 Populus rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 re-annotation Metagenome Rhizosphere
129 3300050508 Populus rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD3 re-annotation Metagenome Rhizosphere
130 3300050509 Populus rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD4 re-annotation Metagenome Rhizosphere
131 8003314358 Amycolatopsis sp. MtRt-6 Isolate Unclassified
132 8025530807 Streptomyces sp. 4R-3d Isolate Unclassified
133 8054160619 Streptomyces rhizoryzae RS10V-4 Isolate Rhizosphere
134 8056667051 Streptomyces sichuanensis SCA3-4 Isolate Rhizosphere

Type Distribution

Type Percentage (%)
Metagenomes 71.79
Metatranscriptomes 0.64
Isolates 27.56

Biome Distribution

Category Percentage (%)
Aerial Root 0.64
Bulb 0
Endosphere 4.49
Nodule 0
Rhizoplane 8.97
Rhizosphere 50.64
Stem 0
Stem Tuber 0.64
Unclassified 34.62

Taxonomy

Scaffolds

Scaffold Dataset Taxonomy Length
1 JGI24739J22299_10011294 3300001989 Bacteria 3298
2 Ga0006562J51391_1126023 3300003578 Bacteria 2840
3 Ga0055540_1000022 3300003792 Bacteria 201257
4 Ga0070668_100047058 3300005347 Bacteria 3315
5 Ga0070667_100003452 3300005367 Bacteria 13466
6 Ga0070713_100032480 3300005436 Bacteria 4169
7 Ga0070663_100001719 3300005455 Bacteria 12149
8 Ga0081538_10004309 3300005981 Bacteria 13151
9 Ga0075365_10032187 3300006038 Bacteria 3369
10 Ga0075363_100021711 3300006048 Bacteria 3238
11 Ga0075369_10014528 3300006186 Bacteria 3147
12 Ga0075428_100012614 3300006844 Bacteria 9396
13 Ga0075430_100012560 3300006846 Bacteria 7210
14 Ga0075431_100017701 3300006847 Bacteria 7245
15 Ga0075429_100007070 3300006880 Bacteria 9743
16 Ga0114129_10004246 3300009147 Bacteria 20242
17 Ga0105248_10156224 3300009177 Bacteria 2574
18 Ga0213876_10043067 3300021384 Bacteria 2386
19 Ga0207426_1008947 3300025302 Bacteria 3994
20 Ga0207426_1014999 3300025302 Bacteria 2827
21 Ga0209051_1000033 3300025303 Bacteria 380540
22 Ga0207680_10019180 3300025903 Bacteria 3653
23 Ga0207647_10009910 3300025904 Bacteria 6748
24 Ga0207678_10000350 3300026067 Bacteria 41955
25 Ga0207678_10019508 3300026067 Bacteria 5957
26 Ga0207674_10043320 3300026116 Bacteria 4641
27 Ga0307515_10018851 3300028794 Bacteria 12458
28 Ga0307515_10037209 3300028794 Bacteria 7835
29 Ga0307512_10007441 3300030522 Bacteria 10867
30 Ga0265327_10011358 3300031251 Bacteria 6141
31 Ga0307508_10005976 3300031616 Bacteria 11480
32 Ga0307514_10030052 3300031649 Bacteria 4362
33 Ga0307518_10001260 3300031838 Bacteria 19022
34 Ga0307406_10049245 3300031901 Bacteria 2665
35 Ga0307507_10000084 3300033179 Bacteria 145642
36 Ga0307507_10065778 3300033179 Bacteria 3330
37 Ga0307507_10086531 3300033179 Bacteria 2716
38 Ga0307507_10099165 3300033179 Bacteria 2448
39 Ga0316584_0028503 3300036712 Bacteria 4119
40 Ga0439461_0002115 3300041410 Bacteria 3143
41 Ga0439465_0006370 3300041413 Bacteria 3748
42 Ga0451853_0094423 3300041512 Bacteria 3391
43 Ga0439448_0002757 3300042005 Bacteria 4806
44 Ga0466969_0055761 3300044656 Bacteria 1932
45 Ga0466972_0001801 3300044658 Bacteria 10499
46 Ga0466966_0003073 3300044684 Bacteria 11004
47 Ga0466966_0022467 3300044684 Bacteria 4138
48 Ga0466961_0003484 3300044693 Bacteria 9808
49 Ga0466961_0026847 3300044693 Bacteria 3703
50 Ga0466963_0083418 3300044694 Bacteria 2168
51 Ga0466968_0017562 3300044735 Bacteria 2861
52 Ga0466970_0023521 3300044765 Bacteria 3219
53 Ga0466959_0002093 3300045049 Bacteria 12628
54 Ga0466958_0001109 3300045836 Bacteria 12397
55 Ga0466958_0009276 3300045836 Bacteria 5479
56 Ga0466967_0018538 3300045976 Bacteria 5567
57 Ga0495627_016214 3300046453 Bacteria 2556
58 Ga0495603_0001078 3300046455 Bacteria 15811
59 Ga0495603_0015462 3300046455 Bacteria 4619
60 Ga0495629_0000980 3300046459 Bacteria 22904
61 Ga0495629_0024493 3300046459 Bacteria 4298
62 Ga0495641_0034812 3300046461 Bacteria 2378
63 Ga0495662_0014130 3300046476 Bacteria 3885
64 Ga0495665_0040681 3300046531 Bacteria 2475
65 Ga0495665_0070392 3300046531 Bacteria 1844
66 Ga0495622_0004969 3300046557 Bacteria 6160
67 Ga0495635_0010779 3300046663 Bacteria 6404
68 Ga0495674_0116432 3300047319 Bacteria 2261
69 Ga0495676_0001080 3300047321 Bacteria 23085
70 Ga0495676_0001331 3300047321 Bacteria 21198
71 Ga0495614_0000852 3300048089 Bacteria 12884
72 Ga0496100_0000190 3300048903 Bacteria 34149
73 Ga0496100_0001800 3300048903 Bacteria 10698
74 Ga0496101_0000676 3300048904 Bacteria 20580
75 Ga0496101_0003237 3300048904 Bacteria 10107
76 Ga0496102_0000011 3300048905 Bacteria 321716
77 Ga0496102_0011048 3300048905 Bacteria 7777
78 Ga0496103_0000032 3300048906 Bacteria 201453
79 Ga0496103_0003677 3300048906 Bacteria 9328
80 Ga0496106_0000733 3300048909 Bacteria 23630
81 Ga0496107_0003581 3300048910 Bacteria 10409
82 Ga0496109_0000305 3300048912 Bacteria 46298
83 Ga0496110_0011109 3300048913 Bacteria 7355
84 Ga0496111_0002087 3300048914 Bacteria 11920
85 Ga0496114_0001229 3300048917 Bacteria 19366
86 Ga0496116_0000072 3300048919 Bacteria 238158
87 Ga0496116_0013836 3300048919 Bacteria 6477
88 Ga0496117_0000039 3300048920 Bacteria 322143
89 Ga0496117_0032353 3300048920 Bacteria 3974
90 Ga0496118_0000035 3300048921 Bacteria 322143
91 Ga0496118_0017460 3300048921 Bacteria 6530
92 Ga0496119_0000035 3300048922 Bacteria 217007
93 Ga0496120_0000079 3300048923 Bacteria 160413
94 Ga0496121_0000004 3300048924 Bacteria 1139011
95 Ga0496121_0000090 3300048924 Bacteria 217920
96 Ga0496122_0001379 3300048925 Bacteria 39464
97 Ga0496123_0004493 3300048926 Bacteria 14614
98 Ga0496124_0000012 3300048927 Bacteria 512581
99 Ga0496125_0000003 3300048928 Bacteria 1189767
100 Ga0496126_0000001 3300048929 Bacteria 1139011
101 Ga0496126_0004697 3300048929 Bacteria 16140
102 Ga0501032_0021469 3300049569 Bacteria 4487
103 Ga0501034_0006540 3300049571 Bacteria 12521
104 Ga0501036_0055826 3300049572 Bacteria 3346
105 Ga0501037_0006367 3300049573 Bacteria 8633
106 Ga0501038_0006961 3300049574 Bacteria 10440
107 Ga0501039_0000997 3300049575 Bacteria 20593
108 Ga0501043_0004824 3300049579 Bacteria 10911
109 Ga0501070_0002474 3300049586 Bacteria 16190
110 Ga0501044_0023877 3300049823 Bacteria 6499
111 nmdc:mga05p37_6265_c1 3300050507 Bacteria 14012
112 nmdc:mga09592_5853_c1 3300050508 Bacteria 10029
113 nmdc:mga0qj67_52854_c1 3300050509 Bacteria 3216

MSA Aligner

Family Sequences

Sample Scaffold Protein Protein Length
1 3300049823 Ga0501044_0023877 Ga0501044_0023877_3328_5046 515
2 3300031616 Ga0307508_10005976 Ga0307508_100059763 535
3 3300028794 Ga0307515_10037209 Ga0307515_100372094 541
4 3300045976 Ga0466967_0018538 Ga0466967_0018538_2860_4560 541
5 3300049569 Ga0501032_0021469 Ga0501032_0021469_577_2205 541
6 3300049571 Ga0501034_0006540 Ga0501034_0006540_4450_6078 541
7 3300049572 Ga0501036_0055826 Ga0501036_0055826_581_2209 541
8 3300049573 Ga0501037_0006367 Ga0501037_0006367_577_2205 541
9 3300049574 Ga0501038_0006961 Ga0501038_0006961_2065_3693 541
10 3300049575 Ga0501039_0000997 Ga0501039_0000997_4175_5803 541
11 3300049579 Ga0501043_0004824 Ga0501043_0004824_6432_8060 541
12 3300049586 Ga0501070_0002474 Ga0501070_0002474_8136_9764 541
13 3300025303 Ga0209051_1000033 Ga0209051_10000338 544
14 3300025903 Ga0207680_10019180 Ga0207680_100191804 544
15 3300031251 Ga0265327_10011358 Ga0265327_100113583 544
16 3300041410 Ga0439461_0002115 Ga0439461_0002115_998_2698 544
17 3300041413 Ga0439465_0006370 Ga0439465_0006370_1427_3127 544
18 3300044694 Ga0466963_0083418 Ga0466963_0083418_517_2151 544
19 3300046531 Ga0495665_0070392 Ga0495665_0070392_177_1829 544
20 3300047319 Ga0495674_0116432 Ga0495674_0116432_555_2201 544
21 3300048903 Ga0496100_0000190 Ga0496100_0000190_7697_9364 544
22 3300048903 Ga0496100_0001800 Ga0496100_0001800_3915_5552 544
23 3300048904 Ga0496101_0000676 Ga0496101_0000676_8588_10225 544
24 3300048904 Ga0496101_0003237 Ga0496101_0003237_6319_7986 544
25 3300048905 Ga0496102_0000011 Ga0496102_0000011_314493_316130 544
26 3300048905 Ga0496102_0011048 Ga0496102_0011048_5110_6777 544
27 3300048906 Ga0496103_0000032 Ga0496103_0000032_194202_195839 544
28 3300048906 Ga0496103_0003677 Ga0496103_0003677_1979_3646 544
29 3300048909 Ga0496106_0000733 Ga0496106_0000733_12979_14646 544
30 3300048910 Ga0496107_0003581 Ga0496107_0003581_1146_2813 544
31 3300048912 Ga0496109_0000305 Ga0496109_0000305_7248_8915 544
32 3300048913 Ga0496110_0011109 Ga0496110_0011109_2822_4489 544
33 3300048914 Ga0496111_0002087 Ga0496111_0002087_2123_3790 544
34 3300048917 Ga0496114_0001229 Ga0496114_0001229_5785_7452 544
35 3300048919 Ga0496116_0000072 Ga0496116_0000072_230907_232544 544
36 3300048919 Ga0496116_0013836 Ga0496116_0013836_1538_3205 544
37 3300048920 Ga0496117_0000039 Ga0496117_0000039_5615_7252 544
38 3300048920 Ga0496117_0032353 Ga0496117_0032353_964_2631 544
39 3300048921 Ga0496118_0000035 Ga0496118_0000035_314892_316529 544
40 3300048921 Ga0496118_0017460 Ga0496118_0017460_1344_3011 544
41 3300048922 Ga0496119_0000035 Ga0496119_0000035_136765_138402 544
42 3300048923 Ga0496120_0000079 Ga0496120_0000079_95261_96898 544
43 3300048924 Ga0496121_0000004 Ga0496121_0000004_282804_284471 544
44 3300048924 Ga0496121_0000090 Ga0496121_0000090_29775_31412 544
45 3300048925 Ga0496122_0001379 Ga0496122_0001379_5764_7431 544
46 3300048926 Ga0496123_0004493 Ga0496123_0004493_552_2219 544
47 3300048927 Ga0496124_0000012 Ga0496124_0000012_7597_9264 544
48 3300048928 Ga0496125_0000003 Ga0496125_0000003_905297_906964 544
49 3300048929 Ga0496126_0000001 Ga0496126_0000001_282804_284471 544
50 3300048929 Ga0496126_0004697 Ga0496126_0004697_6368_8005 544
51 3300005981 Ga0081538_10004309 Ga0081538_100043095 545
52 iso_pu_bacteria 2547132111 2547408123 550
53 iso_pu_bacteria 2643221670 2644389851 550
54 iso_pu_bacteria 2784132148 2784588573 550
55 iso_pu_bacteria 2808606448 2809232186 550
56 iso_pu_bacteria 2918501144 2918505632 550
57 3300006186 Ga0075369_10014528 Ga0075369_100145282 551
58 3300041512 Ga0451853_0094423 Ga0451853_0094423_588_2264 551
59 iso_pu_bacteria 2899370129 2899372838 551
60 iso_pu_bacteria 3006493962 3006500768 551
61 iso_pu_bacteria 2643221548 2643764925 552
62 iso_pu_bacteria 2643221587 2643942180 552
63 iso_pu_bacteria 2643221677 2644429263 552
64 iso_pu_bacteria 2643221682 2644461235 552
65 iso_pu_bacteria 2818991463 2819695375 552
66 iso_pu_bacteria 2966598605 2966601614 552
67 3300031901 Ga0307406_10049245 Ga0307406_100492453 553
68 iso_pu_bacteria 2582581312 2585299329 553
69 iso_pu_bacteria 2862507626 2862511809 554
70 iso_pu_bacteria 2863404153 2863404605 554
71 iso_pu_bacteria 2643221578 2643902631 555
72 iso_pu_bacteria 2643221673 2644404861 555
73 iso_pu_bacteria 2866612099 2866614612 555
74 3300009177 Ga0105248_10156224 Ga0105248_101562242 556
75 3300046459 Ga0495629_0000980 Ga0495629_0000980_13969_15687 556
76 3300046557 Ga0495622_0004969 Ga0495622_0004969_3595_5313 556
77 3300047321 Ga0495676_0001080 Ga0495676_0001080_9720_11438 556
78 3300047321 Ga0495676_0001331 Ga0495676_0001331_14435_16150 556
79 3300048089 Ga0495614_0000852 Ga0495614_0000852_289_2004 556
80 iso_pu_bacteria 2554235005 2554257864 556
81 iso_pu_bacteria 2946045630 2946048841 556
82 3300003578 Ga0006562J51391_1126023 Ga0006562J51391_11260232 557
83 3300030522 Ga0307512_10007441 Ga0307512_100074417 557
84 3300046455 Ga0495603_0001078 Ga0495603_0001078_7107_8807 557
85 3300046459 Ga0495629_0024493 Ga0495629_0024493_1499_3199 557
86 3300046476 Ga0495662_0014130 Ga0495662_0014130_951_2651 557
87 3300046663 Ga0495635_0010779 Ga0495635_0010779_2291_4000 557
88 iso_pu_bacteria 2875391855 2875395780 557
89 iso_pu_bacteria 8003314358 8003323799 557
90 3300046453 Ga0495627_016214 Ga0495627_016214_73_1779 558
91 iso_pu_bacteria 8056667051 8056669628 558
92 iso_pu_bacteria 2795385472 2795793118 559
93 iso_pu_bacteria 2808606522 2809588621 559
94 iso_pu_bacteria 2899359706 2899368219 559
95 iso_pu_bacteria 2974315732 2974316138 559
96 iso_pu_bacteria 2984523437 2984524300 559
97 iso_pu_bacteria 8054160619 8054166067 559
98 3300025302 Ga0207426_1008947 Ga0207426_10089472 560
99 3300025302 Ga0207426_1014999 Ga0207426_10149992 560
100 iso_pu_bacteria 2585427649 2586062210 560
101 iso_pu_bacteria 2867475112 2867480618 560
102 iso_pu_bacteria 2902799365 2902799904 560
103 iso_pu_bacteria 2915768154 2915775322 560
104 iso_pu_bacteria 2997451912 2997457839 560
105 3300005347 Ga0070668_100047058 Ga0070668_1000470583 561
106 3300031649 Ga0307514_10030052 Ga0307514_100300522 561
107 3300033179 Ga0307507_10065778 Ga0307507_100657782 561
108 3300033179 Ga0307507_10086531 Ga0307507_100865312 561
109 3300033179 Ga0307507_10099165 Ga0307507_100991652 561
110 3300044658 Ga0466972_0001801 Ga0466972_0001801_6190_7947 561
111 iso_pu_bacteria 2738541264 2738668304 561
112 iso_pu_bacteria 2738541356 2739147374 561
113 iso_pu_bacteria 8025530807 8025538028 561
114 3300028794 Ga0307515_10018851 Ga0307515_100188513 562
115 3300005455 Ga0070663_100001719 Ga0070663_1000017194 563
116 3300006844 Ga0075428_100012614 Ga0075428_1000126144 563
117 3300006846 Ga0075430_100012560 Ga0075430_1000125602 563
118 3300006847 Ga0075431_100017701 Ga0075431_1000177014 563
119 3300006880 Ga0075429_100007070 Ga0075429_1000070706 563
120 3300009147 Ga0114129_10004246 Ga0114129_1000424612 563
121 3300026067 Ga0207678_10000350 Ga0207678_1000035035 563
122 3300031838 Ga0307518_10001260 Ga0307518_1000126010 563
123 3300033179 Ga0307507_10000084 Ga0307507_1000008430 563
124 3300044656 Ga0466969_0055761 Ga0466969_0055761_167_1858 563
125 3300044684 Ga0466966_0022467 Ga0466966_0022467_2353_4044 563
126 3300044693 Ga0466961_0026847 Ga0466961_0026847_820_2511 563
127 3300044765 Ga0466970_0023521 Ga0466970_0023521_645_2336 563
128 3300045049 Ga0466959_0002093 Ga0466959_0002093_7232_8923 563
129 3300045836 Ga0466958_0009276 Ga0466958_0009276_3569_5260 563
130 3300046455 Ga0495603_0015462 Ga0495603_0015462_446_2149 563
131 3300046461 Ga0495641_0034812 Ga0495641_0034812_572_2275 563
132 3300046531 Ga0495665_0040681 Ga0495665_0040681_732_2435 563
133 3300050507 nmdc:mga05p37_6265_c1 nmdc:mga05p37_6265_c1_11323_13017 563
134 3300050508 nmdc:mga09592_5853_c1 nmdc:mga09592_5853_c1_3944_5638 563
135 3300050509 nmdc:mga0qj67_52854_c1 nmdc:mga0qj67_52854_c1_858_2552 563
136 3300003792 Ga0055540_1000022 Ga0055540_10000228 564
137 3300005367 Ga0070667_100003452 Ga0070667_1000034525 564
138 3300005436 Ga0070713_100032480 Ga0070713_1000324803 564
139 3300006038 Ga0075365_10032187 Ga0075365_100321872 564
140 3300006048 Ga0075363_100021711 Ga0075363_1000217113 564
141 3300021384 Ga0213876_10043067 Ga0213876_100430672 564
142 3300026067 Ga0207678_10019508 Ga0207678_100195083 564
143 3300036712 Ga0316584_0028503 Ga0316584_0028503_393_2120 564
144 3300042005 Ga0439448_0002757 Ga0439448_0002757_1294_2994 564
145 3300044684 Ga0466966_0003073 Ga0466966_0003073_2806_4500 564
146 3300044693 Ga0466961_0003484 Ga0466961_0003484_331_2025 564
147 3300044735 Ga0466968_0017562 Ga0466968_0017562_163_1857 564
148 3300045836 Ga0466958_0001109 Ga0466958_0001109_4072_5766 564
149 iso_pu_bacteria 2551306166 2552111283 564
150 iso_pu_bacteria 2816332119 2816422395 564
151 iso_pu_bacteria 2818991472 2819746717 564
152 iso_pu_bacteria 2866552031 2866552361 564
153 iso_pu_bacteria 2867369537 2867371893 564
154 3300001989 JGI24739J22299_10011294 JGI24739J22299_100112942 569
155 3300025904 Ga0207647_10009910 Ga0207647_100099103 569
156 3300026116 Ga0207674_10043320 Ga0207674_100433203 569

Functional Annotation

PFAM ID Name Description Start Position End Position Accuracy

PF07287

AtuA

Acyclic terpene utilisation family protein AtuA

1

117

0.98

PF07287

AtuA

Acyclic terpene utilisation family protein AtuA

115

415

0.97

PF23544

452

551

0.91

Structural Annotation

Top 5 Hits

ID Description Score Start End
5wa4-assembly1.cif.gz_F pyridine synthase, tbtd, from thiomuracin biosynthesis bound to an n-terminal leader peptide fragment 0.6497 303 389
7m6u-assembly2.cif.gz_D-2 crystal structure of a circular permutation and computationally designed pro-enzyme of carboxypeptidase g2 0.6294 303 400
5zeb-assembly1.cif.gz_f m. smegmatis p/p state 70s ribosome structure 0.6182 303 412
6spf-assembly1.cif.gz_f pseudomonas aeruginosa 70s ribosome from an aminoglycoside resistant clinical isolate 0.6139 304 395
6v3b-assembly1.cif.gz_f cryo-em structure of the acinetobacter baumannii ribosome: 70s in empty state 0.612 303 407
ID Description Score Start End Superfamily
af_C0HF87_842_966_3.30.70.100 Alpha Beta;2-Layer Sandwich;Alpha-Beta Plaits; 0.6693 304 368 3.30.70.100
af_Q57905_62_208_3.40.50.1010 Alpha Beta;3-Layer(aba) Sandwich;Rossmann fold;5'-nuclease 0.643 16 99 3.40.50.1010
af_O14050_557_659_1.20.1280.170 Mainly Alpha;Up-down Bundle;Monooxygenase;Exocyst complex component Exo70 0.6136 301 368 1.20.1280.170
af_Q9C1X2_273_393_3.30.70.2190 Alpha Beta;2-Layer Sandwich;Alpha-Beta Plaits; 0.6118 304 406 3.30.70.2190
af_P90986_85_514_1.10.800.10 Mainly Alpha;Orthogonal Bundle;Phenylalanine Hydroxylase;Aromatic amino acid hydroxylase 0.6109 304 402 1.10.800.10
ID Description Score Start End GO Terms
AF-X8DWJ7-F1-model_v4 Exopolyphosphatase 0.9927 22 166
AF-A0A3B0THY3-F1-model_v4 Terpene utilization protein AtuA 0.9898 458 568
AF-A0A2N5WVH8-F1-model_v4 Exopolyphosphatase 0.9889 450 569
AF-Q08YS7-F1-model_v4 Exopolyphosphatase 0.988 44 569
AF-A0A378UX09-F1-model_v4 Exopolyphosphatase 0.988 22 176

Feature Viewer

pLDDT pTM Quality
89.37 0.89 High
Powered by Feature Viewer

Predicted Structure (AlphaFold2)

Powered by PDBe Molstar

Map