NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|194388850|dbj|BAG61442|]
View 

unnamed protein product [Homo sapiens]

Protein Classification

type II keratin( domain architecture ID 12177255)

type II keratin is an intermediate filament-forming protein that provides mechanical support and fulfills a variety of additional functions in epithelial cells

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Filament pfam00038
Intermediate filament protein;
145-458 7.81e-151

Intermediate filament protein;


:

Pssm-ID: 459643 [Multi-domain]  Cd Length: 313  Bit Score: 434.73  E-value: 7.81e-151
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850  145 EEREQIKTLNNKFASFIDKVQFLEQQNKVLDTKWTLLQEQGTKTVRqNLEPLFEQYINNLRRQLDSIVGERGRLDSELRN 224
Cdd:pfam00038   1 NEKEQLQELNDRLASYIDKVRFLEQQNKLLETKISELRQKKGAEPS-RLYSLYEKEIEDLRRQLDTLTVERARLQLELDN 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850  225 MQDLVEDLKNKYEDEVNKRTAAENEFVTLKKDVDAAYMNKVELQAKADTLTDEINFLRALYDAELSQMQTHISDTSVVLS 304
Cdd:pfam00038  80 LRLAAEDFRQKYEDELNLRTSAENDLVGLRKDLDEATLARVDLEAKIESLKEELAFLKKNHEEEVRELQAQVSDTQVNVE 159
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850  305 MDNNRNLDLDSIIAEVKAQYEEIAQRSRAEAESWYQTKYEELQVTAGRHGDDLRNTKQEIAEINRMIQRLRSEIDHVKKQ 384
Cdd:pfam00038 160 MDAARKLDLTSALAEIRAQYEEIAAKNREEAEEWYQSKLEELQQAAARNGDALRSAKEEITELRRTIQSLEIELQSLKKQ 239
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 194388850  385 CASLQAAIADAEQRGEMALKDAKNKLEGLEDALQKAKQDLARLLKEYQELMNVKLALDVEIATYRKLLEGEECR 458
Cdd:pfam00038 240 KASLERQLAETEERYELQLADYQELISELEAELQETRQEMARQLREYQELLNVKLALDIEIATYRKLLEGEECR 313
Keratin_2_head pfam16208
Keratin type II head;
17-142 1.52e-26

Keratin type II head;


:

Pssm-ID: 465068 [Multi-domain]  Cd Length: 156  Bit Score: 105.51  E-value: 1.52e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   17 GFSANSARLPGVSRSGFSSISVSRSRGSGGLGGACGGAGFGSRSLYGLGGSKRISI----------------GGGSCAIS 80
Cdd:pfam16208   1 GFSSCSAVVPSRSRRSYSSVSSSRRGGGGGGGGGGGGGGFGSRSLYNLGGSKSISIsvagggsrpgsgfgfgGGGGGGFG 80
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 194388850   81 GGYGSRAGGSYGFGGAGSGFGLGGFGGPG--------------FPVCPPGGIQEVTVNQSLLTPLNLQIDPAIQRV 142
Cdd:pfam16208  81 GGFGGGGGGGFGGGGGFGGGFGGGGYGGGgfggggfggrggfgGPPCPPGGIQEVTVNQSLLQPLNLEIDPEIQRV 156
 
Name Accession Description Interval E-value
Filament pfam00038
Intermediate filament protein;
145-458 7.81e-151

Intermediate filament protein;


Pssm-ID: 459643 [Multi-domain]  Cd Length: 313  Bit Score: 434.73  E-value: 7.81e-151
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850  145 EEREQIKTLNNKFASFIDKVQFLEQQNKVLDTKWTLLQEQGTKTVRqNLEPLFEQYINNLRRQLDSIVGERGRLDSELRN 224
Cdd:pfam00038   1 NEKEQLQELNDRLASYIDKVRFLEQQNKLLETKISELRQKKGAEPS-RLYSLYEKEIEDLRRQLDTLTVERARLQLELDN 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850  225 MQDLVEDLKNKYEDEVNKRTAAENEFVTLKKDVDAAYMNKVELQAKADTLTDEINFLRALYDAELSQMQTHISDTSVVLS 304
Cdd:pfam00038  80 LRLAAEDFRQKYEDELNLRTSAENDLVGLRKDLDEATLARVDLEAKIESLKEELAFLKKNHEEEVRELQAQVSDTQVNVE 159
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850  305 MDNNRNLDLDSIIAEVKAQYEEIAQRSRAEAESWYQTKYEELQVTAGRHGDDLRNTKQEIAEINRMIQRLRSEIDHVKKQ 384
Cdd:pfam00038 160 MDAARKLDLTSALAEIRAQYEEIAAKNREEAEEWYQSKLEELQQAAARNGDALRSAKEEITELRRTIQSLEIELQSLKKQ 239
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 194388850  385 CASLQAAIADAEQRGEMALKDAKNKLEGLEDALQKAKQDLARLLKEYQELMNVKLALDVEIATYRKLLEGEECR 458
Cdd:pfam00038 240 KASLERQLAETEERYELQLADYQELISELEAELQETRQEMARQLREYQELLNVKLALDIEIATYRKLLEGEECR 313
Keratin_2_head pfam16208
Keratin type II head;
17-142 1.52e-26

Keratin type II head;


Pssm-ID: 465068 [Multi-domain]  Cd Length: 156  Bit Score: 105.51  E-value: 1.52e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   17 GFSANSARLPGVSRSGFSSISVSRSRGSGGLGGACGGAGFGSRSLYGLGGSKRISI----------------GGGSCAIS 80
Cdd:pfam16208   1 GFSSCSAVVPSRSRRSYSSVSSSRRGGGGGGGGGGGGGGFGSRSLYNLGGSKSISIsvagggsrpgsgfgfgGGGGGGFG 80
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 194388850   81 GGYGSRAGGSYGFGGAGSGFGLGGFGGPG--------------FPVCPPGGIQEVTVNQSLLTPLNLQIDPAIQRV 142
Cdd:pfam16208  81 GGFGGGGGGGFGGGGGFGGGFGGGGYGGGgfggggfggrggfgGPPCPPGGIQEVTVNQSLLQPLNLEIDPEIQRV 156
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
145-440 2.84e-11

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 66.62  E-value: 2.84e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   145 EEREQIKTLNNKFASFIDKVQFLEQQNKVLDTKWTLLQEQGTKTVRQNLEplFEQYINNLRRQLDSIVGERGRLDSELRN 224
Cdd:TIGR02168  674 ERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEE--LSRQISALRKDLARLEAEVEQLEERIAQ 751
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   225 MQDLVEDLKNKYEDEVNKRTAAENEFVTLKKdvdaaymNKVELQAKADTLTDEINFLRALYD---AELSQMQTHISDTSV 301
Cdd:TIGR02168  752 LSKELTELEAEIEELEERLEEAEEELAEAEA-------EIEELEAQIEQLKEELKALREALDelrAELTLLNEEAANLRE 824
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   302 VLSMDNNR-------NLDLDSIIAEVKAQYEEIAQ----------RSRAEAESWYQtKYEELQVTAGRHGDDLRNTKQEI 364
Cdd:TIGR02168  825 RLESLERRiaaterrLEDLEEQIEELSEDIESLAAeieeleelieELESELEALLN-ERASLEEALALLRSELEELSEEL 903
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   365 AEINRMIQRLRSEIDHVKKQCASLQAAIADAEQR-----------GEMALKDAKNKLEGLEDALQKAKQDLARLLKEYQE 433
Cdd:TIGR02168  904 RELESKRSELRRELEELREKLAQLELRLEGLEVRidnlqerlseeYSLTLEEAEALENKIEDDEEEARRRLKRLENKIKE 983

                   ....*..
gi 194388850   434 LMNVKLA 440
Cdd:TIGR02168  984 LGPVNLA 990
PRK09039 PRK09039
peptidoglycan -binding protein;
286-426 9.92e-08

peptidoglycan -binding protein;


Pssm-ID: 181619 [Multi-domain]  Cd Length: 343  Bit Score: 54.20  E-value: 9.92e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 286 DAELSQMQTHISDTSVVLSMDNNRNLDLDSIIAEVKAQYEEiAQRSRAEAESWYQTKYEELQVTAGRHGD---DLRNTKQ 362
Cdd:PRK09039  52 DSALDRLNSQIAELADLLSLERQGNQDLQDSVANLRASLSA-AEAERSRLQALLAELAGAGAAAEGRAGElaqELDSEKQ 130
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 194388850 363 EIAEINRMIQRLRSEIDHVKKQCASLQAAIADAEQRGemalKDAKNKLEG----LEDALQKAKQDLAR 426
Cdd:PRK09039 131 VSARALAQVELLNQQIAALRRQLAALEAALDASEKRD----RESQAKIADlgrrLNVALAQRVQELNR 194
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
198-473 3.66e-06

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 49.63  E-value: 3.66e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 198 EQYIN-NLRRQLDSIVGERGRLDSELRNMQDLVEDLKNKYEDevnkrTAAENEFVTLKKDVDAAYMNKVELQAKADTLTD 276
Cdd:COG3206  159 EAYLEqNLELRREEARKALEFLEEQLPELRKELEEAEAALEE-----FRQKNGLVDLSEEAKLLLQQLSELESQLAEARA 233
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 277 EINFLRALYDAELSQMQTHISDTSVVLSmdnnrnldlDSIIAEVKAQYEEIaqrsraeaeswyQTKYEELQVTAGRHGDD 356
Cdd:COG3206  234 ELAEAEARLAALRAQLGSGPDALPELLQ---------SPVIQQLRAQLAEL------------EAELAELSARYTPNHPD 292
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 357 LRNTKQEIAEINRMIQRLRSEIDhvkkqcASLQAAIADAEQRgemalkdaKNKLEGLEDALQKAKQDLARLLKEYQELMN 436
Cdd:COG3206  293 VIALRAQIAALRAQLQQEAQRIL------ASLEAELEALQAR--------EASLQAQLAQLEARLAELPELEAELRRLER 358
                        250       260       270       280
                 ....*....|....*....|....*....|....*....|
gi 194388850 437 vklalDVEIA--TYRKLLEG-EECRLNgEGVGQVNVSVVQ 473
Cdd:COG3206  359 -----EVEVAreLYESLLQRlEEARLA-EALTVGNVRVID 392
ClyA_Cry6Aa-like cd22656
Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes ...
216-433 2.10e-03

Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes pesticidal Cry6Aa toxin from Bacillus thuringiensis, one of the many parasporal crystal (Cry) toxins produced during the sporulation phase of growth. Many of these proteins are toxic to numerous insect species and have been effectively used as proteinaceous insecticides to directly kill insect pests; some have been used to control insect growth on transgenic agricultural plants. Cry6Aa exists as a protoxin, which is activated by cleavage using trypsin. Structure studies for Cry6Aa support a mechanism of action by pore formation, similar to cytolysin A (ClyA)-type alpha pore-forming toxins (alpha-PFTs) such as HblB, and bioassay and mutation studies show that Cry6Aa is an active pore-forming toxin. Cry6Aa shows atypical features compared to other members of alpha-PFTs, including internal repeat sequences and small loop regions within major alpha helices.


Pssm-ID: 439154 [Multi-domain]  Cd Length: 309  Bit Score: 40.43  E-value: 2.10e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 216 GRLDSELRNMQDLVEDLKNKYEDEVNKRTAAE-NEFV-TLKKDVDaaymnkvELQAKADTLTDEIN-FLRAL--YDAELS 290
Cdd:cd22656   87 GTIDSYYAEILELIDDLADATDDEELEEAKKTiKALLdDLLKEAK-------KYQDKAAKVVDKLTdFENQTekDQTALE 159
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 291 QMQTHISDtsvVLSMDNNRNL--DLDSIIAEVKAQYEEIAQRSRA---EAESWYQTKYEELQV------TAGRHGDDLRN 359
Cdd:cd22656  160 TLEKALKD---LLTDEGGAIArkEIKDLQKELEKLNEEYAAKLKAkidELKALIADDEAKLAAalrliaDLTAADTDLDN 236
                        170       180       190       200       210       220       230
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 194388850 360 TKQEIAEINRMIQRLRSEIDHVKKQCASLQAAIADAEQrgemALKDAKNKLEGLEDALQKAKqDLARLLKEYQE 433
Cdd:cd22656  237 LLALIGPAIPALEKLQGAWQAIATDLDSLKDLLEDDIS----KIPAAILAKLELEKAIEKWN-ELAEKADKFRQ 305
Spc7 smart00787
Spc7 kinetochore protein; This domain is found in cell division proteins which are required ...
355-459 8.73e-03

Spc7 kinetochore protein; This domain is found in cell division proteins which are required for kinetochore-spindle association.


Pssm-ID: 197874 [Multi-domain]  Cd Length: 312  Bit Score: 38.46  E-value: 8.73e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   355 DDLRNTKQEIAEINRMIQRLRSEIDHVKKQCASLQAAIADAEQRGEMALKDAKNKLEGLEDALQKAKQDLARLLKEYQEL 434
Cdd:smart00787 158 EDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDCDPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQEL 237
                           90       100       110
                   ....*....|....*....|....*....|..
gi 194388850   435 -------MNVKLALDVEIATYRKLLegEECRL 459
Cdd:smart00787 238 eskiedlTNKKSELNTEIAEAEKKL--EQCRG 267
 
Name Accession Description Interval E-value
Filament pfam00038
Intermediate filament protein;
145-458 7.81e-151

Intermediate filament protein;


Pssm-ID: 459643 [Multi-domain]  Cd Length: 313  Bit Score: 434.73  E-value: 7.81e-151
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850  145 EEREQIKTLNNKFASFIDKVQFLEQQNKVLDTKWTLLQEQGTKTVRqNLEPLFEQYINNLRRQLDSIVGERGRLDSELRN 224
Cdd:pfam00038   1 NEKEQLQELNDRLASYIDKVRFLEQQNKLLETKISELRQKKGAEPS-RLYSLYEKEIEDLRRQLDTLTVERARLQLELDN 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850  225 MQDLVEDLKNKYEDEVNKRTAAENEFVTLKKDVDAAYMNKVELQAKADTLTDEINFLRALYDAELSQMQTHISDTSVVLS 304
Cdd:pfam00038  80 LRLAAEDFRQKYEDELNLRTSAENDLVGLRKDLDEATLARVDLEAKIESLKEELAFLKKNHEEEVRELQAQVSDTQVNVE 159
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850  305 MDNNRNLDLDSIIAEVKAQYEEIAQRSRAEAESWYQTKYEELQVTAGRHGDDLRNTKQEIAEINRMIQRLRSEIDHVKKQ 384
Cdd:pfam00038 160 MDAARKLDLTSALAEIRAQYEEIAAKNREEAEEWYQSKLEELQQAAARNGDALRSAKEEITELRRTIQSLEIELQSLKKQ 239
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 194388850  385 CASLQAAIADAEQRGEMALKDAKNKLEGLEDALQKAKQDLARLLKEYQELMNVKLALDVEIATYRKLLEGEECR 458
Cdd:pfam00038 240 KASLERQLAETEERYELQLADYQELISELEAELQETRQEMARQLREYQELLNVKLALDIEIATYRKLLEGEECR 313
Keratin_2_head pfam16208
Keratin type II head;
17-142 1.52e-26

Keratin type II head;


Pssm-ID: 465068 [Multi-domain]  Cd Length: 156  Bit Score: 105.51  E-value: 1.52e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   17 GFSANSARLPGVSRSGFSSISVSRSRGSGGLGGACGGAGFGSRSLYGLGGSKRISI----------------GGGSCAIS 80
Cdd:pfam16208   1 GFSSCSAVVPSRSRRSYSSVSSSRRGGGGGGGGGGGGGGFGSRSLYNLGGSKSISIsvagggsrpgsgfgfgGGGGGGFG 80
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 194388850   81 GGYGSRAGGSYGFGGAGSGFGLGGFGGPG--------------FPVCPPGGIQEVTVNQSLLTPLNLQIDPAIQRV 142
Cdd:pfam16208  81 GGFGGGGGGGFGGGGGFGGGFGGGGYGGGgfggggfggrggfgGPPCPPGGIQEVTVNQSLLQPLNLEIDPEIQRV 156
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
145-440 2.84e-11

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 66.62  E-value: 2.84e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   145 EEREQIKTLNNKFASFIDKVQFLEQQNKVLDTKWTLLQEQGTKTVRQNLEplFEQYINNLRRQLDSIVGERGRLDSELRN 224
Cdd:TIGR02168  674 ERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEE--LSRQISALRKDLARLEAEVEQLEERIAQ 751
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   225 MQDLVEDLKNKYEDEVNKRTAAENEFVTLKKdvdaaymNKVELQAKADTLTDEINFLRALYD---AELSQMQTHISDTSV 301
Cdd:TIGR02168  752 LSKELTELEAEIEELEERLEEAEEELAEAEA-------EIEELEAQIEQLKEELKALREALDelrAELTLLNEEAANLRE 824
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   302 VLSMDNNR-------NLDLDSIIAEVKAQYEEIAQ----------RSRAEAESWYQtKYEELQVTAGRHGDDLRNTKQEI 364
Cdd:TIGR02168  825 RLESLERRiaaterrLEDLEEQIEELSEDIESLAAeieeleelieELESELEALLN-ERASLEEALALLRSELEELSEEL 903
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   365 AEINRMIQRLRSEIDHVKKQCASLQAAIADAEQR-----------GEMALKDAKNKLEGLEDALQKAKQDLARLLKEYQE 433
Cdd:TIGR02168  904 RELESKRSELRRELEELREKLAQLELRLEGLEVRidnlqerlseeYSLTLEEAEALENKIEDDEEEARRRLKRLENKIKE 983

                   ....*..
gi 194388850   434 LMNVKLA 440
Cdd:TIGR02168  984 LGPVNLA 990
PRK09039 PRK09039
peptidoglycan -binding protein;
286-426 9.92e-08

peptidoglycan -binding protein;


Pssm-ID: 181619 [Multi-domain]  Cd Length: 343  Bit Score: 54.20  E-value: 9.92e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 286 DAELSQMQTHISDTSVVLSMDNNRNLDLDSIIAEVKAQYEEiAQRSRAEAESWYQTKYEELQVTAGRHGD---DLRNTKQ 362
Cdd:PRK09039  52 DSALDRLNSQIAELADLLSLERQGNQDLQDSVANLRASLSA-AEAERSRLQALLAELAGAGAAAEGRAGElaqELDSEKQ 130
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 194388850 363 EIAEINRMIQRLRSEIDHVKKQCASLQAAIADAEQRGemalKDAKNKLEG----LEDALQKAKQDLAR 426
Cdd:PRK09039 131 VSARALAQVELLNQQIAALRRQLAALEAALDASEKRD----RESQAKIADlgrrLNVALAQRVQELNR 194
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
128-417 1.91e-07

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 54.29  E-value: 1.91e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   128 LTPLNLQIDPAIQRVRAEEREqIKTLNNKFASFIDKVQFLEQQNKVLDTKWTLLQEQgTKTVRQNLEpLFEQYINNLRRQ 207
Cdd:TIGR02168  241 LEELQEELKEAEEELEELTAE-LQELEEKLEELRLEVSELEEEIEELQKELYALANE-ISRLEQQKQ-ILRERLANLERQ 317
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   208 LDSIVGERGRLDSELRNMQDLVEDLKNKYEDEVNKRTAAENEFVTLKKDVDAAYMNKVELQAKADTLTDEINFLR---AL 284
Cdd:TIGR02168  318 LEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLElqiAS 397
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   285 YDAELSQMQTHISDTSVVLSMDNNRNLDLDSII--AEVKAQYEEIAQRSRAEaeswyqtkyEELQVTAGRHGDDLRNTKQ 362
Cdd:TIGR02168  398 LNNEIERLEARLERLEDRRERLQQEIEELLKKLeeAELKELQAELEELEEEL---------EELQEELERLEEALEELRE 468
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 194388850   363 EIAEINRMIQRLRSEIDHVKKQCASLQAAIADAE--QRGEMALKDAKNKLEGLEDAL 417
Cdd:TIGR02168  469 ELEEAEQALDAAERELAQLQARLDSLERLQENLEgfSEGVKALLKNQSGLSGILGVL 525
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
204-453 2.76e-07

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 53.53  E-value: 2.76e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   204 LRRQLDSIVGERGRLDSELRNMQDLVEDLKNKYEDEVNKRTAAENEFVTLKKDVDAAYMNKVELQAKADTLTDEINFLRA 283
Cdd:TIGR02169  679 LRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKS 758
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   284 L---YDAELSQMQTHISdtSVVLSMDNNRNLDLDSIIAEVKAQYEEI-AQRSRAEAeswyqtKYEELQVTAGRHGDDLRN 359
Cdd:TIGR02169  759 ElkeLEARIEELEEDLH--KLEEALNDLEARLSHSRIPEIQAELSKLeEEVSRIEA------RLREIEQKLNRLTLEKEY 830
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   360 TKQEIAEINRMIQRLRSEIDHVKKQCASLQAAIADAEQRgemaLKDAKNKLEGLEDALQKAKQDLARLLKEYQELMNVKL 439
Cdd:TIGR02169  831 LEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEE----LEELEAALRDLESRLGDLKKERDELEAQLRELERKIE 906
                          250
                   ....*....|....
gi 194388850   440 ALDVEIATYRKLLE 453
Cdd:TIGR02169  907 ELEAQIEKKRKRLS 920
PRK01156 PRK01156
chromosome segregation protein; Provisional
135-456 4.94e-07

chromosome segregation protein; Provisional


Pssm-ID: 100796 [Multi-domain]  Cd Length: 895  Bit Score: 52.60  E-value: 4.94e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 135 IDP-AIQRVRAEEREQIKTLNNKFASFIDKVQFLEQQNKVLDTKWTLLQEQ------GTKTVRQNLEPLFEQYINNLRR- 206
Cdd:PRK01156 402 IDPdAIKKELNEINVKLQDISSKVSSLNQRIRALRENLDELSRNMEMLNGQsvcpvcGTTLGEEKSNHIINHYNEKKSRl 481
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 207 --QLDSIVGERGRLDSELRNMQDLVEDLKNKyedEVNKRTAAENEFVTLKKDVDAAYMNKVELQAKADTLTDEINFLRAL 284
Cdd:PRK01156 482 eeKIREIEIEVKDIDEKIVDLKKRKEYLESE---EINKSINEYNKIESARADLEDIKIKINELKDKHDKYEEIKNRYKSL 558
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 285 YDAELSQMQTHISDTSVVLSmdnnrNLDLDSIIA---EVKAQYEEIAQRSRaEAESWYQTKYEELQVTAGRHGDD---LR 358
Cdd:PRK01156 559 KLEDLDSKRTSWLNALAVIS-----LIDIETNRSrsnEIKKQLNDLESRLQ-EIEIGFPDDKSYIDKSIREIENEannLN 632
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 359 NTKQEIAEINRMIQRLRSEIDHVKKQCASLQAAIADAEQRGEMALkDAKNKLEGLEDALQKAKQDLARLLKEYQELMNVK 438
Cdd:PRK01156 633 NKYNEIQENKILIEKLRGKIDNYKKQIAEIDSIIPDLKEITSRIN-DIEDNLKKSRKALDDAKANRARLESTIEILRTRI 711
                        330
                 ....*....|....*...
gi 194388850 439 LALDVEIATYRKLLEGEE 456
Cdd:PRK01156 712 NELSDRINDINETLESMK 729
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
174-434 7.41e-07

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 52.37  E-value: 7.41e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   174 LDTKWTLLQEQGTKTVRqnleplFEQYINNLRR-QLDSIVGERGRLDSELRNMQDLVEDLKNKYEDEVNKRTAAENEFVT 252
Cdd:TIGR02168  198 LERQLKSLERQAEKAER------YKELKAELRElELALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEE 271
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   253 LKKDVDAAYMNKVELQAKADTLTDEINFL---RALYDAELSQMQTHISDTSVVLSMDNNRNLDLDSIIAEVKAQYEEIAQ 329
Cdd:TIGR02168  272 LRLEVSELEEEIEELQKELYALANEISRLeqqKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKE 351
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   330 RSRA------EAESWYQ---TKYEELQVTAGRHGDDLRNTKQEIAEINRMIQRLRSEIDHVKKQCASLQAAIADAEQR-G 399
Cdd:TIGR02168  352 ELESleaeleELEAELEeleSRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKlE 431
                          250       260       270
                   ....*....|....*....|....*....|....*
gi 194388850   400 EMALKDAKNKLEGLEDALQKAKQDLARLLKEYQEL 434
Cdd:TIGR02168  432 EAELKELQAELEELEEELEELQEELERLEEALEEL 466
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
187-434 7.46e-07

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 52.38  E-value: 7.46e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   187 KTVRQNLEpLFEQYINNLRRQLDSIVGERgrldSELRNMQDLVEDL-KNKYEDEVNKRTAAENEFVTLKKDVDAAYMNKV 265
Cdd:TIGR02169  180 EEVEENIE-RLDLIIDEKRQQLERLRRER----EKAERYQALLKEKrEYEGYELLKEKEALERQKEAIERQLASLEEELE 254
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   266 ELQAKADTLTDEINFLRALydaeLSQMQTHISDtsvvlsMDNNRNLDLDSIIAEVKAqyeEIAQRSRAEAEswYQTKYEE 345
Cdd:TIGR02169  255 KLTEEISELEKRLEEIEQL----LEELNKKIKD------LGEEEQLRVKEKIGELEA---EIASLERSIAE--KERELED 319
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   346 LQVTAGRHGDDLRNTKQEIAEINRMIQRLRSEIDHVKKQCASLQAAIADAEQRGE----------MALKDAKNKLEGLED 415
Cdd:TIGR02169  320 AEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEevdkefaetrDELKDYREKLEKLKR 399
                          250
                   ....*....|....*....
gi 194388850   416 ALQKAKQDLARLLKEYQEL 434
Cdd:TIGR02169  400 EINELKRELDRLQEELQRL 418
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
201-434 1.09e-06

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 51.61  E-value: 1.09e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   201 INNLRRQLDSIVGERGRLDSELRNMQDLVEDLKNKYE------DEVNKRTAA--ENEFVTLKKDVDAAYMNKVELQAKAD 272
Cdd:TIGR02169  232 KEALERQKEAIERQLASLEEELEKLTEEISELEKRLEeieqllEELNKKIKDlgEEEQLRVKEKIGELEAEIASLERSIA 311
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   273 TLTDEINFL---RALYDAELSQMQTHISDTSVVLSMDNNRNLDLDSIIAEVKAQYEEIAQR----------------SRA 333
Cdd:TIGR02169  312 EKERELEDAeerLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAEleevdkefaetrdelkDYR 391
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   334 EAESWYQTKYEELQVTAGRHGDDLRNTKQEIAEINRMIQRLRSEIDHVKKQCASLQAAIADAEQrgemalkdaknKLEGL 413
Cdd:TIGR02169  392 EKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEW-----------KLEQL 460
                          250       260
                   ....*....|....*....|.
gi 194388850   414 EDALQKAKQDLARLLKEYQEL 434
Cdd:TIGR02169  461 AADLSKYEQELYDLKEEYDRV 481
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
262-462 1.43e-06

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 51.21  E-value: 1.43e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   262 MNKVELQAKADT----LTDEINFL-RALYDAELSQMQTHISDTSVVLSMDNNRNLDLDSIIAEVKAQYEEIaQRSRAEAE 336
Cdd:TIGR02168  202 LKSLERQAEKAErykeLKAELRELeLALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEEL-RLEVSELE 280
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   337 SwyqtKYEELQvtagrhgDDLRNTKQEIAEINRMIQRLRSEIDHVKKQCASLQAAIADAEQRGEMA---LKDAKNKLEGL 413
Cdd:TIGR02168  281 E----EIEELQ-------KELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELaeeLAELEEKLEEL 349
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|..
gi 194388850   414 EDALQKAKQDLARLLKEYQELMNVKLALDVEIATYRK---LLEGEECRLNGE 462
Cdd:TIGR02168  350 KEELESLEAELEELEAELEELESRLEELEEQLETLRSkvaQLELQIASLNNE 401
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
214-460 2.04e-06

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 50.83  E-value: 2.04e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   214 ERGRLDSELRNMQDLVEDLKNKYEDEVNKRTAAENEFVTLKKDVDAAYMNKVELQAKADTLTDEINFLRALYDAELSQMQ 293
Cdd:TIGR02168  678 EIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELT 757
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   294 THISDTSVVLSMDNNRNLDLDSIIAEVKAQYEEIAQRSRA-----EAESWYQTKYEELQVTAGRHGDDLRNTKQEIAEIN 368
Cdd:TIGR02168  758 ELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEElkalrEALDELRAELTLLNEEAANLRERLESLERRIAATE 837
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   369 RMIQRLRSEIDHVKKQCASLQAAIADAEQRGEMALKDAKNKLE----------GLEDALQKAKQDLARLLKEYQELMNVK 438
Cdd:TIGR02168  838 RRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNerasleealaLLRSELEELSEELRELESKRSELRREL 917
                          250       260
                   ....*....|....*....|..
gi 194388850   439 LALDVEIATYRKLLEGEECRLN 460
Cdd:TIGR02168  918 EELREKLAQLELRLEGLEVRID 939
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
198-473 3.66e-06

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 49.63  E-value: 3.66e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 198 EQYIN-NLRRQLDSIVGERGRLDSELRNMQDLVEDLKNKYEDevnkrTAAENEFVTLKKDVDAAYMNKVELQAKADTLTD 276
Cdd:COG3206  159 EAYLEqNLELRREEARKALEFLEEQLPELRKELEEAEAALEE-----FRQKNGLVDLSEEAKLLLQQLSELESQLAEARA 233
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 277 EINFLRALYDAELSQMQTHISDTSVVLSmdnnrnldlDSIIAEVKAQYEEIaqrsraeaeswyQTKYEELQVTAGRHGDD 356
Cdd:COG3206  234 ELAEAEARLAALRAQLGSGPDALPELLQ---------SPVIQQLRAQLAEL------------EAELAELSARYTPNHPD 292
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 357 LRNTKQEIAEINRMIQRLRSEIDhvkkqcASLQAAIADAEQRgemalkdaKNKLEGLEDALQKAKQDLARLLKEYQELMN 436
Cdd:COG3206  293 VIALRAQIAALRAQLQQEAQRIL------ASLEAELEALQAR--------EASLQAQLAQLEARLAELPELEAELRRLER 358
                        250       260       270       280
                 ....*....|....*....|....*....|....*....|
gi 194388850 437 vklalDVEIA--TYRKLLEG-EECRLNgEGVGQVNVSVVQ 473
Cdd:COG3206  359 -----EVEVAreLYESLLQRlEEARLA-EALTVGNVRVID 392
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
127-384 4.36e-06

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 49.68  E-value: 4.36e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   127 LLTPLNLQIDPAIQRVRAEEREQIKTLNNKFASFIDKVQFLEQQNKVLDTKWTLLQEQGTKTVRQnleplfeqyINNLRR 206
Cdd:TIGR02169  273 LLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAE---------IEELER 343
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   207 QLDSIVGERGRLDSELRNMQDLVEDLKNKYEDEVNKRTAAENEFVTLKKDVDAAYMNKVELQAKADTLTDEINFLralyD 286
Cdd:TIGR02169  344 EIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRL----S 419
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   287 AELSQMQTHISdtsvvlsmdnnrnlDLDSIIAEVKAQYEEIAQRSRAEaeswyQTKYEELQVTAGRHGDDLRNTKQEIAE 366
Cdd:TIGR02169  420 EELADLNAAIA--------------GIEAKINELEEEKEDKALEIKKQ-----EWKLEQLAADLSKYEQELYDLKEEYDR 480
                          250
                   ....*....|....*...
gi 194388850   367 INRMIQRLRSEIDHVKKQ 384
Cdd:TIGR02169  481 VEKELSKLQRELAEAEAQ 498
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
140-427 6.73e-06

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 49.02  E-value: 6.73e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   140 QRVRAEEREQIKTLNNKFASFIDKVQFLEQQN-------KVLDTKWTLLQEQGTKTVRQNLE-----PLFEQYINNLRRQ 207
Cdd:pfam01576  425 ERQRAELAEKLSKLQSELESVSSLLNEAEGKNiklskdvSSLESQLQDTQELLQEETRQKLNlstrlRQLEDERNSLQEQ 504
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   208 LDSIVGERGRLDSELRNMQDLVEDLKNKYEDEVNKRTAAENEFVTLKKDVDAAYMNKVELQAKADTLtdeinflralyDA 287
Cdd:pfam01576  505 LEEEEEAKRNVERQLSTLQAQLSDMKKKLEEDAGTLEALEEGKKRLQRELEALTQQLEEKAAAYDKL-----------EK 573
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   288 ELSQMQTHISDTSVVLsmDNNRNL---------DLDSIIAEVK---AQYEEiaQRSRAEAESwyqTKYEELQVTAGRHGD 355
Cdd:pfam01576  574 TKNRLQQELDDLLVDL--DHQRQLvsnlekkqkKFDQMLAEEKaisARYAE--ERDRAEAEA---REKETRALSLARALE 646
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 194388850   356 DLRNTKQEIAEINRMIQ----RLRSEIDHVKKQCASLQAAIADAEQrgemALKDAKNKLEGLEDALQKAKQDLARL 427
Cdd:pfam01576  647 EALEAKEELERTNKQLRaemeDLVSSKDDVGKNVHELERSKRALEQ----QVEEMKTQLEELEDELQATEDAKLRL 718
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
191-456 7.24e-06

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 48.91  E-value: 7.24e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 191 QNLEPLFEQyINNLRRQLDSIVGERGRLDSELRNMQDLVEDLKNKYEdEVNKRTAAENEfvtLKKDVDaAYMNKVELQAK 270
Cdd:PRK03918 231 KELEELKEE-IEELEKELESLEGSKRKLEEKIRELEERIEELKKEIE-ELEEKVKELKE---LKEKAE-EYIKLSEFYEE 304
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 271 ADTLTDEINFLRALYDAELSQMQTHISDtsvvLSMDNNRNLDLDSIIAEVKAQYEEIAQRSRAeaeswyqtkYEELQVTA 350
Cdd:PRK03918 305 YLDELREIEKRLSRLEEEINGIEERIKE----LEEKEERLEELKKKLKELEKRLEELEERHEL---------YEEAKAKK 371
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 351 GRhgddLRNTKQEIA-----EINRMIQRLRSEIDHVKKQCASLQAAIADAEQRGE------MALKDAKNKL--------- 410
Cdd:PRK03918 372 EE----LERLKKRLTgltpeKLEKELEELEKAKEEIEEEISKITARIGELKKEIKelkkaiEELKKAKGKCpvcgrelte 447
                        250       260       270       280
                 ....*....|....*....|....*....|....*....|....*.
gi 194388850 411 EGLEDALQKAKQDLARLLKEYQELMNVKLALDVEIATYRKLLEGEE 456
Cdd:PRK03918 448 EHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKES 493
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
201-420 2.94e-05

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 46.36  E-value: 2.94e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 201 INNLRRQLDSIVGERGRLDSELRNMQDLVEDLKNKYEDEVNKRTAAENEFVTLKKDVDaaymnkvELQAKADTLTDEI-N 279
Cdd:COG3883   18 IQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIA-------EAEAEIEERREELgE 90
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 280 FLRALYDAELSqmqthISDTSVVLSMDN-----NRNLDLDSIIAEVKAQYEEI--AQRSRAEAESWYQTKYEELQVTAGR 352
Cdd:COG3883   91 RARALYRSGGS-----VSYLDVLLGSESfsdflDRLSALSKIADADADLLEELkaDKAELEAKKAELEAKLAELEALKAE 165
                        170       180       190       200       210       220
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 194388850 353 HGDDLRNTKQEIAEINRMIQRLRSEIDHVKKQCASLQAAIADAEQRGEMALKDAKNKLEGLEDALQKA 420
Cdd:COG3883  166 LEAAKAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAA 233
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
356-437 5.44e-05

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 45.53  E-value: 5.44e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 356 DLRNTKQEIAEINRMIQRLRSEIDHVKKQCASLQAAIADAEQR---GEMALKDAKNKLEGLEDALQKAKQDLARLLKEYQ 432
Cdd:COG4942   28 ELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRiraLEQELAALEAELAELEKEIAELRAELEAQKEELA 107

                 ....*
gi 194388850 433 ELMNV 437
Cdd:COG4942  108 ELLRA 112
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
355-434 6.59e-05

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 45.14  E-value: 6.59e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 355 DDLRNTKQEIAEINRMIQRLRSEIDHVKKQCASLQAAIADAEQRgemaLKDAKNKLEGLEDALQKAKQDLARLLKEYQEL 434
Cdd:COG4942   20 DAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERR----IAALARRIRALEQELAALEAELAELEKEIAEL 95
46 PHA02562
endonuclease subunit; Provisional
162-430 6.64e-05

endonuclease subunit; Provisional


Pssm-ID: 222878 [Multi-domain]  Cd Length: 562  Bit Score: 45.78  E-value: 6.64e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 162 DKVQFLEQQNKVLDTKWTLLQEQgTKTvrqnleplFEQYINNLRRQLDSIVgergrldSELRNMQDLVEDLKNKYEDEVN 241
Cdd:PHA02562 174 DKIRELNQQIQTLDMKIDHIQQQ-IKT--------YNKNIEEQRKKNGENI-------ARKQNKYDELVEEAKTIKAEIE 237
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 242 KRTAAENEFVTLKKDVDAAY----MNKVELQAKADTLTDEINFLRAlYDAELSQMQThISDTsvvlsmdnnrnldlDSII 317
Cdd:PHA02562 238 ELTDELLNLVMDIEDPSAALnklnTAAAKIKSKIEQFQKVIKMYEK-GGVCPTCTQQ-ISEG--------------PDRI 301
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 318 AEVKAQYEEIAQRSRAEaeswyQTKYEELQVTAGRHgDDLRNTKQE----IAEINRMIQRLRSEIDHVKKQCASLQAAIa 393
Cdd:PHA02562 302 TKIKDKLKELQHSLEKL-----DTAIDELEEIMDEF-NEQSKKLLElknkISTNKQSLITLVDKAKKVKAAIEELQAEF- 374
                        250       260       270
                 ....*....|....*....|....*....|....*..
gi 194388850 394 daeqrgemalKDAKNKLEGLEDALQKAKQDLARLLKE 430
Cdd:PHA02562 375 ----------VDNAEELAKLQDELDKIVKTKSELVKE 401
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
214-434 8.13e-05

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 45.14  E-value: 8.13e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 214 ERGRLDSELRNMQDLVEDLKNKYEDEVNKRTAAENEFVTLKKDVDAAYMNKVELQAKADTLTDEINFLRalydAELSQMQ 293
Cdd:COG4942   28 ELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELR----AELEAQK 103
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 294 THISDTSVVLSMDNNRNldldsiIAEVKAQYEEIAQRSRAEAesWYQTKYEELQvtagRHGDDLRNTKQEIAEINRMIQR 373
Cdd:COG4942  104 EELAELLRALYRLGRQP------PLALLLSPEDFLDAVRRLQ--YLKYLAPARR----EQAEELRADLAELAALRAELEA 171
                        170       180       190       200       210       220
                 ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 194388850 374 LRSEIDHVKKQCASLQAAIADAEQRGEMALKDAKNKLEGLEDALQKAKQDLARLLKEYQEL 434
Cdd:COG4942  172 ERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARL 232
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
312-450 9.44e-05

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 45.29  E-value: 9.44e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850  312 DLDSIIAEVKAQYEEIAQRsRAEAEswyqTKYEELQVT-AGRHGDDLRNTKQEIAEINRMIQRLRSEIDHVKKQCASLQA 390
Cdd:COG4913   299 ELRAELARLEAELERLEAR-LDALR----EELDELEAQiRGNGGDRLEQLEREIERLERELEERERRRARLEALLAALGL 373
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850  391 AIADAEQRGEMALKDAKNKLEGLEDALQKAKQDLARLLKEYQELMNVKLALDVEIATYRK 450
Cdd:COG4913   374 PLPASAEEFAALRAEAAALLEALEEELEALEEALAEAEAALRDLRRELRELEAEIASLER 433
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
355-461 1.17e-04

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 45.29  E-value: 1.17e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850  355 DDLRNTKQEIAEINRMIQRLRSEIDHVKKQC----------------ASLQAAIADAEQRGEmALKDAKNKLEGLEDALQ 418
Cdd:COG4913   617 AELAELEEELAEAEERLEALEAELDALQERRealqrlaeyswdeidvASAEREIAELEAELE-RLDASSDDLAALEEQLE 695
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|...
gi 194388850  419 KAKQDLARLLKEYQELMNVKLALDVEIATYRKLLEGEECRLNG 461
Cdd:COG4913   696 ELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEA 738
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
205-436 1.32e-04

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 44.91  E-value: 1.32e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850  205 RRQLDSIVGERGRLDSELRNMQDLVEDLKNKYEDEVNKRTAAENEFVTLKKDVDAAymnkvELQAKADTLTDEInflral 284
Cdd:COG4913   609 RAKLAALEAELAELEEELAEAEERLEALEAELDALQERREALQRLAEYSWDEIDVA-----SAEREIAELEAEL------ 677
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850  285 ydaelsqmqthisdtsvvlsmdnnRNLDLDS-IIAEVKAQYEEiAQRSRAEAESwyqtKYEELQVTAGRHgddlrntKQE 363
Cdd:COG4913   678 ------------------------ERLDASSdDLAALEEQLEE-LEAELEELEE----ELDELKGEIGRL-------EKE 721
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 194388850  364 IAEINRMIQRLRSEIDHVKKQCASLQAAIADAEQRGEMALKDAKNKLEGLEDALQKAKQDLARLLKEYQELMN 436
Cdd:COG4913   722 LEQAEEELDELQDRLEAAEDLARLELRALLEERFAAALGDAVERELRENLEERIDALRARLNRAEEELERAMR 794
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
312-453 1.53e-04

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 43.37  E-value: 1.53e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 312 DLDSIIAEVKAQYEEI-AQRSRAEAE-SWYQTKYEELQVTAGRHGDDLRNTKQEIAEINRMIQRLRSEIDHVK--KQCAS 387
Cdd:COG1579   14 ELDSELDRLEHRLKELpAELAELEDElAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQLGNVRnnKEYEA 93
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 194388850 388 LQAAIADAEQR---GEMALKDAKNKLEGLEDALQKAKQDLARLLKEYQELmnvKLALDVEIATYRKLLE 453
Cdd:COG1579   94 LQKEIESLKRRisdLEDEILELMERIEELEEELAELEAELAELEAELEEK---KAELDEELAELEAELE 159
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
246-450 3.21e-04

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 43.28  E-value: 3.21e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 246 AENEFVTLKKDVDAAYMNKVELQAKADTLTDEINFLRALYD---AELSQMQTHISDTsvvlsmdnnrNLDLDSIIAEVKA 322
Cdd:COG3883   14 ADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNelqAELEALQAEIDKL----------QAEIAEAEAEIEE 83
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 323 QYEEIAQRSRAEAESWYQTKYEELQVTAGRHGDDLRNtkqeIAEINRMIQRLRSEIDHVKKQCASLQAAIADAEQRGEmA 402
Cdd:COG3883   84 RREELGERARALYRSGGSVSYLDVLLGSESFSDFLDR----LSALSKIADADADLLEELKADKAELEAKKAELEAKLA-E 158
                        170       180       190       200       210
                 ....*....|....*....|....*....|....*....|....*....|..
gi 194388850 403 LKDAKNKLEGLEDALQKAKQD----LARLLKEYQELMNVKLALDVEIATYRK 450
Cdd:COG3883  159 LEALKAELEAAKAELEAQQAEqealLAQLSAEEAAAEAQLAELEAELAAAEA 210
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
317-447 3.27e-04

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 43.77  E-value: 3.27e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 317 IAEVKAQYEEIAQRSRAEAEswyqtKYEELQVTAGRHGDDLRNTKQEIAEINRMIQRLRSEIDHVKKQCASLQAAIADAE 396
Cdd:COG1196  262 LAELEAELEELRLELEELEL-----ELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELE 336
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|....
gi 194388850 397 QR---GEMALKDAKNKLEGLEDALQKAKQDLARLLKEYQELMNVKLALDVEIAT 447
Cdd:COG1196  337 EEleeLEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLE 390
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
139-438 5.88e-04

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 42.79  E-value: 5.88e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850  139 IQRVRAEEREQIKTLNNKFASfidkvqfLEQQNKVLDTKWTLLQE-----------QGTKTVRQNLEPLFEQYINNLRRQ 207
Cdd:pfam05483 386 LQKKSSELEEMTKFKNNKEVE-------LEELKKILAEDEKLLDEkkqfekiaeelKGKEQELIFLLQAREKEIHDLEIQ 458
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850  208 LDSIVGERGRLDSElrnmqdlVEDLKNKYEDEVNKRTAaenefvtLKKDVDAAYMNKVELQAKADTLTDEINFLRALYDA 287
Cdd:pfam05483 459 LTAIKTSEEHYLKE-------VEDLKTELEKEKLKNIE-------LTAHCDKLLLENKELTQEASDMTLELKKHQEDIIN 524
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850  288 ELSQMQTHISDTSVVLSMDNNRNLDLDSIIAEVKAQYEEIAQRSRAEAESWYQTKYEELQVTAGRHGDDLR--NTKQEIA 365
Cdd:pfam05483 525 CKKQEERMLKQIENLEEKEMNLRDELESVREEFIQKGDEVKCKLDKSEENARSIEYEVLKKEKQMKILENKcnNLKKQIE 604
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 194388850  366 EINRMIQRLRSEIDHVKKQcaslqaaiADAEQRGEMALKDAKNKLEgLEdaLQKAKQDLARLLKEYQELMNVK 438
Cdd:pfam05483 605 NKNKNIEELHQENKALKKK--------GSAENKQLNAYEIKVNKLE-LE--LASAKQKFEEIIDNYQKEIEDK 666
235kDa-fam TIGR01612
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ...
145-419 6.99e-04

reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.


Pssm-ID: 130673 [Multi-domain]  Cd Length: 2757  Bit Score: 42.73  E-value: 6.99e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   145 EEREQIKTLNNKFASFIDKVQFLE-------------QQNKVLDTkWTLLQEQGTKTVRQNLEPLFEQYiNNLRRQLDSI 211
Cdd:TIGR01612  693 EDKAKLDDLKSKIDKEYDKIQNMEtatvelhlsnienKKNELLDI-IVEIKKHIHGEINKDLNKILEDF-KNKEKELSNK 770
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   212 VGERGRLDSELRNMQDLVEDLKNKYEDEVNKRTAAENEfvtLKKDVDAAYMNKVELQAKADTLTDEINFLRALYDAELSQ 291
Cdd:TIGR01612  771 INDYAKEKDELNKYKSKISEIKNHYNDQINIDNIKDED---AKQNYDKSKEYIKTISIKEDEIFKIINEMKFMKDDFLNK 847
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   292 MQTHIsdtsvvlSMDNNRNLDLDSiiaeVKAQYEEIAQRSRAEAESWYQTKYEelqvtagrhgDDLRNTKQEIAEINRMI 371
Cdd:TIGR01612  848 VDKFI-------NFENNCKEKIDS----EHEQFAELTNKIKAEISDDKLNDYE----------KKFNDSKSLINEINKSI 906
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*...
gi 194388850   372 QRLRSEIDHVKKQCASLQAAIADAEqrgemALKDAKNKLEGLEDALQK 419
Cdd:TIGR01612  907 EEEYQNINTLKKVDEYIKICENTKE-----SIEKFHNKQNILKEILNK 949
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
201-457 7.01e-04

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 42.72  E-value: 7.01e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 201 INNLRRQLDSIVGERGRLDSELRNMQDLVEDLKNKYEDEVNKRTaaenEFVTLKKDVDAAYMNKVELQAKADTLTDEINF 280
Cdd:PRK02224 208 LNGLESELAELDEEIERYEEQREQARETRDEADEVLEEHEERRE----ELETLEAEIEDLRETIAETEREREELAEEVRD 283
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 281 LRALYDaELSQMQTHISDTSVVLSMDNN----RNLDLDSIIAEVKAQYEEI---AQRSRAEAESW------YQTKYEELQ 347
Cdd:PRK02224 284 LRERLE-ELEEERDDLLAEAGLDDADAEaveaRREELEDRDEELRDRLEECrvaAQAHNEEAESLredaddLEERAEELR 362
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 348 VTAGRHGDDLRNTKQEIAEINRMIQRLRSEIDhvkkqcaSLQAAIADAeqrgEMALKDAKNKLEGLEDALQKAKQDLARL 427
Cdd:PRK02224 363 EEAAELESELEEAREAVEDRREEIEELEEEIE-------ELRERFGDA----PVDLGNAEDFLEELREERDELREREAEL 431
                        250       260       270
                 ....*....|....*....|....*....|
gi 194388850 428 LKEYQELMNVklaldveIATYRKLLEGEEC 457
Cdd:PRK02224 432 EATLRTARER-------VEEAEALLEAGKC 454
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
319-456 7.64e-04

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 42.62  E-value: 7.64e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 319 EVKAQYEEIAQRSRAEAESWYQTKYEELQVTAGRHGDDLRNTKQEIAEINRMIQRLRSEIDHVKKQCASLQAAIADAEQR 398
Cdd:COG1196  217 ELKEELKELEAELLLLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAE 296
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|....*...
gi 194388850 399 gemaLKDAKNKLEGLEDALQKAKQDLARLLKEYQELMNVKLALDVEIATYRKLLEGEE 456
Cdd:COG1196  297 ----LARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAE 350
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
312-459 9.28e-04

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 42.21  E-value: 9.28e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850  312 DLDSIIAEVKAQYEEIAQRSRA----EAESWYQTKYEELQVTAGRHGD---DLRNTKQEIAEINRMIQRLRSEIDHVKKQ 384
Cdd:COG4913   628 EAEERLEALEAELDALQERREAlqrlAEYSWDEIDVASAEREIAELEAeleRLDASSDDLAALEEQLEELEAELEELEEE 707
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850  385 CASLQAAIADAEQRgemaLKDAKNKLEGLEDALQKA-----KQDLARLLKEYQELM------NVKLALDVEIATYRKLLE 453
Cdd:COG4913   708 LDELKGEIGRLEKE----LEQAEEELDELQDRLEAAedlarLELRALLEERFAAALgdaverELRENLEERIDALRARLN 783

                  ....*.
gi 194388850  454 GEECRL 459
Cdd:COG4913   784 RAEEEL 789
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
317-450 9.42e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 42.06  E-value: 9.42e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 317 IAEVKAQYEEIAQRsRAEAESWYQTKYEELQVTAGRHgdDLRNTKQEIAEINRMIQRLRSEIDHVKKQCASLQAAIADAE 396
Cdd:COG4717  390 ALEQAEEYQELKEE-LEELEEQLEELLGELEELLEAL--DEEELEEELEELEEELEELEEELEELREELAELEAELEQLE 466
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|....
gi 194388850 397 QRGEmalkdaknkLEGLEDALQKAKQDLARLLKEYQELMNVKLALDVEIATYRK 450
Cdd:COG4717  467 EDGE---------LAELLQELEELKAELRELAEEWAALKLALELLEEAREEYRE 511
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
145-458 9.57e-04

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 42.26  E-value: 9.57e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   145 EEREQIKTLNNKFASFIDKVQFLEQQnkvldtkWTLLQEQGTKTvrQNLEPLFEQYINNLRRQLDSIVGERGRLDSELR- 223
Cdd:TIGR00618  553 SERKQRASLKEQMQEIQQSFSILTQC-------DNRSKEDIPNL--QNITVRLQDLTEKLSEAEDMLACEQHALLRKLQp 623
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   224 --NMQDLVEDLKNKYEDEVNKRTAAENEFVTLKKDvdaaymnKVELQAKADTLTDEINFLRALydAELSQMQTHISDTSV 301
Cdd:TIGR00618  624 eqDLQDVRLHLQQCSQELALKLTALHALQLTLTQE-------RVREHALSIRVLPKELLASRQ--LALQKMQSEKEQLTY 694
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   302 VLSMDNNRNLDLDSI---IAEVKAQYEEIAQRSRA---------EAESWYQTKYEELQVTAGRHG--DDLRNTKQEIAEI 367
Cdd:TIGR00618  695 WKEMLAQCQTLLRELethIEEYDREFNEIENASSSlgsdlaareDALNQSLKELMHQARTVLKARteAHFNNNEEVTAAL 774
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   368 NRM--IQRLRSEIDHVKKQCASLQAAIADAEQRGEMALKDAKNKLEGLEDALQKAKQDLARLLKEyqelmnvKLALDVEI 445
Cdd:TIGR00618  775 QTGaeLSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEIPSDEDILNLQCETLVQEEEQFLSRLEE-------KSATLGEI 847
                          330
                   ....*....|...
gi 194388850   446 AtyRKLLEGEECR 458
Cdd:TIGR00618  848 T--HQLLKYEECS 858
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
119-298 1.24e-03

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 41.82  E-value: 1.24e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850  119 QEVTVNQSLLTPLNLQIDpaiQRVRAEEREQIKTLNNKFASFIDKVQFLEQQNKVLDTKWTLLQEQGTKTVRQNLEPLfE 198
Cdd:COG4913   269 ERLAELEYLRAALRLWFA---QRRLELLEAELEELRAELARLEAELERLEARLDALREELDELEAQIRGNGGDRLEQL-E 344
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850  199 QYINNLRRQLDSIVGERGRLDSELRNMQ----DLVEDLKNKYEDEVNKRTAAENEFVTLKKDVDAAYMNKVELQAKADTL 274
Cdd:COG4913   345 REIERLERELEERERRRARLEALLAALGlplpASAEEFAALRAEAAALLEALEEELEALEEALAEAEAALRDLRRELREL 424
                         170       180
                  ....*....|....*....|....*..
gi 194388850  275 TDEINFLRA---LYDAELSQMQTHISD 298
Cdd:COG4913   425 EAEIASLERrksNIPARLLALRDALAE 451
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
188-434 1.39e-03

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 40.29  E-value: 1.39e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 188 TVRQNLEPLFEqyINNLRRQLDSIVGERGRLDSELRNMQDLVEDLKNKYEDEVNKRTAAENEFVTLKKDVDaaymnkvEL 267
Cdd:COG1579    1 AMPEDLRALLD--LQELDSELDRLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIE-------EV 71
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 268 QAKADTLTDEINFLRalydaelsqmqthisdtsvvlsmdNNRnlDLDSIIAEVkaqyeEIAQRSRAEAEswyqtkyeelq 347
Cdd:COG1579   72 EARIKKYEEQLGNVR------------------------NNK--EYEALQKEI-----ESLKRRISDLE----------- 109
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 348 vtagrhgddlrntkQEIAEINRMIQRLRSEIDHVKKQCASLQAAIADAEQRGEMALKDAKNKLEGLEDALQKAKQDL-AR 426
Cdd:COG1579  110 --------------DEILELMERIEELEEELAELEAELAELEAELEEKKAELDEELAELEAELEELEAEREELAAKIpPE 175

                 ....*...
gi 194388850 427 LLKEYQEL 434
Cdd:COG1579  176 LLALYERI 183
DUF1351 pfam07083
Protein of unknown function (DUF1351); This family consists of several bacterial and phage ...
310-445 1.53e-03

Protein of unknown function (DUF1351); This family consists of several bacterial and phage proteins of around 230 residues in length. The function of this family is unknown.


Pssm-ID: 429283 [Multi-domain]  Cd Length: 210  Bit Score: 40.05  E-value: 1.53e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850  310 NLDLDSIIAEVKAQYEEIAQRSRAEAEswyqtKYEELQVTAgrhgDDLRNTKQEIAEINRMIQRLRSEIDHVKKQCAslq 389
Cdd:pfam07083   1 ELSVTQKPAAISFNFEELETYVDGIVA-----KYEGLVVTE----DTVKEAKKERAELNKIAKALDDKRKEVKKQYS--- 68
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 194388850  390 AAIADAEQRG---EMALKDAKNKL----EGLEDALQKAKQDLARLLK-EYQELMNVKLAlDVEI 445
Cdd:pfam07083  69 EPYDEFEAKIkelVAKIKEAIDPIdeqiKAFEEKEKDAKRQLVKALIsELAEEYGVPLE-EIEI 131
235kDa-fam TIGR01612
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ...
139-429 1.77e-03

reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.


Pssm-ID: 130673 [Multi-domain]  Cd Length: 2757  Bit Score: 41.58  E-value: 1.77e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   139 IQRVRAEEREQIKTLNNK---FASFIDKVQFLEQQNKVLD----------TKWTLLQEQGTKTVRQNLEPLFEQYINNLR 205
Cdd:TIGR01612  598 INKLKLELKEKIKNISDKneyIKKAIDLKKIIENNNAYIDelakispyqvPEHLKNKDKIYSTIKSELSKIYEDDIDALY 677
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   206 RQLDSIVGErgrldSELRNMQD--LVEDLKNKYEDEVNKRTAAENEfvTLKKDVDAAYMNKVELQAKA--------DTLT 275
Cdd:TIGR01612  678 NELSSIVKE-----NAIDNTEDkaKLDDLKSKIDKEYDKIQNMETA--TVELHLSNIENKKNELLDIIveikkhihGEIN 750
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   276 DEINFLRALYDAELSQMQTHISDTSvvlsmdnNRNLDLD---SIIAEVKAQYEE---IAQRSRAEAESWYQTKYEELQVT 349
Cdd:TIGR01612  751 KDLNKILEDFKNKEKELSNKINDYA-------KEKDELNkykSKISEIKNHYNDqinIDNIKDEDAKQNYDKSKEYIKTI 823
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   350 AGRHGD------DLRNTKQEI-AEINRMI-------QRLRSEIDHVKKQCASLQAAIADAEQRG-EMALKDAKNKLEGLE 414
Cdd:TIGR01612  824 SIKEDEifkiinEMKFMKDDFlNKVDKFInfennckEKIDSEHEQFAELTNKIKAEISDDKLNDyEKKFNDSKSLINEIN 903
                          330
                   ....*....|....*
gi 194388850   415 DALQKAKQDLARLLK 429
Cdd:TIGR01612  904 KSIEEEYQNINTLKK 918
ClyA_Cry6Aa-like cd22656
Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes ...
216-433 2.10e-03

Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes pesticidal Cry6Aa toxin from Bacillus thuringiensis, one of the many parasporal crystal (Cry) toxins produced during the sporulation phase of growth. Many of these proteins are toxic to numerous insect species and have been effectively used as proteinaceous insecticides to directly kill insect pests; some have been used to control insect growth on transgenic agricultural plants. Cry6Aa exists as a protoxin, which is activated by cleavage using trypsin. Structure studies for Cry6Aa support a mechanism of action by pore formation, similar to cytolysin A (ClyA)-type alpha pore-forming toxins (alpha-PFTs) such as HblB, and bioassay and mutation studies show that Cry6Aa is an active pore-forming toxin. Cry6Aa shows atypical features compared to other members of alpha-PFTs, including internal repeat sequences and small loop regions within major alpha helices.


Pssm-ID: 439154 [Multi-domain]  Cd Length: 309  Bit Score: 40.43  E-value: 2.10e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 216 GRLDSELRNMQDLVEDLKNKYEDEVNKRTAAE-NEFV-TLKKDVDaaymnkvELQAKADTLTDEIN-FLRAL--YDAELS 290
Cdd:cd22656   87 GTIDSYYAEILELIDDLADATDDEELEEAKKTiKALLdDLLKEAK-------KYQDKAAKVVDKLTdFENQTekDQTALE 159
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 291 QMQTHISDtsvVLSMDNNRNL--DLDSIIAEVKAQYEEIAQRSRA---EAESWYQTKYEELQV------TAGRHGDDLRN 359
Cdd:cd22656  160 TLEKALKD---LLTDEGGAIArkEIKDLQKELEKLNEEYAAKLKAkidELKALIADDEAKLAAalrliaDLTAADTDLDN 236
                        170       180       190       200       210       220       230
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 194388850 360 TKQEIAEINRMIQRLRSEIDHVKKQCASLQAAIADAEQrgemALKDAKNKLEGLEDALQKAKqDLARLLKEYQE 433
Cdd:cd22656  237 LLALIGPAIPALEKLQGAWQAIATDLDSLKDLLEDDIS----KIPAAILAKLELEKAIEKWN-ELAEKADKFRQ 305
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
148-433 2.94e-03

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 40.39  E-value: 2.94e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850  148 EQIKTLNNKFASFIDKVQFLEQQNKVLDTKWTLLQEQgTKTVRQNLEPLfEQYINNLRRQLDSIVGERGRLDSELRNMQD 227
Cdd:TIGR04523  75 NKIKILEQQIKDLNDKLKKNKDKINKLNSDLSKINSE-IKNDKEQKNKL-EVELNKLEKQKKENKKNIDKFLTEIKKKEK 152
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850  228 LVEDLKNKYEDEVNKRTAAENEFVTLKKDvdaaymnKVELQAKADTLTDEINFLRALydaeLSQMQTHISdtsvvlsmdn 307
Cdd:TIGR04523 153 ELEKLNNKYNDLKKQKEELENELNLLEKE-------KLNIQKNIDKIKNKLLKLELL----LSNLKKKIQ---------- 211
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850  308 nRNLDLDSIIAEVKAQYEEIaqrsraeaeswyQTKYEELQvtagrhgDDLRNTKQEIAEINRMIQRLRSEIDHVKKQcas 387
Cdd:TIGR04523 212 -KNKSLESQISELKKQNNQL------------KDNIEKKQ-------QEINEKTTEISNTQTQLNQLKDEQNKIKKQ--- 268
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*.
gi 194388850  388 LQAAIADAEQrgemalkdAKNKLEGLEDALQKAKQDLARLLKEYQE 433
Cdd:TIGR04523 269 LSEKQKELEQ--------NNKKIKELEKQLNQLKSEISDLNNQKEQ 306
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
355-468 2.97e-03

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 40.44  E-value: 2.97e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   355 DDLRNTKQEIAEINRMIQRLRSEIDHVKKQCASLQAAIADAEQ-------RGEMALKD---AKNKLEGLEDALQKAKQDL 424
Cdd:TIGR02169  674 AELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRkigeiekEIEQLEQEeekLKERLEELEEDLSSLEQEI 753
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 194388850   425 ARLLKEYQELMNVKLALDVEIATYRKLLEGEECRLNGEGVGQVN 468
Cdd:TIGR02169  754 ENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEIQ 797
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
136-439 6.45e-03

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 39.25  E-value: 6.45e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 136 DPAIQRVRAEEREQIKTLNNKFASFIDKVQFLEQQNKVLDTKWTLLQEQGTKTVRQNLEplFEQYINNLRRQLDSIVGER 215
Cdd:PRK02224 323 DEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELREEAAELESELEEAREAVED--RREEIEELEEEIEELRERF 400
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 216 GRLDSELRNMQDLVEDLKNKYEDEVNKRTAAENEFVTLKKDVDAAYM-----------NKVELQAKADTLTDEINFLRAL 284
Cdd:PRK02224 401 GDAPVDLGNAEDFLEELREERDELREREAELEATLRTARERVEEAEAlleagkcpecgQPVEGSPHVETIEEDRERVEEL 480
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 285 yDAELSQMQTHISDTSVVL--------------SMDNNRNlDLDSIIAEVKAQYEE---IAQRSRAEAESwYQTKYEELQ 347
Cdd:PRK02224 481 -EAELEDLEEEVEEVEERLeraedlveaedrieRLEERRE-DLEELIAERRETIEEkreRAEELRERAAE-LEAEAEEKR 557
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 348 VTAGRHGDDLRNTKQEIAEINRMIQRLRSEIDHVKKqCASLQAAIADAEQR-----------GEM---------ALKDAK 407
Cdd:PRK02224 558 EAAAEAEEEAEEAREEVAELNSKLAELKERIESLER-IRTLLAAIADAEDEierlrekrealAELnderrerlaEKRERK 636
                        330       340       350
                 ....*....|....*....|....*....|....
gi 194388850 408 NKLEGL--EDALQKAKQDLARlLKEYQELMNVKL 439
Cdd:PRK02224 637 RELEAEfdEARIEEAREDKER-AEEYLEQVEEKL 669
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
364-459 6.86e-03

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 39.28  E-value: 6.86e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   364 IAEINRMIQRLRSEIDHVKKQCASLQAAIADAEQRGEmALKDAKNKLEGLeDALQKAKQDLA--RLLKEYQELMNVKLAL 441
Cdd:TIGR02169  165 VAEFDRKKEKALEELEEVEENIERLDLIIDEKRQQLE-RLRREREKAERY-QALLKEKREYEgyELLKEKEALERQKEAI 242
                           90
                   ....*....|....*...
gi 194388850   442 DVEIATYRKLLEGEECRL 459
Cdd:TIGR02169  243 ERQLASLEEELEKLTEEI 260
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
317-427 6.93e-03

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 39.51  E-value: 6.93e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850  317 IAEVKAQYEEIaQRSRAEAESWY-QTKYEELQvtagrhgDDLRNTKQEIAEINRMIQRLRSEIDHVKKQCASLQAAIADA 395
Cdd:COG4913   264 YAAARERLAEL-EYLRAALRLWFaQRRLELLE-------AELEELRAELARLEAELERLEARLDALREELDELEAQIRGN 335
                          90       100       110
                  ....*....|....*....|....*....|..
gi 194388850  396 EQRgemALKDAKNKLEGLEDALQKAKQDLARL 427
Cdd:COG4913   336 GGD---RLEQLEREIERLERELEERERRRARL 364
YqiK COG2268
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown];
317-428 7.47e-03

Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown];


Pssm-ID: 441869 [Multi-domain]  Cd Length: 439  Bit Score: 39.09  E-value: 7.47e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 317 IAEVKAQYEEIAQRSRAEAESWYQTKYEELQVTAGRHGDDLRNTKQ-EIAEINRMIQRLRSEIDHVKKQCASLQAAIADA 395
Cdd:COG2268  246 LAKKKAEERREAETARAEAEAAYEIAEANAEREVQRQLEIAEREREiELQEKEAEREEAELEADVRKPAEAEKQAAEAEA 325
                         90       100       110
                 ....*....|....*....|....*....|....*.
gi 194388850 396 EQRGEMALKDAKNKLEGLE---DALQKAKQDLARLL 428
Cdd:COG2268  326 EAEAEAIRAKGLAEAEGKRalaEAWNKLGDAAILLM 361
PRK12704 PRK12704
phosphodiesterase; Provisional
318-433 8.33e-03

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 38.99  E-value: 8.33e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850 318 AEVKAQYEEIAQRSRAEAE-----SWYQTKYEELQVTAGRHGDDLRNTKQEIAEINRMIQRLRSEIDHVKKQCASLQAAI 392
Cdd:PRK12704  58 ALLEAKEEIHKLRNEFEKElrerrNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELI 137
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*
gi 194388850 393 ADAEQRGE----MALKDAKNKLegLEDALQKAKQDLARLLKEYQE 433
Cdd:PRK12704 138 EEQLQELErisgLTAEEAKEIL--LEKVEEEARHEAAVLIKEIEE 180
Spc7 smart00787
Spc7 kinetochore protein; This domain is found in cell division proteins which are required ...
355-459 8.73e-03

Spc7 kinetochore protein; This domain is found in cell division proteins which are required for kinetochore-spindle association.


Pssm-ID: 197874 [Multi-domain]  Cd Length: 312  Bit Score: 38.46  E-value: 8.73e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   355 DDLRNTKQEIAEINRMIQRLRSEIDHVKKQCASLQAAIADAEQRGEMALKDAKNKLEGLEDALQKAKQDLARLLKEYQEL 434
Cdd:smart00787 158 EDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDCDPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQEL 237
                           90       100       110
                   ....*....|....*....|....*....|..
gi 194388850   435 -------MNVKLALDVEIATYRKLLegEECRL 459
Cdd:smart00787 238 eskiedlTNKKSELNTEIAEAEKKL--EQCRG 267
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
206-434 9.25e-03

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 38.90  E-value: 9.25e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   206 RQLDSIVGERGRLDSELRNMQDLVEDLKNKYEDEVNKRTAAENEFVTLKKDVDAAYMNKVELQAKADTLTDEI------- 278
Cdd:TIGR02169  716 RKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLshsripe 795
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   279 --NFLRALyDAELSQMQTHISDTSVVLSMDNNRNLDLDSIIAEVKAQYEEIAQR--SRAEAESWYQTKYEELQvtagrhg 354
Cdd:TIGR02169  796 iqAELSKL-EEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQikSIEKEIENLNGKKEELE------- 867
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 194388850   355 DDLRNTKQEIAEINRMIQRLRSEIDHVKKQCASLQAAIADAeqrgEMALKDAKNKLEGLEDALQKAKQDLARLLKEYQEL 434
Cdd:TIGR02169  868 EELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEEL----EAQIEKKRKRLSELKAKLEALEEELSEIEDPKGED 943
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH