|
Name |
Accession |
Description |
Interval |
E-value |
| Arm_APC_u3 |
pfam16629 |
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ... |
606-893 |
1.70e-166 |
|
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known. :
Pssm-ID: 435476 Cd Length: 293 Bit Score: 514.52 E-value: 1.70e-166
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 606 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRSKQRHKQSLYGDYVFDTNRHDDN-- 683
Cdd:pfam16629 1 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRNKQRHKQNVYSEYVLDSGRHDDSvc 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 684 RSDNFNTGNMTVLSPYLNTTVLPSSSSSRG--SLDSSRSEKDRSLERERGIGLGNYHPATENPGTSSKR-GLQISTTAAQ 760
Cdd:pfam16629 81 RSDNFNTGNVTVLSPYLNTTVLPSSSSRDSrgNAESSRSEKDRSLDRERGAGLSNFHPATENSGNSSKRiGMQISTTAAQ 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 761 IAKVMEEVSAIHTSQEDRSSGSTTELHCVTDERNALRRSSAAHTHSNTYNFTKSENSNRTCSMPYAKLEYKRSSNDSLNS 840
Cdd:pfam16629 161 IAKVMEEVSSMHISQEDRSSGSTSDMHCMQDDRNSIRRSSTAHPHSNVYSFNKSESSNRPCPMPYMKMEYKRASNDSLNS 240
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|...
gi 1238280178 841 VSSSDGYGKRGQMKPSIESYSEDDESKFCSYGQYPADLAHKIHSANHMDDNDG 893
Cdd:pfam16629 241 VSSSDGYGKRGQMKPSVESYSEDDEGKFCSYGKYPADLAHKIHSANHMDDNDG 293
|
|
| APC_basic |
pfam05956 |
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ... |
2097-2442 |
8.45e-109 |
|
APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules. :
Pssm-ID: 428690 [Multi-domain] Cd Length: 336 Bit Score: 351.10 E-value: 8.45e-109
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2097 SISRGRTMIHIPGVRNSSSSTSPVSKKGPPLKTPASKSPSEGQTATTS-PRGAKPSVKSELSPVARQTSQIGGSSKAPSR 2175
Cdd:pfam05956 1 VVFRGRTVIYMPGVKESQPSTSPPPKKTPPKTDAPAKNPNLGQQRSRSlHRLGKPSELADLSPPKRSATPPARISKAPSS 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2176 SGSRDSTPSRPAQQPLSRPIQSPGRNSISPGRNgisppnKLSQLPRTSSPSTASTKSSGSgKMSYTSPGRQMSQQNLTKQ 2255
Cdd:pfam05956 81 GSSRDSTPSRPPQKKLTSPSQSPGRLPGSGGRN------KLSPLPKTKSPARASTKKSGS-HKTQKSPVRIPFMQTPTKQ 153
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2256 TGLSKNASSI------PRSESASKGLNqmnngNGANKKVELSRMSSTKSSGSESDRSerpVLVRQSTFIKEAPSPTLRRK 2329
Cdd:pfam05956 154 TGLPRNPSPLvtnqpePRSESASKGLR-----SLPGKRLDLVRMSSARSSGSESDRS---GFLRQLTFIKESPSLLLRRR 225
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2330 LEESASfESLSPSSRPASPTRSQaqtpvlsPSLPDMSLSTHSSVQAGGWRKLPPNLSPTIEYNDGRPAKRHDIARSHSES 2409
Cdd:pfam05956 226 LELSAS-ESLSPSSQPASPRRSR-------PGLPAVFLCSSRCQELKGWRKQPPNPNSRAEPSDRPLTRRRPPRRTSSES 297
|
330 340 350
....*....|....*....|....*....|...
gi 1238280178 2410 PSRLPInRSGTWKREHSKHSSSLPRVSTWRRTG 2442
Cdd:pfam05956 298 PSRLPV-RNGTWKRETFKRYSSLPHINVWRRTG 329
|
|
| EB1_binding |
pfam05937 |
EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the ... |
2544-2717 |
4.50e-89 |
|
EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the microtubule-associating protein EB-1. At the C-terminus of the alignment is also a pfam00595 binding domain. A short motif in the middle of the region appears to be found in the APC2 proteins. :
Pssm-ID: 399141 Cd Length: 174 Bit Score: 287.66 E-value: 4.50e-89
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2544 RSGRSPTGNTPPVIDSVSEKANPNIKDSKDNQAKQNVGNGSVPMRTVGLENRLNSFIQVDAPDQKGTEIKPGQNNPVPVS 2623
Cdd:pfam05937 1 RSGRSPTGNTPPVIDSVPEKGIKDEKDSKDPQAKQNMGNGNVPVRTVGLENRLNSFIQSDSPDKKGTETKPLQNNPVPTP 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2624 ETNESSIVERTPFSSSSSSKHSSPSGTVAARVTPFNYNPSPRKSSADSTSARPSQIPTPVNNNTKKRDSKTDSTESSGTQ 2703
Cdd:pfam05937 81 ETNENPVSERTPFSSSSSSKHSSPSGAVAARVTPFNYNPSPRKSSADSSSARPSQIPTPVNNSTKKRDSKTESTDSSGNQ 160
|
170
....*....|....
gi 1238280178 2704 SPKRHSGSYLVTSV 2717
Cdd:pfam05937 161 SPKRHSGSYLVTSV 174
|
|
| APC_u5 |
pfam16630 |
Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of ... |
910-1009 |
2.30e-56 |
|
Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of natively unstructured sequence lying between the first and the second 15-residue beta-catenin binding motifs, APC_15aa, pfam05972, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known. :
Pssm-ID: 406923 Cd Length: 100 Bit Score: 190.89 E-value: 2.30e-56
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 910 LNSGRQSPSQNERWARPKHIIEDEIKQSEQRQSRNQSTTYPVYTESTDDKHLKFQPHFGQQECVSPYRSRGANGSETNRV 989
Cdd:pfam16630 1 LNSGRQSPSQNERWARPKHIIEDEMKQSEQRQPRSQSTTYPVYTESGDDKHMKFQPRFGQQECVSPFRSRGSNGSEQSRV 80
|
90 100
....*....|....*....|
gi 1238280178 990 GSNHGINQNVSQSLCQEDDY 1009
Cdd:pfam16630 81 GSSHGINQKVSQSLCQVDDY 100
|
|
| APC_u14 |
pfam16635 |
Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively ... |
1620-1713 |
1.53e-46 |
|
Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively unstructured sequence lying between the second SAMP pfam05924, and the fifth creatine-rich region, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known. :
Pssm-ID: 435479 Cd Length: 94 Bit Score: 162.70 E-value: 1.53e-46
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 1620 IMDQVQQASASSSAPNKNQLDGKKKKPTSPVKPIPQNTEYRTRVRKNADSKNNLNAERVFSDNKDSKKQNLKNNSKVFND 1699
Cdd:pfam16635 1 IMDQIQQASAASSGGSKSQQDGEKKKPTSPVKPMPQSSEYRARVRKNTESKNNLNSERSYPDNKESKKQNLKNNSRDFND 80
|
90
....*....|....
gi 1238280178 1700 KLPNNEDRVRGSFA 1713
Cdd:pfam16635 81 KLPNNEERTRGSFA 94
|
|
| APC_u9 |
pfam16633 |
Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of ... |
1157-1242 |
2.01e-35 |
|
Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of natively unstructured sequence lying between the first and second APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known. :
Pssm-ID: 435478 Cd Length: 89 Bit Score: 130.76 E-value: 2.01e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 1157 AEDEI-GCNQTTQEADSANTLQIAEIKEKIGTRSAEDPVSEVPAVSQHPRTKSSRLQGSSLS-SESARHKAVEFSSGAKS 1234
Cdd:pfam16633 2 AEDEIeGRDQATRSTDNYNTLQITELKENSGAVSTEQTVSEVPSSSQHIRTKPNRLQASNLSpSDSSRHKAVEFSSGAKS 81
|
....*...
gi 1238280178 1235 PSKSGAQT 1242
Cdd:pfam16633 82 PSKSGAQT 89
|
|
| APC_u15 |
pfam16636 |
Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of ... |
1747-1821 |
5.55e-31 |
|
Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of natively unstructured sequence lying between the fifth and sixth creatine-rich, APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known. :
Pssm-ID: 435480 Cd Length: 81 Bit Score: 117.65 E-value: 5.55e-31
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1238280178 1747 DLSREKAELRKAKENKESEAKVTSHTELTSNQQSANKTQAIAKQPINRGQPKPILQKQSTFPQSSKDIPDRGAAT 1821
Cdd:pfam16636 7 DLSREKAELRKGKETKETETKVTSHIEQPSNQQSTNRTQACQKHPPNRGQPKPLLQKQTTFPQSSKDIPDRGAAT 81
|
|
| APC_rep |
pfam18797 |
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ... |
282-340 |
9.57e-31 |
|
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region. :
Pssm-ID: 465870 Cd Length: 74 Bit Score: 116.88 E-value: 9.57e-31
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*....
gi 1238280178 282 HLGTKIRAYCETCWEWQEAHEPGMDQDKNPMPAPVEHQICPAVCVLMKLSFDEEHRHAM 340
Cdd:pfam18797 16 HLLEQIRAYCETCWDWQESHSRGPEGDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
|
|
| APC_u13 |
pfam16634 |
Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively ... |
1536-1589 |
2.03e-24 |
|
Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively unstructured sequence lying between the fourth creatine-rich region, APC_crr, pfam05923, and the SAMP pfam05924, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known. :
Pssm-ID: 406927 Cd Length: 54 Bit Score: 97.94 E-value: 2.03e-24
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....
gi 1238280178 1536 IESPPNELAAGEGVRGGAQSGEFEKRDTIPTEGRSTDEAQGGKTSSVTIPELDD 1589
Cdd:pfam16634 1 IESPPNELANAESTGTGAESAEFEKRDTIPTEGRSTDDAQRGKKSNITTSALDD 54
|
|
| Suppressor_APC |
pfam11414 |
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ... |
102-182 |
2.73e-22 |
|
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils. :
Pssm-ID: 463275 Cd Length: 82 Bit Score: 93.09 E-value: 2.73e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 102 SRESTGYLEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLPLTE-NFSLQTDMTRRQLEYEARQIRVAMEEQLG 180
Cdd:pfam11414 1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGtYFDYGSDAQQERLEFLLARIQEVNRCLGG 80
|
..
gi 1238280178 181 TC 182
Cdd:pfam11414 81 LI 82
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1511-1534 |
1.36e-07 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin. :
Pssm-ID: 461781 Cd Length: 24 Bit Score: 49.30 E-value: 1.36e-07
|
| ARM |
smart00185 |
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ... |
523-563 |
4.40e-07 |
|
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin. :
Pssm-ID: 214547 [Multi-domain] Cd Length: 41 Bit Score: 48.19 E-value: 4.40e-07
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 1238280178 523 NEDHRQILRENNCLQTLLQHLKSHSLTIVSNACGTLWNLSA 563
Cdd:smart00185 1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
|
|
| SAMP |
pfam05924 |
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1907-1926 |
3.38e-05 |
|
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin. :
Pssm-ID: 461782 Cd Length: 22 Bit Score: 42.58 E-value: 3.38e-05
|
| ARM |
smart00185 |
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ... |
386-427 |
5.02e-04 |
|
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin. :
Pssm-ID: 214547 [Multi-domain] Cd Length: 41 Bit Score: 39.72 E-value: 5.02e-04
10 20 30 40
....*....|....*....|....*....|....*....|..
gi 1238280178 386 DVANKATLCSMkGCMRALVAQLKSESEDLQQVIASVLRNLSW 427
Cdd:smart00185 1 DDENKQAVVDA-GGLPALVELLKSEDEEVVKEAAWALSNLSS 41
|
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
565-605 |
6.69e-04 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats. :
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 39.36 E-value: 6.69e-04
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 1238280178 565 NPKDQEALWDMGAVSMLKNLIHSKHKMIAMGSAAALRNLMA 605
Cdd:pfam00514 1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
|
|
| SAMP |
pfam05924 |
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1590-1611 |
1.28e-03 |
|
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin. :
Pssm-ID: 461782 Cd Length: 22 Bit Score: 38.34 E-value: 1.28e-03
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1245-1267 |
3.35e-03 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin. :
Pssm-ID: 461781 Cd Length: 24 Bit Score: 36.97 E-value: 3.35e-03
|
| PTZ00449 super family |
cl33186 |
104 kDa microneme/rhoptry antigen; Provisional |
1165-1575 |
4.56e-03 |
|
104 kDa microneme/rhoptry antigen; Provisional The actual alignment was detected with superfamily member PTZ00449:
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 42.75 E-value: 4.56e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 1165 QTTQEAD---SANTLQIAEIKEKIGTRSAEDPvsEVPAVSQHPRTKSSRLQGSSLSSESARHKAVEFSSGAKSPSKSGAQ 1241
Cdd:PTZ00449 480 QFTQEIKkliKKSKKKLAPIEEEDSDKHDEPP--EGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEV 557
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 1242 TPKSPP--EHYVQETPLMFSRCT----SVSSLDSFESRSIASSVQ-SEPCSGMVSGIISPSDLPDSPGQTMPPSRSKTPP 1314
Cdd:PTZ00449 558 GKKPGPakEHKPSKIPTLSKKPEfpkdPKHPKDPEEPKKPKRPRSaQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPP 637
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 1315 PPPQTAQTKR-EVPK----NKAPTAEKRESGPK------QAAVNAAVQRVQvlpdadtllhfaTESTPDGFSCSSSLSAL 1383
Cdd:PTZ00449 638 PPQRPSSPERpEGPKiiksPKPPKSPKPPFDPKfkekfyDDYLDAAAKSKE------------TKTTVVLDESFESILKE 705
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 1384 SLDEPFIQKDVELRIMPPVQENDNG------NETESEQPKESNENQEKEAEKTIDSEKDLLDDSDDDDIEILEECIISAM 1457
Cdd:PTZ00449 706 TLPETPGTPFTTPRPLPPKLPRDEEfpfepiGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAE 785
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 1458 PTKSSRKAKKPAQTASKLPPPVARKPSqlpvyklLPSQNRLQPQKHVSFTPGDDMPRVYCVE--GTPINFSTATSLSDL- 1534
Cdd:PTZ00449 786 TGEPDEAMKRPDSPSEHEDKPPGDHPS-------LPKKRHRLDGLALSTTDLESDAGRIAKDasGKIVKLKRSKSFDDLt 858
|
410 420 430 440
....*....|....*....|....*....|....*....|.
gi 1238280178 1535 TIESPPNELAAGEGVRGGAQSGEFEKRDTIPTEGRSTDEAQ 1575
Cdd:PTZ00449 859 TVEEAEEMGAEARKIVVDDDGTEADDEDTHPPEEKHKSEVR 899
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1131-1148 |
6.34e-03 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin. :
Pssm-ID: 461781 Cd Length: 24 Bit Score: 36.20 E-value: 6.34e-03
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Arm_APC_u3 |
pfam16629 |
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ... |
606-893 |
1.70e-166 |
|
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 435476 Cd Length: 293 Bit Score: 514.52 E-value: 1.70e-166
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 606 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRSKQRHKQSLYGDYVFDTNRHDDN-- 683
Cdd:pfam16629 1 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRNKQRHKQNVYSEYVLDSGRHDDSvc 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 684 RSDNFNTGNMTVLSPYLNTTVLPSSSSSRG--SLDSSRSEKDRSLERERGIGLGNYHPATENPGTSSKR-GLQISTTAAQ 760
Cdd:pfam16629 81 RSDNFNTGNVTVLSPYLNTTVLPSSSSRDSrgNAESSRSEKDRSLDRERGAGLSNFHPATENSGNSSKRiGMQISTTAAQ 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 761 IAKVMEEVSAIHTSQEDRSSGSTTELHCVTDERNALRRSSAAHTHSNTYNFTKSENSNRTCSMPYAKLEYKRSSNDSLNS 840
Cdd:pfam16629 161 IAKVMEEVSSMHISQEDRSSGSTSDMHCMQDDRNSIRRSSTAHPHSNVYSFNKSESSNRPCPMPYMKMEYKRASNDSLNS 240
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|...
gi 1238280178 841 VSSSDGYGKRGQMKPSIESYSEDDESKFCSYGQYPADLAHKIHSANHMDDNDG 893
Cdd:pfam16629 241 VSSSDGYGKRGQMKPSVESYSEDDEGKFCSYGKYPADLAHKIHSANHMDDNDG 293
|
|
| APC_basic |
pfam05956 |
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ... |
2097-2442 |
8.45e-109 |
|
APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.
Pssm-ID: 428690 [Multi-domain] Cd Length: 336 Bit Score: 351.10 E-value: 8.45e-109
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2097 SISRGRTMIHIPGVRNSSSSTSPVSKKGPPLKTPASKSPSEGQTATTS-PRGAKPSVKSELSPVARQTSQIGGSSKAPSR 2175
Cdd:pfam05956 1 VVFRGRTVIYMPGVKESQPSTSPPPKKTPPKTDAPAKNPNLGQQRSRSlHRLGKPSELADLSPPKRSATPPARISKAPSS 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2176 SGSRDSTPSRPAQQPLSRPIQSPGRNSISPGRNgisppnKLSQLPRTSSPSTASTKSSGSgKMSYTSPGRQMSQQNLTKQ 2255
Cdd:pfam05956 81 GSSRDSTPSRPPQKKLTSPSQSPGRLPGSGGRN------KLSPLPKTKSPARASTKKSGS-HKTQKSPVRIPFMQTPTKQ 153
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2256 TGLSKNASSI------PRSESASKGLNqmnngNGANKKVELSRMSSTKSSGSESDRSerpVLVRQSTFIKEAPSPTLRRK 2329
Cdd:pfam05956 154 TGLPRNPSPLvtnqpePRSESASKGLR-----SLPGKRLDLVRMSSARSSGSESDRS---GFLRQLTFIKESPSLLLRRR 225
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2330 LEESASfESLSPSSRPASPTRSQaqtpvlsPSLPDMSLSTHSSVQAGGWRKLPPNLSPTIEYNDGRPAKRHDIARSHSES 2409
Cdd:pfam05956 226 LELSAS-ESLSPSSQPASPRRSR-------PGLPAVFLCSSRCQELKGWRKQPPNPNSRAEPSDRPLTRRRPPRRTSSES 297
|
330 340 350
....*....|....*....|....*....|...
gi 1238280178 2410 PSRLPInRSGTWKREHSKHSSSLPRVSTWRRTG 2442
Cdd:pfam05956 298 PSRLPV-RNGTWKRETFKRYSSLPHINVWRRTG 329
|
|
| EB1_binding |
pfam05937 |
EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the ... |
2544-2717 |
4.50e-89 |
|
EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the microtubule-associating protein EB-1. At the C-terminus of the alignment is also a pfam00595 binding domain. A short motif in the middle of the region appears to be found in the APC2 proteins.
Pssm-ID: 399141 Cd Length: 174 Bit Score: 287.66 E-value: 4.50e-89
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2544 RSGRSPTGNTPPVIDSVSEKANPNIKDSKDNQAKQNVGNGSVPMRTVGLENRLNSFIQVDAPDQKGTEIKPGQNNPVPVS 2623
Cdd:pfam05937 1 RSGRSPTGNTPPVIDSVPEKGIKDEKDSKDPQAKQNMGNGNVPVRTVGLENRLNSFIQSDSPDKKGTETKPLQNNPVPTP 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2624 ETNESSIVERTPFSSSSSSKHSSPSGTVAARVTPFNYNPSPRKSSADSTSARPSQIPTPVNNNTKKRDSKTDSTESSGTQ 2703
Cdd:pfam05937 81 ETNENPVSERTPFSSSSSSKHSSPSGAVAARVTPFNYNPSPRKSSADSSSARPSQIPTPVNNSTKKRDSKTESTDSSGNQ 160
|
170
....*....|....
gi 1238280178 2704 SPKRHSGSYLVTSV 2717
Cdd:pfam05937 161 SPKRHSGSYLVTSV 174
|
|
| APC_u5 |
pfam16630 |
Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of ... |
910-1009 |
2.30e-56 |
|
Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of natively unstructured sequence lying between the first and the second 15-residue beta-catenin binding motifs, APC_15aa, pfam05972, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 406923 Cd Length: 100 Bit Score: 190.89 E-value: 2.30e-56
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 910 LNSGRQSPSQNERWARPKHIIEDEIKQSEQRQSRNQSTTYPVYTESTDDKHLKFQPHFGQQECVSPYRSRGANGSETNRV 989
Cdd:pfam16630 1 LNSGRQSPSQNERWARPKHIIEDEMKQSEQRQPRSQSTTYPVYTESGDDKHMKFQPRFGQQECVSPFRSRGSNGSEQSRV 80
|
90 100
....*....|....*....|
gi 1238280178 990 GSNHGINQNVSQSLCQEDDY 1009
Cdd:pfam16630 81 GSSHGINQKVSQSLCQVDDY 100
|
|
| APC_u14 |
pfam16635 |
Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively ... |
1620-1713 |
1.53e-46 |
|
Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively unstructured sequence lying between the second SAMP pfam05924, and the fifth creatine-rich region, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 435479 Cd Length: 94 Bit Score: 162.70 E-value: 1.53e-46
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 1620 IMDQVQQASASSSAPNKNQLDGKKKKPTSPVKPIPQNTEYRTRVRKNADSKNNLNAERVFSDNKDSKKQNLKNNSKVFND 1699
Cdd:pfam16635 1 IMDQIQQASAASSGGSKSQQDGEKKKPTSPVKPMPQSSEYRARVRKNTESKNNLNSERSYPDNKESKKQNLKNNSRDFND 80
|
90
....*....|....
gi 1238280178 1700 KLPNNEDRVRGSFA 1713
Cdd:pfam16635 81 KLPNNEERTRGSFA 94
|
|
| APC_u9 |
pfam16633 |
Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of ... |
1157-1242 |
2.01e-35 |
|
Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of natively unstructured sequence lying between the first and second APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 435478 Cd Length: 89 Bit Score: 130.76 E-value: 2.01e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 1157 AEDEI-GCNQTTQEADSANTLQIAEIKEKIGTRSAEDPVSEVPAVSQHPRTKSSRLQGSSLS-SESARHKAVEFSSGAKS 1234
Cdd:pfam16633 2 AEDEIeGRDQATRSTDNYNTLQITELKENSGAVSTEQTVSEVPSSSQHIRTKPNRLQASNLSpSDSSRHKAVEFSSGAKS 81
|
....*...
gi 1238280178 1235 PSKSGAQT 1242
Cdd:pfam16633 82 PSKSGAQT 89
|
|
| APC_u15 |
pfam16636 |
Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of ... |
1747-1821 |
5.55e-31 |
|
Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of natively unstructured sequence lying between the fifth and sixth creatine-rich, APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 435480 Cd Length: 81 Bit Score: 117.65 E-value: 5.55e-31
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1238280178 1747 DLSREKAELRKAKENKESEAKVTSHTELTSNQQSANKTQAIAKQPINRGQPKPILQKQSTFPQSSKDIPDRGAAT 1821
Cdd:pfam16636 7 DLSREKAELRKGKETKETETKVTSHIEQPSNQQSTNRTQACQKHPPNRGQPKPLLQKQTTFPQSSKDIPDRGAAT 81
|
|
| APC_rep |
pfam18797 |
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ... |
282-340 |
9.57e-31 |
|
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.
Pssm-ID: 465870 Cd Length: 74 Bit Score: 116.88 E-value: 9.57e-31
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*....
gi 1238280178 282 HLGTKIRAYCETCWEWQEAHEPGMDQDKNPMPAPVEHQICPAVCVLMKLSFDEEHRHAM 340
Cdd:pfam18797 16 HLLEQIRAYCETCWDWQESHSRGPEGDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
|
|
| APC_u13 |
pfam16634 |
Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively ... |
1536-1589 |
2.03e-24 |
|
Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively unstructured sequence lying between the fourth creatine-rich region, APC_crr, pfam05923, and the SAMP pfam05924, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 406927 Cd Length: 54 Bit Score: 97.94 E-value: 2.03e-24
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....
gi 1238280178 1536 IESPPNELAAGEGVRGGAQSGEFEKRDTIPTEGRSTDEAQGGKTSSVTIPELDD 1589
Cdd:pfam16634 1 IESPPNELANAESTGTGAESAEFEKRDTIPTEGRSTDDAQRGKKSNITTSALDD 54
|
|
| Suppressor_APC |
pfam11414 |
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ... |
102-182 |
2.73e-22 |
|
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils.
Pssm-ID: 463275 Cd Length: 82 Bit Score: 93.09 E-value: 2.73e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 102 SRESTGYLEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLPLTE-NFSLQTDMTRRQLEYEARQIRVAMEEQLG 180
Cdd:pfam11414 1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGtYFDYGSDAQQERLEFLLARIQEVNRCLGG 80
|
..
gi 1238280178 181 TC 182
Cdd:pfam11414 81 LI 82
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1511-1534 |
1.36e-07 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.
Pssm-ID: 461781 Cd Length: 24 Bit Score: 49.30 E-value: 1.36e-07
|
| ARM |
smart00185 |
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ... |
523-563 |
4.40e-07 |
|
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
Pssm-ID: 214547 [Multi-domain] Cd Length: 41 Bit Score: 48.19 E-value: 4.40e-07
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 1238280178 523 NEDHRQILRENNCLQTLLQHLKSHSLTIVSNACGTLWNLSA 563
Cdd:smart00185 1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
|
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
523-563 |
5.98e-07 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 47.83 E-value: 5.98e-07
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 1238280178 523 NEDHRQILRENNCLQTLLQHLKSHSLTIVSNACGTLWNLSA 563
Cdd:pfam00514 1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
2125-2438 |
1.32e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.56 E-value: 1.32e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2125 PPLKTPASKsPSEGQTATTSPRGAKPSVKSE----LSPVARQTSQIGGSSKAPSRSGSRDSTPSRPAQQPLSRPIQSPGR 2200
Cdd:PHA03247 2703 PPPPTPEPA-PHALVSATPLPPGPAAARQASpalpAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR 2781
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2201 NSISPGRNGISPPNKLSQLPRTSSPSTASTKSSGSGKMSYTSPGrqmsqqnltkqTGLSKNASSIPRSESASKGLNQ--M 2278
Cdd:PHA03247 2782 RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPA-----------GPLPPPTSAQPTAPPPPPGPPPpsL 2850
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2279 NNGNGANKKVELSRMSSTKSSGSESDRSERPVLVRqstfikeAPSPTLRRKLEESA-SFESLSPSSRPASPTRSQAQTPV 2357
Cdd:PHA03247 2851 PLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRR-------LARPAVSRSTESFAlPPDQPERPPQPQAPPPPQPQPQP 2923
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2358 LSPSLPDMSLSTHSSVQAggwrKLPPNLSPTieyNDGRPAKRHDIARSHSESPSRLPINRSGTWK----RE------HSK 2427
Cdd:PHA03247 2924 PPPPQPQPPPPPPPRPQP----PLAPTTDPA---GAGEPSGAVPQPWLGALVPGRVAVPRFRVPQpapsREapasstPPL 2996
|
330
....*....|.
gi 1238280178 2428 HSSSLPRVSTW 2438
Cdd:PHA03247 2997 TGHSLSRVSSW 3007
|
|
| SAMP |
pfam05924 |
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1907-1926 |
3.38e-05 |
|
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.
Pssm-ID: 461782 Cd Length: 22 Bit Score: 42.58 E-value: 3.38e-05
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
109-241 |
1.81e-04 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 47.07 E-value: 1.81e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 109 LEELEKERSLLLADLDKEEKEKDWY--YAQLQNLTKRIDSLP---------LTENFSLQTDM---------TRRQLEYEA 168
Cdd:COG4717 104 LEELEAELEELREELEKLEKLLQLLplYQELEALEAELAELPerleeleerLEELRELEEELeeleaelaeLQEELEELL 183
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1238280178 169 RQIRVAMEEQLgtcQDMEKRAQRRIARIQQIEKDILRIRQLLQsQATEAERSSQNKHETGsHDAERQNEGQGV 241
Cdd:COG4717 184 EQLSLATEEEL---QDLAEELEELQQRLAELEEELEEAQEELE-ELEEELEQLENELEAA-ALEERLKEARLL 251
|
|
| ARM |
smart00185 |
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ... |
386-427 |
5.02e-04 |
|
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
Pssm-ID: 214547 [Multi-domain] Cd Length: 41 Bit Score: 39.72 E-value: 5.02e-04
10 20 30 40
....*....|....*....|....*....|....*....|..
gi 1238280178 386 DVANKATLCSMkGCMRALVAQLKSESEDLQQVIASVLRNLSW 427
Cdd:smart00185 1 DDENKQAVVDA-GGLPALVELLKSEDEEVVKEAAWALSNLSS 41
|
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
386-426 |
5.77e-04 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 39.36 E-value: 5.77e-04
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 1238280178 386 DVANKATLCSMkGCMRALVAQLKSESEDLQQVIASVLRNLS 426
Cdd:pfam00514 1 SPENKQAVIEA-GAVPPLVRLLSSPDEEVQEEAAWALSNLA 40
|
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
565-605 |
6.69e-04 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 39.36 E-value: 6.69e-04
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 1238280178 565 NPKDQEALWDMGAVSMLKNLIHSKHKMIAMGSAAALRNLMA 605
Cdd:pfam00514 1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
|
|
| SAMP |
pfam05924 |
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1590-1611 |
1.28e-03 |
|
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.
Pssm-ID: 461782 Cd Length: 22 Bit Score: 38.34 E-value: 1.28e-03
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
95-235 |
1.95e-03 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 43.89 E-value: 1.95e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 95 RRGFVNGSRESTGY--------LEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLpltenfslqtdmtrRQLEY 166
Cdd:TIGR02168 657 PGGVITGGSAKTNSsilerrreIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQL--------------RKELE 722
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1238280178 167 EARQIRVAMEEQLGTcqdMEKRAQRRIARIQQIEKDILRIRQLLQSQATEAERSSQNKHETGSHDAERQ 235
Cdd:TIGR02168 723 ELSRQISALRKDLAR---LEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELE 788
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1245-1267 |
3.35e-03 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.
Pssm-ID: 461781 Cd Length: 24 Bit Score: 36.97 E-value: 3.35e-03
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
1165-1575 |
4.56e-03 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 42.75 E-value: 4.56e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 1165 QTTQEAD---SANTLQIAEIKEKIGTRSAEDPvsEVPAVSQHPRTKSSRLQGSSLSSESARHKAVEFSSGAKSPSKSGAQ 1241
Cdd:PTZ00449 480 QFTQEIKkliKKSKKKLAPIEEEDSDKHDEPP--EGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEV 557
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 1242 TPKSPP--EHYVQETPLMFSRCT----SVSSLDSFESRSIASSVQ-SEPCSGMVSGIISPSDLPDSPGQTMPPSRSKTPP 1314
Cdd:PTZ00449 558 GKKPGPakEHKPSKIPTLSKKPEfpkdPKHPKDPEEPKKPKRPRSaQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPP 637
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 1315 PPPQTAQTKR-EVPK----NKAPTAEKRESGPK------QAAVNAAVQRVQvlpdadtllhfaTESTPDGFSCSSSLSAL 1383
Cdd:PTZ00449 638 PPQRPSSPERpEGPKiiksPKPPKSPKPPFDPKfkekfyDDYLDAAAKSKE------------TKTTVVLDESFESILKE 705
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 1384 SLDEPFIQKDVELRIMPPVQENDNG------NETESEQPKESNENQEKEAEKTIDSEKDLLDDSDDDDIEILEECIISAM 1457
Cdd:PTZ00449 706 TLPETPGTPFTTPRPLPPKLPRDEEfpfepiGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAE 785
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 1458 PTKSSRKAKKPAQTASKLPPPVARKPSqlpvyklLPSQNRLQPQKHVSFTPGDDMPRVYCVE--GTPINFSTATSLSDL- 1534
Cdd:PTZ00449 786 TGEPDEAMKRPDSPSEHEDKPPGDHPS-------LPKKRHRLDGLALSTTDLESDAGRIAKDasGKIVKLKRSKSFDDLt 858
|
410 420 430 440
....*....|....*....|....*....|....*....|.
gi 1238280178 1535 TIESPPNELAAGEGVRGGAQSGEFEKRDTIPTEGRSTDEAQ 1575
Cdd:PTZ00449 859 TVEEAEEMGAEARKIVVDDDGTEADDEDTHPPEEKHKSEVR 899
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1131-1148 |
6.34e-03 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.
Pssm-ID: 461781 Cd Length: 24 Bit Score: 36.20 E-value: 6.34e-03
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Arm_APC_u3 |
pfam16629 |
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately ... |
606-893 |
1.70e-166 |
|
Armadillo-associated region on APC; Arm_APC_u3 is a semi-unstructured region lying immediately downstream of the armadillo fold before the beta-catenin binding motifs, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 435476 Cd Length: 293 Bit Score: 514.52 E-value: 1.70e-166
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 606 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRSKQRHKQSLYGDYVFDTNRHDDN-- 683
Cdd:pfam16629 1 NRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPKASHRNKQRHKQNVYSEYVLDSGRHDDSvc 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 684 RSDNFNTGNMTVLSPYLNTTVLPSSSSSRG--SLDSSRSEKDRSLERERGIGLGNYHPATENPGTSSKR-GLQISTTAAQ 760
Cdd:pfam16629 81 RSDNFNTGNVTVLSPYLNTTVLPSSSSRDSrgNAESSRSEKDRSLDRERGAGLSNFHPATENSGNSSKRiGMQISTTAAQ 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 761 IAKVMEEVSAIHTSQEDRSSGSTTELHCVTDERNALRRSSAAHTHSNTYNFTKSENSNRTCSMPYAKLEYKRSSNDSLNS 840
Cdd:pfam16629 161 IAKVMEEVSSMHISQEDRSSGSTSDMHCMQDDRNSIRRSSTAHPHSNVYSFNKSESSNRPCPMPYMKMEYKRASNDSLNS 240
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|...
gi 1238280178 841 VSSSDGYGKRGQMKPSIESYSEDDESKFCSYGQYPADLAHKIHSANHMDDNDG 893
Cdd:pfam16629 241 VSSSDGYGKRGQMKPSVESYSEDDEGKFCSYGKYPADLAHKIHSANHMDDNDG 293
|
|
| APC_basic |
pfam05956 |
APC basic domain; This region of the APC family of proteins is known as the basic domain. It ... |
2097-2442 |
8.45e-109 |
|
APC basic domain; This region of the APC family of proteins is known as the basic domain. It contains a high proportion of positively charged amino acids and interacts with microtubules.
Pssm-ID: 428690 [Multi-domain] Cd Length: 336 Bit Score: 351.10 E-value: 8.45e-109
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2097 SISRGRTMIHIPGVRNSSSSTSPVSKKGPPLKTPASKSPSEGQTATTS-PRGAKPSVKSELSPVARQTSQIGGSSKAPSR 2175
Cdd:pfam05956 1 VVFRGRTVIYMPGVKESQPSTSPPPKKTPPKTDAPAKNPNLGQQRSRSlHRLGKPSELADLSPPKRSATPPARISKAPSS 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2176 SGSRDSTPSRPAQQPLSRPIQSPGRNSISPGRNgisppnKLSQLPRTSSPSTASTKSSGSgKMSYTSPGRQMSQQNLTKQ 2255
Cdd:pfam05956 81 GSSRDSTPSRPPQKKLTSPSQSPGRLPGSGGRN------KLSPLPKTKSPARASTKKSGS-HKTQKSPVRIPFMQTPTKQ 153
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2256 TGLSKNASSI------PRSESASKGLNqmnngNGANKKVELSRMSSTKSSGSESDRSerpVLVRQSTFIKEAPSPTLRRK 2329
Cdd:pfam05956 154 TGLPRNPSPLvtnqpePRSESASKGLR-----SLPGKRLDLVRMSSARSSGSESDRS---GFLRQLTFIKESPSLLLRRR 225
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2330 LEESASfESLSPSSRPASPTRSQaqtpvlsPSLPDMSLSTHSSVQAGGWRKLPPNLSPTIEYNDGRPAKRHDIARSHSES 2409
Cdd:pfam05956 226 LELSAS-ESLSPSSQPASPRRSR-------PGLPAVFLCSSRCQELKGWRKQPPNPNSRAEPSDRPLTRRRPPRRTSSES 297
|
330 340 350
....*....|....*....|....*....|...
gi 1238280178 2410 PSRLPInRSGTWKREHSKHSSSLPRVSTWRRTG 2442
Cdd:pfam05956 298 PSRLPV-RNGTWKRETFKRYSSLPHINVWRRTG 329
|
|
| EB1_binding |
pfam05937 |
EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the ... |
2544-2717 |
4.50e-89 |
|
EB-1 Binding Domain; This region, found at the C-terminus of the APC proteins, binds the microtubule-associating protein EB-1. At the C-terminus of the alignment is also a pfam00595 binding domain. A short motif in the middle of the region appears to be found in the APC2 proteins.
Pssm-ID: 399141 Cd Length: 174 Bit Score: 287.66 E-value: 4.50e-89
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2544 RSGRSPTGNTPPVIDSVSEKANPNIKDSKDNQAKQNVGNGSVPMRTVGLENRLNSFIQVDAPDQKGTEIKPGQNNPVPVS 2623
Cdd:pfam05937 1 RSGRSPTGNTPPVIDSVPEKGIKDEKDSKDPQAKQNMGNGNVPVRTVGLENRLNSFIQSDSPDKKGTETKPLQNNPVPTP 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2624 ETNESSIVERTPFSSSSSSKHSSPSGTVAARVTPFNYNPSPRKSSADSTSARPSQIPTPVNNNTKKRDSKTDSTESSGTQ 2703
Cdd:pfam05937 81 ETNENPVSERTPFSSSSSSKHSSPSGAVAARVTPFNYNPSPRKSSADSSSARPSQIPTPVNNSTKKRDSKTESTDSSGNQ 160
|
170
....*....|....
gi 1238280178 2704 SPKRHSGSYLVTSV 2717
Cdd:pfam05937 161 SPKRHSGSYLVTSV 174
|
|
| APC_u5 |
pfam16630 |
Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of ... |
910-1009 |
2.30e-56 |
|
Unstructured region on APC between 1st and 2nd catenin-bdg motifs; APC_u5 is a short region of natively unstructured sequence lying between the first and the second 15-residue beta-catenin binding motifs, APC_15aa, pfam05972, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 406923 Cd Length: 100 Bit Score: 190.89 E-value: 2.30e-56
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 910 LNSGRQSPSQNERWARPKHIIEDEIKQSEQRQSRNQSTTYPVYTESTDDKHLKFQPHFGQQECVSPYRSRGANGSETNRV 989
Cdd:pfam16630 1 LNSGRQSPSQNERWARPKHIIEDEMKQSEQRQPRSQSTTYPVYTESGDDKHMKFQPRFGQQECVSPFRSRGSNGSEQSRV 80
|
90 100
....*....|....*....|
gi 1238280178 990 GSNHGINQNVSQSLCQEDDY 1009
Cdd:pfam16630 81 GSSHGINQKVSQSLCQVDDY 100
|
|
| APC_u14 |
pfam16635 |
Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively ... |
1620-1713 |
1.53e-46 |
|
Unstructured region on APC between SAMP and APC_crr; APC_u14 is a short region of natively unstructured sequence lying between the second SAMP pfam05924, and the fifth creatine-rich region, APC_crr, pfam05923, on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 435479 Cd Length: 94 Bit Score: 162.70 E-value: 1.53e-46
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 1620 IMDQVQQASASSSAPNKNQLDGKKKKPTSPVKPIPQNTEYRTRVRKNADSKNNLNAERVFSDNKDSKKQNLKNNSKVFND 1699
Cdd:pfam16635 1 IMDQIQQASAASSGGSKSQQDGEKKKPTSPVKPMPQSSEYRARVRKNTESKNNLNSERSYPDNKESKKQNLKNNSRDFND 80
|
90
....*....|....
gi 1238280178 1700 KLPNNEDRVRGSFA 1713
Cdd:pfam16635 81 KLPNNEERTRGSFA 94
|
|
| APC_u9 |
pfam16633 |
Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of ... |
1157-1242 |
2.01e-35 |
|
Unstructured region on APC between 1st two creatine-rich regions; APC_u9 is a short region of natively unstructured sequence lying between the first and second APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 435478 Cd Length: 89 Bit Score: 130.76 E-value: 2.01e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 1157 AEDEI-GCNQTTQEADSANTLQIAEIKEKIGTRSAEDPVSEVPAVSQHPRTKSSRLQGSSLS-SESARHKAVEFSSGAKS 1234
Cdd:pfam16633 2 AEDEIeGRDQATRSTDNYNTLQITELKENSGAVSTEQTVSEVPSSSQHIRTKPNRLQASNLSpSDSSRHKAVEFSSGAKS 81
|
....*...
gi 1238280178 1235 PSKSGAQT 1242
Cdd:pfam16633 82 PSKSGAQT 89
|
|
| APC_u15 |
pfam16636 |
Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of ... |
1747-1821 |
5.55e-31 |
|
Unstructured region on APC between APC_crr regions 5 and 6; APC_u15 is a short region of natively unstructured sequence lying between the fifth and sixth creatine-rich, APC_crr, pfam05923, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 435480 Cd Length: 81 Bit Score: 117.65 E-value: 5.55e-31
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1238280178 1747 DLSREKAELRKAKENKESEAKVTSHTELTSNQQSANKTQAIAKQPINRGQPKPILQKQSTFPQSSKDIPDRGAAT 1821
Cdd:pfam16636 7 DLSREKAELRKGKETKETETKVTSHIEQPSNQQSTNRTQACQKHPPNRGQPKPLLQKQTTFPQSSKDIPDRGAAT 81
|
|
| APC_rep |
pfam18797 |
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo ... |
282-340 |
9.57e-31 |
|
Adenomatous polyposis coli (APC) repeat; Adenomatous polyposis coli contains an armadillo repeat and uses its highly conserved surface groove to recognize the APC-binding region (ABR) of Asef. This entry represents a single repeat unit of the Armadillo region.
Pssm-ID: 465870 Cd Length: 74 Bit Score: 116.88 E-value: 9.57e-31
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*....
gi 1238280178 282 HLGTKIRAYCETCWEWQEAHEPGMDQDKNPMPAPVEHQICPAVCVLMKLSFDEEHRHAM 340
Cdd:pfam18797 16 HLLEQIRAYCETCWDWQESHSRGPEGDSNPMPSPIEHQICPAICALMKLSFDEEHRHAM 74
|
|
| APC_u13 |
pfam16634 |
Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively ... |
1536-1589 |
2.03e-24 |
|
Unstructured region on APC between APC_crr and SAMP; APC_u13 is a short region of natively unstructured sequence lying between the fourth creatine-rich region, APC_crr, pfam05923, and the SAMP pfam05924, domains on APC or adenomatous polyposis coli proteins in higher eukaryotes. The function is not known.
Pssm-ID: 406927 Cd Length: 54 Bit Score: 97.94 E-value: 2.03e-24
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....
gi 1238280178 1536 IESPPNELAAGEGVRGGAQSGEFEKRDTIPTEGRSTDEAQGGKTSSVTIPELDD 1589
Cdd:pfam16634 1 IESPPNELANAESTGTGAESAEFEKRDTIPTEGRSTDDAQRGKKSNITTSALDD 54
|
|
| Suppressor_APC |
pfam11414 |
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has ... |
102-182 |
2.73e-22 |
|
Adenomatous polyposis coli tumour suppressor protein; The tumour suppressor protein, APC, has a nuclear export activity as well as many different intracellular functions. The structure consists of three alpha-helices forming two separate antiparallel coiled coils.
Pssm-ID: 463275 Cd Length: 82 Bit Score: 93.09 E-value: 2.73e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 102 SRESTGYLEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLPLTE-NFSLQTDMTRRQLEYEARQIRVAMEEQLG 180
Cdd:pfam11414 1 DYNMLKRMKQLEQEKDVLLQGLEMVERARDWYQQQLQEVQERQKYLGANGtYFDYGSDAQQERLEFLLARIQEVNRCLGG 80
|
..
gi 1238280178 181 TC 182
Cdd:pfam11414 81 LI 82
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1511-1534 |
1.36e-07 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.
Pssm-ID: 461781 Cd Length: 24 Bit Score: 49.30 E-value: 1.36e-07
|
| ARM |
smart00185 |
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ... |
523-563 |
4.40e-07 |
|
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
Pssm-ID: 214547 [Multi-domain] Cd Length: 41 Bit Score: 48.19 E-value: 4.40e-07
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 1238280178 523 NEDHRQILRENNCLQTLLQHLKSHSLTIVSNACGTLWNLSA 563
Cdd:smart00185 1 DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41
|
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
523-563 |
5.98e-07 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 47.83 E-value: 5.98e-07
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 1238280178 523 NEDHRQILRENNCLQTLLQHLKSHSLTIVSNACGTLWNLSA 563
Cdd:pfam00514 1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
2125-2438 |
1.32e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.56 E-value: 1.32e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2125 PPLKTPASKsPSEGQTATTSPRGAKPSVKSE----LSPVARQTSQIGGSSKAPSRSGSRDSTPSRPAQQPLSRPIQSPGR 2200
Cdd:PHA03247 2703 PPPPTPEPA-PHALVSATPLPPGPAAARQASpalpAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR 2781
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2201 NSISPGRNGISPPNKLSQLPRTSSPSTASTKSSGSGKMSYTSPGrqmsqqnltkqTGLSKNASSIPRSESASKGLNQ--M 2278
Cdd:PHA03247 2782 RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPA-----------GPLPPPTSAQPTAPPPPPGPPPpsL 2850
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2279 NNGNGANKKVELSRMSSTKSSGSESDRSERPVLVRqstfikeAPSPTLRRKLEESA-SFESLSPSSRPASPTRSQAQTPV 2357
Cdd:PHA03247 2851 PLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRR-------LARPAVSRSTESFAlPPDQPERPPQPQAPPPPQPQPQP 2923
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2358 LSPSLPDMSLSTHSSVQAggwrKLPPNLSPTieyNDGRPAKRHDIARSHSESPSRLPINRSGTWK----RE------HSK 2427
Cdd:PHA03247 2924 PPPPQPQPPPPPPPRPQP----PLAPTTDPA---GAGEPSGAVPQPWLGALVPGRVAVPRFRVPQpapsREapasstPPL 2996
|
330
....*....|.
gi 1238280178 2428 HSSSLPRVSTW 2438
Cdd:PHA03247 2997 TGHSLSRVSSW 3007
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
2130-2275 |
7.53e-06 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 51.71 E-value: 7.53e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2130 PASKSPSEGQTATTSPRGAKPSVKSELSPVA----RQTSQIGGSSKAPSRSGSRDSTPSRPAQQPLSRPIQSPGRNSISP 2205
Cdd:PHA03307 278 PSSRPGPASSSSSPRERSPSPSPSSPGSGPApsspRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPP 357
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2206 GRNGISPPNKLSQlPRTSSPSTASTKSSGSGKMSYTSPGRQMSQQNLTKQTGLSKNASSIPRSESASKGL 2275
Cdd:PHA03307 358 PPADPSSPRKRPR-PSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAF 426
|
|
| SAMP |
pfam05924 |
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1907-1926 |
3.38e-05 |
|
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.
Pssm-ID: 461782 Cd Length: 22 Bit Score: 42.58 E-value: 3.38e-05
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
2123-2418 |
5.27e-05 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 49.01 E-value: 5.27e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2123 KGPPLKTPASKSPSEGQTATTSPRGAKPSVKSELSPVARQTSQIGGSSKAPSRSGSRDSTPSRPAQQP---LSRPIQSPG 2199
Cdd:PHA03307 101 AREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASsrqAALPLSSPE 180
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2200 RNSISPGRNGISPPNKLSQLPRTSSP----STASTKSSGSGKMSYTSPGRQMSQQNLTKQTGLSKNASSIPRSE----SA 2271
Cdd:PHA03307 181 ETARAPSSPPAEPPPSTPPAAASPRPprrsSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENEcplpRP 260
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2272 SKGLNQMNNGNGANKKVELSRMSSTKSSGSESDRSERPVLVRQStfiKEAPSPTLRRKLEESASFESLSPSSRPASPTRS 2351
Cdd:PHA03307 261 APITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPG---SGPAPSSPRASSSSSSSRESSSSSTSSSSESSR 337
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1238280178 2352 QAQTPVLSPSLPDMSLSTHSSVQAGGWRKLPPNLSPTIEY---NDGRP---AKRHDIARSH--SESPSRLPINRS 2418
Cdd:PHA03307 338 GAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSpaaSAGRPtrrRARAAVAGRArrRDATGRFPAGRP 412
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
109-241 |
1.81e-04 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 47.07 E-value: 1.81e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 109 LEELEKERSLLLADLDKEEKEKDWY--YAQLQNLTKRIDSLP---------LTENFSLQTDM---------TRRQLEYEA 168
Cdd:COG4717 104 LEELEAELEELREELEKLEKLLQLLplYQELEALEAELAELPerleeleerLEELRELEEELeeleaelaeLQEELEELL 183
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1238280178 169 RQIRVAMEEQLgtcQDMEKRAQRRIARIQQIEKDILRIRQLLQsQATEAERSSQNKHETGsHDAERQNEGQGV 241
Cdd:COG4717 184 EQLSLATEEEL---QDLAEELEELQQRLAELEEELEEAQEELE-ELEEELEQLENELEAA-ALEERLKEARLL 251
|
|
| ARM |
smart00185 |
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form ... |
386-427 |
5.02e-04 |
|
Armadillo/beta-catenin-like repeats; Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
Pssm-ID: 214547 [Multi-domain] Cd Length: 41 Bit Score: 39.72 E-value: 5.02e-04
10 20 30 40
....*....|....*....|....*....|....*....|..
gi 1238280178 386 DVANKATLCSMkGCMRALVAQLKSESEDLQQVIASVLRNLSW 427
Cdd:smart00185 1 DDENKQAVVDA-GGLPALVELLKSEDEEVVKEAAWALSNLSS 41
|
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
386-426 |
5.77e-04 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 39.36 E-value: 5.77e-04
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 1238280178 386 DVANKATLCSMkGCMRALVAQLKSESEDLQQVIASVLRNLS 426
Cdd:pfam00514 1 SPENKQAVIEA-GAVPPLVRLLSSPDEEVQEEAAWALSNLA 40
|
|
| Arm |
pfam00514 |
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form ... |
565-605 |
6.69e-04 |
|
Armadillo/beta-catenin-like repeat; Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
Pssm-ID: 425727 [Multi-domain] Cd Length: 41 Bit Score: 39.36 E-value: 6.69e-04
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 1238280178 565 NPKDQEALWDMGAVSMLKNLIHSKHKMIAMGSAAALRNLMA 605
Cdd:pfam00514 1 SPENKQAVIEAGAVPPLVRLLSSPDEEVQEEAAWALSNLAA 41
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
110-237 |
9.02e-04 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 44.93 E-value: 9.02e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 110 EELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLpltenfslqtDMTRRQLEYEARQIRVAMEEQLGTCQDMEKRA 189
Cdd:COG1196 221 ELKELEAELLLLKLRELEAELEELEAELEELEAELEEL----------EAELAELEAELEELRLELEELELELEEAQAEE 290
|
90 100 110 120
....*....|....*....|....*....|....*....|....*...
gi 1238280178 190 QRRIARIQQIEKDILRIRQLLQSQATEAERSSQNKHETGSHDAERQNE 237
Cdd:COG1196 291 YELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEE 338
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
2128-2388 |
9.25e-04 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 44.57 E-value: 9.25e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2128 KTPASKSPSEGQTATTSPRGAKPsvkselSPVARQTSQIGGSSKAPSRSGSRDSTPSRPAQQPLSRPIQSPGRNSISPgr 2207
Cdd:pfam17823 151 RANASAAPRAAIAAASAPHAASP------APRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHP-- 222
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2208 ngiSPPNKLSQLPrTSSPStASTKSSGSGKMSYTSPGRQMSQQNLTKQTGLSKNASSiPRSESASKGLNQMNNGNGANKk 2287
Cdd:pfam17823 223 ---AAGTALAAVG-NSSPA-AGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGD-PHARRLSPAKHMPSDTMARNP- 295
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2288 veLSRMSSTKSSGSESDRSERPVLvrqSTFIKEAPSPTlRRKLEESASFESLSPSSRPASPTRSQAQTPVLSPsLPDMSL 2367
Cdd:pfam17823 296 --AAPMGAQAQGPIIQVSTDQPVH---NTAGEPTPSPS-NTTLEPNTPKSVASTNLAVVTTTKAQAKEPSASP-VPVLHT 368
|
250 260
....*....|....*....|.
gi 1238280178 2368 STHSSVQAGGWRKLPPNLSPT 2388
Cdd:pfam17823 369 SMIPEVEATSPTTQPSPLLPT 389
|
|
| SAMP |
pfam05924 |
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1590-1611 |
1.28e-03 |
|
SAMP Motif; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). This motif binds axin.
Pssm-ID: 461782 Cd Length: 22 Bit Score: 38.34 E-value: 1.28e-03
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
95-235 |
1.95e-03 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 43.89 E-value: 1.95e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 95 RRGFVNGSRESTGY--------LEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLpltenfslqtdmtrRQLEY 166
Cdd:TIGR02168 657 PGGVITGGSAKTNSsilerrreIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQL--------------RKELE 722
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1238280178 167 EARQIRVAMEEQLGTcqdMEKRAQRRIARIQQIEKDILRIRQLLQSQATEAERSSQNKHETGSHDAERQ 235
Cdd:TIGR02168 723 ELSRQISALRKDLAR---LEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELE 788
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
109-234 |
2.05e-03 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 43.90 E-value: 2.05e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 109 LEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSL------------PLTENFSLQTDMTRRQLEYEARQIRVAME 176
Cdd:TIGR02169 232 KEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIeqlleelnkkikDLGEEEQLRVKEKIGELEAEIASLERSIA 311
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*....
gi 1238280178 177 EqlgtCQDMEKRAQRRIARIQ-QIEKDILRIRQLLQSQATEAERSSQNKHETGSHDAER 234
Cdd:TIGR02169 312 E----KERELEDAEERLAKLEaEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEEL 366
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1245-1267 |
3.35e-03 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.
Pssm-ID: 461781 Cd Length: 24 Bit Score: 36.97 E-value: 3.35e-03
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
1165-1575 |
4.56e-03 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 42.75 E-value: 4.56e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 1165 QTTQEAD---SANTLQIAEIKEKIGTRSAEDPvsEVPAVSQHPRTKSSRLQGSSLSSESARHKAVEFSSGAKSPSKSGAQ 1241
Cdd:PTZ00449 480 QFTQEIKkliKKSKKKLAPIEEEDSDKHDEPP--EGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEV 557
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 1242 TPKSPP--EHYVQETPLMFSRCT----SVSSLDSFESRSIASSVQ-SEPCSGMVSGIISPSDLPDSPGQTMPPSRSKTPP 1314
Cdd:PTZ00449 558 GKKPGPakEHKPSKIPTLSKKPEfpkdPKHPKDPEEPKKPKRPRSaQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPP 637
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 1315 PPPQTAQTKR-EVPK----NKAPTAEKRESGPK------QAAVNAAVQRVQvlpdadtllhfaTESTPDGFSCSSSLSAL 1383
Cdd:PTZ00449 638 PPQRPSSPERpEGPKiiksPKPPKSPKPPFDPKfkekfyDDYLDAAAKSKE------------TKTTVVLDESFESILKE 705
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 1384 SLDEPFIQKDVELRIMPPVQENDNG------NETESEQPKESNENQEKEAEKTIDSEKDLLDDSDDDDIEILEECIISAM 1457
Cdd:PTZ00449 706 TLPETPGTPFTTPRPLPPKLPRDEEfpfepiGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDIHAE 785
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 1458 PTKSSRKAKKPAQTASKLPPPVARKPSqlpvyklLPSQNRLQPQKHVSFTPGDDMPRVYCVE--GTPINFSTATSLSDL- 1534
Cdd:PTZ00449 786 TGEPDEAMKRPDSPSEHEDKPPGDHPS-------LPKKRHRLDGLALSTTDLESDAGRIAKDasGKIVKLKRSKSFDDLt 858
|
410 420 430 440
....*....|....*....|....*....|....*....|.
gi 1238280178 1535 TIESPPNELAAGEGVRGGAQSGEFEKRDTIPTEGRSTDEAQ 1575
Cdd:PTZ00449 859 TVEEAEEMGAEARKIVVDDDGTEADDEDTHPPEEKHKSEVR 899
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
109-223 |
4.89e-03 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 42.06 E-value: 4.89e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 109 LEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLpltenfslqtdmtrrqleyeARQIRvAMEEQLgtcQDMEKR 188
Cdd:COG4942 29 LEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAAL--------------------ARRIR-ALEQEL---AALEAE 84
|
90 100 110
....*....|....*....|....*....|....*
gi 1238280178 189 AQRRIARIQQIEKDILRIRQLLQSQATEAERSSQN 223
Cdd:COG4942 85 LAELEKEIAELRAELEAQKEELAELLRALYRLGRQ 119
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
2126-2442 |
5.05e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 42.47 E-value: 5.05e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2126 PLKTPASKSPSEGQTATTSPRGAKPSVKSELSPVARQTSQiGGSSKAPSRSGsrDSTPSRPAQQPLSRPIQSPGRNSISP 2205
Cdd:PHA03307 63 DRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAR-EGSPTPPGPSS--PDPPPPTPPPASPPPSPAPDLSEMLR 139
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2206 GRNGISPPNKLSQLPRTSSPS-TASTKSSGSGKM---------SYTSPGRQMSQQNLTKQTGLSKNASSIPRSESASKGL 2275
Cdd:PHA03307 140 PVGSPGPPPAASPPAAGASPAaVASDAASSRQAAlplsspeetARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASS 219
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2276 NQMNNGNGANKKVELSRMSSTKSSGSESD---RSERPVLVRQSTfikeaPSPTLRRKLEESASFESLSPSSRPASPTRSQ 2352
Cdd:PHA03307 220 PAPAPGRSAADDAGASSSDSSSSESSGCGwgpENECPLPRPAPI-----TLPTRIWEASGWNGPSSRPGPASSSSSPRER 294
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2353 AqtPVLSPSLPDMSLSTHSSVQAGGWRKLPP-NLSPTIEYNDGRPAKRHDIARSHSESPS----RLPINRSGTWKREHSK 2427
Cdd:PHA03307 295 S--PSPSPSSPGSGPAPSSPRASSSSSSSREsSSSSTSSSSESSRGAAVSPGPSPSRSPSpsrpPPPADPSSPRKRPRPS 372
|
330
....*....|....*
gi 1238280178 2428 HSSSLPRVSTWRRTG 2442
Cdd:PHA03307 373 RAPSSPAASAGRPTR 387
|
|
| APC_r |
pfam05923 |
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis ... |
1131-1148 |
6.34e-03 |
|
APC repeat; This short region is found repeated in the mid region of the adenomatous polyposis proteins (APCs). In the human protein many cancer-linked SNPs are found near the first three occurrences of the motif. These repeats bind beta-catenin.
Pssm-ID: 461781 Cd Length: 24 Bit Score: 36.20 E-value: 6.34e-03
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
2124-2414 |
8.51e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 41.85 E-value: 8.51e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2124 GPPLKTPASKSPSEGQTATTSPRGAKPSvkselsPVARQTSqIGGSSKAPSRSGSRDSTPSRPAQQPLSRPIQSPGRNSI 2203
Cdd:PHA03247 2603 DDRGDPRGPAPPSPLPPDTHAPDPPPPS------PSPAANE-PDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQ 2675
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2204 SPgrngiSPPNKLSQ--LPRTSSPSTASTKSSGSGKMSYTSPGRQMSQQNLTKQTGLSKNASSIPRSESASKGLNQMNNG 2281
Cdd:PHA03247 2676 AS-----SPPQRPRRraARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPAT 2750
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1238280178 2282 NGANKKVE--------LSRMSSTKSSGSESDRSERPVLVRQSTFIKEAPSPT--LRRKLEESASFESLSPSSRPAS---- 2347
Cdd:PHA03247 2751 PGGPARPArppttagpPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWdpADPPAAVLAPAAALPPAASPAGplpp 2830
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1238280178 2348 PTRSQAQTPVLSPSLPDMSLSTHSSVQAGG--WRKLPPNLSPTIEYNDGRPAKRHDIARSHSESPSRLP 2414
Cdd:PHA03247 2831 PTSAQPTAPPPPPGPPPPSLPLGGSVAPGGdvRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFA 2899
|
|
|