Entry: PS50818

General information about the entry
Entry name INTEIN_C_TER
Accession number PS50818
Entry type MATRIX
Date MAY-2002 (CREATED); MAY-2002 (DATA UPDATE); MAR-2013 (INFO UPDATE).
PROSITE Documentation PDOC00687
Name and characterization of the entry
DescriptionIntein C-terminal splicing motif profile.
Matrix / Profile
/GENERAL_SPEC: ALPHABET='ABCDEFGHIKLMNPQRSTVWYZ'; LENGTH=22;
/DISJOINT: DEFINITION=PROTECT; N1=3; N2=20;
/NORMALIZATION: MODE=1; FUNCTION=LINEAR; R1=0.8533; R2=0.02263959; TEXT='NScore';
/CUT_OFF: LEVEL=0; SCORE=290; N_SCORE=7.4; MODE=1; TEXT='!';
/CUT_OFF: LEVEL=-1; SCORE=249; N_SCORE=6.5; MODE=1; TEXT='?';
/DEFAULT: M0=-8; D=-20; I=-20; B0=-60; B1=-60; E0=-60; E1=-60; MI=-105; MD=-105; IM=-105; DM=-105;

                A   B   C   D   E   F   G   H   I   K   L   M   N   P   Q   R   S   T   V   W   Y   Z
/I:         B0=0; B1=0; BI=-105; BD=-105;
/M: SY='Y'; M=-15,-11,-29, -9, -6,  3,-23, -5,-14, -1,-13,-10,-10, -2, -9,  0,-12, -8,-15, -5, 12, -9;
/M: SY='V'; M=  1,-25,-13,-27,-26, -1,-24,-27, 20,-18,  7,  6,-24,-26,-25,-17, -8,  1, 34,-27, -9,-26;
/M: SY='Y'; M=-19,-17,-28,-18,-17, 29,-28,  9, -2,-11, -1, -1,-16,-27,-11, -9,-18,-10, -9, 17, 55,-17;
/M: SY='D'; M=-13, 27, -1, 33,  5,-29,-12, -6,-30, -8,-25,-23, 15,-17, -6,-12,  4, -3,-22,-41,-21, -1;
/M: SY='L'; M= -6,-25,-20,-27,-22,  3,-30,-24, 23,-25, 26, 13,-24,-25,-21,-21,-18, -4, 20,-23, -3,-23;
/M: SY='T'; M= -1,  1,-18, -1,  4,-16,-10,-10,-14, -7,-16,-13,  3,-12, -3, -9, 11, 12, -9,-28,-11,  0;
/M: SY='V'; M=  3,-18, -3,-21,-22, -8,-21,-25, 10,-18,  1,  0,-17,-24,-22,-19, -4,  4, 23,-30,-13,-22;
/I:         I=-5; MD=-27;
/M: SY='E'; M=-10,  0,-25,  7, 15,-17,-19, -7,-21,  2,-19,-15, -5, 11,  0, -6, -6, -9,-21,-21, -9,  6; D=-5;
/I:         I=-5; MD=-27;
/M: SY='N'; M= -2, 10,-17,  7,  0,-18,  5, -1,-18, -2,-18,-12, 13,-11, -3, -3,  2, -3,-16,-22,-13, -2; D=-5;
/I:         I=-5; MI=-27; MD=-27; IM=-27; DM=-27;
/M: SY='H'; M=-12,  0,-26,  1,  5,-22, -6, 42,-25, -8,-18, -6,  5,-16,  7, -4, -4,-13,-24,-27,  0,  4; D=-5;
/I:         I=-5; DM=-27;
/M: SY='N'; M= -5, 14,-19,  2, -5,-10, -8, -2,-13, -4,-15,-11, 27,-20, -5, -2,  3,  0,-17,-30,-13, -5;
/M: SY='F'; M=-16,-25,-24,-30,-25, 40,-25, -8,  5,-22,  9,  3,-20,-29,-25,-17,-20,-10,  0,  8, 37,-25;
/I:         I=-6; MD=-32;
/M: SY='V'; M= -6,-27,-18,-30,-25,  7,-30,-20, 24,-24, 18, 12,-23,-26,-23,-20,-16, -4, 25,-21, -1,-25; D=-6;
/I:         I=-6; MI=-32; IM=-32; DM=-32;
/M: SY='A'; M= 15,-14,-14,-20,-15, -7,-14,-20,  1,-15, -2, -3,-12,-17,-13,-17,  4,  8,  8,-23,-11,-14;
/I:         I=-5; MD=-25;
/M: SY='N'; M= -1,  8,-22,  1, -3,-17,  1, -4,-18, -4,-20,-14, 15,-15, -5, -3,  1, -6,-17,-22,-12, -5; D=-5;
/I:         I=-5; MD=-25;
/M: SY='G'; M= -5, -1,-21, -5, -7,-13, 14, -8,-19, -9,-17,-12,  6,-16, -7, -8,  2, -5,-17,-19,-11, -7; D=-5;
/I:         I=-5; MI=0; MD=-25; IM=0; DM=-25;
/M: SY='I'; M= -6,-28,-15,-34,-27,  9,-30,-27, 27,-27, 16, 11,-22,-25,-24,-25,-17, -8, 22,-20, -1,-27;
/M: SY='V'; M= -6,-28,-20,-31,-25,  8,-30,-24, 22,-23, 16, 11,-25,-23,-24,-21,-17, -5, 23,-15,  0,-25;
/M: SY='V'; M=  5,-15, -9,-20,-18, -7,-21,-23,  7,-16,  1, -1,-14,-20,-18,-17,  3, 14, 18,-29,-11,-18;
/M: SY='H'; M=-12, -3,-28, -3, -3,-20, -5, 62,-30, -7,-21, -5,  6,-19,  3, -2, -7,-17,-27,-26,  7, -3;
/M: SY='N'; M=-10, 35,-21, 17,  3,-23, -3, 10,-20,  1,-29,-17, 52,-19,  8,  1,  9, -1,-30,-37,-19,  5;
/M: SY='C'; M= -3,-11, 63,-18,-18,-18,-20,-23,-24,-21,-21,-18, -8,-27,-18,-21,  9,  8, -8,-44,-24,-18;
/I:         E0=0; E1=0; IE=-105; DE=-105;
Numerical results
  • UniProtKB/Swiss-Prot release number: 2013_05, total number of sequence entries in that release: 540052.
  • Total number of hits in UniProtKB/Swiss-Prot: 106 hits in 94 different sequences
  • Number of hits on proteins that are known to belong to the set under consideration: 106 hits in 94 different sequences
  • Number of hits on proteins that could potentially belong to the set under consideration: 0 hits in 0 different sequences
  • Number of false hits (on unrelated proteins): 0 hits in 0 different sequences
  • Number of known missed hits: 8
  • Number of partial sequences which belong to the set under consideration, but which are not hit by the pattern or profile because they are partial (fragment) sequences: 0
  • Precision (true hits / (true hits + false positives)): 100.00 %
  • Recall (true hits / (true hits + false negatives)): 92.98 %
Comments
  • MATRIX_TYPE: protein_domain
  • Scaling database: UniProtKB/Swiss-Prot: reversed
  • Matrix author: CJA_Sigrist
  • Taxonomic range: Archaea, Bacteriophages, Eukaryotes, Prokaryotes (Bacteria), Eukaryotic viruses
  • Maximum known number of repetitions of the pattern in a single protein: 3
  • VERSION: 1
Cross-references
UniProtKB/Swiss-Prot True positive hits:
DNAB_GUITH  (O78411), DNAB_MYCBO  (P59966), DNAB_MYCTU  (P71715), 
DNAB_NOSS1  (Q8YZA1), DNAB_PORPU  (P51333), DNAB_PORYE  (Q1XDF3), 
DNAB_RHOMR  (O30477), DNAB_SYNY3  (Q55418), DP2L_METHJ  (Q2FSF9), 
DP2L_METMJ  (A3CXE7), DP2L_PYRAB  (Q9V2F4), DP2L_PYRHO  (O57861), 
DP2L_PYRKO  (Q5JET0), DPO3A_SYNY3 (P74750), DPOL_BPAPS  (Q9T1Q3), 
DPOL_CEV01  (A7U6F1), DPOL_METJA  (Q58295), DPOL_MIMIV  (Q5UQR0), 
DPOL_PYRHO  (O59610), DPOL_PYRKO  (P77933), DPOL_PYRSD  (Q51334), 
DPOL_THEAG  (O33845), DPOL_THEFM  (P74918), DPOL_THEG8  (Q9HH84), 
DPOL_THEHY  (Q9HH05), DPOL_THELI  (P30317), EF2_METKA   (Q8TXJ4), 
GLMS_METJA  (Q58815), GYRA_MYCFV  (Q49166), GYRA_MYCGO  (Q49467), 
GYRA_MYCKA  (Q49608), GYRA_MYCLE  (Q57532), GYRA_MYCMA  (O33149), 
GYRA_MYCXE  (P72065), GYRB_SYNY3  (P77966), HELS_METJA  (Q58524), 
HELS_PYRKO  (Q5JGV6), IF2P_METJA  (Q57710), IF2P_PYRAB  (Q9UZK7), 
IF2P_PYRFU  (Q8U1R8), IF2P_PYRHO  (O58822), IF2P_PYRKO  (Q5JGR9), 
LONB_PYRAB  (Q9UYC6), LONB_PYRHO  (O58221), NRDEB_BACSU (O31875), 
NRDEB_BPSPB (O64173), PPSA_METJA  (Q57962), RADA_PYRHO  (O58001), 
RADA_PYRKO  (Q5JET4), RECA_MYCBO  (P0A5U5), RECA_MYCCI  (Q9F417), 
RECA_MYCFA  (Q9F416), RECA_MYCFV  (Q9F415), RECA_MYCGS  (Q9F414), 
RECA_MYCLE  (P35901), RECA_MYCSH  (Q9F410), RECA_MYCTH  (Q9F407), 
RECA_MYCTU  (P0A5U4), RFCS_HALWD  (Q18E75), RFCS_METJA  (Q58817), 
RFCS_METKA  (Q8TZC4), RFCS_PYRAB  (Q9V2G4), RFCS_PYRFU  (Q8U4J3), 
RFCS_PYRHO  (O57852), RFCS_PYRKO  (Q5JHP2), RGYR_METJA  (Q58907), 
RGYR_PYRHO  (O58530), RGYR_PYRKO  (Q6F598), RIR2_AQUAE  (O67475), 
RPC2_DICDI  (Q54IZ9), RPOA1_METJA (Q58445), RPOA2_METJA (Q58446), 
RTCB_METJA  (Q58095), RTCB_METKA  (Q8TUS2), RTCB_PYRAB  (Q9V168), 
RTCB_PYRFU  (Q8U0H4), RTCB_PYRHO  (O59245), TF2B_METJA  (Q58192), 
TOP1_PYRFU  (O73954), VATA_PYRAB  (Q9UXU7), VATA_PYRFU  (Q8U4A6), 
VATA_PYRHO  (O57728), VATA_THEAC  (Q9P997), VATA_THEVO  (Q97CQ0), 
VATA_YEAST  (P17255), VATB_METKA  (Q8TUT0), Y043_METJA  (Q60348), 
Y1054_METJA (Q58454), Y1461_MYCTU (P67125), Y1496_MYCBO (P67126), 
Y450_MYCPN  (Q50362), Y593_MYCLE  (Q49689), Y781_METJA  (Q58191), 
Y832_METJA  (Q58242)
False negative hits (sequences which belong to the set under consideration, but which have not been picked up by the pattern or profile):
CLPP_CHLMO  (P42379), DNAB_MYCIT  (Q9F5P4), DNAB_MYCLE  (P46394), 
DP2L_HALMA  (Q5UZ40), DP2L_HALSA  (Q9HMX8), DP2L_HALWD  (Q18ER3), 
RIR1_IIV6   (O55716), VATA_CANTR  (P38078)
Retrieve an alignment of UniProtKB/Swiss-Prot true positive hits:

[Clustal format, color, condensed view] [Clustal format, color] [Clustal format, plain text] [Fasta format]

Retrieve the sequence logo from the alignment

PDB
[Detailed view]
1AM2; 1MI8; 2CW7; 2CW8; 2IN0; 2IN9; 2L8L; 2LQM; 4E2T; 4E2U;

View entry in original PROSITE format
View entry in raw text format (no links)


If you would like to retrieve all the Swiss-Prot entries referenced in the DR lines of this entry (with the exception of false positive hits) , you can enter a file name. These entries will then be saved to a file under this name. (Please note that this temporary file will only be kept for 1 week.)

File name:

Format: Swiss-Prot Fasta

or