National Center for Biotechnology Information (NCBI)
Welcome to the blast2 network service.
PEPTIDE SEQUENCE DATABASES
nr Non-redundant GenBank CDS translations+PDB+SwissProt+PIR
month All new or revised GenBank CDS translation+PDB+SwissProt+PIR
sequences released in the last 30 days
pdb PDB protein sequences
yeast Yeast (Saccharomyces cerevisiae) protein sequences.
kabat Kabat Sequences of Proteins of Immunological Interest
alu * Translations of Select Alu Repeats from REPBASE
swissprot SwissProt sequences
NUCLEOTIDE SEQUENCE DATABASES
nr Non-redundant GenBank+EMBL+DDBJ+PDB sequences (but no EST's or
STS's)
month All new or revised GenBank+EMBL+DDBJ+PDB sequences released in
the last 30 days
yeast Yeast (Saccharomyces cerevisiae) genomic nucleotide sequences.
est + Non-redundant Database of GenBank+EMBL+DDBJ EST Division
sts + Non-redundant Database of GenBank+EMBL+DDBJ STS Division
pdb PDB nucleotide sequences
vector Vector subset of GenBank
mito * Database of mitochondrial sequences, Rel. 1.0, July 1995
gss Genome Survey Sequences (includes single-pass genomic data, exon-
trapped sequences, and Alu PCR primers.
kabat Kabat Sequences of Nucleic Acid of Immunological Interest
epd Eukaryotic Promotor Database
alu *+ Select Alu Repeats from REPBASE
BLASTX 1.4.11 [24-Nov-97] [Build 24-Nov-97]
Reference: Gish, Warren and David J. States (1993). Identification of
protein coding regions by database similarity search. Nat. Genet. 3:266-72.
Altschul, Stephen F., Warren Gish, Webb Miller, Eugene W. Myers, and David J.
Lipman (1990). Basic local alignment search tool. J. Mol. Biol. 215:403-10.
Notice: statistical significance is estimated under the assumption that
the equivalent of one entire reading frame in the query sequence codes for
protein and that significant alignments will involve only coding reading
frames.
Query= tmpseq_1
(297 letters)
Translating both strands of query sequence in all 6 reading frames
Database: Non-redundant GenBank CDS
translations+PDB+SwissProt+SPupdate+PIR
293,041 sequences; 87,997,542 total letters.
Searching..................................................done
Smallest
Sum
Reading High Probability
Sequences producing High-scoring Segment Pairs: Frame Score P(N) N
pir||I40767 catalase - Campylobacter jejuni /pir||S719... +1 148 3.8e-12 1
gi|148036 (M17939) tox protein DT-201 [Artificial ge... +1 138 9.1e-11 1
pir||A55092 catalase (EC 1.11.1.6) CAT-2 - maize (frag... +1 138 9.1e-11 1
gi|994736 (M18327) LacOPZ-alpha peptide from pUC9; p... +2 131 8.4e-10 1
gi|145811 (M19035) D-serine deaminase activator [Esc... -2 110 1.8e-08 2
pir||I40767 catalase - Campylobacter jejuni pir||S71937 catalase -
Campylobacter jejuni gi|984737 (X85130) catalase [Campylobacter jejuni]
Length = 507
Plus Strand HSPs:
Score = 148 (67.8 bits), Expect = 3.8e-12, P = 3.8e-12
Identities = 25/26 (96%), Positives = 25/26 (96%), Frame = +1
Query: 76 WRNHGHICFLCEIVIRSQFHTTYEPE 153
WRNHGH CFLCEIVIRSQFHTTYEPE
Sbjct: 481 WRNHGHRCFLCEIVIRSQFHTTYEPE 506
gi|148036 (M17939) tox protein DT-201 [Artificial gene] gi|209498
(M21873) tox/synthetic protein [Artificial gene]
Length = 38
Plus Strand HSPs:
Score = 138 (63.2 bits), Expect = 9.1e-11, P = 9.1e-11
Identities = 24/25 (96%), Positives = 24/25 (96%), Frame = +1
Query: 79 RNHGHICFLCEIVIRSQFHTTYEPE 153
RNHGH CFLCEIVIRSQFHTTYEPE
Sbjct: 13 RNHGHSCFLCEIVIRSQFHTTYEPE 37
pir||A55092 catalase (EC 1.11.1.6) CAT-2 - maize (fragment)
Length = 493
Plus Strand HSPs:
Score = 138 (63.2 bits), Expect = 9.1e-11, P = 9.1e-11
Identities = 24/25 (96%), Positives = 24/25 (96%), Frame = +1
Query: 79 RNHGHICFLCEIVIRSQFHTTYEPE 153
RNHGH CFLCEIVIRSQFHTTYEPE
Sbjct: 468 RNHGHSCFLCEIVIRSQFHTTYEPE 492
gi|994736 (M18327) LacOPZ-alpha peptide from pUC9; putative [cloning
vectors] gi|994738 (M18328) LacOPZ-alpha peptide from pUC9;
putative [cloning vectors] gi|994740 (M18329) LacOPZ-alpha peptide
from pUC9; putative [cloning vectors]
Length = 93
Plus Strand HSPs:
Score = 131 (60.0 bits), Expect = 8.4e-10, P = 8.4e-10
Identities = 27/30 (90%), Positives = 28/30 (93%), Frame = +2
Query: 74 LGVIMVISVSCVKLLSAHNSTQHMSRKXKV 163
LGVIMVI+VSCVKLLSAHNSTQH SRK KV
Sbjct: 64 LGVIMVIAVSCVKLLSAHNSTQHTSRKHKV 93
gi|145811 (M19035) D-serine deaminase activator [Escherichia coli]
Length = 249
Minus Strand HSPs:
Score = 110 (50.4 bits), Expect = 1.8e-08, Sum P(2) = 1.8e-08
Identities = 23/36 (63%), Positives = 25/36 (69%), Frame = -2
Query: 200 CELAHSLGTPGLTLYXSGSYVVWNCERITISHRKQI 93
CELAHSLG TL V WNCERITISHRK++
Sbjct: 153 CELAHSLGPDFHTLCFRLLCVCWNCERITISHRKRL 188
Score = 43 (19.7 bits), Expect = 1.8e-08, Sum P(2) = 1.8e-08
Identities = 7/10 (70%), Positives = 8/10 (80%), Frame = -1
Query: 39 HXCR*TIEDP 10
H CR T+EDP
Sbjct: 198 HACRSTLEDP 207
Parameters:
V=5
Lambda K H
0.318 0.135 0.401
Cutoff to enter 2nd pass: >= 45 ( 0.0 bits)
E S T1 T2 X1 X2 W Gap
10.0 61 12 12 -16 -22 40 50
Database: Non-redundant GenBank CDS translations+PDB+SwissProt+SPupdate+PIR
Posted date: Mar 5, 1998 9:47 AM
# of letters in database: 87,997,542
# of sequences in database: 293,041
Number of Hits to DB: 1st pass: 43509662, 2nd pass: 103123
Number of Sequences: 1st pass: 293041, 2nd pass: 343
Number of extensions: 1st pass: 728775, 2nd pass: 84295
Number of successful extensions: 1st pass: 343, 2nd pass: 633
Number of sequences better than 10: 24