National Center for Biotechnology Information (NCBI)
welcome to the blast2 network service.
PEPTIDE SEQUENCE DATABASES
nr Non-redundant GenBank CDS translations+PDB+SwissProt+PIR
month All new or revised GenBank CDS translation+PDB+SwissProt+PIR
sequences released in the last 30 days
pdb PDB protein sequences
yeast Yeast (Saccharomyces cerevisiae) protein sequences.
kabat Kabat Sequences of Proteins of Immunological Interest
alu * Translations of Select Alu Repeats from REPBASE
swissprot SwissProt sequences
NUCLEOTIDE SEQUENCE DATABASES
nr Non-redundant GenBank+EMBL+DDBJ+PDB sequences
(but no EST, STS, GSS, or HTGS sequences)
month All new or revised GenBank+EMBL+DDBJ+PDB sequences released in
the last 30 days
yeast Yeast (Saccharomyces cerevisiae) genomic nucleotide sequences.
est + Non-redundant Database of GenBank+EMBL+DDBJ EST Division
sts + Non-redundant Database of GenBank+EMBL+DDBJ STS Division
htgs High Throughput Genomic Sequences
pdb PDB nucleotide sequences
vector Vector subset of GenBank
mito * Database of mitochondrial sequences, Rel. 1.0, July 1995
gss Genome Survey Sequences (includes single-pass genomic data, exon-
trapped sequences, and Alu PCR primers.
kabat Kabat Sequences of Nucleic Acid of Immunological Interest
epd Eukaryotic Promotor Database
alu *+ Select Alu Repeats from REPBASE
BLASTX 1.4.11 [24-Nov-97] [Build 24-Nov-97]
Reference: Gish, Warren and David J. States (1993). Identification of
protein coding regions by database similarity search. Nat. Genet. 3:266-72.
Altschul, Stephen F., Warren Gish, Webb Miller, Eugene W. Myers, and David J.
Lipman (1990). Basic local alignment search tool. J. Mol. Biol. 215:403-10.
Notice: statistical significance is estimated under the assumption that
the equivalent of one entire reading frame in the query sequence codes for
protein and that significant alignments will involve only coding reading
frames.
Query= tmpseq_1
(430 letters)
Translating both strands of query sequence in all 6 reading frames
Database: Non-redundant GenBank CDS
translations+PDB+SwissProt+SPupdate+PIR
329,726 sequences; 100,504,245 total letters.
Searching..................................................done
Smallest
Sum
Reading High Probability
Sequences producing High-scoring Segment Pairs: Frame Score P(N) N
gi|208131 (M77169) beta-galactosidase alpha-peptide [... +3 52 0.74 2
gi|3322926 (AE001237) T. pallidum predicted coding reg... +3 66 0.80 1
gi|434648 (U03991) beta-galactosidase alpha peptide [... +3 52 0.82 2
gi|987050 (X65335) lacZ gene product [unidentified cl... +2 55 0.93 2
gi|215564 (M64097) E10 [Bacteriophage mu] +1 64 0.95 1
gi|208131 (M77169) beta-galactosidase alpha-peptide [Cloning vector]
gi|3132861 (U90554) beta-galactosidase alpha peptide [Shuttle
vector pJIR1456] gi|3132864 (U90555) beta-galactosidase alpha
peptide [Shuttle vector pJIR1457]
Length = 107
Plus Strand HSPs:
Score = 52 (23.8 bits), Expect = 1.3, Sum P(2) = 0.74
Identities = 10/10 (100%), Positives = 10/10 (100%), Frame = +3
Query: 3 LESTCRHASL 32
LESTCRHASL
Sbjct: 15 LESTCRHASL 24
Score = 42 (19.2 bits), Expect = 1.3, Sum P(2) = 0.74
Identities = 9/27 (33%), Positives = 13/27 (48%), Frame = +2
Query: 101 WR*FKYSPIWHSTDISYRARLSSGTAC 181
WR +Y + H IS+R + T C
Sbjct: 77 WRLMRYFLLTHLCGISHRIWCTLSTIC 103
gi|3322926 (AE001237) T. pallidum predicted coding region TP0622
[Treponema pallidum]
Length = 593
Plus Strand HSPs:
Score = 66 (30.2 bits), Expect = 1.6, P = 0.80
Identities = 20/57 (35%), Positives = 25/57 (43%), Frame = +3
Query: 87 SAHLAGANLNTRQYGTPLTSVTEHGSLQVLRVTLLSDVSLNSYSSIFYDLFRKKNSF 257
SA G + Y L SV E L +LR +LLSD Y+ +RKK F
Sbjct: 504 SASARGTLRSIFSYYEALLSVHEEERLSLLRASLLSDPRNGRTLFALYEWYRKKKDF 560
gi|434648 (U03991) beta-galactosidase alpha peptide [Cloning vector pUC1918]
Length = 125
Plus Strand HSPs:
Score = 52 (23.8 bits), Expect = 1.7, Sum P(2) = 0.82
Identities = 10/10 (100%), Positives = 10/10 (100%), Frame = +3
Query: 3 LESTCRHASL 32
LESTCRHASL
Sbjct: 33 LESTCRHASL 42
Score = 42 (19.2 bits), Expect = 1.7, Sum P(2) = 0.82
Identities = 9/27 (33%), Positives = 13/27 (48%), Frame = +2
Query: 101 WR*FKYSPIWHSTDISYRARLSSGTAC 181
WR +Y + H IS+R + T C
Sbjct: 95 WRLMRYFLLTHLCGISHRIWCTLSTIC 121
gi|987050 (X65335) lacZ gene product [unidentified cloning vector]
Length = 209
Plus Strand HSPs:
Score = 55 (25.2 bits), Expect = 2.7, Sum P(2) = 0.93
Identities = 13/20 (65%), Positives = 13/20 (65%), Frame = +2
Query: 2 SRVDLQACKLN*AAAQHRLD 61
SRVDLQACKL A R D
Sbjct: 117 SRVDLQACKLALAVVLQRRD 136
Score = 42 (19.2 bits), Expect = 2.7, Sum P(2) = 0.93
Identities = 9/27 (33%), Positives = 13/27 (48%), Frame = +2
Query: 101 WR*FKYSPIWHSTDISYRARLSSGTAC 181
WR +Y + H IS+R + T C
Sbjct: 179 WRLMRYFLLTHLCGISHRIWCTLSTIC 205
gi|215564 (M64097) E10 [Bacteriophage mu]
Length = 66
Plus Strand HSPs:
Score = 64 (29.3 bits), Expect = 3.1, P = 0.95
Identities = 15/31 (48%), Positives = 19/31 (61%), Frame = +1
Query: 154 STALFRYCVLPC*VMCPSTHILPSSMIYFGR 246
ST L Y L C V+ ++LPSSM+YF R
Sbjct: 11 STLLRAYGRLTCGVLAEKMNMLPSSMVYFLR 41
Parameters:
V=5
Lambda K H
0.318 0.135 0.401
Cutoff to enter 2nd pass: >= 46 ( 0.0 bits)
E S T1 T2 X1 X2 W Gap
10.0 62 12 12 -16 -22 40 50
Database: Non-redundant GenBank CDS translations+PDB+SwissProt+SPupdate+PIR
Posted date: Oct 14, 1998 7:57 AM
# of letters in database: 100,504,245
# of sequences in database: 329,726
Number of Hits to DB: 1st pass: 76342576, 2nd pass: 431446
Number of Sequences: 1st pass: 329726, 2nd pass: 623
Number of extensions: 1st pass: 1421512, 2nd pass: 355912
Number of successful extensions: 1st pass: 623, 2nd pass: 1668
Number of sequences better than 10: 10