National Center for Biotechnology Information (NCBI)

welcome to the blast2 network service.


PEPTIDE SEQUENCE DATABASES

 nr        Non-redundant GenBank CDS translations+PDB+SwissProt+PIR
 month     All new or revised GenBank CDS translation+PDB+SwissProt+PIR
           sequences released in the last 30 days
 pdb       PDB protein sequences
 yeast     Yeast (Saccharomyces cerevisiae) protein sequences.
 kabat     Kabat Sequences of Proteins of Immunological Interest
 alu *     Translations of Select Alu Repeats from REPBASE
 swissprot SwissProt sequences


NUCLEOTIDE SEQUENCE DATABASES

 nr       Non-redundant GenBank+EMBL+DDBJ+PDB sequences
          (but no EST, STS, GSS, or HTGS sequences) 
 month    All new or revised GenBank+EMBL+DDBJ+PDB sequences released in
          the last 30 days
 yeast    Yeast (Saccharomyces cerevisiae) genomic nucleotide sequences.
 est +    Non-redundant Database of GenBank+EMBL+DDBJ EST Division
 sts +    Non-redundant Database of GenBank+EMBL+DDBJ STS Division
 htgs     High Throughput Genomic Sequences 
 pdb      PDB nucleotide sequences
 vector   Vector subset of GenBank
 mito *   Database of mitochondrial sequences, Rel. 1.0, July 1995
 gss      Genome Survey Sequences (includes single-pass genomic data, exon-
          trapped sequences, and Alu PCR primers.
 kabat    Kabat Sequences of Nucleic Acid of Immunological Interest
 epd      Eukaryotic Promotor Database
 alu *+   Select Alu Repeats from REPBASE



BLASTX 1.4.11 [24-Nov-97] [Build 24-Nov-97]

Reference:  Gish, Warren and David J. States (1993).  Identification of
protein coding regions by database similarity search.  Nat. Genet. 3:266-72.
Altschul, Stephen F., Warren Gish, Webb Miller, Eugene W. Myers, and David J.
Lipman (1990).  Basic local alignment search tool.  J. Mol. Biol. 215:403-10.

Notice:  statistical significance is estimated under the assumption that
the equivalent of one entire reading frame in the query sequence codes for
protein and that significant alignments will involve only coding reading
frames.

Query=  tmpseq_1
        (430 letters)

  Translating both strands of query sequence in all 6 reading frames

Database:  Non-redundant GenBank CDS
           translations+PDB+SwissProt+SPupdate+PIR
           329,726 sequences; 100,504,245 total letters.
Searching..................................................done

                                                                     Smallest
                                                                       Sum
                                                     Reading  High  Probability
Sequences producing High-scoring Segment Pairs:        Frame Score  P(N)      N

gi|208131  (M77169) beta-galactosidase alpha-peptide [... +3    52  0.74      2
gi|3322926 (AE001237) T. pallidum predicted coding reg... +3    66  0.80      1
gi|434648  (U03991) beta-galactosidase alpha peptide [... +3    52  0.82      2
gi|987050  (X65335) lacZ gene product [unidentified cl... +2    55  0.93      2
gi|215564  (M64097) E10 [Bacteriophage mu]                +1    64  0.95      1

 

gi|208131 (M77169) beta-galactosidase alpha-peptide [Cloning vector]
            gi|3132861 (U90554) beta-galactosidase alpha peptide [Shuttle
            vector pJIR1456] gi|3132864 (U90555) beta-galactosidase alpha
            peptide [Shuttle vector pJIR1457]
            Length = 107

  Plus Strand HSPs:

 Score = 52 (23.8 bits), Expect = 1.3, Sum P(2) = 0.74
 Identities = 10/10 (100%), Positives = 10/10 (100%), Frame = +3

Query:     3 LESTCRHASL 32
             LESTCRHASL
Sbjct:    15 LESTCRHASL 24

 Score = 42 (19.2 bits), Expect = 1.3, Sum P(2) = 0.74
 Identities = 9/27 (33%), Positives = 13/27 (48%), Frame = +2

Query:   101 WR*FKYSPIWHSTDISYRARLSSGTAC 181
             WR  +Y  + H   IS+R   +  T C
Sbjct:    77 WRLMRYFLLTHLCGISHRIWCTLSTIC 103

 

gi|3322926 (AE001237) T. pallidum predicted coding region TP0622
            [Treponema pallidum]
            Length = 593

  Plus Strand HSPs:

 Score = 66 (30.2 bits), Expect = 1.6, P = 0.80
 Identities = 20/57 (35%), Positives = 25/57 (43%), Frame = +3

Query:    87 SAHLAGANLNTRQYGTPLTSVTEHGSLQVLRVTLLSDVSLNSYSSIFYDLFRKKNSF 257
             SA   G   +   Y   L SV E   L +LR +LLSD          Y+ +RKK  F
Sbjct:   504 SASARGTLRSIFSYYEALLSVHEEERLSLLRASLLSDPRNGRTLFALYEWYRKKKDF 560

 

gi|434648 (U03991) beta-galactosidase alpha peptide [Cloning vector pUC1918]
            Length = 125

  Plus Strand HSPs:

 Score = 52 (23.8 bits), Expect = 1.7, Sum P(2) = 0.82
 Identities = 10/10 (100%), Positives = 10/10 (100%), Frame = +3

Query:     3 LESTCRHASL 32
             LESTCRHASL
Sbjct:    33 LESTCRHASL 42

 Score = 42 (19.2 bits), Expect = 1.7, Sum P(2) = 0.82
 Identities = 9/27 (33%), Positives = 13/27 (48%), Frame = +2

Query:   101 WR*FKYSPIWHSTDISYRARLSSGTAC 181
             WR  +Y  + H   IS+R   +  T C
Sbjct:    95 WRLMRYFLLTHLCGISHRIWCTLSTIC 121

 

gi|987050 (X65335) lacZ gene product [unidentified cloning vector]
            Length = 209

  Plus Strand HSPs:

 Score = 55 (25.2 bits), Expect = 2.7, Sum P(2) = 0.93
 Identities = 13/20 (65%), Positives = 13/20 (65%), Frame = +2

Query:     2 SRVDLQACKLN*AAAQHRLD 61
             SRVDLQACKL  A    R D
Sbjct:   117 SRVDLQACKLALAVVLQRRD 136

 Score = 42 (19.2 bits), Expect = 2.7, Sum P(2) = 0.93
 Identities = 9/27 (33%), Positives = 13/27 (48%), Frame = +2

Query:   101 WR*FKYSPIWHSTDISYRARLSSGTAC 181
             WR  +Y  + H   IS+R   +  T C
Sbjct:   179 WRLMRYFLLTHLCGISHRIWCTLSTIC 205

 

gi|215564 (M64097) E10 [Bacteriophage mu]
            Length = 66

  Plus Strand HSPs:

 Score = 64 (29.3 bits), Expect = 3.1, P = 0.95
 Identities = 15/31 (48%), Positives = 19/31 (61%), Frame = +1

Query:   154 STALFRYCVLPC*VMCPSTHILPSSMIYFGR 246
             ST L  Y  L C V+    ++LPSSM+YF R
Sbjct:    11 STLLRAYGRLTCGVLAEKMNMLPSSMVYFLR 41


Parameters:

  V=5

  Lambda     K      H
     0.318   0.135   0.401

  Cutoff to enter 2nd pass: >= 46 ( 0.0 bits)

  E     S     T1     T2     X1     X2     W     Gap
  10.0      62      12      12      -16      -22      40      50

  Database:  Non-redundant GenBank CDS translations+PDB+SwissProt+SPupdate+PIR
    Posted date:  Oct 14, 1998  7:57 AM
  # of letters in database:  100,504,245
  # of sequences in database:  329,726



  Number of Hits to DB: 1st pass: 76342576, 2nd pass: 431446
  Number of Sequences: 1st pass: 329726, 2nd pass: 623
  Number of extensions: 1st pass: 1421512, 2nd pass: 355912
  Number of successful extensions: 1st pass: 623, 2nd pass: 1668
  Number of sequences better than 10: 10