National Center for Biotechnology Information (NCBI)

welcome to the blast2 network service.


PEPTIDE SEQUENCE DATABASES

 nr        Non-redundant GenBank CDS translations+PDB+SwissProt+PIR
 month     All new or revised GenBank CDS translation+PDB+SwissProt+PIR
           sequences released in the last 30 days
 pdb       PDB protein sequences
 yeast     Yeast (Saccharomyces cerevisiae) protein sequences.
 kabat     Kabat Sequences of Proteins of Immunological Interest
 alu *     Translations of Select Alu Repeats from REPBASE
 swissprot SwissProt sequences


NUCLEOTIDE SEQUENCE DATABASES

 nr       Non-redundant GenBank+EMBL+DDBJ+PDB sequences
          (but no EST, STS, GSS, or HTGS sequences) 
 month    All new or revised GenBank+EMBL+DDBJ+PDB sequences released in
          the last 30 days
 yeast    Yeast (Saccharomyces cerevisiae) genomic nucleotide sequences.
 est +    Non-redundant Database of GenBank+EMBL+DDBJ EST Division
 sts +    Non-redundant Database of GenBank+EMBL+DDBJ STS Division
 htgs     High Throughput Genomic Sequences 
 pdb      PDB nucleotide sequences
 vector   Vector subset of GenBank
 mito *   Database of mitochondrial sequences, Rel. 1.0, July 1995
 gss      Genome Survey Sequences (includes single-pass genomic data, exon-
          trapped sequences, and Alu PCR primers.
 kabat    Kabat Sequences of Nucleic Acid of Immunological Interest
 epd      Eukaryotic Promotor Database
 alu *+   Select Alu Repeats from REPBASE



BLASTX 1.4.11 [24-Nov-97] [Build 24-Nov-97]

Reference:  Gish, Warren and David J. States (1993).  Identification of
protein coding regions by database similarity search.  Nat. Genet. 3:266-72.
Altschul, Stephen F., Warren Gish, Webb Miller, Eugene W. Myers, and David J.
Lipman (1990).  Basic local alignment search tool.  J. Mol. Biol. 215:403-10.

Notice:  statistical significance is estimated under the assumption that
the equivalent of one entire reading frame in the query sequence codes for
protein and that significant alignments will involve only coding reading
frames.

Query=  tmpseq_1
        (365 letters)

  Translating both strands of query sequence in all 6 reading frames

Database:  Non-redundant GenBank CDS
           translations+PDB+SwissProt+SPupdate+PIR
           329,726 sequences; 100,504,245 total letters.
Searching..................................................done

                                                                     Smallest
                                                                       Sum
                                                     Reading  High  Probability
Sequences producing High-scoring Segment Pairs:        Frame Score  P(N)      N

gnl|PID|e323030      (Z97050) hypothetical protein Rv0... -2    64  0.91      1
sp|P43963|Y205_HAEIN HYPOTHETICAL PROTEIN HI0205 PRECU... -2    62  0.99      1
gnl|PID|e318951      (Z49542) ORF YJR041c [Saccharomyc... +2    51  0.9998    2
sp|P47108|YJ11_YEAST HYPOTHETICAL 135.1 KD PROTEIN IN ... +2    51  0.9999    2

 

gnl|PID|e323030 (Z97050) hypothetical protein Rv0191 [Mycobacterium
            tuberculosis]
            Length = 413

  Minus Strand HSPs:

 Score = 64 (29.3 bits), Expect = 2.4, P = 0.91
 Identities = 15/37 (40%), Positives = 19/37 (51%), Frame = -2

Query:   307 TPQILAYLMVASIAVFEMEVVAVLPAGSLSPFLRNER 197
             TP+I   L V + A F      +LP G+LS   RN R
Sbjct:    16 TPRIATQLSVLACAAFIYVTAEILPVGALSAIARNLR 52

 

sp|P43963|Y205_HAEIN HYPOTHETICAL PROTEIN HI0205 PRECURSOR pir||H64003
            hypothetical protein HI0205 - Haemophilus influenzae (strain Rd
            KW20) gi|1573169 (U32705) H. influenzae predicted coding region
            HI0205 [Haemophilus influenzae Rd]
            Length = 257

  Minus Strand HSPs:

 Score = 62 (28.4 bits), Expect = 4.4, P = 0.99
 Identities = 15/51 (29%), Positives = 25/51 (49%), Frame = -2

Query:   319 DKHLTPQILAYLMVASIAVFEMEVVAVLPAGSLSPFLRNERC**FVRXLIN 167
             DK+LTP+ L YL    I   +  ++  +    L  FL N++   ++R   N
Sbjct:    92 DKNLTPKFLDYLYFEPINTVDANLIQEMKKNLLVSFLANDQAKIYIRQTDN 142

 

gnl|PID|e318951 (Z49542) ORF YJR041c [Saccharomyces cerevisiae]
            Length = 1120

  Plus Strand HSPs:

 Score = 51 (23.4 bits), Expect = 8.7, Sum P(2) = 1.0
 Identities = 10/17 (58%), Positives = 14/17 (82%), Frame = +2

Query:    65 DNLLSQLVSFLKAIFAL 115
             D LL++ VSF+KA FA+
Sbjct:   114 DKLLTRSVSFIKAFFAI 130

 Score = 46 (21.1 bits), Expect = 8.7, Sum P(2) = 1.0
 Identities = 10/46 (21%), Positives = 26/46 (56%), Frame = +1

Query:   178 DEQIINISHFLRMDLNYQQEELLQLPSRIPLSMLPSDMRVSVVSNV 315
             +E  I  +   ++  +Y Q   L+   +IP+  +  ++RV++++N+
Sbjct:   605 EETNITYALINKLASSYHQTFALEALIQIPIQCINKNVRVALINNL 650

 

sp|P47108|YJ11_YEAST HYPOTHETICAL 135.1 KD PROTEIN IN GEF1-NUP85
            INTERGENIC REGION pir||S57060 probable membrane protein YJR041c -
            yeast (Saccharomyces cerevisiae) gi|1015693 (Z49541) ORF YJR041c
            [Saccharomyces cerevisiae] gi|1197069 (L36344) ORF; putative
            [Saccharomyces cerevisiae]
            Length = 1174

  Plus Strand HSPs:

 Score = 51 (23.4 bits), Expect = 9.1, Sum P(2) = 1.0
 Identities = 10/17 (58%), Positives = 14/17 (82%), Frame = +2

Query:    65 DNLLSQLVSFLKAIFAL 115
             D LL++ VSF+KA FA+
Sbjct:   114 DKLLTRSVSFIKAFFAI 130

 Score = 46 (21.1 bits), Expect = 9.1, Sum P(2) = 1.0
 Identities = 10/46 (21%), Positives = 26/46 (56%), Frame = +1

Query:   178 DEQIINISHFLRMDLNYQQEELLQLPSRIPLSMLPSDMRVSVVSNV 315
             +E  I  +   ++  +Y Q   L+   +IP+  +  ++RV++++N+
Sbjct:   605 EETNITYALINKLASSYHQTFALEALIQIPIQCINKNVRVALINNL 650


Parameters:

  V=5

  Lambda     K      H
     0.318   0.135   0.401

  Cutoff to enter 2nd pass: >= 45 ( 0.0 bits)

  E     S     T1     T2     X1     X2     W     Gap
  10.0      62      12      12      -16      -22      40      50

  Database:  Non-redundant GenBank CDS translations+PDB+SwissProt+SPupdate+PIR
    Posted date:  Oct 14, 1998  7:57 AM
  # of letters in database:  100,504,245
  # of sequences in database:  329,726



  Number of Hits to DB: 1st pass: 57156328, 2nd pass: 299276
  Number of Sequences: 1st pass: 329726, 2nd pass: 628
  Number of extensions: 1st pass: 937880, 2nd pass: 252448
  Number of successful extensions: 1st pass: 628, 2nd pass: 1518
  Number of sequences better than 10: 4