National Center for Biotechnology Information (NCBI)
welcome to the blast2 network service.
PEPTIDE SEQUENCE DATABASES
nr Non-redundant GenBank CDS translations+PDB+SwissProt+PIR
month All new or revised GenBank CDS translation+PDB+SwissProt+PIR
sequences released in the last 30 days
pdb PDB protein sequences
yeast Yeast (Saccharomyces cerevisiae) protein sequences.
kabat Kabat Sequences of Proteins of Immunological Interest
alu * Translations of Select Alu Repeats from REPBASE
swissprot SwissProt sequences
NUCLEOTIDE SEQUENCE DATABASES
nr Non-redundant GenBank+EMBL+DDBJ+PDB sequences
(but no EST, STS, GSS, or HTGS sequences)
month All new or revised GenBank+EMBL+DDBJ+PDB sequences released in
the last 30 days
yeast Yeast (Saccharomyces cerevisiae) genomic nucleotide sequences.
est + Non-redundant Database of GenBank+EMBL+DDBJ EST Division
sts + Non-redundant Database of GenBank+EMBL+DDBJ STS Division
htgs High Throughput Genomic Sequences
pdb PDB nucleotide sequences
vector Vector subset of GenBank
mito * Database of mitochondrial sequences, Rel. 1.0, July 1995
gss Genome Survey Sequences (includes single-pass genomic data, exon-
trapped sequences, and Alu PCR primers.
kabat Kabat Sequences of Nucleic Acid of Immunological Interest
epd Eukaryotic Promotor Database
alu *+ Select Alu Repeats from REPBASE
BLASTX 1.4.11 [24-Nov-97] [Build 24-Nov-97]
Reference: Gish, Warren and David J. States (1993). Identification of
protein coding regions by database similarity search. Nat. Genet. 3:266-72.
Altschul, Stephen F., Warren Gish, Webb Miller, Eugene W. Myers, and David J.
Lipman (1990). Basic local alignment search tool. J. Mol. Biol. 215:403-10.
Notice: statistical significance is estimated under the assumption that
the equivalent of one entire reading frame in the query sequence codes for
protein and that significant alignments will involve only coding reading
frames.
Query= tmpseq_1
(365 letters)
Translating both strands of query sequence in all 6 reading frames
Database: Non-redundant GenBank CDS
translations+PDB+SwissProt+SPupdate+PIR
329,726 sequences; 100,504,245 total letters.
Searching..................................................done
Smallest
Sum
Reading High Probability
Sequences producing High-scoring Segment Pairs: Frame Score P(N) N
gnl|PID|e323030 (Z97050) hypothetical protein Rv0... -2 64 0.91 1
sp|P43963|Y205_HAEIN HYPOTHETICAL PROTEIN HI0205 PRECU... -2 62 0.99 1
gnl|PID|e318951 (Z49542) ORF YJR041c [Saccharomyc... +2 51 0.9998 2
sp|P47108|YJ11_YEAST HYPOTHETICAL 135.1 KD PROTEIN IN ... +2 51 0.9999 2
gnl|PID|e323030 (Z97050) hypothetical protein Rv0191 [Mycobacterium
tuberculosis]
Length = 413
Minus Strand HSPs:
Score = 64 (29.3 bits), Expect = 2.4, P = 0.91
Identities = 15/37 (40%), Positives = 19/37 (51%), Frame = -2
Query: 307 TPQILAYLMVASIAVFEMEVVAVLPAGSLSPFLRNER 197
TP+I L V + A F +LP G+LS RN R
Sbjct: 16 TPRIATQLSVLACAAFIYVTAEILPVGALSAIARNLR 52
sp|P43963|Y205_HAEIN HYPOTHETICAL PROTEIN HI0205 PRECURSOR pir||H64003
hypothetical protein HI0205 - Haemophilus influenzae (strain Rd
KW20) gi|1573169 (U32705) H. influenzae predicted coding region
HI0205 [Haemophilus influenzae Rd]
Length = 257
Minus Strand HSPs:
Score = 62 (28.4 bits), Expect = 4.4, P = 0.99
Identities = 15/51 (29%), Positives = 25/51 (49%), Frame = -2
Query: 319 DKHLTPQILAYLMVASIAVFEMEVVAVLPAGSLSPFLRNERC**FVRXLIN 167
DK+LTP+ L YL I + ++ + L FL N++ ++R N
Sbjct: 92 DKNLTPKFLDYLYFEPINTVDANLIQEMKKNLLVSFLANDQAKIYIRQTDN 142
gnl|PID|e318951 (Z49542) ORF YJR041c [Saccharomyces cerevisiae]
Length = 1120
Plus Strand HSPs:
Score = 51 (23.4 bits), Expect = 8.7, Sum P(2) = 1.0
Identities = 10/17 (58%), Positives = 14/17 (82%), Frame = +2
Query: 65 DNLLSQLVSFLKAIFAL 115
D LL++ VSF+KA FA+
Sbjct: 114 DKLLTRSVSFIKAFFAI 130
Score = 46 (21.1 bits), Expect = 8.7, Sum P(2) = 1.0
Identities = 10/46 (21%), Positives = 26/46 (56%), Frame = +1
Query: 178 DEQIINISHFLRMDLNYQQEELLQLPSRIPLSMLPSDMRVSVVSNV 315
+E I + ++ +Y Q L+ +IP+ + ++RV++++N+
Sbjct: 605 EETNITYALINKLASSYHQTFALEALIQIPIQCINKNVRVALINNL 650
sp|P47108|YJ11_YEAST HYPOTHETICAL 135.1 KD PROTEIN IN GEF1-NUP85
INTERGENIC REGION pir||S57060 probable membrane protein YJR041c -
yeast (Saccharomyces cerevisiae) gi|1015693 (Z49541) ORF YJR041c
[Saccharomyces cerevisiae] gi|1197069 (L36344) ORF; putative
[Saccharomyces cerevisiae]
Length = 1174
Plus Strand HSPs:
Score = 51 (23.4 bits), Expect = 9.1, Sum P(2) = 1.0
Identities = 10/17 (58%), Positives = 14/17 (82%), Frame = +2
Query: 65 DNLLSQLVSFLKAIFAL 115
D LL++ VSF+KA FA+
Sbjct: 114 DKLLTRSVSFIKAFFAI 130
Score = 46 (21.1 bits), Expect = 9.1, Sum P(2) = 1.0
Identities = 10/46 (21%), Positives = 26/46 (56%), Frame = +1
Query: 178 DEQIINISHFLRMDLNYQQEELLQLPSRIPLSMLPSDMRVSVVSNV 315
+E I + ++ +Y Q L+ +IP+ + ++RV++++N+
Sbjct: 605 EETNITYALINKLASSYHQTFALEALIQIPIQCINKNVRVALINNL 650
Parameters:
V=5
Lambda K H
0.318 0.135 0.401
Cutoff to enter 2nd pass: >= 45 ( 0.0 bits)
E S T1 T2 X1 X2 W Gap
10.0 62 12 12 -16 -22 40 50
Database: Non-redundant GenBank CDS translations+PDB+SwissProt+SPupdate+PIR
Posted date: Oct 14, 1998 7:57 AM
# of letters in database: 100,504,245
# of sequences in database: 329,726
Number of Hits to DB: 1st pass: 57156328, 2nd pass: 299276
Number of Sequences: 1st pass: 329726, 2nd pass: 628
Number of extensions: 1st pass: 937880, 2nd pass: 252448
Number of successful extensions: 1st pass: 628, 2nd pass: 1518
Number of sequences better than 10: 4