NNSSP - Prediction of protein secondary sturcture by combining nearest-neighbor algorithms and multiply sequence alignments
(Version 2. 10.5.94)
Department of Cell Biology, Baylor College of Medicine
Analysis of amino acid sequences is available through WWW (NEW VERSION WITH PROBABILITIES OF A- and B- PREDICTIONS) or by sending your file containing a sequence (a sequence format described below) to service@bchs.uh.edu or services@bioinformatics.weizmann.ac.il with the subject line "nnssp".
Examples: mail -s nnssp service@bchs.uh.edu < test.seq
mail -s nnssp services@bioinformatics.weizmann.ac.il < test.seq
where test.seq a file with the sequence.
(*) Yi T-M., Lander E.S. (1993) Protein secondary structure prediction using nearest-neighbor methods. J.Mol.Biol.,232:1117-1129. (**) Bowie J.U., Luthy R., Eisenberg D. (1991) A method to identify protein sequences that fold into a known three-dimensional structure. Science, 253, 164-170.)
SEE ALSO "ssp" program of this server.
Submitting sequences via email:
For email submission the sequences must have the following format:
a) if you send one sequence:
1 line - sequence name
2 line - number 1 in format I5
3 and subsequent lines - amino acid sequence
Sequence length must be less than 2000 amino acids !!!
for example:
ADENYLATE KINASE
1
RLLRAIMGAPGSGKGTVSSRITKHFELKHLSSGDLLRDNMLRGTEIGVLA
KTFIDQGKLIPDDVMTRLVLHELKNLTQYNWLLDGFPRTLPQAEALDRAY
QIDTVINLNVPFEVIKQRLTARWIHPGSGRVYNIEFNPPKTMGIDDLTGE
PLVQREDDRPETVVK............
(Restrict the line length to 75 characters).
b) if you send multiple aligned sequences
1 line - sequence name
2 line - number of aligned sequences and length of protein
3 and subsequent lines - aligned sequences in format 60a1
(where 3-d line is empty or with numbers as well as other lines
which separate parts of aligned sequences)
The number of aligned sequences must be less than 250 !!!
You can use multiple sequence alignments variant only by EMAIL !!!!!
for example:
ACTINOXANTHIN
5 107
10 20 30 40 50 60 (numbers not
APAFSVSPASGASDGQSVSVSVAAAGETYYIAQaAPVGGQDAaNPATATSFTTDASGAAS necessary)
APAFSVSPASGLSDGQSVSVSGAAAGETYYIAQCAPVGGQDACNPATATSFTTDASGAAS
APTATVTPSSGLSDGTVVKVAGAgaGTAYDVGQCAWVdgVLACNPADFSSVTADANGSAS
APGVTVTPATGLSNGQTVTVSATgpGTVYHVGQCAVvpGVIGCDATTSTDVTADAAGKIT
ATPKSSSGGAGASTGSGTSSAAVTSgaASSAQQSGLQGATGAGGGSSSTPGTQPGSGAGG
70 80 90 100
FSFTVRKSYAGQTPSGTPVGSVDbATDAbNLGAGNSGLNLGHVALTF
FSFV-RKSYAGZTPSGTPVGSVDCATDACNLGAGNSGLNLGHVALTF
TSLTVRRSFEGFLFDGTRWGTVDCTTAACQVGLSDAAGNGpgVAISF
AQLKVHSSFQAVvaNGTPWGTVNCKVVSCSAGLGSDSGEGAAQAITF
AIAARPVSAMGGtpPHTVPGSTNTTTTAMAGGVGGPgaNPNAAALM-
(you can use small letters for Cys amino acids, if you want)
Alignment MUST be without deletions in the 1-st (query) sequence!!!
Example of NNSSP output:
This output contains probabilities (Pa and Pb) of a and b structures in 0-9
scale. Probability of c is approximately 10 - Pa - Pb.
ADENYLATE KINASE ISOENZYME-3, /GTP:AMP$
L= 214 SS content: a- 0.43 b= 0.05 c= 0.52
10 20 30 40 50
PredSS aaaaaaa aaaaaa aaaaaaaa aa
AA seq RLLRAIMGAPGSGKGTVSSRITKHFELKHLSSGDLLRDNMLRGTEIGVLA
Prob a 99888651000001112244545422211111346775554221332335
Prob b 00001221000001134422321222233221001110010101134443
60 70 80 90 100
PredSS aaaa aaaaaaaaaaaaaaaa aaaaaaaaa
AA seq KTFIDQGKLIPDDVMTRLVLHELKNLTQYNWLLDGFPRTLPQAEALDRAY
Prob a 54543201110346789888877545553334210001113588888875
Prob b 22221001210001111000000000111233410101110000000011
110 120 130 140 150
PredSS bb aaaaaaaa bb bbbb
AA seq QIDTVINLNVPFEVIKQRLTARWIHPGSGRVYNIEFNPPKTMGIDDLTGE
Prob a 32111111111466766643321110001100000000000111111111
Prob b 12135643321222110122245531001478764210013333211101
160 170 180 190 200
PredSS aaaaaaaaaaaaaaaaaaaaaaa bbb a
AA seq PLVQREDDRPETVVKRLKAYEAQTEPVLEYYRKKGVLETFSGTETNKIWP
Prob a 23433211146788999997765577888886621121111111123335
Prob b 12321000001110000000000000000000101365542111111221
210
PredSS aaaaaaa
AA seq HVYAFLQTKLPQRS
Prob a 46687764210111
Prob b 22211110110001
Reference:
Salamov A.A., Solovyev V.V. Prediction of protein secondary sturcture by combining nearest-neighbor algorithms and multiply sequence alignments. J.Mol.Biol.,1995, 247, 11-15.
Victor V.Solovyev, Department of Cell Biology, Baylor College of Medicine
solovyev@cmb.bcm.tmc.edu
Last modified: Wed Oct 20 17:25:12 CDT 1999