Nnessy
kNN search Maximum likelihood prediction Results Availability

Determining class-membership probabilities with nearest-neighbor search

Spencer Krieger and John Kececioglu
March, 2020


Overview

Key to our core approach for secondary structure prediction is estimating class membership probabilities for the residues of the protein: at each position i in the protein, for each structure class c, the probability that class c is the true secondary structure state of the residue at position i. We estimate these probabilities by a form of nearest neighbor classification, using words of a fixed length l extracted from the amino acid sequence P of the protein.


Overlapping words

Using our template database of amino acid words we find nearest neighbor words for each fixed-length word from the input sequence. These nearest-neighbors have known secondary structure, which we use to estimate the structure state probabilities at each position of the input sequence. An illustration of this is shown at the top of this page.


Videos

The following video was presented at ISMB 2020 and gives more detailed information on Nnessy:
A shorter version was presented at SCS 2020:
Terms of Use
Nnessy is free for noncommercial use, and comes with neither warranty nor guarantee. Nnessy cannot be redistributed in any form without consent of the authors. If you wish to use Nnessy for commercial purpose, you must first obtain the permission from all authors. All noteworthy uses of Nnessy should cite the related paper.

Citation
Noteworthy uses of Nnessy should cite the following publication: Spencer Krieger and John Kececioglu, “Boosting the accuracy of protein secondary structure prediction through nearest neighbor search and method hybridization”, Proceedings of the 28th Conference on Intelligent Systems for Molecular Biology (ISMB 2020).

Funding
Research supported by the US National Science Foundation through grant CCF-1617192.

Contact
Spencer Krieger
spencer.krieger@gmail.com

Department of Computer Science
University of Arizona
754 Gould-Simpson
1040 E. 4th Street
Tucson, AZ 85721, USA

Last Updated: 2022-09-12 15:28:31 -0700