Celiac Database Search

Peptide Exact Match

Candidate proteins derived from members of the Pooideae subfamily of grasses would be screened as query sequences by comparison against the 1041 peptides using an EXACT MATCH algorithm. This is the primary screening tool and most likely to identify a protein representing a risk of eliciting CD. However, exact matches to a 9 AA peptide may occur to some non-Pooideae, non-gluten proteins, see peptide 68 as an example. It matches a fungal protein, but also has multiple matches to alpha-2 gliadin of wheat. Check the matches and the output table looking for multiple matches, and matched peptide lengths. Also use FASTA for overall protein similarity. For questions, search your protein against NCBI Protein by BLASTP for similarities to other proteins.

Full Fasta

Candidate protein sequences may also be screened as query sequences against the set of 7 full length CD eliciting proteins using FASTA version 35.04. FASTA alignment results of greater than 45% overall identity covering at least one-half of the length of one or more of the 76 CD eliciting proteins, with an E score less than 1 x 10-16th (or 1e-016 in the tabular FASTA output) would represent a potential risk of eliciting CD. The matched protein should be screed further using biological systems such as a T cell stimulation test using proven CD-T cell clones or lines, or in vivo challenges.

Sequence Entry

A sequence in FASTA format begins with a single-line description followed by a line of sequence data. The description line is distinguished from the sequence data by a greater-than (">") symbol in the first column.

>gi|167018|gb|AAA32943.1| C-hordein storage protein [Hordeum vulgare]
FPQPQEPFPQQPQQPFPLQPQQPFPQQPQQPFPQPQ QPFRQQAELIIPQQPQQPFPLQPHQPYTQQTIWSMV
  1. Search Method
Search Options
  • Note: We do not observe or log any protein sequences submitted through this website.