Proteins Technologies: Expanding the Rare Disease Discovery Layer
ncORFs and “Peptideins”: Expanding the Human Proteome
Deutsch et al.’s Nature paper expands the map of the human proteome beyond canonical gene annotation. The TransCODE Consortium analyzed 7,264 non-canonical open reading frames (ncORFs) and demonstrated that approximately 25% produce detectable peptides across more than 95,000 proteomics experiments. The authors also introduced ORBL (ORF Relative Branch Length), a metric designed to evaluate the evolutionary relevance of ncORF-derived products and distinguish potentially functional microproteins from biological “noise.”
Why This Matters for Sponsors
For rare oncology, the translational relevance is relatively direct. HLA-presented peptideins and other non-canonical peptides may create entirely new classes of antigens for:
- immunotherapy
- cancer vaccines
- biomarker discovery
These targets may remain invisible to conventional proteogenomic workflows.
Potential Relevance for Lysosomal Storage Disorders
For metabolic lysosomal storage disorders (LSDs), the connection remains more speculative — but highly interesting. LSDs often demonstrate substantial phenotypic variability even among patients carrying the same pathogenic mutation. In Fabry disease, for example, sisters with the same GLA mutation may experience dramatically different disease severity. ncORF-derived peptides could potentially explain part of these modifier effects through:
- immune presentation
- proteostasis regulation
- stress-response pathways
- tissue-specific biology
Pumpkinseed and deSIPHR: Beyond Conventional Proteomics
Pumpkinseed’s deSIPHR platform addresses a different challenge entirely: protein detection sensitivity.
Reference-free, single-molecule protein sequencing could become critically important in settings where LC-MS/MS lacks sufficient sensitivity, particularly for:
- tissue-specific biomarkers
- ultra-low-abundance proteins
- previously unannotated protein products
Why This Matters in Rare Disease
In many rare diseases, key pathogenic proteins are expressed:
- at extremely low concentrations
- only in specific tissues
- or within highly specialized cellular populations
In these settings, conventional proteomics workflows often lack sufficient depth for meaningful biomarker discovery.
Join Illia Petrov’s subscriber chat
Strategic Takeaway
In the near term, this is not about therapy. It is about discovery infrastructure. Together, ncORF biology and single-molecule protein sequencing shift the central question from:
“What known proteins are altered?” to:
“What biology are current tools failing to detect altogether?”
The caveat remains important: neither ncORF-derived peptideins nor de novo single-molecule protein sequencing is yet a validated therapeutic pathway for LSDs. But both technologies may become increasingly important for:
- target discovery
- biomarker identification
- patient stratification
- phenotype variability analysis
- next-generation rare disease biology programs
