Author: Zhang, Yuanliang
Title: Profiling of the sORF-encoded peptides and discovery of their potential biological function using mass spectrometry
Advisors: Zhao, Qian (ABCT)
Degree: Ph.D.
Year: 2025
Subject: Peptides
Mass spectrometry
Proteomics
Hong Kong Polytechnic University -- Dissertations
Department: Department of Applied Biology and Chemical Technology
Pages: xx, 177 pages : color illustrations
Language: English
Abstract: The discovery of non-canonical proteins, such as small open reading frame-encoded peptides (sORF-encoded peptides, SEPs), alternative proteins (AltProts), and cryptic immunopeptides, has challenged the traditional boundaries of proteomics, revealing a hidden layer of the proteome. These biomolecules, often overlooked due to their small size, low abundance, and origins in genomic regions previously considered non-coding, challenge traditional proteomic paradigms. To overcome detection barriers, we developed advanced mass spectrometry-based approaches, integrating data-independent acquisition (DIA) proteomics, immunopeptidomics, and CRISPR-based functional screening. These methods, applied across three distinct studies corresponding to the core chapters of this thesis, provide novel insights into the biological roles of non-canonical proteins and establish robust tools for their characterization.
First in Chapter 2, we optimized a DIA-based proteomics workflow to enhance AltProt detection in mouse cardiac development. Recognizing the limitations of conventional data-dependent acquisition, we employed computationally predicted spectral libraries to improve sensitivity, achieving a twofold increase in AltProt identifications with reduced missing values. Applied to embryonic and adult mouse heart tissues, this approach identified over 50 differentially expressed AltProts enriched in pathways linked to cellular differentiation. Through targeted validation using Western blotting and parallel reaction monitoring, we confirmed ASDURF, an upstream open reading frame (uORF)-derived protein, as a regulator of cardiomyocyte maturation, highlighting the developmental importance of AltProts and establishing a reliable pipeline for their study in other biological contexts.
Next in Chapter 3, we addressed the challenges of identifying MHC-presented peptides from non-canonical sources by developing a Pseudo-DIA Library Search Strategy for immunopeptidomics. This method, combining unrestricted DIA searches with in silico predicted libraries, increased immunopeptide detection by up to 3.8-fold compared to standard approaches. In human cell lines, we identified cryptic peptides and neoantigens encoded by sORFs, assessing their immunogenic potential for personalized cancer immunotherapy. To facilitate broader adoption, we created a user-friendly graphical interface for generating spectral libraries from MSFragger outputs, validated across independent datasets. This work expands the scope of immunopeptidomics and supports the development of targeted cancer therapies.
Finally in Chapter 4, we conducted a large-scale analysis of over 36,000 microproteins across 86 public mass spectrometry datasets, revealing their predominant origins in long non-coding RNAs and frequent proximity to oncogenes. Conservation analyses across 100 species indicated diverse evolutionary pressures, with mammalian-specific patterns suggesting adaptive roles. Using a CRISPR knockout library in three cancer cell lines, we demonstrated that several microproteins independently regulate proliferation, distinct from their associated canonical proteins, as evidenced by comparisons with public DepMap data. These findings position microproteins as potential therapeutic targets in oncology.
Collectively, this thesis advances our understanding of the proteome's complexity by uncovering the functional roles of non-canonical proteins in development and disease. The methodologies developed provide practical tools for future studies, while the biological insights open new avenues for precision medicine and developmental biology research.
Rights: All rights reserved
Access: open access

Files in This Item:
File Description SizeFormat 
8611.pdfFor All Users12.51 MBAdobe PDFView/Open


Copyright Undertaking

As a bona fide Library user, I declare that:

  1. I will abide by the rules and legal ordinances governing copyright regarding the use of the Database.
  2. I will use the Database for the purpose of my research or private study only and not for circulation or further reproduction or any other purpose.
  3. I agree to indemnify and hold the University harmless from and against any loss, damage, cost, liability or expenses arising from copyright infringement or unauthorized usage.

By downloading any item(s) listed above, you acknowledge that you have read and understood the copyright undertaking as stated above, and agree to be bound by all of its terms.

Show full item record

Please use this identifier to cite or link to this item: https://theses.lib.polyu.edu.hk/handle/200/14157