Open Access Article
Sara Hamzelou
*a,
Damien Belobrajdicabc,
Henri Brooka,
Wei Chend,
Fletcher Hitena,
Kim Lee Change,
Diane Purcellf,
Utpal Bosed,
Simone Osborned,
Cassandra Peggd and
Michelle L. Colgrave*dg
aCSIRO Health & Biosecurity, Adelaide, SA 5000, Australia
bAlliance for Research in Exercise, Nutrition and Activity (ARENA), University of South Australia, Adelaide, SA 5001, Australia
cMedicine and Public Health, Health Flinders University, Adelaide, SA 5042, Australia
dCSIRO Agriculture and Food, St Lucia, QLD 4036, Australia
eCSIRO Environment, Hobart, Tasmania 7004, Australia
fCSIRO NCMI, Hobart, Tasmania 7004, Australia
gAustralian Research Council Centre of Excellence for Innovations in Peptide and Protein Science, School of Science, Edith Cowan University, Joondalup, WA 6027, Australia
First published on 8th May 2026
Microalgae are a sustainable protein source, yet few species are used for human nutrition. This study evaluated eight species for protein quality, digestibility, and peptide profiles after simulated gastrointestinal digestion using INFOGEST. Amino acid content, and intestinal-phase peptide mixtures were analysed by mass spectrometry. The methanol precipitation method was used to separate digested (supernatant) and undigested (pellet) peptides following in vitro INFOGEST digestion. The pellet was further separated by molecular weight into 3–10 kDa and >10 kDa fractions to isolate the lower molecular weight peptides from the higher ones. This separation method enabled the identification of potential digested peptides within the 3–10 kDa fraction that may still be present in the pellet. Cell-based and in silico analysis showed the 3–10 kDa fraction contained permeable peptides, highlighting 80% methanol's limitation in recovering all digestible peptides in the INFOGEST protocol. However, discrepancies between in silico predictions and in vitro permeability revealed that machine learning approaches alone are insufficient, underscoring the critical importance of experimental validation. The >10 kDa fraction predominantly contained structural proteins for photosynthesis, indicating digestion resistance. Pandorina sp. and Glaucocystis nostochinearum exhibited essential amino acid-rich profiles, especially their high Lysine content which is a limiting amino acid in most plant-derived proteins. These findings reveal digested microalgae's peptide composition and potential as functional food ingredients.
While numerous studies have focused on maximising protein extraction from microalgae to increase yield,1 less attention has been given to assessing the quality and nutritional value of these proteins. Protein quality is primarily determined by the profile of essential amino acids (EAA) and their ileal digestibility, factors that directly influence the nutritional value of microalgae. Advances in mass spectrometry, particularly in the emerging field of digestomics, now enable the detailed identification and quantification of peptide digests and amino acids generated during protein digestion.4
Nutrient absorption—including that of proteins, carbohydrates, lipids, and minerals—occurs primarily in the small intestine, which plays a central role in human digestion.5 While the most accurate assessment of protein digestibility is obtained through ileal measurements in humans, this approach is often limited by practical and ethical constraints. Animal models, like ileum-fistulated pigs and rats, are frequently used but are expensive and present ethical considerations. As an alternative, in vitro digestion protocols provide a reproducible, validated, and non-invasive method for simulating human digestive processes, which is particularly valuable when evaluating novel food sources.6,7 This aligns with FAO/WHO recommendations to develop in vitro methods that accurately predict protein digestibility and bioavailability in humans.8 The static in vitro digestion model, developed by the European Cooperation in Science and Technology (COST) Action INFOGEST in 2014, closely mimics human physiology and is now widely accepted as a method for studying protein digestibility in food.6,9
The current research on microalgae proteins emphasizes the characterisation of permeable peptides and the quantification of free amino acids, aiming to identify health-promoting bioactive peptides and EAAs beneficial for human nutrition. However, profiling the peptides that remain undigested after simulated digestion is also valuable, as it can reveal structural protein domains that are resistant to gastrointestinal digestion. Thus, analysis of bioactive peptides after the small intestinal digestion process is important, as it provides an indication of those peptides that have the potential to be absorbed in the small intestine or pass into the large intestine, where they can influence the microbiome.10,11
In the present study, we investigated the protein digestibility and peptide profiles—digested and undigested—of eight different species of microalgae. These include green species (Chlamydomonas sp., Desmodesmus arthrodesmiformis, Pandorina sp., Nannochloropsis australis to blue-green consisting of Snowella sp., Glaucocystis Nostochinearum, Arthrospira platensis) and a red species (Rhodomonas salina).
While some microalgae are already approved for human consumption, the incorporation of new species or derived products (e.g., protein isolates) into food systems may require comprehensive regulatory evaluation, such as under the European Union “Novel Food” framework.12 Among the species investigated in this study, Arthrospira platensis is well established for human consumption and is classified as Generally Recognised As Safe (GRAS), having been approved by the United States Food and Drug Administration (FDA) as a dietary supplement since 2003.12,13 Nannochloropsis sp. has attracted increasing attention due to their high nutritional value, particularly as sources of essential fatty acids and bioactive peptides.14 Similarly, Rhodomonas salina is widely utilised in aquaculture as a high-value live feed due to its favourable lipid composition and phycoerythrin content,15 and is emerging as a promising candidate for human food applications, particularly for its nutritional and colour-enhancing properties. Pandorina sp., Glaucocystis nostochinearum, Desmodesmus arthrodesmiformis, Chlamydomonas sp., and Snowella sp. are not currently established as food sources However, these species were selected for this study due to their biochemical diversity and potential as novel sources of functional compounds. All strains were obtained from a controlled culture collection, ensuring consistency and suitability for exploratory biochemical and digestion studies.
In vitro digestion assays were conducted on freeze-dried, disrupted algal cells, followed by mass spectrometric identification of the peptides generated during intestinal digestion. To our knowledge, this is the first study to comprehensively map the peptidome (digestome) of digested microalgal samples.
:
12 h light/dark cycle. The incubation temperatures were set at 15 °C, 20 °C, or 25 °C (Table 1), depending on each strain's natural environment.
| Row | Taxonomic class | ANACC identifier | Microalgae species | Growth temperature | Source | Culture media |
|---|---|---|---|---|---|---|
| a Modified liquid medium.16b Guillard's f/2 marine microalgal growth medium.17c GSe medium, a variation of G with the addition of 1 × 10−8 mol L−1 selenium.18 | ||||||
| 1 | Cyanophyceae | CS-697 | Arthrospira platensis | 25 | Taiwan | 1/2 MLAa–1/2 F/2b |
| 2 | Chlorophyceae | CS-977 | Chlamydomonas sp. | 20 | Australia (Tasmania) | MLA freshwater |
| 3 | Chlorophyceae | CS-950 | Desmodesmus arthrodesmiformis | 20 | Australia (Victoria) | MLA freshwater |
| 4 | Glaucophyceae | CS-777 | Glaucocystis Nostochinearum | 20 | Australia (Western Australia) | MLA ground pool water |
| 5 | Eustigmatophyceae | CS-416 | Nannochloropsis australis | 20 | Australia (Tasmania) | F2 Marine |
| 6 | Chlorophyceae | CS-946 | Pandorina sp. | 15 | Australia (Victoria) | MLA freshwater |
| 7 | Cryptophyceae | CS-174 | Rhodomonas salina | 20 | USA (Connecticut) | GSe marinec |
| 8 | Cryptophyceae | CS-1224 | Snowella sp. | 15 | Australia (Tasmania) | MLA freshwater |
Biomass was harvested by centrifugation at 3400g for 20 min at 20 °C. The resulting supernatant was discarded, and the pellet was resuspended and washed with sterile 3.15% (w/v) ammonium formate solution and freeze-dried for analysis.
Portions equivalent to 40 mg of protein were subjected to the in vitro gastrointestinal INFOGEST method.20 The enzyme activities and bile concentrations were determined according to the standardized protocol described by Minekus et al., 2014 in vitro.20
In brief, the samples were dissolved in 1 mL of water, and mixed with 1 mL of simulated salivary fluid (pH 7, 37 °C, 300 U mL−1 amylase) for 2 min, then combined with 2 mL of simulated gastric fluid (pH 3, containing 2000 U mL−1 pepsin) and incubated for 120 min at 37 °C. Afterwards, 4 mL of simulated intestinal juice (pH 7) containing pancreatin (100 U trypsin activity per mL) and bile (10 mmol L−1) were added and incubated for 120 min at 37 °C with constant gentle mixing.
Sample separation into digested and undigested fractions was performed using the method described by7 with minor modifications.
Peptide samples were separated into digested and undigested fractions by precipitation with methanol to a final concentration of 80% (v/v) at −20 °C for 1 h, followed by centrifugation at 2000g for 10 min at 4 °C. The supernatants were collected, and the pellets were washed using 80% (v/v) methanol, with repeated centrifugation between washes. Both pellets and supernatant (considered as undigestible and digested fraction respectively) were dried and stored at −20 °C for further analysis.
Supernatants were resuspended in 500 µL of 25% (v/v) acetonitrile containing 1% (v/v) formic acid, filtered (0.2 µm nylon) and transferred to vials for LC-MS/MS analysis.
Pellet fractions were resuspended in 8 M urea, 100 mM Tris-HCl (pH 8.8) and processed to obtain two size fractions: 3–10 kDa and >10 kDa. To obtain the 3–10 kDa fraction, first urea was removed from the samples using Amicon® Ultra Centrifugal Filter, 3 kDa MWCO (Merck Millipore, Bayswater, Vic, Australia) by inverting the filter units and spinning down samples into recovery tube at 1000g for 2 min. The concentrated samples were then loaded on Amicon® Ultra Centrifugal Filter, 10 kDa MWCO (Merck Millipore, Bayswater, Vic, Australia) and the flow through were collected. Peptide samples then desalted using Pierce™ peptide desalting spin columns (Pierce Biotechnology, Rockford, IL, USA). The >10 kDa fractions were obtained by using filter-aided sample preparation (FASP)14 by transferring the resuspended pellet in urea to 10 kDa MWCO filters (Merck Millipore, Bayswater, Vic, Australia). All peptide concentrations were determined by micro-BCA assay (Pierce Biotechnology, Rockford, IL, USA) according to the manufacturer's instructions and expressed as mg mL−1. Measurements were performed in triplicate for each sample across three independent biological replicates.
The 3–10 kDa and >10 kDa fractions (0.25 μg peptide) were analysed using a ZenoTOF 7600 mass spectrometer (SCIEX Redwood City, CA, USA) coupled to an ACQUITY UPLC M-Class system (Waters, Massachusetts, USA). Peptides were separated on a C18 column (300 µm × 150 mm, 1.8 µm, 100 Å, Waters) at 40 °C at a flow rate of 5 μL min−1. Peptides were eluted using a 30 min gradient with the same buffers described above, starting with 3% to 35% buffer B for 20 min followed 35–80% buffer B for 2 min and then holding for 1 min at 80% buffer B. The column was then equilibrated with 3% buffer B for 5 min. The source parameters were set at follows: spray voltage at 5000 V; curtain gas at 35; collisional activated dissociation (CAD) gas at 7; and ion source gases 1 and 2 at 20 psi and 15 psi, respectively. TOF MS scans were acquired across a mass range of m/z 350–1750; with accumulation time of 0.2 seconds, a decluttering potential of 80 V, and a collision energy of 10 V. The top 20 precursors were selected for MS2 with charge states between 1–4 and excluding precursors for 5 s after two occurrences. Parameters included a mass range of m/z 100–1500; dynamic collision-induced dissociation (CID), Zeno pulsing enabled, an accumulation time of 0.025 seconds, and summation of 8-time bins.
532 protein sequences) was compiled from UniProt KB and NCBI entries from all microalgal species (September 2024). The database included Arthrospira platensis (13
102 protein sequences), Chlamydomonas sp. (116
001 protein sequences), Snowella sp. (4285 protein sequences), Nannochloropsis sp. (25
580 protein sequences), Glaucocystis sp., (420 protein sequences), and Chlorophyceae (331
001 after removing the sequence duplicates) downloaded from UniProt KB. The database also included Rhodomonas sp. (79
984 protein sequences) and translated Desmodesmus reference genome (210
998 protein sequences) downloaded from NCBI. The later was created from the contigs derived from the Desmodesmus reference genome assembly ASM744998v2 to predict open reading frames (ORFs) with a minimum cut-off of 100 bp using CLC Main Workbench version 24.0.2.
Mass spectrometry data processing and peptide identification for all three fractions were performed via Peaks Studio version 11.0 software (Bioinformatics Solutions, Waterloo, ON, Canada) using our custom database for >10 kDa fraction. Searches were performed with nonspecific (no-enzyme) digestion, a precursor mass tolerance of 20 ppm, and a fragment ion tolerance of 0.2 Da. Cysteine carbamidomethylation was set as fixed modification for >10 kDa fraction and methionine oxidation and deamidation were chosen as variable modifications. Only de novo sequenced peptides with an average local confidence (ALC) score of ≥80% were retained. De novo sequence tags were used by PEAKS DB to be searched against our database. False discovery rate (FDR) analysis was set to 1% for peptide-spectrum matches (PSMs).
:
1) grown in a 96-well black plate for 7 days without changing growth media in three replicates. On day 7, growth media was removed and replaced with Hanks’ Balanced Salt Solution (HBSS) for 2 h at 37 °C. After removing HBSS, cells were incubated with 100 μL of peptides which were synthesised by Mimotopes Pty. Ltd diluted in HBSS for 2 h. The Caco-2: HT-29 cells were then washed once in HBSS, and cell viability was measured using 100 μL of 0.2% CyQUANT in HBSS. Cells in HBSS were used as control of viable cells. The plates were incubated for 1 h at 37 °C, and fluorescent was measured at 485/530 nm (excitation/emission).
:
1) cells were seeded at 4 × 104 cells per well in a growth medium on transwell inserts with polycarbonate membranes (0.4 μm pore size, surface area 0.33 cm2) (Corning, NY, USA). Cells were cultivated for 21 days with growth medium changed twice per week.
After cells were washed with HBSS, monolayer integrity was monitored by transepithelial electrical resistance measurement (TEER) using a Millicell ERS-2 Voltohmmeter (EMD Millipore, Billerica, MA). The growth media was removed and replaced with HBSS for both apical and basolateral chambers for 2 h at 37 °C. HBSS was removed from apical chamber and replaced with peptides (1000 μM in HBSS) and incubated for 2 h at 37 °C. Samples from apical and basolateral chambers were collected and stored at −80 until the further analysis using LC-MS/MS. TEERs were measured at 2, 24, and 48 h post-assay.
Multiple reaction monitoring (MRM) targeted peptides (e.g., FFGLV, WAMLG, GSGPLP, WTNGPL, LVGAEV, and hydrolysed forms MLG, NGPL). Transitions were analysed in Skyline (version 24.1.0.414), with manual curation.21 Quantified peak areas (mean of three replicates) were log2-transformed prior to statistical analysis (t-test, p < 0.05).
To further identify novel potential bioactive peptides, several additional online platforms employing machine learning-based approaches were used. The PeptideRanker tool (https://bioware.ucd.ie/~compass/biowareweb/Server_pages/help/peptideranker/help.php) was applied to prioritise peptides with a high probability of bioactivity, retaining those with scores above 0.8 (where 0.0 indicates a low probability and 1.0 indicates a high probability of bioactivity).23 Other complementary machine learning-based tools were also used to provide more precise functional annotation and assess different types of therapeutic potential, including CSM-peptides (https://biosig.lab.uq.edu.au/csm_peptides/)24 and PepNet (https://liulab.top/PepNet/server).25 Using PepNet, which employs deep learning techniques, anti-inflammatory peptides (AIPs) were further predicted and confirmed.
To improve specificity, dipeptides and tripeptides were excluded, and only peptides with a minimum length of four amino acids were considered for further analysis. The selected peptides were subsequently evaluated for sequence diversity and synthetic feasibility, leading to the identification of five peptides (FFGLV, WAMLG, GSGPLP, WTNGPL, and LPVP) and a negative control (LVGAEV) for experimental validation.
| Row | ANACC identifier | Microalgae species | EAA | Non-EAA | Total EAA | Total | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| His | Ile | Leu | Lys | Met | Phe | Thr | Val | Ala | Arg | Asx | Glx | Gly | Pro | Ser | Tyr | |||||
| Values are presented in g per 100 g dried material. The results for Asp and Glu include contribution of Asn and Gln respectively, as they converted during hydrolysis. Glx represents both Gln and Glu and Asx represents Asp as well as Asn. Trp and Cys were not measured in this study. | ||||||||||||||||||||
| 1 | CS946 | Pandorina sp. | 2.2 | 1.2 | 2.4 | 4.4 | 0.6 | 2.7 | 2.5 | 1.6 | 2.7 | 6.6 | 4.7 | 4.6 | 2.2 | 1.5 | 2.3 | 1.4 | 17.6 | 43.6 |
| 2 | CS777 | Glaucocystis Nostochinearum | 1.7 | 0.8 | 1.8 | 3.3 | 0.4 | 1.7 | 1.9 | 1.2 | 2.7 | 6.7 | 3.6 | 3.8 | 1.8 | 1.3 | 1.6 | 1.0 | 12.7 | 35.0 |
| 3 | CS950 | Desmodesmus arthrodesmiformis | 1.7 | 1.0 | 1.8 | 3.2 | 0.5 | 1.9 | 2.0 | 1.4 | 2.3 | 3.9 | 3.6 | 3.8 | 1.9 | 1.2 | 1.9 | 1.4 | 13.5 | 33.4 |
| 4 | CS416 | Nannochloropsis australis | 1.4 | 0.8 | 1.7 | 2.7 | 0.5 | 1.7 | 1.9 | 1.1 | 1.9 | 3.1 | 3.0 | 3.7 | 1.6 | 2.1 | 1.7 | 1.2 | 11.8 | 30.1 |
| 5 | CS174 | Rhodomonas salina | 1.2 | 0.7 | 1.3 | 2.3 | 0.6 | 1.5 | 1.6 | 1.0 | 1.7 | 2.7 | 2.7 | 2.8 | 1.3 | 0.8 | 1.5 | 1.7 | 10.2 | 25.4 |
| 6 | CS697 | Arthrospira platensis | 1.1 | 0.9 | 1.3 | 1.9 | 0.4 | 1.3 | 1.5 | 1.1 | 1.5 | 2.6 | 2.5 | 3.0 | 1.2 | 0.7 | 1.5 | 1.2 | 9.5 | 23.7 |
| 7 | CS977 | Chlamydomonas sp. | 1.2 | 0.8 | 1.3 | 2.0 | 0.5 | 1.4 | 1.5 | 1.0 | 1.6 | 2.5 | 2.3 | 2.5 | 1.2 | 1.0 | 1.4 | 1.3 | 9.6 | 23.5 |
| 8 | CS1224 | Snowella sp. | 1.0 | 0.7 | 1.1 | 1.7 | 0.4 | 1.1 | 1.3 | 0.8 | 1.3 | 2.0 | 1.9 | 2.2 | 1.0 | 0.7 | 1.3 | 0.6 | 8.0 | 19.2 |
The supernatant fraction contained relatively few unique peptides across the eight microalgae species studied. The highest peptide count was observed in Rhodomonas salina (27 peptides), followed by Pandorina sp. (19 peptides, Fig. 1a). Peptide masses in the supernatant primarily ranged between 246.1 and 649.3 (m/z) (SI Table S1) and were predominantly singly charged (Fig. 1c). Peptide lengths spanned 2 to 7 amino acids; notably, only one peptide with 7 amino acids was identified in Arthrospira platensis (Fig. 1b).
As anticipated, a larger diversity and number of peptides were detected in the pellet fractions. Generally, the 3–10 kDa fraction contained more unique peptides than the >10 kDa fraction for most species. However, Arthrospira platensis and Desmodesmus arthrodesmiformis showed the reverse pattern, with more peptides detected in the >10 kDa fraction (Fig. 1a). Peptide mass distributions ranged from 350.6–1206.1 Da in the 3–10 kDa fraction and from 404.2–2837.3 in the >10 kDa fraction (SI Table S1). Most peptides in the >10 kDa fraction (93.1%) carried a +2 charge, while the 3–10 kDa fraction contained primarily singly charged peptides, with only 24.5% doubly charged (SI Table S1 and Fig. 1d).
Peptide sequence lengths in the 3–10 kDa fraction ranged from 4 to 23 amino acids, with a modal length of 5 amino acids. In the >10 kDa fraction, sequences spanned 4 to 26 amino acids, with a predominant length of 8 amino acids (Fig. 1c).
A total of six peptides (17%) were common to both the supernatant and the 3–10 kDa fraction, but none were found in both the supernatant and >10 kDa fraction. Additionally, 95 peptides were shared between the 3–10 kDa (8.3% of total) and >10 kDa (14%; Fig. 1b). Peptides and proteins in >10 kDa fraction were further identified via database search against a custom in-house microalgae protein database.
The instability index, which reflects the predicted peptide stability, also differed significantly between the three fractions (p < 0.05). Only one-third of the peptides in the supernatant fraction exhibited an instability index below 40, the threshold for stability,27 compared to 78% in the 3–10 kDa fraction and 76% in the >10 kDa fraction. This indicates that most peptides in the supernatant are less stable than those in the pellet fractions.
Most peptides in the supernatant fraction (62%) had an isoelectric point of 6.1, while a broader distribution of isoelectric points was seen in both pellet fractions (Fig. 1e). These analyses guided the selection of candidate peptides for further investigation of bioactivity and bioavailability.
Among all identified peptides, only three with previously documented bioactivity were identified from the digesta of supernatant and 3–10 kDa. Due to the use of de novo sequencing, leucine (L) in these sequences may represent leucine or its isomer, isoleucine (I).28 The peptide LPVP (a known dipeptidyl peptidase IV inhibitor, DPP IV inhibitor) was present in the supernatant of all eight microalgal species. In the 3–10 kDa fraction, LPPV (a PEP inhibitor) and LLAP (another DPP IV inhibitor) were found (BIOPEP-UWM database). LPPV was present in all species other than Arthrospira platensis, while LLAP was present in five of the species: Arthrospira platensis, Chlamydomonas sp., Glaucocystis Nostochinearum, Pandorina sp. and Rhodomonas salina. Overall, these findings indicate that bioactive peptides were broadly conserved across species, with minor differences in their distribution.
Following the identification of potential bioactive peptides, a stepwise filtering pipeline was employed to prioritise peptide candidates for synthesis. Initial screening retained peptides with a PeptideRanker bioactivity scores above 80%. The refined set was further evaluated based on physicochemical parameters, with emphasis on stability and functionality under small intestinal conditions. Those peptides were retained with high stability (instability index < 40), moderate hydrophobicity, and an isoelectric point between 6.0 and 6.4, approximating the pH range of the small intestine.29 Finally, peptides were manually assessed for sequence diversity and synthetic feasibility, resulting in the selection of five peptides along with a negative control for experimental validation. Of eight peptides meeting these criteria, four (FFGLV, WAMLG, GSGPLP, and WTNGPL) as well as LPVP and LVGAEV were synthesised for experimental validation. Notably, all the bioactive peptide candidates were present in the 3–10 kDa fraction in Snowella sp. Interestingly, LVGAEV was predicted to have a low probability of bioactivity by PeptideRanker (5%) and was not classified as anti-inflammatory by CSM-peptides or PepNet yet showed binding affinity to the active site of human inducible nitric oxide synthase (1NSI) in molecular docking studies.
The interactions between Val6 of the LVGAEV peptide with Ala282 and Arg266, both of which are located near the substrate access channel, suggesting potential influence on enzymatic activity. Moreover, the interaction between Ala4 with H4B600, a residue adjacent to the cofactor binding site, potentially contributes to conformational stabilization (SI Fig. S1). Hydrogen bonding was also observed between Glu5 with Thr121 and Tyr492, residues located near the heme group. Two hydrogen bonds of Gly3 with Arg3888, with interaction distances of 3.2 Å and 2.6 Å, reinforcing the binding of the peptide within the active pocket. Interestingly, the only peptide that showed anti-inflammatory properties was LVGAEV based on in vitro cell assay (Fig. 2a).
The potential allergenicity of proteins in the undigested fraction was assessed using AlgPred 2.0, a comprehensive web-based tool that integrates machine learning with motif, BLAST, and IgE epitope mapping approaches for allergen prediction.30 Two allergenic peptide motifs—NNACEAFLCNCDR and DNVEGFNCER—were detected, with machine learning (ML) scores of 0.4 and 0.35, respectively. The motif-emerging and with classes-identification (MERCI) score for both motifs were 0.5, indicating their presence matches experimentally validated IgE epitopes. These findings suggest that digestion-resistant proteins in microalgae may possess peptide sequences with allergenic potential, warranting further validation in allergenicity risk assessments.
Downstream processing techniques prior to digestibility analysis or use of crude protein rather than whole biomass, have been shown to significantly enhance protein digestibility.33,34 Pre-treatment methods, such as cell wall disruption by ball-milling, were employed in this study to maximise protein accessibility in whole algal biomass prior to in vitro digestion, an approach that aimed to reflect consumption of whole biomass in vivo.
In protein digestibility studies, 80% (v/v) methanol precipitation is commonly used to distinguish undigested proteins and high molecular weight peptides (pellet) from low molecular weight peptides and free amino acids (supernatant) (Santos-Hernández et al., 2018; Sousa et al., 2023).7,35 However, our findings reveal that methanol precipitation is not fully effective to separate digested from undigested fractions. Size fractionation demonstrated that digested peptides remained in the pellet, likely as a result of their hydrophobicity or low solubility which causes them to aggregate and precipitate.36 This indicates that using 80% (v/v) methanol precipitation may underestimate the pool of potentially digestible and permeable peptides. Sampling at the end of the in vitro digestibility assay (intestinal phase) allowed us to capture peptides after multiple and combined enzymatic degradation, thus explaining the predominance of smaller peptides (mostly di- to pentapeptide) and free amino acids.37 To enhance detection, we optimised liquid chromatography-mass spectrometry (LC-MS) method with a pentafluorophenyl (PFP) column for better retention and separation of short peptides, instead of the C18 column which is commonly used in proteomics studies. De novo interpretation of MS/MS spectra was used to identify the peptides as most were too short for confident identification through database searching and spectral matching algorithms.37
Characterisation of the undigested fraction (>10 kDa) revealed a set of proteins and peptides that were resistant to digestion. Several proteins identified in the >10 kDa pellet fraction were associated with photosynthetic processes (GO:0015979), indicating that structural or membrane-associated proteins involved in photosynthesis may be more resistant to enzymatic digestion. The identification of proteins such as ribulose bisphosphate carboxylase and chlorophyll a–b binding protein across multiple species suggests a conserved resistance among photosynthetic machinery to gastrointestinal digestion. Resistance of photosynthetic protein complexes such as chlorophyll a–b binding proteins, and photosystem I (PSI) and II (PSII) in spinach (Spinacia oleracea) leaves to digestive enzymes even after extended exposure to pancreatic juice has been demonstrated previously.38
Interestingly, literature suggests that delayed gastrointestinal digestion of thylakoid-rich plants—such as those containing photosynthetic protein complexes—can promote satiety and contribute to appetite regulation, potentially aiding in obesity prevention.35,38,39 While plant thylakoids contain lipids, pigments, and proteins, the resistance of specific proteins (e.g., chlorophyll a–b binding protein, PSI, PSII) to digestion may contribute to satiety, suggesting that similar proteins in microalgae may act in a similar way.
On the other hand, proteins or peptides that resist gastrointestinal digestion are more likely to reach the gut immune system intact and potentially elicit allergic responses.40 Our in silico analysis of peptides from >10 kDa fraction identified two putative allergenic motifs of NNACEAFLCNCDR and DNVEGFNCER in the undigested fraction. While these motifs overlapped with experimentally validated IgE epitopes, suggesting potential allergenicity, this will require immunological and clinical validation. For bioactive peptide screening, this study integrated several approaches: physicochemical property analysis, in silico bioactivity prediction, molecular docking, and experimental validation.41,42 Initial filtering of excluded peptide sequences predicted to be unstable or poorly soluble under physiological conditions, focusing on those with greater functional potential. Our findings revealed that machine learning-based prediction tools alone are insufficient to reliably identify putative bioactive peptides. The results obtained from different prediction tools may vary; for example, the peptide EALPE which was assigned a low bioactivity probability (0.1%) by PeptideRanker—a widely used software tool41—was predicted by PepNet to be a putative antimicrobial with a probability of 94.4%. This inconsistency highlights a major limitation of relying on a single predictive model and underscores the importance of using multiple, complementary algorithms when screening for functional peptides.
The limitations of in silico prediction were further illustrated by LVGAEV, a peptide that was not predicted as bioactive peptide based on PeptideRanker but demonstrated significant in vitro anti-inflammatory activity, as shown by the reduction of nitric oxide (NO) production in LPS-stimulated RAW264.7 cells. Using molecular docking showed LVGAEV interacts with the oxygenase domain of human iNOS, suggesting this peptide's anti-inflammatory effect is mechanistically plausible. This example demonstrates how structure-based computational tools can complement machine learning by elucidating target interactions that explain observed bioactivity.
In contrast, peptides predicted to be anti-inflammatory (FFGLV and WAMLG) did not reduce NO production in the in vitro model. These discrepancies highlight the gap between computational predictions and actual biological activity, reinforcing the necessity of experimental confirmation.
The results from the transwell permeability assays further illustrate the complexities of peptide bioactivity. Peptide hydrolysis, poor solubility, or aggregation within the cell culture medium could diminish the effective concentration at intracellular targets, limiting measurable bioactivity despite favourable in silico scores.43 Therefore, the functional impact of food-derived peptides depends not only on their predicted properties and target interactions but also on their bioavailability and stability in biological contexts. This may highlight the importance of animal digestive models to more effectively evaluate the bioavailability, functionality and safety of potential bioactive peptides from novel foods such as microalgae.
Our findings on algal bioactive peptides demonstrate that a multi-pronged screening pipeline—incorporating physicochemical profiling, using diverse computational tools, structure-based docking, and experimental validation—enables a more robust characterisation and selection of peptides with nutritional and therapeutic potential.
To our knowledge, this is the first study to analyse both digested and undigested peptide fractions in microalgae enabling the identification of digestion-resistant peptides that might otherwise be overlooked. By examining the pellet fraction—typically considered the undigested fraction following in vitro digestion using INFOGEST protocols—we uncovered additional permeable and bioactive peptides, broadening the nutritional and functional landscape of microalgae-derived proteins. This work also represents the first in-depth peptide mapping across multiple microalgae species.
Our findings highlight the complexity of food-derived peptide bioactivity and support the need for a comprehensive strategy that combines multiple predictive models (rather than relying on one model), physicochemical screening, structural modelling, and in vitro biological validation to more reliably identify and characterise functional peptides. Notably, discrepancies between in silico predictions and in vitro outcomes—such as the lack of anti-inflammatory activity of predicted peptides FFGLV and WAMLG—demonstrate that computational approaches alone are insufficient and must be complemented by experimental validation. Future research should focus on in vivo validation of identified peptides and their physiological effects to further identify the potential of microalgae as a sustainable, functional food to support health, wellbeing and disease prevention.
| AAA | Aromatic amino acids |
| ANACC | Australian National Algae Culture Collection |
| COST | Cooperation in Science and Technology |
| EAA | Essential amino acids |
| FASP | Filter-aided sample preparation |
| FDR | False discovery rate |
| GO | Gene ontology |
| HBSS | Hanks’ balanced salt solution |
| iNOS | Inducible nitric oxide synthase |
| LC-MS | Liquid chromatography-mass spectrometry |
| MRM | Multiple reaction monitoring |
| NO | Nitric oxide |
| PSMs | Peptide-spectrum matches |
| SAA | Sulfur amino acid |
| TEER | Transepithelial electrical resistance measurement |
| This journal is © The Royal Society of Chemistry 2026 |