Skip to content

Free amino acids in African indigenous vegetables: Analysis with improved HILIC-UHPLC-QqQ-MS/MS and interactive machine learning

Notifications You must be signed in to change notification settings

yuanboFaith/AfricanVegetables_AminoAcids

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

59 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Free amino acids in African indigenous vegetables: Analysis with improved hydrophilic interaction ultra-high performance liquid chromatography tandem mass spectrometry and interactive machine learning

See original article published in J. Chroma A.

Abstract

A hydrophilic interaction (HILIC) ultra-high performance liquid chromatography (UHPLC) with triple quadrupole tandem mass spectrometry (MS/MS) method was developed and validated for the quantification of 21 free amino acids (AAs). Compared to published reports, our method renders collectively improved sensitivity with lower limit of quantification (LLOQ) at 0.5 ~ 42.19 ng/mL with 0.3 μL injection volume (or equivalently 0.15 ~ 12.6 pg injected on column), robust linear range from LLOQ up to 3521 ~ 5720 ng/mL (or 1056 ~ 1716 pg on column) and a high throughput with total time of 6 min per sample, as well as easier experimental setup, less maintenance and higher adaptation flexibility. Ammonium formate in the mobile phase, though commonly used in HILIC, was found unnecessary in our experimental setup, and its removal from mobile phase was key for significant improvement in sensitivity (4 ~ 74 times higher than with 5 mM ammonium formate). Addition of 10 (or up to100 mM) hydrochloric acid (HCl) in the sample diluent was crucial to keep response linearity for basic amino acids of histidine, lysine and arginine. Different HCl concentration (10 ~ 100 mM) in sample diluent also excreted an effect on detection sensitivity, and it is of importance to keep the final prepared sample and calibrators in the same HCl level. Leucine and isoleucine were distinguished using different transitions. Validated at seven concentration levels, accuracy was bound within 75 ~ 125%, matrix effect generally within 90~110%, and precision error mostly below 2.5%.

Using this newly developed method, the free amino acids were then quantified in a total of 544 African indigenous vegetables (AIVs) samples from African nightshades (AN), Ethiopian mustards (EM), amaranths (AM) and spider plants (SP), comprising a total of 8 identified species and 43 accessions, cultivated and harvested in USA, Kenya and Tanzania over several years, 2013 ~ 2018.

The AN, EM, AM and SP were distinguished based on free AAs profile using machine learning methods (ML) including principle component analysis, discriminant analysis, naïve Bayes, elastic net-regularized logistic regression, random forest and support vector machine, with prediction accuracy achieved at ca. 83~97% on the test set (train/test ratio at 7/3). An R Shiny based interactive ML platform was constructed for modeling train-test simulation and category prediction of unknown AIV sample(s). This new method presents a robust and rapid approach to quantifying free amino acids in plants for use in evaluating plants, biofortification, botanical authentication, safety, adulteration and with applications to nutrition, health and food product development.

R Script Reference

This repository records the R script developed for data analysis and visualization and machine learning in the original publication (see code and output). The R code has been developed with reference to R for Data Science (2e), and the official documentation of tidyverse, and DataBrewer.co. See breakdown of modules below:

  • Data visualization with ggplot2 (tutorial of the fundamentals; and data viz. gallery).

  • Data wrangling with the following packages: tidyr: transform (e.g., pivoting) the dataset into tidy structure; dplyr: the basic tools to work with data frames; stringr: work with strings; regular expression: search and match a string pattern; purrr: functional programming (e.g., iterating functions across elements of columns); and tibble: work with data frames in the modern tibble structure.

Follow me. Keep updated with my latest research

Medium Logo X Logo DataBrewer Logo Harvard Public Health Logo ORCID Logo LinkedIn Logo Academia Logo Google Scholar

About

Free amino acids in African indigenous vegetables: Analysis with improved HILIC-UHPLC-QqQ-MS/MS and interactive machine learning

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published