Welcome to LipidAnalyst!

LipidAnalyst: A Comprehensive Tool for Lipidomics Data Analysis

LipidAnalyst is an interactive web application designed for the analysis of lipidomics data. It provides a user-friendly interface for uploading, processing, normalizing, and analyzing lipidomics datasets.

Key Features:

  • Data Upload: Easily upload lipidomics data, metadata and internal standards in CSV/TSV/XLS/XLSX format.
  • Data Filtering: Apply filters to remove low-quality, low-abundance, and low-variance features.
  • Missing Value Imputation: Choose from various imputation methods to handle missing data.
  • Lipid Parsing: Automatically parse lipid names into structured components.
  • Normalization: Normalize data using internal standards or other user-defined methods.
  • Statistical Analysis: Perform T-tests, ANOVA, and correlation analyses with visualization options.
  • Interactive Visualizations: Explore your data with Differential Mean Lipid Heatmaps, PCA plots, boxplots, DSPC networks, and more.


For detailed instructions and help, please refer to the tutorial document .


Developed in Pennathur Lab, University of Michigan.
For support or inquiries, please contact: lipidanalyst-requests@umich.edu

Upload Lipidomics Data

Please select the correct data format before uploading the file. After adjusting any settings, make sure to reload your data so that the software uses the correct parameters.

Download Example Data

Lipidomics Data Preview

Loading...

Upload Metadata (Group information)

Download Example Data

Metadata Preview

Loading...

Upload Internal Standard

Please select the correct data format before uploading the file. After adjusting any settings, make sure to reload your data so that the software uses the correct parameters.

Please select the correct data format before uploading the file.

Download Example Data

Internal Standard Preview

Loading...

Data Filtering Overview

Data filtering is a crucial preprocessing step in lipidomics data analysis. It helps to eliminate noise and irrelevant features, ensuring that subsequent analyses focus on meaningful biological variations.

This section provides options to apply various filters to your lipidomics data. You can choose to enable or disable each filter and set specific thresholds according to your analysis needs.

Low Quality Filter

Low Quality Filter removes features with a high percentage of missing values across samples.

Low Abundance Filter

Low Abundance Filter removes features with a mean lower than a specified percentile across all feature means.

Low Variance Filter

Low Variance Filter removes features with low variability across samples, which would not be useful for distinguishing between different conditions or groups.

Filtered Data Preview



Download CSV
Loading...

Missing Value Imputation Guide

Missing values are common in lipidomics datasets and can arise from several sources, including:

  • Technical limitations of the mass spectrometer, such as detection thresholds or signal suppression,
  • Variability introduced during sample extraction, handling, or instrument runs,
  • True biological absence of a lipid in certain samples or conditions.

Understanding the pattern of missingness is essential for selecting an appropriate imputation strategy. Different types of missingness can reflect different underlying causes and may require tailored handling to avoid introducing bias into downstream analyses such as statistical testing, or network construction.

We categorize missingness into two major types:

  • Group-level missingness: A lipid is almost completely missing within one experimental group (e.g., all disease group samples have NA). This often suggests true biological absence or very low abundance. In these cases, methods such as Limit of Detection (LoD) imputation is generally more appropriate.
  • General missingness: Values are sporadically missing across samples but not confined to a single group. This pattern typically reflects technical noise or stochastic signal dropout. For this scenario, K-Nearest Neighbors (KNN) imputation(sample-wise) is recommended as it leverages similarity among samples to estimate reasonable values.

You can view the Missing value heatmap to identify these patterns and decide on the best imputation approach for your data.

Group-level Missingness Summary

Group Level Missing means features that are missing in a large proportion of samples within a specific group. This type of missingness can usually mean features are truly absent in the specific group, and we suggest to use Limit of Detection (LoD) 1/5 minimum value of the specific feature to impute the missingness here.

Imputation Settings

This panel allows you to handle general missing values in your lipidomics data. You can choose to skip imputation (if no missingness) or select from various imputation methods to fill in missing data points.



Imputed Data Preview

Download CSV
Loading...

Combine duplicated lipids or lipids with different adducts

In lipidomics data, it is common to encounter duplicated lipid entries that represent the lipid species with same shorthand name but differ in their adduct forms (e.g., [M+H]+, [M+Na]+, [M+K]+). Combining these duplicated lipids into a single representative entry can help streamline data analysis and interpretation. This process involves selecting criteria to identify and merge these duplicates based on their lipid names and adduct types.

Combine Duplicated lipids

If you have duplicated lipids with different adducts, you can combine them into one lipid by selecting the criteria below.

Combined Data Preview

Download CSV
Loading...

Lipid Parsing Controls

Chain annotation convention
The biological meaning of each chain field depends on lipid class:

  • Sphingolipids (e.g., SM, Cer, dhCer, LacCer, HexCer): Chain1 = sphingoid base (long-chain base, LCB), Chain2 = N-acyl fatty acid (amide-linked acyl chain), Chain3 = not applicable.
  • Phospholipids (e.g., PC, PE, PG, PI, PS,LPC, LPE, PE(O), PE(P), PA): Chain1 and Chain2 refer to fatty acyl chains at Sn1 and Sn2 carbons of the glycerol back bone, respectively. Chain3 = not applicable.
  • Glycerolipids (e.g., MAG, TAG, DAG): Chain1 to Chain3 refer to fatty acyl chains at Sn1 to Sn3 carbons of the glycerol back bone, respectively.For MAG, chain2 and chain3 are not applicable.
  • Cholesteryl esters (CE): Chain1 = esterified fatty acid chain, Chain2 and Chain3 = not applicable.
Note: Chain fields are stored uniformly for all lipid classes; interpretation is lipid-class–dependent.

The parsing table is editable. Double-click a cell to make changes. If any lipid information is incorrect, you may revise it directly in the table.

Use the search box in each column to filter the rows that you want to view or edit.


If you want to edit multiple points in the parse table, we suggest download the parse table as a CSV file, make changes in Excel, and upload the updated parse table using the file input below.

Parsed Lipid Names

Download CSV
Loading...

Lipid Class Pie Plot

Loading...

Lipid Class Boxplot

Loading...

PCA Plot

Loading...

Sample Boxplot

Loading...

Barplot of lipid species

Loading...

Barplot of group distribution

Loading...

Internal Standard Selection

Normalization by internal standards is a widely used technique in lipidomics data analysis to correct for technical variability and improve the accuracy of quantification. Internal standards are compounds that are chemically similar to the target analytes but are not naturally present in the sample. By adding known quantities of internal standards to each sample prior to analysis, researchers can account for variations in sample preparation, instrument performance, and other factors that may affect the measured signal intensity.

Updated Parsed Table with Internal Standard

Download CSV
Loading...

Normalization Messages


                    

Normalized Data Preview

Download CSV
Loading...

Normalization by a user-defined constant value

Normalization by a constant value involves adjusting the lipid abundances by dividing each data entry by a user-defined constant.
This method is useful when there is a known factor that affects all samples equally, such as dilution factor, weight, or volume.
By applying this normalization, researchers can standardize the lipid measurements across samples, facilitating more accurate comparisons and analyses.

Metadata Supplement

Normalization by metadata involves adjusting lipid abundances based on relevant metadata information.
This method helps to account for variations in sample characteristics, experimental conditions, or other factors that may influence lipid measurements, such as cell counts and protein concentration.
By normalizing the lipid data using appropriate metadata, researchers can improve the accuracy and reliability of their analyses, leading to more meaningful biological interpretations.

Normalized Data Preview

Download CSV
Loading...

Normalization

Quantile normalization is a technique that makes the distributions of values across multiple samples identical by aligning their quantiles, helping to reduce technical variation and enable fair comparisons.

Data Transformation

Data Scaling

Normalized Data Preview

Download CSV
Loading...

Boxplot Settings

We don't support boxplot if your expression data has more than 80 lipids.

Pixel size settings for downloading Boxplot

Boxplot Output

Plot Settings

Pixel size settings for downloading plots

Plot Settings

Pixel size settings for downloading plots

Differential Mean Lipid Heatmap Parameter

Pixel size settings for downloading Heatmap

Font and Color Settings

Download the Plots for the Selected Lipid Class Download Plots of All Lipid Classes Following same requirements as ZIP

Differential Mean Lipid Heatmap Output

Volcano Plot Settings

Fold change was defined as the difference in mean lipid abundance between groups rather than a ratio. This is because some normalized and transformed values included negative numbers, making ratio-based fold change inappropriate.

Pixel size settings for downloading plots

Volcano Plot Output

Volcano Plot Data

PCA Plot Settings

Pixel size settings for downloading PCA

PCA Plot Output

Heatmap Settings

Heatmap Output

Lipidomics Mean Calculator

Mean Calculated Data Preview

Download CSV
Loading...

T-test Settings

T-test Results

Download CSV
Loading...

T-test PCA Plot

Loading...
Download T-test PCA Plot (.png)

ANOVA Settings

ANOVA Results

Download CSV
Loading...

ANOVA PCA Plot

Loading...
Download ANOVA PCA Plot (.png)

Correlation

DSPC Network Settings

DSPC (Debiased Sparse Partial Correlation) is a statistical framework designed to infer sparse and interpretable molecular networks from high-dimensional omics data such as lipidomics. It estimates partial correlations while correcting for the bias introduced by regularization (e.g., graphical lasso), allowing more reliable detection of direct molecular relationships.

Download DSPC Data (.csv)
Loading...

DSPC Network Output

Loading...

PLS-DA Model Overview

Partial Least Squares Discriminant Analysis (PLS-DA) is a supervised multivariate statistical method used to model the relationship between lipidomic data and predefined categorical groups.
PLS-DA identifies latent variables that maximize the covariance between the lipid features and the group labels, enabling effective classification and discrimination of samples based on their lipid profiles.
This technique is particularly useful for biomarker discovery, as it highlights the most relevant features contributing to group separation while handling complex, high-dimensional datasets typical in lipidomics studies.

PLS-DA Model Settings

PLS-DA Model Output

Loading...

Loading...
Download VIP Plot (.png)

OPLS-DA Model Overview

Orthogonal Partial Least Squares Discriminant Analysis (OPLS-DA) is a supervised multivariate statistical method used to identify differences between predefined groups in complex datasets.

Unlike PLS-DA, OPLS-DA explicitly separates the variation in the data into:

  • Predictive components: directly associated with the group labels.
  • Orthogonal components: capture systematic variation unrelated to group differences (e.g., technical variation or biological heterogeneity).

OPLS-DA Model Settings

OPLS-DA Model Output

Loading...


Loading...
Download VIP Plot (.png)

Random Forest Model Settings

Random Forest Model Output

Loading...
Download Importance Plot (.html)

Download All Generated Files

Thank you for using our lipidomics data analysis tool! If you have any feedback or encounter any issues, please don't hesitate to contact us.

For support, please contact: lipidanalyst-requests@umich.edu

Click the button below to download all output files as a zip archive.

Download All Data (ZIP) Download All Plots (ZIP)