<i class="fa-solid fa-magnifying-glass" style="margin-right: 5px;"></i> LipidAnalyst

Welcome to LipidAnalyst!

LipidAnalyst: A Comprehensive Tool for Lipidomics Data Analysis

LipidAnalyst is an interactive web application designed for the analysis of lipidomics data. It provides a user-friendly interface for uploading, processing, normalizing, and analyzing lipidomics datasets.

Key Features:

Data Upload: Easily upload lipidomics data, metadata and internal standards in CSV/TSV/XLS/XLSX format.
Data Filtering: Apply filters to remove low-quality, low-abundance, and low-variance features.
Missing Value Imputation: Choose from various imputation methods to handle missing data.
Lipid Parsing: Automatically parse lipid names into structured components.
Normalization: Normalize data using internal standards or other user-defined methods.
Statistical Analysis: Perform T-tests, ANOVA, and correlation analyses with visualization options.
Interactive Visualizations: Explore your data with Differential Mean Lipid Heatmaps, PCA plots, boxplots, DSPC networks, and more.

For detailed instructions and help, please refer to the tutorial document .

Developed in Pennathur Lab, University of Michigan.
For support or inquiries, please contact: lipidanalyst-requests@umich.edu

Upload Lipidomics Data

Please select the correct data format before uploading the file. After adjusting any settings, make sure to reload your data so that the software uses the correct parameters.

Data format:

Lipidomics Data File (CSV/TSV/XLS/XLSX)

Browse...

Download Example Data

Lipidomics Data Preview

Upload Metadata (Group information)

Metadata File (CSV/TSV/XLS/XLSX)

Browse...

Download Example Data

Select Grouping Variable:

Metadata Preview

Upload Internal Standard

Please select the correct data format before uploading the file. After adjusting any settings, make sure to reload your data so that the software uses the correct parameters.

Skip Internal Standard File

Please select the correct data format before uploading the file.

Data format:

Internal Standard File (CSV/TSV/XLS/XLSX)

Browse...

Download Example Data

Internal Standard Preview

Data Filtering Overview

Data filtering is a crucial preprocessing step in lipidomics data analysis. It helps to eliminate noise and irrelevant features, ensuring that subsequent analyses focus on meaningful biological variations.

This section provides options to apply various filters to your lipidomics data. You can choose to enable or disable each filter and set specific thresholds according to your analysis needs.

Low Quality Filter

Low Quality Filter removes features with a high percentage of missing values across samples.

Enable Low Quality Filter

Filter bottom features with more than x% missing value(s):

Low Abundance Filter

Low Abundance Filter removes features with a mean lower than a specified percentile across all feature means.

Enable Low Abundance Filter

Use summary statistic:

Filter bottom X% features:

Low Variance Filter

Low Variance Filter removes features with low variability across samples, which would not be useful for distinguishing between different conditions or groups.

Enable Low Variance Filter

Variance method:

Interquantile range (IQR)

Standard deviation (SD)

Median absolute deviation (MAD)

Relative standard deviation (RSD = SD/mean)

Non-parametric RSD (MAD/median)

Filter bottom X% features:

Filtered Data Preview

Download CSV

Missing Value Imputation Guide

Missing values are common in lipidomics datasets and can arise from several sources, including:

Technical limitations of the mass spectrometer, such as detection thresholds or signal suppression,
Variability introduced during sample extraction, handling, or instrument runs,
True biological absence of a lipid in certain samples or conditions.

Understanding the pattern of missingness is essential for selecting an appropriate imputation strategy. Different types of missingness can reflect different underlying causes and may require tailored handling to avoid introducing bias into downstream analyses such as statistical testing, or network construction.

We categorize missingness into two major types:

Group-level missingness: A lipid is almost completely missing within one experimental group (e.g., all disease group samples have NA). This often suggests true biological absence or very low abundance. In these cases, methods such as Limit of Detection (LoD) imputation is generally more appropriate.
General missingness: Values are sporadically missing across samples but not confined to a single group. This pattern typically reflects technical noise or stochastic signal dropout. For this scenario, K-Nearest Neighbors (KNN) imputation(sample-wise) is recommended as it leverages similarity among samples to estimate reasonable values.

You can view the Missing value heatmap to identify these patterns and decide on the best imputation approach for your data.

Group-level Missingness Summary

Group Level Missing means features that are missing in a large proportion of samples within a specific group. This type of missingness can usually mean features are truly absent in the specific group, and we suggest to use Limit of Detection (LoD) 1/5 minimum value of the specific feature to impute the missingness here.

Skip imputation of group level missingness

Threshold for group-level missingness (Missing for X % in one group):

Imputation Method:

Number of Nearest Neighbours:

Imputation Settings

This panel allows you to handle general missing values in your lipidomics data. You can choose to skip imputation (if no missingness) or select from various imputation methods to fill in missing data points.

Skip imputation

Imputation Method:

Number of Nearest Neighbours:

Imputed Data Preview

Download CSV

Combine duplicated lipids or lipids with different adducts

In lipidomics data, it is common to encounter duplicated lipid entries that represent the lipid species with same shorthand name but differ in their adduct forms (e.g., [M+H]+, [M+Na]+, [M+K]+). Combining these duplicated lipids into a single representative entry can help streamline data analysis and interpretation. This process involves selecting criteria to identify and merge these duplicates based on their lipid names and adduct types.

Combine Duplicated lipids

If you have duplicated lipids with different adducts, you can combine them into one lipid by selecting the criteria below.

Skip combine duplicated lipids

Combined Data Preview

Download CSV

Lipid Parsing Controls

Chain annotation convention
The biological meaning of each chain field depends on lipid class:

Sphingolipids (e.g., SM, Cer, dhCer, LacCer, HexCer): Chain1 = sphingoid base (long-chain base, LCB), Chain2 = N-acyl fatty acid (amide-linked acyl chain), Chain3 = not applicable.
Phospholipids (e.g., PC, PE, PG, PI, PS,LPC, LPE, PE(O), PE(P), PA): Chain1 and Chain2 refer to fatty acyl chains at Sn1 and Sn2 carbons of the glycerol back bone, respectively. Chain3 = not applicable.
Glycerolipids (e.g., MAG, TAG, DAG): Chain1 to Chain3 refer to fatty acyl chains at Sn1 to Sn3 carbons of the glycerol back bone, respectively.For MAG, chain2 and chain3 are not applicable.
Cholesteryl esters (CE): Chain1 = esterified fatty acid chain, Chain2 and Chain3 = not applicable.

Note: Chain fields are stored uniformly for all lipid classes; interpretation is lipid-class–dependent.

The parsing table is editable. Double-click a cell to make changes. If any lipid information is incorrect, you may revise it directly in the table.

Use the search box in each column to filter the rows that you want to view or edit.

If you want to edit multiple points in the parse table, we suggest download the parse table as a CSV file, make changes in Excel, and upload the updated parse table using the file input below.

Updated parse table Data File (CSV)

Browse...

Cancel uploading the updated parse table and keep the original parse table

Parsed Lipid Names

Download CSV

Lipid Class Pie Plot

Lipid Class Boxplot

PCA Plot

Sample Boxplot

Barplot of lipid species

Barplot of group distribution

Internal Standard Selection

Normalization by internal standards is a widely used technique in lipidomics data analysis to correct for technical variability and improve the accuracy of quantification. Internal standards are compounds that are chemically similar to the target analytes but are not naturally present in the sample. By adding known quantities of internal standards to each sample prior to analysis, researchers can account for variations in sample preparation, instrument performance, and other factors that may affect the measured signal intensity.

Skip normalization by internal standard

Updated Parsed Table with Internal Standard

Download CSV

Normalization Messages

Normalized Data Preview

Download CSV

Normalization by a user-defined constant value

Normalization by a constant value involves adjusting the lipid abundances by dividing each data entry by a user-defined constant.
This method is useful when there is a known factor that affects all samples equally, such as dilution factor, weight, or volume.
By applying this normalization, researchers can standardize the lipid measurements across samples, facilitating more accurate comparisons and analyses.

Skip normalization by supplement coeffient

Normalized by a constant value (e.g. Dilution factor, Weight, Volume):

Metadata Supplement

Normalization by metadata involves adjusting lipid abundances based on relevant metadata information.
This method helps to account for variations in sample characteristics, experimental conditions, or other factors that may influence lipid measurements, such as cell counts and protein concentration.
By normalizing the lipid data using appropriate metadata, researchers can improve the accuracy and reliability of their analyses, leading to more meaningful biological interpretations.

Skip normalization by metadata

Select a column from the uploaded Metadata as the Supplement Data:

Normalized Data Preview

Download CSV

Boxplot Settings

Boxplot for:

We don't support boxplot if your expression data has more than 80 lipids.

Pixel size settings for downloading Boxplot

Width (in inches):

Height (in inches):

Select the dots per inch (DPI) for downloading plot:

Boxplot Output

Download Boxplot

Plot Settings

Select lipid class for comparison:

Filter by number of double bonds

Select range for number of double bonds:

Filter by total carbon number

Select range for total carbon number:

Enable an additional range selection for number of double bonds

Select range for number of double bonds:

Enable an additional range selection for total carbon number

Select range for total carbon number:

Select the plot type:

Select the statistical methods to compare difference (Only for two group comparison):

Show individual data points

Pixel size settings for downloading plots

Width (in inches):

Height (in inches):

Select the dots per inch (DPI) for downloading plot:

Plot Output

Download the Plot for the Selected Lipid Class Download Plots for All Lipid Class

Plot Settings

Select a lipid:

Select plot type:

Statistical method:

Show individual data points

Pixel size settings for downloading plots

Width (in inches):

Height (in inches):

Select the dots per inch (DPI) for downloading plot:

Plot Output

Download the Plot for the Selected Lipid

Differential Mean Lipid Heatmap Parameter

Select the lipid class of interest

Show numbers inside the heatmap

Select Label font size for Heatmap:

Heatmap range (min):

Heatmap range (max):

For lipids that has multiple chains, select a criteria to sequence lipids on the x axis

Pixel size settings for downloading Heatmap

Width (in inches):

Height (in inches):

Select the dots per inch (DPI):

Font and Color Settings

Select the color code for the plot:

Select title font size for Heatmap:

Select x axis label font size for Heatmap:

Select y axis label font size for Heatmap:

Select x axis title font size for Heatmap:

Select y axis title font size for Heatmap:

Select legend title font size for Heatmap:

Select legend text font size for Heatmap:

Download the Plots for the Selected Lipid Class Download Plots of All Lipid Classes Following same requirements as ZIP

Differential Mean Lipid Heatmap Output

Volcano Plot Settings

Select Grouping Variable:

Select Comparison Variable:

P value Threshold:

Fold Change Threshold:

Fold change was defined as the difference in mean lipid abundance between groups rather than a ratio. This is because some normalized and transformed values included negative numbers, making ratio-based fold change inappropriate.

Adjusted p value on the Y axis

Pixel size settings for downloading plots

Width (in inches):

Height (in inches):

Select the dots per inch (DPI) for downloading plot:

Volcano Plot Output

Download Volcano Plot (.png)

Volcano Plot Data

Download Volcano Data (.csv)

PCA Plot Settings

3D PCA Plot

Pixel size settings for downloading PCA

Width (in inches):

Height (in inches):

Select the dots per inch (DPI):

PCA Plot Output

Download PCA plot

Heatmap Settings

Cluster Group Labels

Cluster Lipid Features

Heatmap Output

Download Heatmap

Lipidomics Mean Calculator

Mean Calculated Data Preview

Download CSV

T-test Settings

Select First Group Variable for T-test:

Select Second Group Variable for T-test:

Select T-test method:

T-test Results

Download CSV

T-test PCA Plot

Select:

Download T-test PCA Plot (.png)

ANOVA Settings

ANOVA Results

Download CSV

ANOVA PCA Plot

Select:

Download ANOVA PCA Plot (.png)

Correlation

Select the color for the plot:

Enable correlation threshold

Select the correlation threshold. Correlations within the range (e.g., -0.9 to 0.9) will be filtered out:

Filter correlations that are not significantly different to zero by Fisher z-test

Correlation heatmap

Download Zip Files for Correlations Table Download Zip Files for Correlation Heatmap

DSPC Network Settings

DSPC (Debiased Sparse Partial Correlation) is a statistical framework designed to infer sparse and interpretable molecular networks from high-dimensional omics data such as lipidomics. It estimates partial correlations while correcting for the bias introduced by regularization (e.g., graphical lasso), allowing more reliable detection of direct molecular relationships.

Max nodes per cluster

Minimum nodes to keep in one cluster

Download DSPC Data (.csv)

DSPC Network Output

PLS-DA Model Overview

Partial Least Squares Discriminant Analysis (PLS-DA) is a supervised multivariate statistical method used to model the relationship between lipidomic data and predefined categorical groups.
PLS-DA identifies latent variables that maximize the covariance between the lipid features and the group labels, enabling effective classification and discrimination of samples based on their lipid profiles.
This technique is particularly useful for biomarker discovery, as it highlights the most relevant features contributing to group separation while handling complex, high-dimensional datasets typical in lipidomics studies.

PLS-DA Model Settings

Number of Permutations:

Number of Cross-validations:

PLS-DA Model Output

Download Overview (.png)

Download Permutation (.png)

Select Color Scheme for VIP Plot:

Download VIP Plot (.png)

Download VIP Table (.csv)

OPLS-DA Model Overview

Orthogonal Partial Least Squares Discriminant Analysis (OPLS-DA) is a supervised multivariate statistical method used to identify differences between predefined groups in complex datasets.

Unlike PLS-DA, OPLS-DA explicitly separates the variation in the data into:

Predictive components: directly associated with the group labels.
Orthogonal components: capture systematic variation unrelated to group differences (e.g., technical variation or biological heterogeneity).

OPLS-DA Model Settings

Number of Permutations:

Number of Cross-validations:

OPLS-DA Model Output

Download Overview (.png)

Download Permutation (.png)

Download Score Plot (.png)

Download Outlier Plot (.png)

Select Color Scheme for VIP Plot:

Download VIP Plot (.png)

Download VIP Table (.csv)

Random Forest Model Settings

Number of Trees:

Data Partition (Training Set %):

Random Forest Model Output

Download Importance Table (.csv)

Select Color Scheme for Importance Plot:

Number of Top Important Features to Display:

Download Importance Plot (.html)

Download All Generated Files

Thank you for using our lipidomics data analysis tool! If you have any feedback or encounter any issues, please don't hesitate to contact us.

For support, please contact: lipidanalyst-requests@umich.edu

Click the button below to download all output files as a zip archive.

Download All Data (ZIP) Download All Plots (ZIP)

Welcome to LipidAnalyst!

Upload Lipidomics Data

Lipidomics Data Preview

Upload Metadata (Group information)

Metadata Preview

Upload Internal Standard

Internal Standard Preview

Data Filtering Overview

Low Quality Filter

Low Abundance Filter

Low Variance Filter

Filtered Data Preview

Missing Value Imputation Guide

Group-level Missingness Summary

Imputation Settings

Imputed Data Preview

Combine duplicated lipids or lipids with different adducts

Combine Duplicated lipids

Combined Data Preview

Lipid Parsing Controls

Parsed Lipid Names

Lipid Class Pie Plot

Lipid Class Boxplot

PCA Plot

Sample Boxplot

Barplot of lipid species

Barplot of group distribution

Internal Standard Selection

Updated Parsed Table with Internal Standard

Normalization Messages

Normalized Data Preview

Normalization by a user-defined constant value

Metadata Supplement

Normalized Data Preview

Normalization

Data Transformation

Data Scaling

Normalized Data Preview

Boxplot Settings

Pixel size settings for downloading Boxplot

Boxplot Output

Plot Settings

Pixel size settings for downloading plots

Plot Output

Plot Settings

Pixel size settings for downloading plots

Plot Output

Differential Mean Lipid Heatmap Parameter

Pixel size settings for downloading Heatmap

Font and Color Settings

Differential Mean Lipid Heatmap Output

Volcano Plot Settings

Pixel size settings for downloading plots

Volcano Plot Output

Volcano Plot Data

PCA Plot Settings

Pixel size settings for downloading PCA

PCA Plot Output

Heatmap Settings

Heatmap Output

Lipidomics Mean Calculator

Mean Calculated Data Preview

T-test Settings

T-test Results

T-test PCA Plot

ANOVA Settings

ANOVA Results

ANOVA PCA Plot

Correlation

Correlation heatmap

DSPC Network Settings

DSPC Network Output

PLS-DA Model Overview

PLS-DA Model Settings

PLS-DA Model Output

OPLS-DA Model Overview

OPLS-DA Model Settings

OPLS-DA Model Output

Random Forest Model Settings

Random Forest Model Output