Streamlining LCMS data processing from Agilent Chemstation

I'm working on developing a suite of in vitro ADME assays (specifically forced degradation right now) most of which center on using HPLC-MS. The assays themselves are well developed and I can run them efficiently but I'm facing a serious bottleneck at the data processing stage.

I run these assays on an Agilent Infinity 1290, processing with Agilent CDS Chemstation Edition. The forced degradation assays typically run like this:

  1. Set up a sequence where DMSO stocks are diluted with a variety of buffer conditions (e.g., 0.1 N HCl with 20% ACN at 0.1 mg/mL).
  2. Run the sequence to generate t0 data. The three pieces of data that are collected at this stage are retention time, % area, and m/z of base peak in corresponding ESI positive MS trace. These three pieces of data are associated with a peak that we will track as the assay continues. Collect this data for all peaks with area > some arbitrary value.
  3. Using the command scheduler, schedule sequences for 2, 4, 8, 16, and 24 hours from t0.
  4. As time points are taken, populate an excel spreadsheet in a table like the one below.
m/z RT t0 t2 t4 t8 t16 t24
395 1.016 100 100 99.1 97.5 94.9 92.3
162 1.095 0 0 0.9 1.8 3.8 5.7
377 1.153 0 0 0 0.6 1.3 1.9

Here are the issues I'm encountering in this process:

  1. Each timepoint is run as a new iteration of the forced degradation sequence. Ideally, all timepoints for a given sample would be exported into a new sequence (or some other meta-organization data structure) so that I could easily follow the degradation. Presently, I have to flip back and forth between sequences looking for the same sample name before I can populate the excel spreadsheet. This adds time and requires more focus than I'd like. It is possible to load signals from other sequences into a new result set as I'm learning right now, but I'm still not sure how to do this automatically.
  2. I have to manually identify and track peaks across timepoints, making judgement calls about whether peaks are real or if they're erroneous. Further, I have to confirm that the peaks in later time points are the same as peaks from earlier timepoints. Ideally, I could use automation to follow peaks over time and populate the table with relevant data.
  3. I'd like to generate stacked plots of chromatograms across timepoints for a given sample. Chemstation has some tools for this but they're extremely clunky and I'm never fully convinced that I've figured them out even when I make good looking plots.

These are my main thoughts about improving my data processing and analysis. Hopefully someone out there has encountered this situation before and knows how to help.

Was this helpful?