Gas chromatography-mass spectrometry (GCMS) is the analytical tool that fragrance reconstructors depend on most. When you want to understand what a commercial fragrance is actually made of — which materials, at roughly what proportions — a GCMS report is where you start. Understanding how to read one is one of the most practically useful skills a home perfumer can develop.
What GCMS actually does
GCMS separates a fragrance sample into its individual chemical components, then identifies and quantifies each one. The gas chromatography step vaporises the sample and passes it through a column with a carrier gas; different molecules travel at different speeds depending on their physical properties and emerge at different times — their retention times. The mass spectrometry step then identifies each compound by its molecular mass and fragmentation pattern, matched against a reference library.
The result is a list of compounds — typically dozens, sometimes hundreds — each with a name, a CAS number, and a percentage figure representing how much of the sample it constitutes.
What a GCMS report looks like
A typical report contains at minimum:
- Compound name — the common or IUPAC name of the aroma material
- CAS number — the Chemical Abstracts Service registry number, which uniquely identifies the compound regardless of trade name
- Peak area % — the proportion of the total detector response attributed to that compound
Some reports add retention time, a confidence score (how closely the detected mass spectrum matched the reference library), and whether the compound is a flagged allergen under current IFRA or EU cosmetics regulations.
The key thing to understand about percentages
The percentages in a GCMS report are peak area percentages — they reflect the detector's response to each compound, not its weight or olfactory contribution. Different compounds produce different detector responses at the same concentration, so peak area % is not the same as weight %. It is a proxy, and a useful one, but it comes with a meaningful caveat: a compound at 5% by area might be 3% or 8% by weight depending on its molecular structure and detector sensitivity.
This is one reason a professional reconstruction involves judgement as well as data. The GCMS report gives you a starting list; you still need to smell your way to the right formula.
Identifying the compounds
For a home perfumer, the CAS number is the most important column. Trade names like "Iso E Super" or "Ambroxan" will not appear in a GCMS library — only the chemical names and CAS numbers of their active components. Once you have a CAS, cross-referencing it against a supplier catalogue or a resource like the Good Scents Company database tells you what it smells like and where to buy it.
Some common aroma chemicals and their CAS numbers worth learning:
- Ambroxan: 6790-58-5
- Iso E Super (main isomer): 54464-57-2
- Hedione: 24851-98-7
- Linalool: 78-70-6
- Coumarin: 91-64-5
- Dihydromyrcenol: 18479-58-8
As you build familiarity, a GCMS report becomes more readable — you recognise the common compounds immediately and can focus on the less familiar ones.
What GCMS cannot tell you
GCMS reports have real limitations that affect how you use them.
Trace materials are often invisible. Compounds present below roughly 0.01–0.05% of the sample may fall below the detection threshold. Some highly potent materials — certain indoles, musks, or pyrazines — contribute enormous olfactory impact at concentrations too small for reliable detection. These are the compounds that complete a reconstruction but do not appear in the report.
Natural extracts are composites. A fragrance containing rose absolute contains dozens of compounds from that single ingredient. The GCMS report breaks them down individually, which can make a simple formula look alarmingly complex. Recognising clusters of compounds that originate from a single natural material is a skill that develops with experience.
Co-elution happens. Two compounds with similar properties can emerge from the column at nearly the same time, making them difficult to distinguish. Reports from lower-resolution instruments sometimes misidentify or aggregate these.
The percentages are relative, not absolute. A GCMS report tells you that compound A represents 12% of detected material and compound B represents 4%. It does not tell you what the total concentration of the sample was. You cannot derive an exact formula from a GCMS report alone — you derive a starting ratio, which you then refine by smell.
How ScentFormulas uses GCMS data
Every formula in the ScentFormulas catalogue is built from GCMS-informed starting points. The data establishes the structural framework — which material families are present and in what rough proportions — and from there each formula is refined by smell to produce a working reconstruction. The result reflects both analytical data and sensory judgement: not just a list of detected peaks, but a formula that actually performs on skin.
Where to find GCMS reports
Third-party GCMS data for commercial fragrances is published by independent analysts and shared through fragrance communities. Searching "[fragrance name] GCMS" or "[fragrance name] chromatography analysis" will often surface something useful, though quality varies significantly. Some resources are paywalled; others are freely shared by hobbyists.
The skill of reading that data and translating it into a buildable formula is what separates an interesting analytical exercise from a wearable reconstruction.