Open Access Article
Róbert
Arató
*a,
Derrick
Quarles
Jr
b,
Gabriella
Obbágy
a,
Zsolt
Dallos
a,
Miklós
Arató
c,
Phillip
Gopon
a and
Frank
Melcher
a
aMontanuniversität Leoben, Chair of Geology and Economic Geology, Peter-Tunner-Straße 5, 8700 Leoben, Austria. E-mail: robert.arato@unileoben.ac.at
bElemental Scientific, 7277 World Communications Drive, Omaha, NE 68122, USA
cDepartment of Probability Theory and Statistics, Eötvös Loránd University, Pázmány Péter Sétány 1/C, 1117 Budapest, Hungary
First published on 8th August 2025
Graphite is a critical raw material for sustainable energy technologies, and establishing its traceability is crucial for ensuring responsible sourcing in the future. This study presents maps acquired on a comprehensive set of natural graphite concentrates via Laser-induced Breakdown Spectroscopy (LIBS). LIBS generates multi-elemental data at an unprecedented speed even from samples with non-ideal ablation characteristics, such as pressed graphite pellets. The generated data is used for constructing elemental maps to shed light on the chemical distribution of elements as well as for multivariate classification. Natural graphite concentrates exhibit inhomogeneous chemical composition. As such, the graphite concentrate LIBS-fingerprint is a heterogeneous mixture of LIBS signals from pure graphite and mineral impurities, which either represent crystal intergrowth with graphite, or they are adsorbed on graphite flakes as a result of natural or artificial processes. The observed chemical heterogeneity serves as a prominent fingerprint of individual deposits, although the heterogeneity is also omnipresent between different samples of the same deposit. The generated multivariate dataset is well suited for multivariate data analysis. Random forest classifiers show a robust performance across a broad range of hyperparameters, achieving over 90% classification accuracy. The heterogeneity of the concentrates presents a significant challenge for classification, regardless of the analytical and classification approach used. The addition of chemically different samples to the same classification group (i.e., graphite deposit) does not necessarily hinder correct classification and renders the routine application of the method possible.
Material fingerprinting refers to the use of analytical techniques to capture unique, intrinsic chemical or physical characteristics of a material, enabling its identification and traceability. For a technique/method to be routinely used in material traceability based on intrinsic material fingerprint, it needs to fulfil most/all of the following criteria: (i) high differentiating power (ii) simple usage (iii) high sample throughput (iv) simple accessibility and (v) low cost. For (i), multi-parameter output and a large number of repeated measurements (e.g., spectrometry data) provides notable advantages over the very precise measurement of few parameters (e.g., isotopic ratios). (ii) Includes the complexity of sample preparation and analysis, which is inherently interconnected with (iii), where the time needed for sample preparation and analysis is critical. (ii) Is again connected to (iv) and (v), where the versatility and availability of different instrumental setups and trained personnel is decisive.
Due to its chemical and thermal resistance, the chemical analysis of graphite is particularly challenging. Complete dissolution of graphite is nearly impossible under standard laboratory conditions, and in situ analyses (e.g., by LA-ICP-MS) are challenging due to its flakey/brittle structure. Accordingly, the knowledge of the chemical composition of graphite is currently limited. There is a need to determine which impurities—such as other mineral phases, chemical substitutions, or adsorbed elements—are present in natural graphite concentrates and how these are spatially distributed. Key questions include the nature and origin of these impurities and how effectively they can be used to fingerprint graphite deposits for traceability. Currently, no routine methodology exists to distinguish between natural graphite deposits and address these needs. The present study takes a step toward filling this knowledge gap.
In this study, we present laser induced breakdown spectroscopy (LIBS) data, acquired on a series of pressed natural graphite concentrate pellets from important graphite producers worldwide. We apply LIBS-mapping on these pellets, to understand the spatial and multivariate distribution of elements.
The chemical heterogeneity of graphite concentrates is evaluated by considering both local elemental anomalies and universally present elements. Given that the samples originate from natural sources but may have been altered during processing steps such as grinding and flotation, potential sources of chemical inhomogeneity are also examined. Individual elemental maps are used to assess the spatial distribution and grouping of elements within the concentrates. Using this comprehensive dataset, strategies for extracting the most informative sections of the acquired data are presented, and the challenges associated with within-deposit heterogeneity are discussed.
| Sample code | Country | Region/city (deposit/s) | Type | Processing method | Metamorphic faciese | Ref. |
|---|---|---|---|---|---|---|
| a Pedra Azul, Itapecerica, Salto da Divisa. b Vatomina & Sahamamy Sahasoa. c Microcrystalline. d Chemical treatment. e Amp = amphibolite, gran = granulite, gsch = greenschist; ref = relevant publication about the deposit or region. | ||||||
| 3B | Brazil | Minas Gerais (3 depositsa) | Flake | Grinding & flotation | Amp–gran | 7 |
| 21C | Brazil | Minas Gerais (3 depositsa) | Flake | Grinding & flotation | Amp–gran | |
| 21N | Brazil | Minas Gerais (3 depositsa) | Flake | Grinding & flotation | Amp–gran | |
| 22C | Brazil | Minas Gerais (3 depositsa) | Flake | Grinding & flotation | Amp–gran | |
| 6B | China | Inner Mongolia | Flake | Grinding & flotation + ctd | Granulite | 8 |
| 7B | China | Heilongjian | Flake | Grinding & flotation | Granulite | |
| 4A | China | Hunan (Lutang deposit) | mcc | Grinding | Contact | 9 |
| 23A | China | Hunan (Lutang deposit) | mc | Grinding | Contact | |
| 21A | China | Shandong | Flake | Grinding & flotation | Gsch–amp–gran | 10 |
| 22A | China | Shandong | Flake | Grinding & flotation | Gsch–amp–gran | |
| 13B | Germany | Passau (Kropfmühl) | Flake | Grinding & flotation | Amp–gran | 11 |
| 5A | Korea | Uncertain | mc | Grinding | Contact | |
| 9B | Korea | Uncertain | mc | Grinding & flotation + ct | Contact | |
| 11B | Madagascar | Brickaville (2 depositsb) | Flake | Grinding & flotation | Granulite | 12 |
| 21D | Madagascar | Brickaville (2 depositsb) | Flake | Grinding & flotation | Granulite | |
| 22D | Madagascar | Brickaville (2 depositsb) | Flake | Grinding & flotation | Granulite | |
| 23D | Madagascar | Brickaville (2 depositsb) | Flake | Grinding & flotation | Granulite | |
| 1B | Mozambique | Cabo Delgado (Balama) | Flake | Grinding & flotation | Amp–gran | 13 |
| 21B | Mozambique | Cabo Delgado (Balama) | Flake | Grinding & flotation | Amp–gran | |
| 21J | Mozambique | Cabo Delgado (Balama) | Flake | Grinding & flotation | Amp–gran | |
| 22B | Mozambique | Cabo Delgado (Balama) | Flake | Grinding & flotation | Amp–gran | |
| 23B | Mozambique | Cabo Delgado (Balama) | Flake | Grinding & flotation | Amp–gran | |
| 23E | Namibia | Karas (Aukam) | Vein | Grinding & flotation | Hydrothermal | 14 |
| 2B | Norway | Skaland (Traelen) | Flake | Grinding & flotation | Amphibolite | 15 |
| 15B | Norway | Skaland (Traelen) | Flake | Grinding & flotation | Amphibolite | |
| 22G | Norway | Skaland (Traelen) | Flake | Grinding & flotation | Amphibolite | |
| 23G | Norway | Skaland (Traelen) | Flake | Grinding & flotation | Amphibolite | |
| 5B | Russia | Chelyabinsk (Taiginka) | Flake | Grinding & flotation | Amp–gran | 16 |
| 21E | Russia | Chelyabinsk (Taiginka) | Flake | Grinding & flotation | Amp–gran | |
| 22E | Russia | Chelyabinsk (Taiginka) | Flake | Grinding & flotation | Amp–gran | |
| 4B | Ukraine | Kirovograd (Zavallia) | Flake | Grinding & flotation | Granulite | 17 |
| 22F | Ukraine | Kirovograd (Zavallia) | Flake | Grinding & flotation | Granulite | |
| 23F | Ukraine | Kirovograd (Zavallia) | Flake | Grinding & flotation | Granulite | |
Samples of several 100 grams were obtained from each deposit. These were thoroughly mixed, and ca. 250 mg of flakes were extracted from different parts of the bags by a small spatula and loaded into a conventional hydraulic pellet press without a binding matter. The 13 mm dies were loaded by 4 tons for two minutes for each pellet. The pellets were then mounted on glass slides, carefully levelled with the top of the sample drawer and placed in the laser-ablation chamber. Altogether, 33 pressed natural flake graphite pellets were analyzed in the study.
In this study, we present results collected with an ESLumen LIBS coupled to an ImageGEO193 laser-ablation system, comprising a 193 nm excimer laser and a two-volume (TwoVol3) ablation chamber (Fig. 1). The laser is delivered to the sample through an XYR beam aperture and for LIBS analysis, the emitted light from the laser-induced plasma is collected by an optical fiber inserted directly into the laser-ablation chamber (see Manard et al., 2022)27 for details. The fiber optics consist of a 5-channel spectrometer with fixed-grating Czerny-Turner design, covering a spectral range from 188 nm to 1099 nm via a Complementary Metal-Oxide Semiconductor (CMOS) detector.
The LIBS fiber was calibrated by using NIST SRM 612 glass for maximum intensity across the LIBS spectrum. Alignment consists of an X–Y adjustment of the LIBS fiber via adjustment screws (Fig. 1C) and the Z-position of the sample is fixed in order to always keep the same depth into the analytical cup within the ablation chamber.
The LIBS maps were acquired over an area of 1 mm × 1 mm on one to five samples from each deposit, by ablating with 100 μm × 100 μm rectangular spots and 90 μm overlap in X and Y directions, corresponding to a nominal 10 μm × 10 μm X–Y resolution. By this approach, high resolution 2D maps could be constructed, while maintaining high signal intensity due to larger spot size. 100 Hz laser frequency and ca. 13 J cm−2 laser fluence was applied in He atmosphere, flushed at a 1000 ml min−1 rate. The described settings resulted in a scan speed of 1000 μm s−1, which resulted in the acquisition of a 1 mm × 1 mm map within 100 seconds. All measurements were acquired within a single day.
LIBS spectra were processed in the ‘iolite’ v4 software (Elemental Scientific Lasers).29,30 Data processing within iolite included atomic emission line identification, intensity integration within selected spectral windows, normalization and the creation of 2D elemental images. Spectral windows for the characteristic optical emission lines of individual elements were manually set. Finally, to avoid missing important emission lines, the ‘Interesting Feature Finder’ was applied and further lines were added to the integration set.31 LIBS spectra were integrated by selecting the left and right integration windows around the targeted emission lines. All integrated data was background-corrected via the ‘rolling ball’ background subtraction approach.32
The presented maps show raw spectra, while for data analysis, normalized spectra were used. To normalize each spectrum, each individual integrated emission line was divided by the total emission intensity over the entire spectral range between 188 nm and 1099 nm. Each map comprises all acquired data except the upper and lower 99th percentile of relative intensities. The ‘Polar’ colormap is used for all maps without smoothing/filter.
There are two general approaches when applying multivariate methods to LIBS data (see Brunnbauer et al., 2023 for a review):33 (i) classification based on selected emission lines or (ii) classification based on entire spectra. While (i) has the advantage of being computationally less intensive, (ii) is capable of capturing the variance of the entire multivariate dataset. In this study, we tested both approaches.
Multivariate classification was applied on the same dataset, as is used for constructing the elemental maps. Raw intensities (variables) normalized to the total intensity were considered in each shot. Shot data were averaged for each line, and outliers exceeding two standard deviations from the mean were excluded. Data was not scaled, whereas labels were numerically encoded when necessary. The classifiers were trained on 80% of all individual shots, whereas 20% of the shots were held out of training for testing classification accuracy. To test for group-wise classification accuracy and potential misclassification phenomena, a confusion matrix is used.
The model's sensitivity was tested with several hyperparameters, which are important to consider when optimizing the performance of the algorithm. Most importantly, these parameters include the number of trees in the forest, the maximum depth of each tree as well as the minimum number of observations to split an internal node in the tree (min_samples_split) and the minimum number of samples to be at a leaf node (min_samples_leaf). First, randomized large intervals are searched for each hyperparameter. Then a grid search on a narrower range is based on the results of the previous step. For randomized search and grid search, a ten-fold cross-validation was applied. To test the model's robustness on unknown samples, the classifier's performance is tested by leaving out one sample from each group (deposit), and using all other samples for training and iterating through all such combinations. This approach is essentially similar to GroupKFold cross-validation in scikit-learn with the advantage that the generalization potential of the classifier is tested by leaving out entire maps from the training, thereby treating them completely unknown for the classifier.
000 shots per map are made, while a complete LIBS spectrum between 188 and 1098 nm is recorded at every shot (Fig. 2).
Anomalously high intensities at the edges of the acquired maps were removed to avoid eventual cross-contamination effects. As a result, over 300
000 spectra were evaluated in this work. In these spectra 32 characteristic emission lines of H, Li, B, C, O, Na, Mg, Al, Si, K, Ca, V, Cr, Mn, Fe, Cu, Zn, Rb, Sr and Ba were identified. Most concentrates showed a number of elements besides C (Fig. 2).
From the maps, it is evident that the elements are not uniformly distributed, and instead show anomalies of spatially correlated elements (Fig. 3). While certain elements show a very narrow intensity range close to background level, the most abundant elements are present in almost all individual spectra (Fig. 2 and 3).
![]() | ||
| Fig. 3 (A–J) Maps of simultaneously acquired, selected wavelengths of the same area on a graphite pellet from sample 2B. Intensities are relative over the intensity range of each individual map. The corresponding element and its selected wavelength (in nm) are noted in the top left corner. (K) Overlap of three selected elements on an RGB scale to represent variability in silicate composition. (L) Three-dimensional map of the same are on the example of Si (height of the map) and Al (color of the map). Refer to ESI SI1† for all other maps. | ||
Based on the spatial correlation of silicon with oxygen and other elements, it is apparent that graphite flakes are intermixed with abundant silicate minerals (Fig. 2 and 3). Exceptions in terms of mineral contents include samples 6B and 9B where silicate minerals are virtually absent.
By extracting absolute intensities underlying the elemental maps, common and highly variable elements can be detected, as they are generally characterized by high mean and % RSD (Table 2). As these measures refer to the entire population, they do not necessarily mean that the same elements can be best used for separating between different groups (e.g., deposits). For this, the ratio of the between-group variance and the within-group variance (i.e., ‘Separation’) is used.
| Variable | Mean | Std | % RSD | Median | Min. | Max. | Separation |
|---|---|---|---|---|---|---|---|
| a AllLight = total sum intensity. | |||||||
| A1308 | 878 | 613 | 70 | 691 | 87 | 8994 | 5522 |
| A1309 | 1263 | 1125 | 89 | 874 | 130 | 15 121 |
6719 |
| A1394 | 1207 | 1376 | 114 | 697 | 49 | 19 440 |
8533 |
| A1396 | 2105 | 2395 | 114 | 1238 | 59 | 32 020 |
9319 |
| Ba455 | 285 | 206 | 72 | 270 | 0 | 11 849 |
1444 |
| C247 | 5616 | 2535 | 45 | 5004 | 385 | 16 334 |
5554 |
| Ca393 | 1299 | 1718 | 132 | 677 | 83 | 40 914 |
15 413
|
| Cr357 | 216 | 89 | 41 | 203 | 23 | 2998 | 2732 |
| Cr427 | 208 | 99 | 48 | 192 | 22 | 3637 | 1428 |
| Cu324 | 102 | 42 | 41 | 98 | 0 | 1839 | 3137 |
| Fe253 | 273 | 99 | 36 | 261 | 43 | 4537 | 3967 |
| H656 | 426 | 162 | 38 | 419 | 0 | 4233 | 4385 |
| K766 | 2752 | 3779 | 137 | 1183 | 0 | 74 142 |
15 540
|
| K770 | 1704 | 2423 | 142 | 739 | 0 | 48 847 |
15 069 |
| Li610 | 267 | 208 | 78 | 279 | 0 | 6835 | 6489 |
| Li670 | 462 | 332 | 72 | 435 | 0 | 8114 | 9195 |
| Mg285 | 1128 | 534 | 47 | 1012 | 202 | 19 818 |
4622 |
| Mn259 | 712 | 574 | 81 | 563 | 119 | 16 115 |
3795 |
| Na589 | 1325 | 2348 | 177 | 524 | 0 | 77 780 |
16 043
|
| O777 | 981 | 623 | 64 | 841 | 0 | 17 129 |
2790 |
| Rb780 | 277 | 94 | 34 | 272 | 0 | 1854 | 5812 |
| Si288 | 990 | 921 | 93 | 675 | 31 | 18 852 |
4860 |
| Sr407 | 194 | 86 | 44 | 181 | 19 | 4433 | 6061 |
| V437 | 222 | 75 | 34 | 214 | 30 | 5334 | 2387 |
| Zn202 | 676 | 135 | 20 | 665 | 219 | 1652 | 6642 |
| Zn206 | 188 | 59 | 31 | 182 | 23 | 993 | 4365 |
| B249 | 269 | 90 | 33 | 258 | 49 | 3159 | 4027 |
| Ca854 | 666 | 516 | 77 | 628 | 0 | 19 300 |
5710 |
| Ca866 | 650 | 373 | 57 | 647 | 0 | 12 098 |
2996 |
| Cl837 | 168 | 74 | 44 | 168 | 0 | 1400 | 3434 |
| Li812 | 152 | 60 | 39 | 151 | 0 | 597 | 4308 |
| Mg518 | 1548 | 1931 | 125 | 1245 | 314 | 108 185 |
3238 |
| N743 | 136 | 65 | 48 | 135 | 0 | 684 | 2468 |
| Na818 | 1045 | 506 | 48 | 1019 | 0 | 32 205 |
4936 |
| O844 | 286 | 150 | 53 | 276 | 0 | 3785 | 2047 |
| Zn213 | 247 | 70 | 29 | 239 | 23 | 815 | 4866 |
| Zn330 | 245 | 65 | 27 | 240 | 41 | 1047 | 4841 |
| AllLight | 741 123 |
247 080 |
33 | 687 803 |
242 331 |
2 077 900 |
|
‘Separation’ is calculated from the relative intensities to avoid artefacts introduced by the different total sum intensity between individual maps. From these values, the largest differences between deposits are given by their sodium, calcium and potassium contents (Fig. 4). Microcrystalline graphite samples from China (Lutang deposit, samples 4A and 23A) and Korea (sample 5A), which were not subjected to flotation, show the highest degree of impurity. The effect of chemical treatment is also evident. Samples from Inner Mongolia (China) (6B) and Korea (9B) of this category are moderate to low in Ca and Na content, while they are among the lowest in terms of potassium and aluminum. Brazilian samples show a fairly uniform composition with low calcium, sodium and potassium content, but high aluminum content. The group of samples from Mozambique and Madagascar are similar with respect to Ca and Na, but show a much larger scatter in K compared to Brazil. Notably, samples from Madagascar show the largest within-sample and between-sample variability, especially evidenced by the aluminum and potassium concentrations (Fig. 4B). Samples from Russia, Ukraine and Norway show a relative enrichment in all four elements (Na, Ca, K, Al), but due to severe within-sample and between-sample variations at each deposit, no characteristic fingerprint is described for any of the deposits solely based on four elements. This underlines the limitations of traditional discrimination plots in high-dimensional datasets,37 and shows the necessity of applying multivariate techniques for classification.
Despite the fact that the majority of the concentrates consists of graphitic carbon (Fig. 3A), the coexistence of non-C elements in these spectra shows that other minerals are present in the concentrates (Table 3). It also has to be considered that by overlapping ablation in both X and Y direction, a 100-shot deep 3D column of each pellet is presented as 2D maps. There are several instances where the chemical signature of multiple minerals is present at the same X–Y coordinate. This is due to, for example, quartz (SiO2) and/or K-feldspar (KAlSi3O8) underlying muscovite (KAl2AlSi3O10(OH)2) which would have a spectrum quite similar to pure muscovite. On the other hand, the set of co-detected elements helps to restrict the options to a small set of possible minerals based on straightforward exclusion criteria. The overarching majority of the minerals analyzed here (Table 3) are potassium and hydroxide-bearing silicate minerals, presumably sheet silicates. These commonly include K, Al, Si, Na, Mg, O and H as well as less commonly Ca and Li.
| Sample code | Country | Type | Proc. methodb | K-Na-Ca-Mg-mica | K-Mg-Fe-mica | Kln | Qz | Kfs | Pl | Cal | Other phases | Mineral amount | Peculiarity |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| a Microcrystalline. b gr = grinding, fl = flotation, ct = chemical treatment, xxx = very abundant; xx = abundant; x = multiple grains identified; 1 = one grain identified; - = not identified; Kln = kaolin-group minerals; Qz = quartz; Kfs = K-feldspar; Pl = plagioclase; Cal = calcite; Sps = spessartine; Cls = celsian; Act-Tr = actinolite-tremolite; Gln = glaucophane; Hbl = hornblende; Ca-Al-sil. = Ca-Al-silicate; Cpx = clinopyroxene; low int. = low intensity. | |||||||||||||
| 3B | Brazil | flake | gr & fl | x | - | xxx | - | - | - | - | Sps | xxx | kaolin-gr. |
| 21C | Brazil | flake | gr & fl | x | - | xxx | - | - | - | - | Cls | xxx | kaolin-gr. |
| 21N | Brazil | flake | gr & fl | x | - | xxx | - | - | 1 | Act-Tr | xxx | kaolin-gr. | |
| 22C | Brazil | flake | gr & fl | x | - | xxx | - | - | - | - | Gln | xxx | kaolin-gr. |
| 6B | China | flake | gr & fl + ct | - | - | - | - | - | - | - | NaOH, Ca(OH)2 | - | no minerals |
| 7B | China | flake | gr & fl | xxx | - | - | - | - | - | - | - | xx | Mg-Na-Li |
| 4A | China | mca | gr | xxx | - | - | - | - | - | - | - | xxx | Na-Ca-Mg |
| 23A | China | mc | gr | xxx | - | x | - | - | - | - | - | xxx | Na-Ca |
| 21A | China | flake | gr & fl | xx | - | xxx | - | - | - | - | - | xx | low int. |
| 22A | China | flake | gr & fl | xx | - | - | - | - | - | - | - | x | low int., Na-(Li) |
| 13B | Germany | flake | gr & fl | xxx | xx | x | x | xxx | |||||
| 5A | Korea | mc | gr | xxx | xx | - | - | - | - | - | - | xxx | Na-Ca-Li |
| 9B | Korea | mc | gr & fl + ct | - | - | - | - | - | - | - | NaOH, Ca(OH)2 | - | no minerals |
| 11B | Madagascar | flake | gr & fl | xx | xx | x | - | - | 1 | - | xx | ||
| 21D | Madagascar | flake | gr & fl | xx | - | xx | 1 | - | 1 | 1 | Hbl | xx | low K |
| 22D | Madagascar | flake | gr & fl | xxx | xx | - | - | - | - | - | - | xxx | |
| 23D | Madagascar | flake | gr & fl | xx | - | x | - | - | - | - | - | xx | low int. |
| 1B | Mozambique | flake | gr & fl | xx | - | - | xx | 1 | x | x | x | xx | Qz, low K |
| 21B | Mozambique | flake | gr & fl | xx | - | x | xx | - | x | - | - | x | Qz |
| 21J | Mozambique | flake | gr & fl | xx | 1 | xx | - | 1 | x | - | Ca-Al-silicate | xx | low K |
| 22B | Mozambique | flake | gr & fl | xxx | - | x | x | - | x | - | Sps | xxx | low K |
| 23B | Mozambique | flake | gr & fl | x | - | xx | x | - | x | - | Hbl | xxx | low K, high Mn |
| 23E | Namibia | vein | gr & fl | xxx | - | xx | - | - | - | - | - | xxx | Na |
| 2B | Norway | flake | gr & fl | xxx | xx | - | - | - | x | x | - | xxx | Mg |
| 15B | Norway | flake | gr & fl | xxx | xx | - | - | - | x | x | - | xxx | Mg-Ca |
| 22G | Norway | flake | gr & fl | xxx | x | - | - | - | x | x | - | xxx | Na-Ca |
| 23G | Norway | flake | gr & fl | xxx | xx | - | - | 1 | x | - | - | xx | Na-Mg-Ca |
| 5B | Russia | flake | gr & fl | xxx | - | x | - | - | - | - | - | xxx | Na |
| 21E | Russia | flake | gr & fl | xxx | - | - | - | - | - | x | - | xxx | Na-Ca-Li |
| 22E | Russia | flake | gr & fl | xxx | x | x | - | - | - | - | - | xxx | Na-Mg-Ca-(Li) |
| 4B | Ukraine | flake | gr & fl | xx | x | xx | - | - | x | - | - | xxx | |
| 22F | Ukraine | flake | gr & fl | xxx | xxx | x | x | - | - | 1 | - | xxx | Mg-Ca |
| 23F | Ukraine | flake | gr & fl | xx | xxx | xx | x | - | - | x | Cpx | xxx | Ca-Na-Mg-Mn |
The multi-elemental maps of this study clearly show that certain elemental impurities are spatially restricted (Fig. 5). These mineral impurities, which were not removed from graphite during preconcentration, are characteristic for the geological environment where the sample was mined from. Table 3 shows the minerals that were identified in the various concentrates. The most common minerals include various sheet silicates, probably mica or clay minerals. Also common are feldspars and quartz. Graphite-mica intergrowths are described in the literature,40 however, little was known about their abundance in concentrates. The observations of this study imply that the accumulation process of organic material always involves the input of anorganic material, which develops to minerals – to a great part mica – during metamorphism. Importantly, due to the large size (100 × 100 μm) of the ablation spots, we cannot assess the actual size of most ablated minerals. We can however say that there is almost no 100 × 100 μm domain, which does not contain non-carbon elements. A more detailed understanding of the mineral impurities, and as if there are also other types of chemical heterogeneities in graphite, would require a technique with higher spatial resolution (e.g., transmission electron microscopy or atom probe tomography), or the separation of minerals from graphite, which is beyond the scope of this paper.
The described mineralogical observations can be used for fingerprinting purposes. Certain types of minerals, and the combination of minerals, define the ‘fingerprint’ of individual deposits. While K–Na–Mg-bearing micas dominate in most of the studied concentrates (Fig. 5A–D), including deposits from China, Korea, Madagascar, Mozambique, Namibia, Norway and Russia, there are marked differences in their elemental composition and accompanying minerals. K, Mg and Fe-rich mica (s.l. biotite) is far less abundant, but well-recognizable by its intense K and Mg lines as well as a series of transition metal lines below 500 nm. They predominate in the concentrates from Ukraine, but are also abundant in the samples from Norway, Germany and Korea, as well as a minor phase in Madagascar graphite. Clay minerals of the kaolin group, in turn are a uniform mineralogical fingerprint of samples from Brazil (Fig. 5E–H) with occasional higher abundance in the samples from Shandong-China (sample 21A), Madagascar, Mozambique, Namibia and Ukraine. Quartz shows high abundance only in samples from Mozambique and/or its signal is intermixed with the previously mentioned phases in other samples. The remaining minerals do not appear to be diagnostic for any deposit. Overall, the mineralogical signature of the studied concentrates is mostly dominated by sheet silicates, which are harder to separate from graphite during the grinding and flotation steps, due to their similar habit and hydraulic properties to graphite flakes.
Apart from obvious mineral impurities, elemental inhomogeneities are apparent throughout the majority of all maps. This is especially apparent in the case of aluminum, silicon, oxygen and potassium. Two elements, aluminum and oxygen are detectable at almost every single shot, except for samples 6B and 9B. Those two samples have been subjected to chemical treatment during ore processing and only contain Ca, Na, O and H (with only very rare traces of Si), corresponding to remnants of Na and Ca-hydroxides, commonly used for removing silicate minerals from natural graphite concentrate via alkali autoclave-acid leaching.41 The absence of silicates and the presence of abundant hydroxides in those samples serve as a distinct fingerprint of chemically treated graphite and these features make them easily distinguishable from untreated products (Fig. 6A–D). On the other hand, microcrystalline samples, which were not subjected to flotation, mark the other end of the spectrum with the highest abundance of silicates (Fig. 6E–H) and an accordingly definite fingerprint.
The observed heterogeneity in individual maps contributes to the diagnostic geochemical fingerprint, capturing subtle differences even when the same minerals are present in varying quantities across different concentrates. This high degree of specificity enables detailed characterization at the sample level, in addition to distinguishing between deposits. As a result, the unique mineralogical composition and abundance reflected in each map (Table 3) allow for both inter-deposit and intra-deposit differentiation in practice.
The RF classifier shows a robust overall classification accuracy of ∼93% (Fig. 8). The high classification accuracy is striking, especially considering the significant differences between individual samples within the same deposits. In order to assess the contribution of each selected elemental line (i.e., feature) to the classification, feature importances were extracted. The most important features for classification are lines related to major elements, such as Ca, Al, K, Na, Si and Mg (ESI Table ST1†). Also, some minor elements, including Li, Ba, Zn, Sr and Mn contribute to the overall classification accuracy.
To ensure that the model's performance reflects actual patterns in the data rather than noise or overfitting to specific features, careful attention is paid to model optimization. Overfitting is avoided by not selecting unrealistic hyperparameters (i.e., >100 trees, and <10 samples for splitting a tree) and by checking the ‘area under curve,’ which is used as an evaluation metric for each hyperparameter (ESI SI2†). These curves imply no overfitting. Furthermore, the performance of the classifier is robust over a broad range of hyperparameters, resulting in only a few percent reduction in the classification accuracy, if a very conservative hyperparameter-set is applied ESI2.† The effect of the number of spectra (i.e., dataset size) was also tested and the results are summarized in the ESI SI3.† If entire spectra are used instead of manually selected wavelengths classification accuracy decreases notably (see details in ESI SI3†).
While the model numerically does not overfit the data (ESI SI2†), the extreme heterogeneity within and between samples, it is difficult to argue that a 1 mm × 1 mm map is sufficient to classify an entire deposit. However, the observed high classification accuracy shows that the RF classifier is capable of learning the main features of the acquired multivariate LIBS dataset, independent of the representativity of the dataset.
To test the generalization potential of the model, entire samples should be left out from training and used for testing. This can only partially be done with our dataset as four (or more) samples are only available from four deposits, where the train
:
test ratio can be kept close to the values used in common practice, usually between 90
:
10 and 70
:
30. If only deposits represented by four or more samples are included (Fig. 9), classification accuracy reaches ∼65%. This is achieved by testing all combinations where one entire sample is left out for testing from each deposit and the other samples are used for training. This result shows the severe effect if we have no control over the geological heterogeneity of the deposits and the changes in processing practices, which can both have a large impact on the chemical signature of the traded material. Low classification accuracy in the case of Madagascar is also diagnostic, as it can reflect different provenance (e.g., different mine from the same deposit), changing processing practices over years, on-site mixing or within-deposit heterogeneity. The results also showcase that, in this context, the amount and versatility of the available sample material is of greater importance than the choice of the classifier and its hyperparameters. Fig. 9B demonstrates that if the underlying training data describes the test population well (i.e., the highly heterogeneous Madagascar deposit is not included), a correct prediction is made with a high probability. It is also important to note that these observations are not LIBS-specific: any chemical fingerprinting method will face similar challenges with sample heterogeneity. Here the speed of the LIBS analysis is advantageous as it can analyze more sample within a short period of time, compared to most other techniques.
Footnote |
| † Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d5ja00053j |
| This journal is © The Royal Society of Chemistry 2025 |