Preprocessing Data¶
latoools
expects data to be organised in a particular way. If your data do not meet these expectations, you’ll have to do some pre-processing to get your data into a format that latools can deal with. Read on to learn about the ‘preprocessing’ tools included in latools
.
If you’ve got data that can’t be processed with the functions below, please let us know and we’ll work out how to accommodate your data.
1. Data in a single long file¶
To work with this data, you have to split it up into numerous shorter files, each containing ablations of a single sample. This can be done using latools.preprocessing.split.long_file()
.
What this function does:¶
- Import your data, and provide a list of sample names.
- Apply
autorange()
to identify ablations.- Match up the sample names and ablations.
- Save a single file for each sample in an output folder, which can be imported by
analyse()
You’ll end up with a single directory containing one file for each sample in the sample list, named with the sample names that you provide. The number of samples in the list and the number of ablations in the long file obviously have to match.
Example usage:
import latools as la
la.preprocessing.long_file('path/to/long_data_file.csv', dataformat='DEFAULT',
sample_list=sample_list)
dataformat can be a dataformat dictionary, or the name of a file or latools
configuration.
2. Data in multiple files without sample names¶
Some instruments store file names as sequential numbered files, with sample id stored either externally (e.g. in a laser log file), or in the file headers.
External log file¶
Todo
Not implemented yet. This will be implemented when there is demand for it.
File names in headers¶
Currently supported.