MRC format#
The .mrc
format is widely used for tomographic data. The implementation of this plugin
is based on this specification
and has partial support for FEI’s custom header.
Note
When reading 4D-STEM data saved by the Velox software, the data are read as a stack
of diffraction patterns, but the navigation_shape
argument can be used to
specify the shape of the navigation space.
Note
For .mrc
files, the file_reader
takes the mmap_mode
keyword argument
to load the file using a different mode (default is copy-on-write) . However,
note that lazy loading does not support in-place writing (i.e lazy loading and
the r+
mode are incompatible).
See also the format documentation by the Collaborative Computational Project for Electron cryo-Microscopy (CCP-EM).
This plugin does not support writing .mrc
files, which can however be done
using the mrcz plugin. No additional feature of the
mrcz format should be used in order
to write a .mrc
compliant file. In particular, the compressor
argument should
not be passed (Default is None
):
import numpy as np
from rsciio import mrcz
data = np.random.randint(100, size=(10, 100, 100)).astype('int16')
s = hs.signals.Signal2D(data)
s_dict = s.as_dictionary()
mrcz.file_writer('test.mrc', s_dict)
Alternatively, use hyperspy.api.signals.BaseSignal.save()
, which will pick the
mrcz
plugin automatically:
import hyperspy.api as hs
import numpy as np
data = np.random.randint(100, size=(10, 100, 100)).astype('int16')
s = hs.signals.Signal2D(data)
s.save("data.mrc")
MRC Format (Direct Electron)#
Loading from Direct Electron’s .mrc
as well as reading the metadata from the .txt file
saved by the software is supported by passing the metadata_file
argument to the
file_reader
function. The metadata_file
argument can be a string or a file-like
object.
This will automatically set the navigation shape based on the Scan - Size X
and = 256
Scan - Size Y
as well as the Scan - Repeats
and Scan - Point Repeats
parameters in the metadata file. The navigation shape can be overridden
by passing the navigation_shape
argument to the file_reader
function.
API functions#
- rsciio.mrc.file_reader(filename, lazy=False, mmap_mode=None, endianess='<', navigation_shape=None, distributed=False, chunks='auto', metadata_file=None)#
File reader for the MRC format for tomographic data.
- Parameters:
- filename
str
,pathlib.Path
Filename of the file to read or corresponding pathlib.Path.
- lazybool, default=False
Whether to open the file lazily or not.
- mmap_mode{
None
, “r+”, “r”, “w+”, “c”}, default=None Argument passed to
numpy.memmap
. A memory-mapped array is stored on disk, and not directly loaded into memory. However, it can be accessed and sliced like any ndarray. Lazy loading does not support in-place writing (i.e lazy loading and the"r+"
mode are incompatible). IfNone
(default), the value is"r"
whenlazy=True
, otherwise it is"c"
.- endianess
str
, default=”<” "<"
or">"
, depending on how the bits are written to the file.- navigation_shape
tuple
orNone
, default=None Specify the shape of the navigation space. If
None
, the navigation shape will be infer from metadata and if not possible, the data will be loaded as a stack with a navigation dimension equal to one.- distributedbool, default=False
Whether to load the data using memory-mapping in a way that is compatible with dask-distributed. This can sometimes improve performance when reading large files. And splitting the data loading/processing over multiple workers.
- chunks
tuple
ofint
orNone
, default=None Define the chunking used for saving the dataset. If
None
, calculates chunks for the signal, with preferably at least one chunk per signal space.- metadata_file
str
The filename of the metadata file.
- filename
- Returns:
list
ofdict
List of dictionaries containing the following fields:
‘data’ – multidimensional
numpy.ndarray
ordask.array.Array
‘axes’ – list of dictionaries describing the axes containing the fields ‘name’, ‘units’, ‘index_in_array’, and either ‘size’, ‘offset’, and ‘scale’ or a numpy array ‘axis’ containing the full axes vector
‘metadata’ – dictionary containing the parsed metadata
‘original_metadata’ – dictionary containing the full metadata tree from the input file
When the file contains several datasets, each dataset will be loaded as separate dictionary.