Reading/writing segments and flags¶

The SegmentList, DataQualityFlag, and DataQualityDict objects each include read() and write() methods to enable reading from and writing to a number of different file formats for segment-like objects. As with other classes in GWpy, the format keyword argument can be used to manually specify the input or output format, if the file extension isn’t obvious enough.

The read() and write() methods take different arguments and keywords based on the input/output file format, see the following sections for details on reading/writing for each of the built-in formats. Those formats are:

LIGO_LW XML
HDF5
JSON

`LIGO_LW` XML¶

Additional dependencies: python-ligo-lw

The LIGO Scientific Collaboration uses a custom scheme of XML in which to store tabular data, called LIGO_LW. Complementing the scheme is a python library - python-ligo-lw - which allows users to read and write all of the different types of tabular data produced by gravitational-wave searches.

Segments are stored in LIGO_LW format using a trio of tables:

`LIGO_LW` XML tables for segment storage¶
Table name	Contents
`segment_definer`	Definitions for each flag, including names and versions
`segment_summary`	Known segments for each flag
`segment`	Active segments for each flag

Reading¶

The DataQualityFlag.read() method takes in the file path (or paths) and the name of the flag to read:

>>> f = DataQualityFlag.read('segments.xml', 'L1:DMT-ANALYSIS_READY:1')

This will parse of each of three tables for references to the given name, returning the metadata and segments as a DataQualityFlag. The result may be something like:

>>> print(f)
DataQualityFlag('L1:DMT-ANALYSIS_READY:1',
                known=[[1000000000 ... 1000000100)],
                active=[[1000000000 ... 1000000034)
                        [1000000065 ... 1000000100)],
                description=None)

This indicates a single ‘known’ segment starting at GPS time 1000000000, with two active segments.

These results are simulated, and do not actually indicate operating times of the LIGO-Livingston observatory.

The coalesce=True keyword argument can be used to combine overlapping segments into a single, longer segment.

Writing¶

To write a DataQualityFlag to file in LIGO_LW format, use the write() method:

>>> f.write('new-segments.xml')

As with writing tables, if the target file already exists, an IOError will be raised, use overwrite=True to force a new file to be written.

To write a table to an existing file, use append=True:

>>> f.write('new-segments.xml', append=True)

To replace the segment tables in an existing file, while preserving other tables, use both append=True and overwrite=True:

>>> f.write('new-table.xml', append=True, overwrite=True)

Extra attributes can be written to the tables via the attrs={} keyword, all attributes are set for all three of the segment-related tables:

>>> f.write('new-table.xml', append=True, overwrite=True, attrs={'process_id': 0})

Note

The python-ligo-lw library reads and writes files using an updated version of the LIGO_LW format compared to glue.ligolw used to. GWpy should support both format versions natively when _reading_, but only supports writing using the updated format.

`DataQualityDict`¶

The DataQualityDict DataQualityDict.read() and DataQualityDict.write() methods work in an almost identical manner, taking a list of flag names when reading:

>>> fdict = DataQualityFlag.read('segments.xml', ['H1:DMT-ANALYSIS_READY:1', 'L1:DMT-ANALYSIS_READY:1'])

Identical arguments should be used relative to the DataQualityFlag.write() method when writing:

>>> fdict.write('new-segments.xml')

HDF5¶

GWpy uses HDF5 Groups to store a DataQualityFlag, with each of the known and active segment lists stored in a Dataset, and extra metadata stored in the Group’s attributes.

Reading¶

To read a DataQualityFlag from an HDF5-format file:

>>> f = DataQualityFlag.read('segments.hdf')

As with reading other classes from HDF5, the path keyword should be used to specify the name of the HDF5 group that contains the given flag.

The coalesce=True keyword can be used to coalesce() the known and active segment lists before returning - by default the segments will be returned as read from the file.

Writing¶

To write a DataQualityFlag to an HDF5-format file:

>>> f.write('new-segments.hdf5')

As with reading, the path keyword should be used to specify the name of the HDF5 group to which the given flag should be written.

Alternatively, an HDF5 group can be passed directly to write() when writing multiple objects to the same file.

`DataQualityDict`¶

As with LIGO_LW XML, the DataQualityDict DataQualityDict.read() and DataQualityDict.write() methods work in an almost identical manner, taking a list of flag names when reading:

>>> fdict = DataQualityFlag.read('segments.hdf5', ['H1:DMT-ANALYSIS_READY:1', 'L1:DMT-ANALYSIS_READY:1'])

Identical arguments should be used relative to the DataQualityFlag.write() method when writing:

>>> fdict.write('new-segments.hdf5')

JSON¶

The DQSEGDB server uses JSON as the intermediate format for returning information during queries.

Reading¶

To read a DataQualityFlag from JSON, simply pass the path of the file:

>>> f = DataQualityFlag.read('segments.json')

See json.load() for acceptable keyword arguments options.

Writing¶

To write a DataQualityFlag to JSON:

>>> f = DataQualityFlag.write('new-segments.json')

See json.dump() for keyword arguments options.

Reading/writing segments and flags¶

LIGO_LW XML¶

Reading¶

Writing¶

DataQualityDict¶

HDF5¶

Reading¶

Writing¶

DataQualityDict¶

JSON¶

Reading¶

Writing¶

`LIGO_LW` XML¶

`DataQualityDict`¶

`DataQualityDict`¶