This notebook serves as a basic introduction to loading and viewing data released in associaton with the publication titled GW190412: Observation of a Binary-Black-Hole Coalescence with Asymmetric Masses avaliable through DCC.
The data used in these tutorials can be downloaded from the same DCC page.
Note: This notebook has been prepared for the released samples that are available since DCC version 12. Preliminary samples were distributed with preprint versions of the paper. The samples initially released with the published paper contained a minor bug that only affected the calculation of the network matched-filter SNR. This bug has been fixed with the current release.
The released data file can be read in using the PESummary or h5py libraries. For this notebook we'll start with simple stuff using h5py. Then we'll use PESummary v0.9.1 to read the data  files as well as for plotting. For general instructions on how to manipulate the data file and/or read this data file with h5py, see the PESummary docs.
# import useful python packages
%matplotlib inline
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
import h5py
import scipy
# There is a known incompatibility between some older versions of numpy/scipy and seaborn. 
# Make sure you have an up-to-date version of scipy installed.
# This notebook has been tested with the following versions
for name, package in zip(('numpy', 'scipy', 'seaborn', 'h5py'), (np, scipy, sns, h5py)):
    print('{} version: {}'.format(name, package.__version__))
Some simple stuff with "vanilla" h5py
# read in the data
fn = 'GW190412_posterior_samples_v3.h5'
data = h5py.File(fn,'r')
# print out parametrized waveform family names ("approximants" in LIGO jargon).
print('approximants:',data.keys())
# print out top-level data structures for one approximant. Here fore example we use the combined samples
# between IMRPhenomPv3HM and SEOBNRv4PHM. The data structure is the same for all approximants.
print('Top-level data structures:',data['combined'].keys())
# extract posterior samples for one of the approximants
posterior_samples = data['combined']['posterior_samples']
print('data structures in posterior_samples:',posterior_samples.dtype)
pnames = [item for item in posterior_samples.dtype.names]
print('parameter names:',pnames)
# get samples for one of the parameters
dL = posterior_samples['luminosity_distance']
print('dL shape, mean, std =',dL.shape,dL.mean(),dL.std())
# smooth it
from scipy.stats.kde import gaussian_kde
hs = gaussian_kde(dL)
# histogram, and overlay the smoothed PDF
plt.figure()
h, b, o = plt.hist(dL,bins=100)
hsmoothed = hs(b)*len(dL)*(b[1]-b[0])
plt.plot(b,hsmoothed)
plt.xlabel('luminosity distance')
plt.ylabel('posterior PDF')
plt.show()
# release memory for the data
#del data
Now use PESummary v0.9.1 to read the data files as well as for plotting.
# import ligo-specific python packages. 
# pesummary is a ligo-specific python package for reading and plotting the results of Bayesian parameter estimation.
# Install with "pip install pesummary" , and make sure you have version >= 0.3.0.
import pesummary
from pesummary.gw.file.read import read
print(pesummary.__version__)
There are 9 different approximants that were used to analyze GW190412 plus the combined samples of IMRPhenomPv3HM and SEOBNRv4PHM. They are all stored in the data file.
fn = 'GW190412_posterior_samples_v3.h5'
data = read(fn)
labels = data.labels
print(labels)
To illustrate the data structure we pick the combined posteriors and plot the respective data.
samples_dict = data.samples_dict
posterior_samples = samples_dict["combined"]
prior_samples = data.priors["samples"]["combined"]
parameters = posterior_samples.keys()
print(parameters)
As an example, we'll show the different posterior distributions derived from the combined (precession + higher multipoles) data and the posterior distribution derived using the different approximants for the luminosity_distance parameter.
from pesummary.gw.plots.latex_labels import GWlatex_labels
parameter = "luminosity_distance"
latex_label = GWlatex_labels[parameter]
fig = posterior_samples.plot(parameter, type="hist", kde=False, prior=prior_samples[parameter])
fig.set_size_inches(12, 8)
plt.show()
from pesummary.core.plots.plot import _1d_comparison_histogram_plot
samples = []
for label in labels:
    samples.append(samples_dict[label][parameter])
colors = ['b', 'r', 'k', 'y', 'orange', 'g','purple','cyan','grey','violet']
fig = _1d_comparison_histogram_plot(parameter, samples, colors, latex_label, labels, kde=True)
fig.set_size_inches(12, 8)
plt.show()
Below is another examples for the mass ratio of the combined data set.
parameter = "mass_ratio"
latex_label = GWlatex_labels[parameter]
fig = posterior_samples.plot(parameter, type="hist", kde=True, prior=prior_samples[parameter])
fig.set_size_inches(12, 8)
plt.show()
Make a corner plot:
from pesummary.gw.plots.plot import _make_corner_plot
fig = _make_corner_plot(posterior_samples, GWlatex_labels, corner_parameters=['mass_1_source', 'mass_2_source', 'luminosity_distance', 'theta_jn', 'chi_eff', 'chi_p'])
plt.show()
The psds that were used for each analysis can also be extracted from this file and plotted
psd = data.psd["combined"]
fig = psd.plot(fmin=20.0)
fig.set_size_inches(12, 8)
plt.show()