Lockman SWIRE master catalogue¶

Preparation of Red Cluster Sequence Lensing Survey (RCSLenS) data¶

This catalogue comes from dmu0_RCSLenS.

In the catalogue, we keep:

The id as unique object identifier;
The position;
The g, r, i, z, y auto magnitudes.

Strange magnitudes¶

The missing values seems to be encoded as -99. but there are also quite some 99. magnitudes.

The “sensible” range of magnitudes seems to go from 14 to 37 (depending on the bands and given that 37 is really faint and may not be reliable). In addition to that there are some very low magnitudes under -40. and very high ones above 90. We don't know the meaning of these extreme values so we are removing all the negative magnitudes and and those above 80.

We are also removing the sources for which we have no magnitude information given the modifications above.

from herschelhelp_internal import git_version
print("This notebook was run with herschelhelp_internal version: \n{}".format(git_version()))

This notebook was run with herschelhelp_internal version: 
44f1ae0 (Thu Nov 30 18:27:54 2017 +0000)

%matplotlib inline
#%config InlineBackend.figure_format = 'svg'

import matplotlib.pyplot as plt
plt.rc('figure', figsize=(10, 6))

from collections import OrderedDict
import os

from astropy import units as u
from astropy.coordinates import SkyCoord
from astropy.table import Column, Table
import numpy as np

from herschelhelp_internal.flagging import  gaia_flag_column
from herschelhelp_internal.masterlist import nb_astcor_diag_plot, remove_duplicates
from herschelhelp_internal.utils import astrometric_correction, mag_to_flux

OUT_DIR =  os.environ.get('TMP_DIR', "./data_tmp")
try:
    os.makedirs(OUT_DIR)
except FileExistsError:
    pass

RA_COL = "rcs_ra"
DEC_COL = "rcs_dec"

I - Column selection¶

imported_columns = OrderedDict({
        "id": "rcs_id",
        "ALPHA_J2000": "rcs_ra",
        "DELTA_J2000": "rcs_dec",
        "CLASS_STAR": "rcs_stellarity",
        "MAG_g": "m_rcs_g",
        "MAGERR_g": "merr_rcs_g",
        "MAG_r": "m_rcs_r",
        "MAGERR_r": "merr_rcs_r",        
        "MAG_i": "m_rcs_i",
        "MAGERR_i": "merr_rcs_i",
        "MAG_z": "m_rcs_z",
        "MAGERR_z": "merr_rcs_z",
        "MAG_y": "m_rcs_y",
        "MAGERR_y": "merr_rcs_y"    
    })


catalogue = Table.read("../../dmu0/dmu0_RCSLenS/data/RCSLenS_Lockman-SWIRE.fits")[list(imported_columns)]
for column in imported_columns:
    catalogue[column].name = imported_columns[column]

epoch = 2017

# Clean table metadata
catalogue.meta = None

# Adding flux and band-flag columns
for col in catalogue.colnames:
    if col.startswith('m_'):
        
        errcol = "merr{}".format(col[1:])
        
        # Remove missing values (-99, 99?) and extreme magnitudes (see above).
        mask = (catalogue[col] < 0) | (catalogue[col] > 80)
        catalogue[col][mask] = np.nan
        catalogue[errcol][mask] = np.nan
        
        flux, error = mag_to_flux(np.array(catalogue[col]), np.array(catalogue[errcol]))
        
        # Fluxes are added in µJy
        catalogue.add_column(Column(flux * 1.e6, name="f{}".format(col[1:])))
        catalogue.add_column(Column(error * 1.e6, name="f{}".format(errcol[1:])))

        # We add NAN filled aperture columns because no aperture fluxes are present
        # EDIT: Better not add empty columns if we can avoid.
        #nancol = np.zeros(len(catalogue))
        #nancol.fill(np.nan)
        #catalogue.add_column(Column(nancol, 
         #                           name="m_ap{}".format(col[1:])))
        #catalogue.add_column(Column(nancol, 
        #                            name="merr_ap{}".format(col[1:])))
        #catalogue.add_column(Column(nancol, 
        #                           name="f_ap{}".format(col[1:])))
        #catalogue.add_column(Column(nancol, 
        #                           name="ferr_ap{}".format(col[1:])))
        
        # Band-flag column
        if 'ap' not in col:
            catalogue.add_column(Column(np.zeros(len(catalogue), dtype=bool), name="flag{}".format(col[1:])))
        
# TODO: Set to True the flag columns for fluxes that should not be used for SED fitting.

catalogue[:10].show_in_notebook()

1.1 Remove all nan rows¶

orig_len = len(catalogue)
mask = ~(np.isnan(catalogue['m_rcs_g']) 
        & np.isnan(catalogue['m_rcs_r'])
        & np.isnan(catalogue['m_rcs_i'])
        & np.isnan(catalogue['m_rcs_z'])
        & np.isnan(catalogue['m_rcs_y'])
        )
catalogue = catalogue[mask]
print(orig_len-len(catalogue), 'out of ', orig_len, ' objects removed due to all nan magnitudes.')

481360 out of  2607590  objects removed due to all nan magnitudes.

II - Removal of duplicated sources¶

We remove duplicated objects from the input catalogues.

SORT_COLS = []
FLAG_NAME = 'rcs_flag_cleaned'

nb_orig_sources = len(catalogue)

catalogue = remove_duplicates(
    catalogue, RA_COL, DEC_COL, 
    sort_col= SORT_COLS,
    flag_name=FLAG_NAME)

nb_sources = len(catalogue)

print("The initial catalogue had {} sources.".format(nb_orig_sources))
print("The cleaned catalogue has {} sources ({} removed).".format(nb_sources, nb_orig_sources - nb_sources))
print("The cleaned catalogue has {} sources flagged as having been cleaned".format(np.sum(catalogue[FLAG_NAME])))

The initial catalogue had 2126230 sources.
The cleaned catalogue has 2052639 sources (73591 removed).
The cleaned catalogue has 72807 sources flagged as having been cleaned

III - Astrometry correction¶

We match the astrometry to the Gaia one. We limit the Gaia catalogue to sources with a g band flux between the 30th and the 70th percentile. Some quick tests show that this give the lower dispersion in the results.

gaia = Table.read("../../dmu0/dmu0_GAIA/data/GAIA_Lockman-SWIRE.fits")
gaia_coords = SkyCoord(gaia['ra'], gaia['dec'])

nb_astcor_diag_plot(catalogue[RA_COL], catalogue[DEC_COL], 
                    gaia_coords.ra, gaia_coords.dec, near_ra0=True)

delta_ra, delta_dec =  astrometric_correction(
    SkyCoord(catalogue[RA_COL], catalogue[DEC_COL]),
    gaia_coords, near_ra0=True
)

print("RA correction: {}".format(delta_ra))
print("Dec correction: {}".format(delta_dec))

RA correction: -0.07070391181969171 arcsec
Dec correction: -0.1302633008947396 arcsec

catalogue[RA_COL] +=  delta_ra.to(u.deg)
catalogue[DEC_COL] += delta_dec.to(u.deg)

nb_astcor_diag_plot(catalogue[RA_COL], catalogue[DEC_COL], 
                    gaia_coords.ra, gaia_coords.dec, near_ra0=True)

IV - Flagging Gaia objects¶

catalogue.add_column(
    gaia_flag_column(SkyCoord(catalogue[RA_COL], catalogue[DEC_COL]), epoch, gaia)
)

GAIA_FLAG_NAME = "rcs_flag_gaia"

catalogue['flag_gaia'].name = GAIA_FLAG_NAME
print("{} sources flagged.".format(np.sum(catalogue[GAIA_FLAG_NAME] > 0)))

27935 sources flagged.

V - Flagging objects near bright stars¶

VI - Saving to disk¶

catalogue.write("{}/RCSLenS.fits".format(OUT_DIR), overwrite=True)

idx	rcs_id	rcs_ra	rcs_dec	rcs_stellarity	m_rcs_g	merr_rcs_g	m_rcs_r	merr_rcs_r	m_rcs_i	merr_rcs_i	m_rcs_z	merr_rcs_z	m_rcs_y	merr_rcs_y	f_rcs_g	ferr_rcs_g	flag_rcs_g	f_rcs_r	ferr_rcs_r	flag_rcs_r	f_rcs_i	ferr_rcs_i	flag_rcs_i	f_rcs_z	ferr_rcs_z	flag_rcs_z	f_rcs_y	ferr_rcs_y	flag_rcs_y
0	CDE1040A1_012221	164.8274632	55.9342951	0.59	26.4383	0.192844	24.954	0.0591562	24.3109	0.0468637	24.2471	0.128944	nan	nan	0.096534	0.017146	False	0.378791	0.0206384	False	0.68492	0.0295633	False	0.726372	0.0862653	False	nan	nan	False
1	CDE1040A1_012351	164.8598581	55.9352114	0.51	nan	nan	24.0958	0.111241	24.3293	0.179642	nan	nan	nan	nan	nan	nan	False	0.834987	0.08555	False	0.673411	0.11142	False	nan	nan	False	nan	nan	False
2	CDE1040A1_012523	164.8440663	55.9366953	0.05	24.9588	0.0891576	24.0017	0.0422543	23.8614	0.0494575	23.3469	0.096879	nan	nan	0.37712	0.0309681	False	0.910584	0.0354378	False	1.03619	0.0472006	False	1.66433	0.148506	False	nan	nan	False
3	CDE1040A1_012541	164.856439	55.936853	0.05	24.7429	0.0641804	24.0179	0.0365814	23.8584	0.0421868	23.8193	0.125855	nan	nan	0.460086	0.0271968	False	0.897098	0.0302257	False	1.03906	0.0403731	False	1.07716	0.124861	False	nan	nan	False
4	CDE1040A1_012593	164.844011	55.9373469	0.69	25.0123	0.0667518	25.01	0.067012	24.947	0.0838051	nan	nan	nan	nan	0.358988	0.0220708	False	0.359749	0.0222038	False	0.381241	0.029427	False	nan	nan	False	nan	nan	False
5	CDE1040A1_012610	164.8309266	55.9375784	0.53	25.7485	0.158175	25.0131	0.0894123	24.6255	0.0859212	24.1334	0.171663	nan	nan	0.182222	0.0265469	False	0.358724	0.0295415	False	0.512625	0.0405673	False	0.806566	0.127524	False	nan	nan	False
6	CDE1040A1_012657	164.8606267	55.9380141	0.7	25.164	0.0863055	24.5895	0.0555585	24.4154	0.0632862	24.1565	0.154603	nan	nan	0.312176	0.024815	False	0.529907	0.027116	False	0.622071	0.0362597	False	0.789587	0.112433	False	nan	nan	False
7	CDE1040A1_012667	164.8430236	55.9382088	0.48	nan	nan	nan	nan	nan	nan	nan	nan	nan	nan	nan	nan	False	nan	nan	False	nan	nan	False	nan	nan	False	nan	nan	False
8	CDE1040A1_012677	164.8294469	55.9382729	0.9	nan	nan	24.0691	0.0291555	nan	nan	23.2083	0.0539764	nan	nan	nan	nan	False	0.855776	0.0229803	False	nan	nan	False	1.89095	0.0940069	False	nan	nan	False
9	CDE1040A1_012702	164.8438543	55.9384471	0.58	25.2618	0.104723	25.1174	0.0949201	nan	nan	nan	nan	nan	nan	0.285285	0.0275167	False	0.325867	0.0284888	False	nan	nan	False	nan	nan	False	nan	nan	False