In the present era of ever larger photometric sky surveys, designing efficient automated methods for unbiased selection of specific categories of objects and prediction of their properties based on photometric data is becoming an increasingly urgent quest. A good example of such objects are quasars (QSOs), both because of our interest in their physics and possible cosmological applications.
In my talk I present a catalog of 343,000 QSOs derived from one of the largest photometric surveys up to date - the Kilo-Degree Survey (KiDS) Data Release 4. The catalog is characterised by robust photometric redshifts with uncertainties, and its high magnitude depth was achieved with artificial neural networks (ANN) applied in a controlled way. The catalog is based on optical ugri and near-infrared ZYJHKs bands. With purity 97%, completeness 94%, and redshift error (mean and scatter) 0.009 +/- 0.12, the catalog is limited at r < 23.5. The robustness of the catalog was demonstrated e.g. by comparing the number counts of our QSO candidates to models from eBOSS, and by cross-matching the QSOs with Gaia in order to confirm zero parallaxes. In the talk, I will explain how machine learning (ML) methods can be successfully applied to observations fainter than the training data known from spectroscopy. It includes building inference subsets using high dimensional t-SNE visualisations of the feature space, and properly adapting bias vs variance tradeoff of the ML models with tests against the problem of extrapolation in the feature space. Our success of the extrapolation challenges the way that models are optimised and applied at the faint data end. The resultant catalog is ready for cosmological and active galactic nucleus (AGN) studies, and developed methodology is ready to be used on future yet larger photometric datasets.
Slides: in PDF
Video: https://youtu.be/Kwv0CH4KKRA