Mwitondi, Kassim S.Al-Kuwari, Farha A.Saeed, Raed A.Zargari, Shahrzad A.2020-01-212020-01-2120182019Mwitondi, K. S., Al-Kuwari, F. A., Saeed, R. A., & Zargari, S. (2019). A statistical downscaling framework for environmental mapping. Journal of Supercomputing, 75(2), 984–997. https://doi.org/10.1007/s11227-018-2624-y09208542http://dx.doi.org/10.1007/s11227-018-2624-yhttps://hdl.handle.net/20.500.12519/12This article is not available at CUD collection. The version of scholarly record of this article is published in Journal of Supercomputing (2019), available online at: https://doi.org/10.1007/s11227-018-2624-y.In recent years, knowledge extraction from data has become increasingly popular, with many numerical forecasting models, mainly falling into two major categories—chemical transport models (CTMs) and conventional statistical methods. However, due to data and model variability, data-driven knowledge extraction from high-dimensional, multifaceted data in such applications require generalisations of global to regional or local conditions. Typically, generalisation is achieved via mapping global conditions to local ecosystems and human habitats which amounts to tracking and monitoring environmental dynamics in various geographical areas and their regional and global implications on human livelihood. Statistical downscaling techniques have been widely used to extract high-resolution information from regional-scale variables produced by CTMs in climate model. Conventional applications of these methods are predominantly dimensional reduction in nature, designed to reduce spatial dimension of gridded model outputs without loss of essential spatial information. Their downside is twofold—complete dependence on unlabelled design matrix and reliance on underlying distributional assumptions. We propose a novel statistical downscaling framework for dealing with data and model variability. Its power derives from training and testing multiple models on multiple samples, narrowing down global environmental phenomena to regional discordance through dimensional reduction and visualisation. Hourly ground-level ozone observations were obtained from various environmental stations maintained by the US Environmental Protection Agency, covering the summer period (June–August 2005). Regional patterns of ozone are related to local observations via repeated runs and performance assessment of multiple versions of empirical orthogonal functions or principal components and principal fitted components via an algorithm with fully adaptable parameters. We demonstrate how the algorithm can be extended to weather-dependent and other applications with inherent data randomness and model variability via its built-in interdisciplinary computational power that connects data sources with end-users. © 2018, The Author(s).enPermission to reuse the abstract has been secured from Springer New York LLC.Chemical transport modelsDownscalingEmpirical orthogonal functionsEnsemble modellingInterdisciplinary computationPrincipal component analysisPrincipal fitted componentsUnsupervised modellingChemical analysisClimate modelsEnvironmental Protection AgencyExtractionMappingNumerical methodsOrthogonal functionsOzoneChemical transport modelsDown-scalingEmpirical orthogonal functionsEnvironmental phenomenaPerformance assessmentPrincipal fitted componentsStatistical downscalingUS Environmental Protection AgencyPrincipal component analysisA statistical downscaling framework for environmental mappingArticleCopyright : 2018, The Author(s).