Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.
Advertisement
npj Climate and Atmospheric Science volume 5, Article number: 76 (2022)
504
3
Metrics details
The use of air quality monitoring networks to inform urban policies is critical especially where urban populations are exposed to unprecedented levels of air pollution. High costs, however, limit city governments’ ability to deploy reference grade air quality monitors at scale; for instance, only 33 reference grade monitors are available for the entire territory of Delhi, India, spanning 1500 sq km with 15 million residents. In this paper, we describe a high-precision spatio-temporal prediction model that can be used to derive fine-grained pollution maps. We utilize two years of data from a low-cost monitoring network of 28 custom-designed low-cost portable air quality sensors covering a dense region of Delhi. The model uses a combination of message-passing recurrent neural networks combined with conventional spatio-temporal geostatistics models to achieve high predictive accuracy in the face of high data variability and intermittent data availability from low-cost sensors (due to sensor faults, network, and power issues). Using data from reference grade monitors for validation, our spatio-temporal pollution model can make predictions within 1-hour time-windows at 9.4, 10.5, and 9.6% Mean Absolute Percentage Error (MAPE) over our low-cost monitors, reference grade monitors, and the combined monitoring network respectively. These accurate fine-grained pollution sensing maps provide a way forward to build citizen-driven low-cost monitoring systems that detect hazardous urban air quality at fine-grained granularities.
Pollution prediction in cities with dense populations can be critical for generating fine-grained policy recommendations and public health warnings1,2,3. The scale of accurate sensor-based monitoring required to achieve this can come at a huge cost and thus inhibit building a dense fine-grained pollution sensing map. The deployment of low-cost particulate matter sensors to replace or augment reference grade pollution air quality monitoring systems has been studied extensively recently, and have addressed issues of calibration4,5,6, design7,8, data selection9, and personal exposure quantification10,11. However, building a highly accurate large scale fine-grained pollution sensing and monitoring map that leverages the size of a pollution network has been largely unexplored. Specifically, modeling the behavior of noisy low-cost sensors in cities with high pollution and population density has not been studied previously, with recent state-of-the-art mapping approaches providing errors only in the range of 30–40%12,13. This high error lends the pollution sensing map unusable for policymaking and air quality hazard detection. Prior work on deploying low-cost sensor networks for air pollution have been successful on a small scale (within 2 km radius) with high rates of agreement for PM 2.5 measurements in Southeastern United States14. Survey studies have shown that there is a necessity for a paradigm shift towards crowd-funded sensor networks to enable fine-grained sensing-based applications on a large scale15. The question of calibration issues in such large scale settings has been explored recently with promising results without the need for significant recalibration16 after well-controlled laboratory calibration17. PM 2.5 prediction models recently have explored deep neural networks like long-short term memory (LSTM), convolution neural networks (CNN), attention-based models; vector regression, partial differential equations, but focus on a single unified model at a single location, rather than in a large scale sensor network setting18,19,20,21,22,23,24.
Recent work has also explored the use of distributed sensor networks to gather information on air pollution and other meteorological variables in urban contexts25,26,27,28,29. Clements et al. 30 provide a comprehensive review of many such works. Researchers have sought to learn more about how pollution sensing systems of low-cost sensors may be deployed in urban contexts14,31,32,33,34,35,36. With the exception of Gao et al. 36, who examine the performance of fine particulate sensors in Xi’an in China, most of these deployments have occurred in areas with significantly lower air pollution than the city of Delhi in India. Gao et al. 36 also point out that low-cost PM2.5 sensors may perform worse in very low pollution environments, suggesting that they may be relatively more useful when particulate concentrations are high. Related approaches in this space can be broadly classified into three groups—spatial interpolation approaches, land-use regression, and dispersion models Xie et al. 37, Jerrett et al. 38. In the case of dispersion models, they assume that an appropriate chemical transport model is identified along with their parameter values, and a high-quality emissions inventory. In the case of land-use regression models, having access to environmental characteristics that significantly influence pollution is critical. This additional data is often suited for longer range predictions, as the geographical and meteorological data vary over a longer temporal and coarser spatial grids39,40.
In this paper, we describe a methodology to model and predict urban air quality at a fine-grained level using dense and noisy, low-cost sensors. There are two main questions we seek to answer in this paper—(i) how can we use a network of low-cost and portable air quality monitors in order to build a fine-grained pollution heatmap in a city that provides accurate prediction?, (ii) does it help to augment existing monitoring networks by the local governments with low-cost air quality sensors?
We deploy a network of 28 low-cost sensors, many of them concentrated in the south Delhi area, in collaboration with Kaiterra41, a company that makes low-cost air quality monitors and air filters. We dramatically increase the density of the deployment by 28× in Delhi (area 573 mi2) with 28 sensors, compared to previous deployments (Xi’an – area 3898 mi2, 8 low-cost sensors). Further, the large longitudinal dataset we have been able to capture over 2 years as compared to prior work, which captured at most a few weeks of data, allows us to model long-term seasonal changes and train more complex neural network models that can adapt to seasonal and daily patterns. We build on prior work and model the pollution network in its entirety, with prediction models at each sensor location using data from near-by sensor locations.
We model pollution at any location in Delhi as measured by the concentration of fine particulate matter (PM2.5) measured in μgm−3 using historical data of up to 8 h from all the sensors in the network. We make this choice of building a fine-grained pollution sensing map over shorter timelines to leverage the primary advantage of low-cost sensors while overcoming the drawback of noise by aggregating numerous spatio-temporal measurements. By learning the variability of each of these noisy measurements through message passing neural networks (MPRNN) which have the ability to model each sensor separately, we learn to not only separate the signal from the noise, but build an accurate sensing network of low-cost sensors that achieves <10% root mean squared earror (RMSE) in predicting up to one hour in advance over a fine-grained spatio-temporal grid as compared to baseline modeling approaches that provide 30% RMSE. By using a sparse network of sensors, whose signals are shared through neural network embeddings, we learn to capture the information from nearby sources that might affect the readings of nearby sources (e.g., factory) and ignore the ones which are heavily localized (e.g., food cart). Such an accurate, fine-grained pollution sensing map (≤10% MAPE) is usable by policymakers in deciding which neighborhoods of the city need interventions to improve the air quality and population health. To the best of our knowledge, we are the first in attempting to model a city-scale sensor network deployment with low-cost sensors augmenting high-quality government monitoring stations. With a sensor network the size of a city, with 60 sensors spread across the city of Delhi (700 sq km), capturing spatio-temporal variations and constructing accurate pollution maps necessitates modeling each sensor separately. By increasing the scale and addressing the corresponding modeling challenges, our work has widespread implications for pollution sensing and its low-cost deployability.
Our data consists of PM2.5 concentration data averaged to the hour from the 28 low-cost sensors and the 32 government monitors, a total of 60 monitors, collected over a period of 24 months, from May 1, 2018, to May 1, 2020. We use the until Oct 30, 2019 for training (75%) and hold out the remaining (25%) for testing. We report two criteria—the RMSE and the mean absolute percentage error (MAPE). We evaluate our models on the data from the combined set of our 28 low-cost sensors and the 32 government monitors, as well as separately on each set. For each of these locations, we compare our model-based predictions with the ground truth of the measurement of the pollution sensor.
Overall, the MPRNN model with imputed data using STHM along with the spline correction provides a very highly accurate estimation of the PM concentration level across all locations (ref Table 1). The best performing model is able to predict PM2.5 concentrations with an average RMSE of 10.1 μgm−3 and MAPE of 9.6% across all the locations and over the testing period. While estimating a spline per location provides the best predictive performance, we note that using an average spline across all observed locations only marginally increases the RMSE and MAPE errors. The average spline is computed after averaging the data over all the locations. Across all locations, the median RMSE and MAPE are 9.15 μgm−3 and 8.64% respectively (ref Fig. 1). The best case values are 4.28 μgm−3 and 5.57% respectively, and the worst case values are 24.1 μgm−3 and 19.64% respectively. The location where we have minimum MAPE is at a location in Green Park, a very busy area of south Delhi, further validating the need for fine-grained pollution sensing in a large city like Delhi.
In this period, the PM2.5 concentration values ranges between 0 and 1000 μgm−3, and average value being ~130 μgm−3. a Bar plot comparing our methodology with other competing approaches. We note that modeling spatiotemporal interactions using a neural network such as MPRNN and accounting for intra-day periodic patterns in the form of spline corrections together make a big difference in the performance. b Distribution of MAPE for the best performing model – Per-Sensor Spline with STHM imputation + MPRNN, across all the locations shown as a cumulative density function (CDF). c Prediction errors of the best performing model (MPRNN+Spline) at every monitoring location on the map. d Errors of the final prediction zoomed into the regions with highest concentration of sensors (New Delhi and South Delhi).
The 3-way cubic spline fit shows a common trend of baseline pollution rising steadily up to 8 am, then decreasing up to 4 pm and then increasing again until midnight. We note that this is the composite polynomial model of the PM concentrations in an average day (ref Fig. 2). The median error of this model is about 40 μgm−3 at each of the three windows, 12 am–8 am, 8 am–4 pm and 4 pm–12 am, and this is reduced to about 10 μgm−3 post the neural network model fit on the residuals. Figure 2 and Supplementary Fig. 2 show the per-sensor splines and the average spline in detail. Not only do the per-sensor splines vary widely across space, we notice that regions with significantly high spline residual errors like the sensors A838, E8E4, and 2E9C in Supplementary Fig. 2, are all located in central locations of Delhi with well established commercial activity like Connaught Place, Sardarjung Enclave and Lado Sarai respectively. Further, in Supplementary Fig. 2, the outliers with significantly high residual error splines among the government monitoring stations are Patparganj DPCC, Punjabi Bagh DPCC, and DKSSR DPCC. While Patparganj is situated next to an industrial area, Punjabi Bagh is a well-known residential locality with established commercial activity centers, and DKSSR, short for Dr. Karni Singh Shooting Range, is a shooting range located in the outskirts of Delhi next to an interstate highway. The diversity of these splines across various geographical regions further indicate the need to model fine-grained pollution profiles in seemingly remote as well as central locations of Delhi. We also note that the average spline can sufficiently operate for bootstrapping at locations where we do not have enough sensor data to begin with.
The top two rows show the distribution of the residuals (in PM units of μg/m3) over space, before and after the spline correction. Three different splines were fitted over the residuals in three different time slots in the day. We observe that for the most part, locations that exhibited high residual errors after MPRNN fit (in the upper quantiles of the residual error distribution) continued to show high error (relative to other locations) even after spline correction, even though the magnitude of the residual does decrease. This phenomenon is partially explained by the high baseline values of the sensors with high residual errors, that is often coupled with high variance in measurement. a Slot 1 (12 AM–8 AM). b Slot 2 (8 AM–4 PM). c Slot 3 (4 PM–12 AM). d Composite cubic spline correction consisting of three splines fitted for three non-overlapping parts of the day—midnight to early morning (12 AM to 8 AM), midday (8 AM to 4 PM), and evening to midnight (4 PM to 12 AM). e Ground truth PM2.5 (blue), along with MPRNN prediction (green) and final prediction after spline correction (red) at one of our sensor locations in Chanakyapuri in New Delhi. f Ground truth PM2.5 (blue), along with MPRNN prediction (green) and final prediction after spline correction (red) at the CPCB monitor at Sirifort in South Delhi.
For the most part, locations that exhibited high residual errors after MPRNN fit continued to show high error (relative to other locations) even after spline correction, even though the magnitude of the residual decreases. This phenomenon is partially explained by the high baseline values of the sensors with high residual errors, that is often coupled with high variance in measurement.
The fewer the monitors we used in our hybrid model, the greater was the final prediction performance. As Supplementary Fig. 3 shows, with only one monitor in the network, the predictive errors are about 35 and 20 μgm−3, respectively, for the low-cost sensor network and government network. However, as we include data from more nodes in the network, final prediction error drops sharply to about 15% and then gradually tails off at about 10%. The error flattens out about 30 sensors, which is approximately the number of sensors of each type that we have in our experiment. We infer that having an even denser deployment likely adds little value to the predictive performance. Further, decreasing the amount of training data to train the model shows that at minimum, one year of data is required to capture the seasonal trends and achieve RMSE of almost 10% (Supplementary Table 3).
The low MAPE and RMSE across all monitors in Delhi provided by our Per-Sensor Spline+MPRNN with STHM imputation model are significant as it means that our model can detect hazardous air quality with high precision. The RMSE error is significantly lower than the observed variance in PM2.5 concentrations in a day, making it useful for short-term and intraday analyses as well. The WHO air quality standards prescribe that PM2.5 levels should not exceed 5 and 15 μgm−3 at an annual and daily average levels, while the Indian Government air quality standards prescribe 40 and 60 μgm−3, respectively. We note that for the 60 sensors, Delhi has exceeded these prescribed levels 371 out of the 641 days on a daily level, across 2 years of our measurement. The 9.6 % MAPE error that we are able to achieve, corresponds to the ability to detect hazardous air quality as per Indian government standards with 93.5% precision and 90.8% recall. This further indicates that the low error rate we have obtained leads to an almost exact prediction of hazardous air quality. This enables citizen-driven sensing where pollution sensor readings can be crowdsourced, and effective policy interventions like clean energy policies that penalize construction sites that have PM2.5 levels more than 25% higher than the nearest monitoring center can be operationalized42. Specifically, the improvement in predictive power is achieved in specific pollution hotspots like bus stations, markets, etc. (Fig. 1). In addition, we can provide transparency of the overall average pollution of the city43 and contribute towards increasing the co-benefits of clean energy policies44,45.
Since the data used to measure the model performance is new, it is important to understand the spatial variations and heterogeneity in measurements that underlies the sensor network. To further ensure that the improvement in model’s prediction performance is better than the noise in the data, we performed extensive calibration of the sensors. For this, we leveraged the calibration performed in-house by the sensor manufacturer (Kaiterra46) (more info in Appendix) which confirms that re-calibration is not required47, and also perform validation by comparing our sensor readings with the readings provided by the nearest government pollution monitoring station. Supplementary Figure 5 shows the cross-calibration of the average pollution value reported by the 28 government monitors with the average value of the 18 sensors in our testbed in the locality of South Delhi. We observe that the sensors have been fairly well calibrated with the reference monitors and report a similar average value across the city despite individual sensor level and spatio-temporal variations. This provides confidence in the data generated from this pilot to be useful as a reference for pollution modeling and forecasting.
Further, we also performed a nearest neighbor calibration where we compute temporal correlation of our sensor with the nearest government monitoring station of that sensor. Supplementary Table 4 shows that on average the correlation coefficients are >0.8, which indicates that there is no statistical significant difference between them on average (t-test, confidence level: 0.05, p-value: 0.0011). Further, in Supplementary Fig. 4, we see that when we order our sensors by the nearest neighboring government station, the cross-correlations between our sensors are correspondingly aligned, with high correlation between nearby sensors and low correlation between farther sensors. This further emphasizes the importance of the improvement in modeling as it significantly improves the prediction capabilities of a fine-grained sensor network, which can capture spatial variations in pollution of Delhi.
The development of fine-grained pollution sensing maps at low-costs can further catalyze the deployment of such monitoring networks in other polluted cities, where the pollution networks are sparse. With citizens procuring, deploying, and modeling pollution of cities accurately, this paper provides a way forward for developing high-quality fine-grained pollution sensing maps.
We model the spatio-temporal prediction problem as a graph prediction problem, where we predict a value at every node at a certain time using as input the historical values from neighboring nodes. In our setting, each sensor location v ∈ V is a node in an undirected graph. Assuming that air pollutants diffuse uniformly in all directions and exert their influence throughout our region of interest, in this case the greater Delhi region, we make the graph complete, where an edge exists between every pair of nodes. The end goal is to train a model that predicts at any node, the pollution level, measured in terms of the concentration of fine particulate matter PM2.5, at time t given one or more readings from neighboring locations prior to t. The first step is to interpolate the gaps in the data. We use a geostatistics model for this task, called the Spatio-temporal Hierarchical Model (STHM). Then we fit a cubic spline based on daily trends at each sensor location, and then finally train a Message-Passing Recurrent Neural Network (MPRNN) (Section 4.4) to predict residuals over the baseline. In order to account for the amount of influence based on the pairwise distances, we include the Euclidean distance between sensors as part of our feature embedding in our message-passing formulation. We test this model by predicting values at locations where sensors, and therefore ground truth information, are present, but the model is generalized enough to be used to predict at locations where there is no ground truth data available. If (y_{v,t}) is the reading of the sensor at location v, at timestamp t, and ({hat{y}}_{v,t}) is our corresponding prediction, the prediction model aims to minimize the mean absolute percentage loss:
Our pollution forecasting model for estimating the PM2.5 particulate matter concentration across space and time consists of three important steps. Given the variations in data availability across our pollution sensors, the first step of our method uses a standard Spatio-Temporal Hierarchical Model (STHM) to estimate the missing data. Our STHM model is a standard statistical modeling framework from geostatistics that combines multiple sources of information, accommodates missing values, and computes predictions in both space and time. Based on daily variation patterns observed at each of the pollution sensors, the second step in our method estimates a three-way cubic spline at each sensor location, one for each disjoint 8 h interval in a 24 h period (12 am to 8 am, 8 am to 4 pm and 4 pm to 12 am), representing three different patterns in the PM2.5 variations. The cubic splines for each sensor represented a baseline level of PM2.5 concentration. The cubic splines may provide a good approximation to the overall average daily variations across sensors but do not capture short term spatio-temporal variations represented by the residual errors in the baseline. The final step of our method is to train a Message-Passing Recurrent Neural Network (MPRNN) across the pollution monitoring points to estimate the residual errors from neighboring sensors. We will briefly describe the characteristics of our data and then explain the cubic spline and MPRNN methodology in this section. We refer the reader to the supplementary text for a detailed description of the STHM model.
The data used for the modeling the air pollution levels in Delhi was sourced from a combination of 32 local government monitors and a network of 28 low-cost sensors deployed by us in various locations of Delhi from May 2018 to May 2020. The average availability of each of these sensors are about 90 and 30% over the measured period, respectively. This disparity is attributed to a variety of factors such as disconnection for periodic necessary calibration, network outages and periodic servicing of sensors. The sensors are calibrated against the government sensors, by conducting a longitudinal comparison study by measuring in proximity to the location of the government monitoring centers. The locations and their summary statistics of the sensors by location is given by the Supplementary Tables 1 and 2, and are shown visually in the box plots in Supplementary Fig. 1.
We observe that on a daily basis, depending on the time of the day and the location, there is a low-frequency component that makes up an approximate “baseline level” of PM concentration. Based on this observation, we fit a piecewise polynomial function, called a spline, to model this low-frequency component. We divided a single day into a number of epochs and fit a spline for each epoch. Prior to implementing the cubic splines, we observed that the residual errors from the MPRNN model exhibits different errors at different times in the day. We then proceeded to fit cubic splines based on the daily spatio-temporal patterns per sensor and per location. For example, if our prediction error follows a temporal pattern of say, higher prediction error in the morning, while lower in the afternoon, we can leverage this fitting separate splines for morning and afternoon to subtract out this component. The spline can be of any order, but given our residual error patterns, but we found that piecewise cubic spline works best. Suppose at time t and location v, the raw PM value is given by yv,t. Then, the piecewise spline to predict y, with time period p is given by:
Note that the chosen parameters per sensor αv,p, βv,p, κv,p, νv,p, where p ∈ {“morning”, “afternoon”, “evening”}, depend on the patterns in our residual errors and are fit accordingly to minimize the root mean-squared residual error:
MPRNN, based on refs. 48,49, is a neural network architecture that is applied on a graph in order to predict values at each node in the graph. This approach enables to us incorporates spatial interactions between each pair of nodes as “messages” that are broadcast from every node to its neighbors. Each node has a modified version of a long short term memory (LSTM) network that iterates between message-passing and the recurrent computations.
Suppose yv,t is a quantity of interest at node v and time t, for which we would like to build a predictive model. Mathematically, we would like to learn a function ({{{mathcal{F}}}}) such that, ({y}_{v,t+1}={{{mathcal{F}}}}({v}_{1},{y}_{{v}_{1},t},{v}_{2},{y}_{{v}_{2},t},ldots ;{v}_{j}in {{{mathcal{V}}}})) where the set ({{{mathcal{V}}}}) denotes the set of all the nodes in the graph. A recurrent neural network unit is assigned to each node in the graph, with each node v maintaining a hidden state hv,t at time t. Through a message-passing phase and a time-recurrent phase, our model infers the next hidden state, hv,t+1 from which the PM value at v is decoded. A message-passing operation allows one segment to observe the hidden state of its neighboring segments.
The computation proceeds in five steps, as five layers of the neural network. In the first phase, the observation phase, the input observations ({Y}_{t}={{y}_{v,t}| vin {{{mathcal{V}}}}}) at time t are encoded into hv,t by the observation operation Ov. In the second and third phases, one or more iterations of messaging (M) and updating (U) operations are performed to propagate the observations in the graph. In the fourth phase, for each node, a time-recurrent operator Tv utilizing an LSTM unit takes as input the final hidden state hv,t and predicts the next hidden state hv,t+1. The final phase is the readout operation Rv, which decodes the hidden state to produce the output value to be predicted ({hat{y}}_{v,t+1}). These five steps are shown below. The message function takes as input the hidden states of a pair of nodes v and n and the Euclidean distance between them, dv,n as the influence of the pollution at a given location on the pollution at another location would depend on the distance between them. Hence, we include the distance in the embedding.
For a selection of nodes ({{{mathcal{W}}}}) in the graph, the components of the model ({{O}_{w},M,U,{T}_{w},{R}_{w},| win {{{mathcal{W}}}}}) are defined. During inference, the states ({H}_{t}={{h}_{w,t}| win {{{mathcal{W}}}}}) are maintained at each time step. The hidden state for each segment is initialized at t = 0 randomly during training and evaluation ({h}_{v,0} sim {{{mathcal{N}}}}(0,1)).
We used the data from May 1, 2018, to Nov 1, 2019, a period of 18 months, as the training period. The number of samples we had for training were 166,979 from our low-cost sensor network, and 371,806 from the government network, resulting in a total of 538,785 samples. The model was trained at each sensor location, using as input data from all the other monitors except itself, over the entire training period. We used the Adam optimizer50 with a learning rate of 0.001, and ran the training for 30 epochs to ensure a robust and well-trained model. To validate the model, we used the remaining 6 months data from Nov 1, 2019, to May 1, 2020. The number of ground truth samples available in this period were 20,408 and 91,493 in the low-cost network and government network, respectively, resulting in a total of 111901 samples. However, only 12 out of the 28 low-cost sensors were operational in the testing phase, since many of them had not been serviced properly, partly owing to the COVID-19 pandemic. The testing error reported under Results (§2), therefore, shows the predictions tested at 12 low-cost sensor locations and 32 government monitors, a total of 44 locations combined. Further, to understand the implications of availability of less data during training, we evaluated our model as shown in Supplementary Table 3 and found that with training data less than a year, our model’s performance significantly decreases as seasonal trends are not well captured.
The MPRNN is implemented using the Deep Graph Library51 and PyTorch 52 in Python. The model diagram is shown in Fig. 3.
a Network of air quality monitors in the entire greater Delhi region. b Model architecture, showing M sensor inputs feeding into the layers and producing a single real output, illustrated by zooming on the selected region in (a). The computation goes from top to bottom. The green boxes represent input PM concentrations from a set of locations, the gray boxes the hidden linear transformation layers, with the numbers in the boxes representing the number of internal parameters to be learned, and the orange box shows the RNN with the LSTM cells. Here 256 is the embedding size of the hidden layer messages passed, that was chosen empirically based on performance. The final output is the single real value of PM concentration. The input to the RNN is the vector output of length 256 from the hidden layer. More details are in the supplementary text. c Sample model of a low-cost sensor. d Our experimental testbed of monitors, and the quality of the PM2.5 data obtained. We had to contend with frequent outages and communication issues that plagued our sensor network and affected data availability.
We contrast our combined model with two alternative modeling approaches in order to set a baseline to benchmark the MPRNN model performance. The first one is the STHM itself, a state-of-the-art spatio-temporal modeling methodology. When the STHM is used solely for the prediction, it performs poorly, as it does not model unknown non-linear spatial dependencies due to dispersion. The second baseline is an alternative neural network formulation that collects information from a specified number (K) of nearest neighbors to a location L, and feeds them into a trained recurrent neural network, to predict the value at L. Unlike the MPRNN, this model does not account for explicit spatial influence between every pair of sensors, thus allowing us to see how a more simplified multi-variate non-linear model might perform. We call this model the k-Nearest Neighbor (k-NN) Spatial Neural Network.
The data that supports the findings of this study comprises two parts—the PM2.5 data from the government monitors and the data collected from our low-cost sensor network. The former is public data and can be accessed here53. The data can also be provided by the authors upon request. The latter is third-party data and the authors are bound by a confidentiality agreement with Kaiterra, the makers of the low-cost sensors, and can only be made available for confidential peer review, if requested by reviewers, within the terms of the data use agreement and if compliant with ethical and legal requirements.
All the relevant code can be obtained upon request from the corresponding author. The code is also available on GitHub: https://github.com/shivariyer/epod-nyu-delhi-pollution.
Shaddick, G., Thomas, M., Mudu, P., Ruggeri, G. & Gumy, S. Half the world’s population are exposed to increasing air pollution. NPJ Clim. Atmos. Sci. 3, 1–5 (2020).
Article Google Scholar
Rao, N. D., Kiesewetter, G., Min, J., Pachauri, S. & Wagner, F. Household contributions to and impacts from air pollution in India. Nat. Sustain. 4, 1–9 (2021).
Article Google Scholar
Geng, G. et al. Drivers of pm2. 5 air pollution deaths in china 2002–2017. Nat. Geosci. 14, 645–650 (2021).
Article Google Scholar
Liu, H.-Y., Schneider, P., Haugen, R. & Vogt, M. Performance assessment of a low-cost pm2. 5 sensor for a near four-month period in Oslo, Norway. Atmosphere 10, 41 (2019).
Article Google Scholar
Liu, X. et al. Low-cost sensors as an alternative for long-term air quality monitoring. Environ. Res. 185, 109438 (2020).
Article Google Scholar
Giordano, M. R. et al. From low-cost sensors to high-quality data: A summary of challenges and best practices for effectively calibrating low-cost particulate matter mass sensors. J. Aerosol Sci. 158, 105833 (2021).
Article Google Scholar
Tryner, J. et al. Design and testing of a low-cost sensor and sampling platform for indoor air quality. Building Environ. 206, 108398 (2021).
Article Google Scholar
Prakash, J. et al. Real-time source apportionment of fine particle inorganic and organic constituents at an urban site in Delhi city: An iot-based approach. Atmospheric Pollution Res. 12, 101206 (2021).
Article Google Scholar
Bi, J. et al. Publicly available low-cost sensor measurements for pm2.5 exposure modeling: Guidance for monitor deployment and data selection. Environ. Int. 158, 106897 (2022).
Article Google Scholar
Zusman, M. et al. Calibration of low-cost particulate matter sensors: Model development for a multi-city epidemiological study. Environ. Int. 134, 105329 (2020).
Article Google Scholar
Mahajan, S. & Kumar, P. Evaluation of low-cost sensors for quantitative personal exposure monitoring. Sustainable Cities Soc. 57, 102076 (2020).
Article Google Scholar
Spyropoulos, G. C., Nastos, P. T. & Moustris, K. P. Performance of aether low-cost sensor device for air pollution measurements in urban environments. accuracy evaluation applying the air quality index (aqi). Atmosphere 12, 1246 (2021).
Chu, H.-J., Ali, M. Z. & He, Y.-C. Spatial calibration and pm 2.5 mapping of low-cost air quality sensors. Sci. Rep. 10, 1–11 (2020).
Article Google Scholar
Jiao, W. et al. Community air sensor network (cairsense) project: Evaluation of low-cost sensor performance in a suburban environment in the southeastern united states. Atmos. Meas. Tech. 9, 5281–5292 (2016).
Article Google Scholar
Morawska, L. et al. Applications of low-cost sensing technologies for air quality monitoring and exposure assessment: How far have they gone? Environ. Int. 116, 286–299 (2018).
Article Google Scholar
Stavroulas, I. et al. Field evaluation of low-cost pm sensors (purple air pa-ii) under variable urban air quality conditions, in Greece. Atmosphere 11, 926 (2020).
Article Google Scholar
Tancev, G. & Pascale, C. The relocation problem of field calibrated low-cost sensor systems in air quality monitoring: a sampling bias. Sensors 20, 6198 (2020).
Article Google Scholar
Kim, H. S. et al. Development of a daily pm 10 and pm 2.5 prediction system using a deep long short-term memory neural network model. Atmos. Chem. Phys. 19, 12935–12951 (2019).
Article Google Scholar
Kalajdjieski, J., Mirceva, G. & Kalajdziski, S. Attention models for pm 2.5 prediction. In 2020 IEEE/ACM International Conference on Big Data Computing, Applications and Technologies (BDCAT) 1–8 (IEEE, 2020).
Lin, L., Chen, C.-Y., Yang, H.-Y., Xu, Z. & Fang, S.-H. Dynamic system approach for improved pm 2.5 prediction in Taiwan. IEEE Access 8, 210910–210921 (2020).
Article Google Scholar
Pérez, P., Trier, A. & Reyes, J. Prediction of pm2. 5 concentrations several hours in advance using neural networks in Santiago, Chile. Atmos. Environ. 34, 1189–1196 (2000).
Article Google Scholar
Song, L., Pang, S., Longley, I., Olivares, G. & Sarrafzadeh, A. Spatio-temporal pm 2.5 prediction by spatial data aided incremental support vector regression. In 2014 International Joint Conference on Neural Networks (ijcnn) 623–630 (IEEE, 2014).
Wang, Y., Wang, H., Chang, S. & Avram, A. Prediction of daily pm 2.5 concentration in china using partial differential equations. PLoS One 13, e0197666 (2018).
Article Google Scholar
Qin, D. et al. A novel combined prediction scheme based on cnn and lstm for urban pm 2.5 concentration. IEEE Access 7, 20050–20059 (2019).
Article Google Scholar
Liu, T. et al. Seasonal impact of regional outdoor biomass burning on air pollution in three Indian cities: Delhi, Bengaluru, and Pune. Atmos. Environ. 172, 83–92 (2018).
Article Google Scholar
Chambliss, S. E. et al. Local- and regional-scale racial and ethnic disparities in air pollution determined by long-term mobile monitoring. Proc. Natl Acad. Sci. USA 118, e2109249118 (2021).
Article Google Scholar
Liang, Y. et al. Wildfire smoke impacts on indoor air quality assessed using crowdsourced data in California. Proc. Natl Acad. Sci. USA 118, e2106478118 (2021).
Article Google Scholar
Ferraro, P. J. & Agrawal, A. Synthesizing evidence in sustainability science through harmonized experiments: Community monitoring in common pool resources. Proc. Natl Acad. Sci. USA 118, e2106489118 (2021).
Ludescher, J. et al. Network-based forecasting of climate phenomena. Proc. Natl Acad. Sci. USA 118, e1922872118 (2021).
Article Google Scholar
Clements, A. L. et al. Low-cost air quality monitoring tools: From research to practice (a workshop summary). Sensors 17, 2478 (2017).
Article Google Scholar
Lin, C. et al. Evaluation and calibration of aeroqual series 500 portable gas sensors for accurate measurement of ambient ozone and nitrogen dioxide. Atmos. Environ. 100, 111–116 (2015).
Article Google Scholar
Shusterman, A. A. et al. The Berkeley atmospheric co 2 observation network: Initial evaluation. Atmos. Chem. Phys. 16, 13449–13463 (2016).
Article Google Scholar
Moltchanov, S. et al. On the feasibility of measuring urban air pollution by wireless distributed sensor networks. Sci. Total Environ. 502, 537–547 (2015).
Article Google Scholar
Sun, L. et al. Development and application of a next generation air sensor network for the hong kong marathon 2015 air quality monitoring. Sensors 16, 211 (2016).
Article Google Scholar
Tsujita, W., Yoshino, A., Ishida, H. & Moriizumi, T. Gas sensor network for air-pollution monitoring. Sensors Actuators B: Chem. 110, 304–311 (2005).
Article Google Scholar
Gao, M., Cao, J. & Seto, E. A distributed network of low-cost continuous reading sensors to measure spatiotemporal variations of pm2. 5 in Xi’an, China. Environ. Pollution 199, 56–65 (2015).
Article Google Scholar
Xie, X. et al. A review of urban air pollution monitoring and exposure assessment methods. ISPRS Int. J. Geo-Inform. 6, 389 (2017).
Article Google Scholar
Jerrett, M. et al. A review and evaluation of intraurban air pollution exposure models. J. Exposure Sci. Environ. Epidemiol. 15, 185 (2005).
Article Google Scholar
Yeh, C. et al. Using publicly available satellite imagery and deep learning to understand economic well-being in Africa. Nat. Commun. 11, 1–11 (2020).
Article Google Scholar
U.S. Environmental Protection Agency, Office of Air Quality Planning and Standards. Air Quality Assessment Division, Research Triangle Park, NC (2021).
Technologies, K. Laser egg. kaiterra.com (2022).
Harigovind, A. Dust management committee recommends air quality monitors at all large construction sites in Delhi. https://indianexpress.com/article/cities/delhi/dust-management-committee-recommends-air-quality-monitors-at-large-delhi-construction-sites-7437599/ (2021).
Somvanshi, A. Delhi’s air quality and number games. https://www.downtoearth.org.in/blog/air/delhi-s-air-quality-and-number-games-76214 (2021).
Qian, H. et al. Air pollution reduction and climate co-benefits in china’s industries. Nat. Sustain. 4, 417–425 (2021).
Article Google Scholar
Tibrewal, K. & Venkataraman, C. Climate co-benefits of air quality and clean energy policy in India. Nat. Sustain. 4, 305–313 (2021).
Article Google Scholar
Johnson, C. How kaiterra ensures that sensedge devices are accurate and correctly calibrated. https://learn.kaiterra.com/en/resources/how-sensedge-devices-are-accurate-and-correctly-calibrated (2022).
Technologies, K. Does the laser egg need to be recalibrated? https://support.kaiterra.com/does-the-laser-egg-need-to-be-recalibrated (2022).
Gilmer, J., Schoenholz, S. S., Riley, P. F., Vinyals, O. & Dahl, G. E. Neural message passing for quantum chemistry. Proceedings of the 34th International Conference on Machine Learning (ICML). Vol. 70, 1263–1272 (2017).
Iyer, S. R., An, U. & Subramanian, L. Forecasting sparse traffic congestion patterns using message-passing rnns. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 3772–3776 (2020).
Kingma, D. P. & Ba, J. Adam: A method for stochastic optimization. International Conference on Learning Representations (ICLR). (2015).
Wang, M. et al. Deep graph library: A graph-centric, highly-performant package for graph neural networks. Preprint at https://arxiv.org/abs/1909.01315 (2019).
Paszke, A. et al. H. Advances in Neural Information Processing Systems 32 (eds. Wallach, H., Larochelle, H., Beygelzimer, A., d’Alché- Buc, F., Fox, E., & Garnett, R.) 8024–8035 (Curran Associates, Inc., 2019).
Central Pollution Control Board (CPCB). Central Control Room for Air Quality Management – All India. https://app.cpcbccr.com/ccr/#/caaqm-dashboard-all/caaqm-landing/caaqm-comparison-data (2022).
Download references
The work done by the authors Shiva Iyer, Ananth Balashankar, and Lakshminarayanan Subramanian in this paper was supported by funding from industrial affiliates in the NYUWIRELESS research group (https://www.nyuwireless.com), that funded Shiva Iyer in part as well as the air quality sensors used in the study. Shiva was also funded in part by an NSF Grant (award number OAC-2004572) titled “A Data-informed Framework for the Representation of Sub-grid Scale Gravity Waves to Improve Climate Prediction”. Mr. Balashankar is a Ph.D. student at New York University, and is also funded in part, by the Google Student Research Advising Program. We acknowledge our collaboration with Kaiterra for their efforts in the development and installation of the low-cost sensors. We acknowledge the data availability from CPCB on their public portal. We also acknowledge the contributions of Ulzee An, a former masters’ student, in writing code for older baseline models. Any opinions, findings and conclusions, or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of NYUWIRELESS or Kaiterra.
Department of Computer Science, New York University, New York, NY, USA
Shiva R. Iyer, Ananth Balashankar & Lakshminarayanan Subramanian
Swiss Data Science Center, ETH Zurich, Zurich, Switzerland
William H. Aeberhard
Columbia University, New York, NY, USA
Sujoy Bhattacharyya
Evidence for Policy Design (EPoD) at the Institute for Financial Management and Research (IFMR), New Delhi, New Delhi, India
Sujoy Bhattacharyya & Giuditta Rusconi
State Secretariat for Education, Research and Innovation (SERI), Bern, Switzerland
Giuditta Rusconi
Kai Air Monitoring Pvt Ltd, Gautam Buddha Nagar, UP, India
Lejo Jose & Nita Soans
Department of Economics, University of Chicago, Chicago, IL, USA
Anant Sudarshan
Department of Economics, Yale University, New Haven, CT, USA
Rohini Pande
You can also search for this author in PubMed Google Scholar
You can also search for this author in PubMed Google Scholar
You can also search for this author in PubMed Google Scholar
You can also search for this author in PubMed Google Scholar
You can also search for this author in PubMed Google Scholar
You can also search for this author in PubMed Google Scholar
You can also search for this author in PubMed Google Scholar
You can also search for this author in PubMed Google Scholar
You can also search for this author in PubMed Google Scholar
You can also search for this author in PubMed Google Scholar
S.I., A.S., R.P., and L.S. contributed to problem conceptualization and design. S.I., A.B., W.A., and L.S. contributed to the spatio-temporal models. S.I., A.B., and W.A. contributed to the code, data analysis, and visualizations. S.B. and G.R. contributed to the sensor network deployment and data gathering efforts in Delhi guidance of R.P. S.I., A.B., W.A., R.P., A.S., and L.S. helped in writing and editing various sections of the paper.
Correspondence to Lakshminarayanan Subramanian.
Prof. Subramanian declares no competing non-financial interests but the following competing financial interests: Prof. Subramanian is a co-founder of Entrupy Inc, Velai Inc, and Gaius Networks Inc and has served as a consultant for the World Bank and the Governance Lab. Dr. Subramanian reports that Velai Inc broadly works in the area of socio-economic predictive models. All other authors declare no competing interests.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
Reprints and Permissions
Iyer, S.R., Balashankar, A., Aeberhard, W.H. et al. Modeling fine-grained spatio-temporal pollution maps with low-cost sensors. npj Clim Atmos Sci 5, 76 (2022). https://doi.org/10.1038/s41612-022-00293-z
Download citation
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41612-022-00293-z
Anyone you share the following link with will be able to read this content:
Sorry, a shareable link is not currently available for this article.
Provided by the Springer Nature SharedIt content-sharing initiative
Advertisement
© 2022 Springer Nature Limited
Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.