1 / 29

Overview of MEP ’s big data applications

This presentation provides an overview of the Ministry of Environmental Protection's (MEP) use of big data in promoting ecological civilization. It covers key programs and projects, including data integration and sharing, scientific decision making, innovation in eco-environmental supervision, improving public services, and developing a big data platform.

dyoungberg
Download Presentation

Overview of MEP ’s big data applications

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Overview of MEP’s big data applications

  2. Presentation Outline I.Background II. Plan and Progress III. Key Programs/Projects IV. Questions

  3. Background • The Chinese Government attach great importance to the role of big data in promoting the development of ecological civilization. • The State Council has released the Outline of Action Plan for Promoting the Development of Big Data, which makes arrangement for the activities on big data. • In the chapter on “Information Disclosure and Public Participation”, the new Environmental Protection Law requires promoting the application of information technology in environmental protection. • MEP Minister Chen Jining hopes applying big data to enhance synergy of relevant work and promote transformation of environmental management.

  4. Environmental Information Center (EIC) of MEP • In charge of the collection, storage, maintenance, analysis, publication and management of national environmental data and information. • Having more than 50 staff in 6 divisions, including Technology, Website, Network, Application Development, Information Security and General Affair. • Establishing and maintaining the National Environmental Information System (NEIS) for Digital Environmental Protection

  5. Presentation Outline I.Background II. Plan and Progress III. Key Programs/Projects IV. Questions

  6. Three objectives Scientific decision making Systematic and comprehensive Rational & effective Open & innovative Accurate & reliable Big Data Accurate supervision Public service People oriented Synergy

  7. Six Tasks • Promote integration and sharing of data resources • Strengthen scientific decision making in eco-environment • Make innovation in the mode of supervision on eco environment • Improve public services for eco environment • Develop a big data platform

  8. Task 1:Promote integration and sharing of data resources Promote public access to eco environment data Improve capacity in obtaining data resources Strengthen integration of data resources 02 04 Promote sharing of data resources Construct infrastructure for big data 03 01 05

  9. Task 2:Strengthen scientific decision making Driven by data quantification Driven by experience Way of decision making Predict and give early warning of environment risks Assess the performances of key work Analyze environmental load and carrying capacity Comprehensively judge environment situation

  10. Task3:Make innovation in the mode of supervision on eco environment Energy Early warning EIA review On-line monitoring Video monitoring Satellite remote sensing Environmental complaints by letters & visits APP Environment Pollution source E Q New pollution sources of approved construction projects release pollutants to the environment and affect the environment Mag of sources Accurate supervision Eco environment Supervision Pollutants Scientific monitoring& analysis Three dimension supervision

  11. Task 4: Improve public service for eco environment • Construct the e-administration hall and develop the mode combining both on-line and off-line services. • Develop the “one stop”platform for on-line review and approval • Strengthen inter-department sharing of administrative review and approval data and support combined review and approval. Improve on-line services • Disclose more environment information and improve the quality and timelessness of disclosed information. • Expand the interactive channel between government and the public, create convenience for access to environment information and improve public participation. Improve service quality for disclosing information Enhance government comprehensive service capacity • Employ industry and social data to develop environmental information service products. • Develop convenient applications including APP, enrich service contents and expand service scope.

  12. Task 5:Develop a big data platform Collection, transmission, management, analysis, sharing and access of data resources, which provide unified data support to the application of big data. Innovative applications such as comprehensive decision making for eco environment, environmental supervision and public services. Big data management platform Big data application platform

  13. Presentation Outline I.Background II. Plan and Progress III. Key Programs/Projects IV. Questions

  14. Analysis and prediction of particle size of urban air pollutants based on big data (I) POIs Weather Traffic Human activities Road network Air quality of local urban areas Real-time air quality monitoring report Historical air quality data

  15. Analysis and prediction of particle size of urban air pollutants based on big data (II) • Partition a city into disjoint grids Extract features for each grid from its affecting region • Meteorological features • Traffic features • Human mobility features • POI features • Road network features • Co-training-based semi-supervised learning model for each pollutant • Predict the AQI labels • Data sparsity • Two classifiers

  16. Analysis and prediction of particle size of urban air pollutants based on big data (III)

  17. Demonstration • http://urbanair.msra.cn • MEP Information Center big data model on temporal and spatial distribution of local key urban air pollutants in Beijing-Tianjin-Hebei region http://kqzl.chinaeic.net:8080/uAir/p/air/air-state Urban Air Windows Phone App

  18. Predict air quality with satellite data (I) Ground surface PM2.5 concentration with high temporal and spatial accuracy is estimated withsatellite remote sensing monitoring data on the thickness and scope of haze combined with monitoring data of ground monitoring sites At present, ground air quality monitoring sites in China are established based on administrative region with very limited amount of monitoring sites. They cannot reflect spatial distribution of pollutants and are easy to be affected by the environment surrounding monitoring sites. Satellite remote sensing technology could offset the regional limit of ground monitoring and make real-time monitoring of each inch of land. In addition, it could help real-time calculation and display PM2.5 concentration across the whole country.

  19. Predict air quality with satellite data (II) http://54.191.242.127:8200/

  20. Weather and pollution analytics Weather and pollution analytics aims to investigate the correlations between API (Air Pollution Index) and the atmosphere states of cities in China. Both the air quality and the weather data are measured by monitoring sties distributing over the whole country. We gathered the big data, and then employed various machine learning methods to analyze them in distributed computing platforms, such as Spark. After the exploration of big data, we can use the knowledge to forecast the air quality and further provide decision support to our government. Raw Data ETL Filter by Fields Hypothesis Testing of Data Distribution The Correlation Features between Air Quality and Atmosphere States of Cities Significance Test Correlation Analysis

  21. Weather and pollution analytics This chart shows the correlations between PM2.5 and wind speed at ground in 370 cities of China. The warm color and cool color indicate positive and negative correlation respectively. The correlations of cities have an obviously regional distribution.

  22. Backward Trajectory Analytics Regional pollutant sourcing and pathway analysis (national wide and city based), especially for high pollution events Typical big data analytics use case for AQM on pollutant sourcing between major cities, especially for AQM guidance on finding out major pollution pathway and sources for high pollution events. E.g., Haikou (tourist city, no local pollution sources) high pollution events during 2014 New Year holidays, the local gov. desire to find out the pollution sources. Pollution injection pathway cluster for Haikou city during 2014 new year holidays Based on our sourcing results, two major pollution pathways found for Haikou city (northeast straight flow and west golf circulation) Zhejiang 12.5 % Guangdong 75% Hunan 12.5% West pacific subtropical high Haikou

  23. Backward Trajectory Analytics 10-04 20:00 PM2.5 Beijing high pollution events during 2015 National Day: the pollution cause can be analyzed based on backward trajectory (with passed-city’s PM2.5 concentration) at beginning time of event Pollution transport pathway at beginning time of event Beijing: PM2.5 207 (10-04 20:00) Based on our results, major pollution transport pathway found to cause the high pollution event for Beijing city at beginning Shandong -> Hebei -> southeast of Beijing HengShui: PM2.5 189.5 (10-04 05:00)

  24. Case Study on the control of water pollution at river basin level ---TMDL Model (I)

  25. Case Study on the control of water pollution at river basin level ---TMDL Model (II) • Big data model is employed in prediction of surface water concentration at river basin level, TMDL allocation and so on. In many circumstances, it has better simulation accuracy and applicability compared with traditional hydrology and water quality models and is applied in the management of water quality of several river basins of the United States.

  26. Case Study on the control of water pollution at river basin level ---TMDL Model (III) • Big data model developed by USGS could simulate BOD and NH3 discharge of point sources and DO discharge of non-point sources. • It provides support to the granting of pollution discharge license of the region.

  27. Abnormal pattern mining for monitoring data(I) Background: The condition of pollutant sensors used by factories is hard to examine and maintain. The reported sensor readings often consist of a significant amount of erroneous data which degrades its overall quality. Careless usage of such sensor readings may mislead the conclusions and misguide the decisions made for environment protection. Solution: Aims to develop a ubiquitous framework for detecting, filtering, and pattern mining the anomalies in the time series data captured with pollutant sensors. Challenges: • Lack of precise ground truth labels on the anomaly data • Capturing and distinguishing among various types of anomalies including that caused by device failures, forged readings, and abnormal manufacturing activities, etc.

  28. Abnormal pattern mining for monitoring data(I) Case study: Industrial pollutant emission automatic monitoring data Time Series Analysis Normal Data Trend analysis Anomaly Detection Anomaly Data Filter • Forged readings • Device failures • Extreme values • Mean shifts • Pattern changes • … Pollutant Sensor Pattern mining Abnormal Data Time Series Data

  29. Thanks a lot!

More Related