1 / 28

Data Mining Approach for Traffic Flow Characterization in Complex Airspace: The New York Metroplex Case

This study explores a data mining framework to characterize and analyze traffic flows in the complex airspace of the New York Metroplex, aiming to identify key factors impacting operational performance, assess system capacity, and model current operations for simulation of new concepts.

jmurley
Download Presentation

Data Mining Approach for Traffic Flow Characterization in Complex Airspace: The New York Metroplex Case

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. A Data Mining Approach for Characterization of Traffic Flows in Complex Airspace: The New York Metroplex Case FAA Joint University Program – Quarterly Meeting August 1, 2016 Mayara Condé Rocha Murça Prof. John Hansman

  2. Motivation • Metroplex are critical nodes in the National Airspace System (NAS) • High traffic levels associated with large metropolitan regions • Complex airspace design resulting from procedures at airports closely located and need for deconfliction • Volume and airspace complexity result in TFM challenges and can lead to inefficiencies • Planning and execution of arrival and departure rates for metroplex airports is a challenge as terminal airspace is highly constrained and resources are shared • The parameters and conditions that drive metroplex operational complexity and capacity are not well understood

  3. Research Objective • To develop a data mining framework for characterization of traffic flows in metroplex systems at spatial and temporal scales in order to: • Identify key factors that impact metroplex operational performance • Assess system capacity from observed performance • Identify airspace interactions that prevent airport capacity from being used at full extent • Model current operations for simulation of new operational concepts Focus: New York Metroplex Terminal Airspace

  4. The New York Metroplex • Most complex airspace in the NAS • Three major commercial airports • Newark International (EWR) • John F. Kennedy International (JFK) • LaGuardia (LGA) • Very dense routes with small separation • Must be run in a coordinated way LGA 14 nm EWR 9 nm JFK 18 nm

  5. The New York MetroplexTraffic Characteristics Departure push Arrival push Balanced Data Source: ASPM 2013-2015

  6. The New York MetroplexInfrastructure JFK EWR LGA

  7. Data Mining Framework FILTERED AND RESAMPLED FLIGHT TRACKS RAW DATA: OBSERVED FLIGHT TRACKS (ETMS) FOR 69 DAYS 60 nm F1 = (x11, y11, x12, y12,…, x1n, y1n) F2 = (x21, y21, x22, y22,…, x2n, y2n) . . . Fm = (xm1, ym1, xm2 ym2,…, xmn, ymn) TRAINING DATA: 14 days TEST DATA: 55 days CLUSTERING AT SPATIAL SCALE -DBSCAN- CLASSIFICATION -RANDOM FORESTS- CLUSTERING AT TEMPORAL SCALE -HIERARCHICAL CLUSTERING- AIRSPACE STRUCTURE RESOURCE USE MATRIX/VECTOR RESOURCE USE PATTERNS CLUSTER TIME

  8. Data Mining FrameworkClustering at Spatial Scale • Density-Based Spatial Clustering of Applications with Noise – DBSCAN (Ester et al., 1996) • Concept: a cluster is a set of density-connected points • Two input parameters: • MinPts: Minimum number of points (to create a cluster) • Epsilon: Distance threshold

  9. Data Mining FrameworkClustering at Spatial Scale • Density-Based Spatial Clustering of Applications with Noise – DBSCAN (Ester et al., 1996) • Concept: a cluster is a set of density-connected points • Two input parameters: • MinPts: Minimum number of points (to create a cluster) • Epsilon: Distance threshold FUNDAMENTAL CONCEPTS

  10. Data Mining FrameworkClustering at Spatial Scale • Density-Based Spatial Clustering of Applications with Noise – DBSCAN (Ester et al., 1996) • Concept: a cluster is a set of density-connected points • Two input parameters: • MinPts: Minimum number of points (to create a cluster) • Epsilon: Distance threshold FUNDAMENTAL CONCEPTS Epsilon – neighborhood (of a point) Contains all neighboring points that are within a distance Epsilon

  11. Data Mining FrameworkClustering at Spatial Scale • Density-Based Spatial Clustering of Applications with Noise – DBSCAN (Ester et al., 1996) • Concept: a cluster is a set of density-connected points • Two input parameters: • MinPts: Minimum number of points (to create a cluster) • Epsilon: Distance threshold FUNDAMENTAL CONCEPTS Epsilon – neighborhood (of a point) Contains all neighboring points that are within a distance Epsilon Density-reachability A point x is (directly) density-reachable from a core point y if x belongs to the Epsilon-neighborhood of y

  12. Data Mining FrameworkClustering at Spatial Scale • Density-Based Spatial Clustering of Applications with Noise – DBSCAN (Ester et al., 1996) • Concept: a cluster is a set of density-connected points • Two input parameters: • MinPts: Minimum number of points (to create a cluster) • Epsilon: Distance threshold FUNDAMENTAL CONCEPTS Epsilon – neighborhood (of a point) Contains all neighboring points that are within a distance Epsilon Density-reachability A point x can be (indirectly) density-reachable from a core point y through a chain of points

  13. Data Mining FrameworkClustering at Spatial Scale • Density-Based Spatial Clustering of Applications with Noise – DBSCAN (Ester et al., 1996) • Concept: a cluster is a set of density-connected points • Two input parameters: • MinPts: Minimum number of points (to create a cluster) • Epsilon: Distance threshold FUNDAMENTAL CONCEPTS Epsilon – neighborhood (of a point) Contains all neighboring points that are within a distance Epsilon Density-reachability A point x can be (indirectly) density-reachable from a core point y through a chain of points Density-connectivity Two points are density-connected if there is a third point from which they are density-reachable

  14. Data Mining FrameworkClustering at Spatial Scale • Density-Based Spatial Clustering of Applications with Noise – DBSCAN (Ester et al., 1996) • Concept: a cluster is a set of density-connected points • Two input parameters: • MinPts: Minimum number of points (to create a cluster) • Epsilon: Distance threshold FUNDAMENTAL CONCEPTS Epsilon – neighborhood (of a point) Cluster 2 Cluster 1 Contains all neighboring points that are within a distance Epsilon Density-reachability A point x can be (indirectly) density-reachable from a core point y through a chain of points Density-connectivity Noise Two points are density-connected if there is a third point from which they are density-reachable

  15. Airspace StructureJFK Arrivals JFK Arrivals Northeast West 21 clusters South North

  16. Airspace StructureNY Arrivals JFK Arrivals EWR Arrivals LGA Arrivals 21 clusters 16 clusters 13 clusters North Northeast West South

  17. Data Mining Framework FILTERED AND RESAMPLED FLIGHT TRACKS RAW DATA: OBSERVED FLIGHT TRACKS (ETMS) FOR 69 DAYS 60 nm F1 = (x11, y11, x12, y12,…, x1n, y1n) F2 = (x21, y21, x22, y22,…, x2n, y2n) . . . Fm = (xm1, ym1, xm2 ym2,…, xmn, ymn) TRAINING DATA: 14 days TEST DATA: 55 days CLUSTERING AT SPATIAL SCALE -DBSCAN- CLASSIFICATION -RANDOM FORESTS- CLUSTERING AT TEMPORAL SCALE -HIERARCHICAL CLUSTERING- AIRSPACE STRUCTURE RESOURCE USE MATRIX/VECTOR RESOURCE USE PATTERNS CLUSTER TIME

  18. Data Mining FrameworkClassification Supervised Learning Random Forests

  19. Data Mining FrameworkClustering at Temporal Scale Hourly Resource Use Vectors Vector composed of dominant trajectory clusters from each arrival gate to each airport during the specified time scale: (Csouth-JFK, Csouth-EWR, Csouth-LGA,…, Cnorth-JFK, Cnorth-EWR, Cnorth-LGA) Dissimilarity Matrix Hierarchical Clustering Average linkage Hamming distance (fraction of features that differ)

  20. Resource Use PatternsDominant Aggregate NY Flow Pattern • Clustering of hourly Resource Use Vectors results in 75 clusters (Resource Use Patterns – RUP) • 10 RUP capture 80% of the data • Landing direction • Arrivals or departures favored • Approach procedure South Flow North Flow

  21. Resource Use PatternsSouth Flow RUP 3: S-DF-LGA22/31 RUP 2: S-AF-LGA22/13 RUP 9: S-AF-LGA22/31 RUP 1: S-DF-LGA22/13 IMC VMC VMC IMC VMC IMC VMC IMC Number of flights JFK Arrivals JFK Departures EWR Arrivals EWR Departures LGA Arrivals LGA Departures 15 0

  22. Resource Use PatternsNorth Flow RUP 8: N-DF-VAP@LGA RUP 10: N-AF-IAP@LGA RUP 7: N-AF-VAP@LGA RUP 4: N-DF-IAP@LGA IMC VMC VMC IMC VMC IMC VMC IMC Number of flights JFK Arrivals JFK Departures EWR Arrivals EWR Departures LGA Arrivals LGA Departures 15 0

  23. Resource Use PatternsMixed North and South Flow RUP 6: Mixed-Ndominant-AF RUP 5: Mixed-Sdominant-AF IMC IMC VMC VMC Number of flights JFK Arrivals JFK Departures EWR Arrivals EWR Departures LGA Arrivals LGA Departures 15 0

  24. Throughput and DelayJFK Arrivals IMC VMC VMC IMC

  25. Throughput and DelayJFK Arrivals VMC VMC

  26. Throughput and DelayJFK Arrivals SOUTH NORTH MIXED (1)Source: FAA Management Tips

  27. Summary and Future Work • Data mining framework to characterize traffic flows at spatial and temporal scales • New York Metroplex Terminal Airspace Analysis • Identified the current as-flown terminal area structure • Identified and characterized dominant flow patterns • Assessed arrival capacity based on observed throughput and terminal area delay • Future work • Identify key flow pattern characteristics that increase airspace complexity and reduce capacity • Identify other relevant factors that explain the variability in flow pattern performance • Develop a prediction model for transit times (delay) in the terminal area

  28. Questions? mayara@mit.edu

More Related