Maximizing Data Quality land Fitness for Purpose of Work Zone Data
- May 29, 2025
- Focus Area
- Transportation Systems Management & Operations
- Submitting On Behalf Of
- TRB Committee on RTSMO
- Urgency
- Critical - High Priority
- Cost
- $500,000 - 749,000
- Timeframe
- 1 - 2 years
- Type of Research
- Full Research Project
- Date Posted
- May 29, 2025
- Status
- Not Funded
Research Description
Research Idea Description:
Systems that can generate and deliver timely and actionable information about work zones are valuable. They have the potential to save lives and improve safety for both human and automated drivers. Work zones are useful edge cases for driver-assistance and automated driving systems, and detailed data about these cases are needed for training new AI.
WZDx feeds, produced by over 25 states and national parks, are intended to standardize and disseminate critical work zone information. While their structured format aims to ensure the availability of key data elements, many of these feeds fail to meet reliability standards necessary for integration with third-party systems like navigation platforms (e.g., Google Maps, Waze) or Original Equipment Manufacturers (OEMs)’ infotainment centers. These platforms are relied upon by drivers for real-time alerts, yet issues with data quality often hinder their effective use. Challenges with existing feeds include the following: ● Inconsistent Implementation Recently emerging data feeds, such as the Work Zone Data Exchange (WZDx), may not be implemented consistently across agencies. Subtle differences in semantics may impede “plug-and-play” interoperability, making it difficult or impossible for third-party mapping platforms and OEMs to integrate the feed into their products and services. ● Lack of Data Elements The current procurement processes for work zone vendors supplying equipment, including smart and connected devices, often fail to define or enforce the collection and provision of all data elements specified in feeds like WZDx. This is primarily due to a lack of clear use cases for the data, the higher costs associated with comprehensive data collection, and limited workforce capacity to manage and verify the data. Additionally, the absence of optional data elements can significantly impact the usability of the feed, as it limits the ability to generate complete and actionable information. For instance, a WZDx feed missing critical details like road names or directions, even if other data points are correctly populated, becomes less functional and may not be adopted by third-party users. ● Discrepancies Between Data Feeds and Ground Truth Many agencies create annual work zone plans that are incorporated into the WZDx feed, often reflecting extended construction projects spanning months and covering miles of roadway, including lane or road closures. However, vendors responsible for supplying this data frequently fail to deactivate devices once work is completed, leading to outdated or inaccurate information. In some cases, active devices are left running in non-active areas, such as parking lots, further misrepresenting real-world conditions. This results in data that is irrelevant and unusable for generating timely, life-saving warnings for drivers.
Third-party mapping companies and OEMs are understandably hesitant to use such unreliable data, as they would bear the liability for providing incorrect information to drivers. Additionally, the presence of inaccurate data undermines user trust in the system, ultimately jeopardizing the long-term goal of delivering accurate and actionable safety information.
Research Objectives:
Gain a thorough understanding of the business practices involved in the provision of current WZDx data feeds, including the efforts, policies, procurement processes, and costs involved in enabling agencies to produce reliable, actionable data. Examine the challenges agencies face in generating these feeds, such as ensuring compliance with standards, maintaining data accuracy, and addressing workforce or technological limitations. This analysis should uncover gaps and opportunities to improve data quality and consistency.
Key Components ● Literature Review: Perform a review of evaluation studies of existing data feed deployments, as well as key standards and existing implementation guidance ● Stakeholder Outreach: Perform a survey of stakeholders involved in the work zone data ecosystem. This may include agencies, mapping companies, OEMs, data-sharing platform providers, etc. ● In-Depth Interviews: Interview a selection of survey respondents for a deep dive into current practices, and lessons learned. ● Report: Summarize findings and develop a guide for best practices
Expected Outcomes:
Develop a report to provide practical guidance to assist agencies and all stakeholders in creating and sharing reliable data that end users can trust. Develop best practices to foster collaboration among agencies, vendors, and data users. These practices should focus on aligning goals with existing standards like Connected Work Zones (CWZ) and promoting the use of high-quality data to enhance safety outcomes. By creating clear guidelines for data creation, collection, and validation, this initiative can empower agencies and vendors to produce reliable feeds while reducing costs and overcoming resource constraints.
Additional Supporting Information
Author of Research Idea: Blake Hansen, PE, PTOE, PMP TRB ACP-10 Data for Operations Subcommittee of the Regional TSMO Committee Co-sponsored by TRB ACP-20 Freeway Operations
- Submitted By
- Thomas Jacobs
- Center for Advanced Transportation Technology
- 301-405-7328
Attachments
This idea has no attachments.
Comments
Join the Discussion
You can make your thoughts known by commenting, and by rating the idea and other comments. All you need to do is sign in, or register for an account with us!