Developing CAVEMAN to Monitor the Accuracy of Traffic Flow

Reprinted from TEC 50 Issue 3 (MARCH 2009)
Philip Tait
National Traffic Control Centre (NTCC), 3 Ridgeway, Quinton Business Park, Quinton Parkway, Birmingham, B32 1AF
PACS Numbers: 02.70.Lq, 02.70.Rw, 46.90.+s, 89.40.+k
Key Words: traffic, highway, simulation, modelling

Abstract

The National Traffic Control Centre (NTCC) collects traffic data in real time. Data received from the Traffic Monitoring Equipment are processed to calculate vehicle flow rates. The accuracy of these flow rates is determined using a self-monitoring validation methodology. This paper describes the application of the Long-term Improvement Process (LIP) to develop CAVEMAN which is now recognised as the approved methodology for validating the accuracy of vehicle flow data on the Highway's Agency Project Network.

Download a PDF version of this paper

Back to Traffic Science

1. Introduction

The Highways Agency's Project Network consists of the motorways and major trunk roads in England. The network is represented by a traffic data model containing approximately 12,000 links of which 4070 are currently designated as strategically important and contain reporting points. The National Traffic Control Centre (NTCC) is run by Serco on behalf of the HA and is responsible for maintaining the traffic data model; collecting and processing real time data from Automatic Number plate Recognition systems (ANPR) and Traffic Monitoring Equipment (TME) and ensuring that traffic data is reported accurately. The NTCC system calculates flows as vehicles per hour every five minutes for each reporting point. These flows are derived from traffic counts received from TME inductance loops located on major carriageways within the network. Approximately 30% of reporting point links have no installed monitoring equipment and the flow rates have to be derived using traffic counts from more than one location. In all cases, the accuracy of the flows must be validated and are subject to a contractual reporting process on a monthly basis.

2. Background

Traditionally, the only way to check the accuracy of flow data has been to make a video recording of the traffic and manually count the vehicles passing a reporting point. The counts are then compared with the flow data so that the accuracy of the data can be assessed. However, checking the accuracy of all the reporting points in this way would take a very long time and, given that the majority of data are accurate, would be extremely inefficient. To satisfy contractual requirements to demonstrate accuracy of flow data, NTCC currently performs validation checks on 40 sites selected randomly each month. The accuracy of this sample is applied to the full population to validate the accuracy of the full set of reporting points. However, this statistical method could lead to inaccurate flows being undetected for many years. One problem associated with video assessment is the possibility of manual counting errors. These could result in an accurate site being assessed as inaccurate and vice versa. Where a site is initially assessed as inaccurate, the counts can be re-checked to eliminate the errors. However, if an inaccurate site is assessed as accurate, it is unlikely to be rechecked and therefore the inaccuracy will never be detected, unless it is selected as part of a future random sample. The other major problem with the video survey validation method is that the flow data is only validated for the period of the survey (normally 2 hours) which may not be representative of the sample link flows on a day to day basis.

NTCC developed the Long-term Integration Process for detecting errors in loop-based flow data to improve the methodology for validating link flows on the project network. This applies the principles of Kirchhoff's law for electric current to vehicle flows on the Project Network and was reported in TEC [1]. The LIP methodology has now been used to develop CAVEMAN - the Continuous Assessment of Validation Equations by Monitoring the Agency's Network.

3. Creating the Validation model

Kirchhoff's law for electric current states that:

The sum of the currents entering a node must equal the sum of the currents exiting a node.

LIP applies this law to vehicle flow rates by representing the Project Network as links joined by nodes in a series of validation sets. Additionally, analysis of traffic flows on the Project Network have shown [1] that over a seven-day period, total flows measured on a carriageway in one direction are approximately equal to flows measured on the opposite carriageway enabling opposite flows to be included in validation sets. A small number of exceptions exist (notable amongst these is the Severn Bridge, where a toll is charged to cross into Wales but not in the opposite direction). Once such a location is known and its effect quantified, it can be included in a validation set by using a correction factor in order to scale the opposite flows.

There are two standard templates which represent geometries commonly found on the Project Network. The first template is based upon the layout of a motorway grade-separated junction; the second represents the layout of a dual carriageway. These templates assist in the development of validation equations for each junction set. For example, in the motorway grade-separated junction shown in Figure 1, the flow at Link 1 is compared with the sum of those at Links 6 and 7, with the sum of the flows at Links 2 and 4 and with the flow at Link 12. If all agree within a defined limit of accuracy, then all can be regarded as accurate. If they all disagree, the flow on Link 1 is inaccurate. If one is wrong and the others are correct, the flow on Link 1 is almost certainly accurate and the inaccuracy will be in one of the other flows. All the comparisons can be verified against each other.

Figure 1: Layout of motorway grade-separated junction

For dual carriageways, the number of validation equations that can be derived from a standard geometry is less than that for a grade-separated junction. This is due to the number of un-monitored roads which contribute to the flows measured on the main carriageways. In Figure 2 the flow on Link 1 can validate the flow on Link 6; Link 2 flow can validate flow measured on Link 5 and Link 3 flow can validate Link 4. Flows measured on Link 3 cannot be used directly to validate flows on Link 2 as the flows leaving and entering the main carriageway are unknown.

Figure 2: Layout of a standard dual carriageway

These principles have been used to develop the LIP validation model. This model represents the Project Network as a series of junction sets containing one or more reporting point links. Each set contains a series of validation equations that can be used to validate the flows measured on each link within the set.

4. Using the LIP process

The initial version of the LIP validation model was developed in 2005 [1]. Weekly analysis demonstrated that link flows can be validated where accurate and that validation exception errors can be used to identify flow inaccuracy.

Validation exception errors can be caused by:

Poorly calibrated loop sensitivity results in one or more lanes either not counting all vehicles or counting vehicles in an adjacent lane;
Poor loop installation which may result in the same effects as poor calibration, but cannot be corrected without re-installing the loops in the carriageway;
Loop faults resulting in erratic counting, either when flows are low or when the faults are intermittent;
Equipment faults leading to inconsistent counting;
Incorrect loop configurations leading to traffic being miscounted;
Incorrect site configuration causing data for one site to be extracted from a different site;
Changes to the network itself - e.g. when a 2 lane road is reduced to a single lane / if a bypass is built / etc.

Errors can also be raised in cases where the link flow validation equations are inaccurate.

These may occur when:

Long-term road-works, contra-flow, road closures etc. have made the model inaccurate for the period of analysis;
Link flow data have been calculated using erroneous flow formulae;
The validation flow model is inappropriate due to local traffic behaviour.

Once the inaccuracies have been identified, the causes can be investigated and faults rectified.

5. Evolution of CAVEMAN

Regular analysis of the Project Network using the LIP process established the methodology to be accurate, enabling the detection of flow inaccuracies due to equipment faults and network modelling errors. The application of the LIP process in its initial form proved to be cumbersome and time-consuming. However, over a period of time, it did allow NTCC to identify inaccurate traffic data and take corrective action that could be verified by video surveys. For a practical day-to-day accuracy verification process, a more streamlined and automated tool was required and the CAVEMAN (Continuous Assessment of Validation Equations by Monitoring the Agency Network) process was created.

The concept of "continuous assessment" was developed to allow an automatic measure of delivered service level. Previously, the service level had been assessed on a monthly basis using video survey results and using a statistical method to apply the results to the full reporting point population. The single criterion was (and still is) that the product of the percentage of all reported data accurate to within defined levels of tolerance and the overall availability must be greater than 95%. For example, if 98% of all reported data were accurate to within the defined levels of tolerance, in order to meet the criterion, the overall availability of data must be 97%. The main disadvantages of the video survey method, apart from the cost, were that it depended on the ability of individuals to count and classify vehicles and it only considered a 2-hour snapshot within the month. The consequence of the snapshot is that a data report that is in error for a short period of time is either reported as totally bad or totally good, depending on the timing of the snapshot.

CAVEMAN has now been accepted by the HA as the prescribed method for assessing service level for traffic flow data, alleviating the need for the video survey method.

6. The CAVEMAN process

The CAVEMAN calculation and reporting system has been developed as a Microsoft Access application. This extracts traffic data for the NTCC's ORACLE database and then uses the LIP methodology to analyse and report flow accuracy.

The daily monitoring process can be summarised by the following steps:

Link flow data are extracted for the day preceding the analysis date and averaged for each link in the CAVEMAN validation model and stored in a daily flow table;
Link flow data are processed to provide a 7-day flow table containing the averaged data for the seven days preceding the analysis date;
Validation equations are used to provide single day and 7-day error values for each validation equation defined in the model;
Daily and 7-day maximum and minimum error values are calculated for each link in the validation model;
The link errors values are compared to error acceptance criteria;
Daily exception reports are created to list link validation errors outside the acceptance criteria;
Investigation and action plan is enabled to diagnose problems and fix faults;
Fault status reports are used to monitor and progress outstanding faults.

7. Calculating Errors using Validation Equations

The CAVEMAN validation equations define conditions where the net flow described by a validation calculation is zero.

The following is an example showing the derivation of a validation equation for a typical node in the project network.

The diagram shows a representation of three links joining at a single node.

If a, b and c represent the vehicle flow rate on the links, the test criteria is

a + b - c = 0

The measure of the variance from the test condition is

Error(%) = 2 ×

|(a_T + b_T - c_T)|

(a_T + b_T + c_T)

× 100

This error calculation is used in the CAVEMAN model to calculate errors associated with unclassified (i.e. total) flows. Flow accuracy validation is also required for classified flows, where vehicles are divided into two length classifications: short vehicles less than 6.6m in length and long vehicles greater than 6.6m in length.

The Traffic Monitoring Equipment categorises the counted vehicles by length and supplies the count information in four categories. These four categories are used to identify vehicles as short (category 1+category 2) and long (category 3+ category 4).

The Caveman model uses a similar error calculation to test for the accuracy of classified flows:

Error(%) = 2 ×

|(a_s + b_s - c_s)| + |(a_l + b_l - c_l)|

(a_T + b_T + c_T)

× 100

Where a_s, b_s and c_s are the short vehicle flow rates, a_l, b_l and c_l are the long vehicle flow rates and a_T, b_T and c_T are the total flow rates on each link.

The allowable deviation from the validation criteria before an exception error is raised is defined by the Highway's Agencies link categorisation. The HA has classified the different roads on the Project Network as "A", "B" or "C", according to the importance of the routes. The required accuracy is shown in Table 1 for the three categories.

8. An example of the CAVEMAN process

This example demonstrates the application of the CAVEMAN validation process in order to test the accuracy of the link flow data on the M3 westbound between J3 and J4. This is a Category A reporting point link, identified by Link_ID 103047401in the HA Project Network.

Figure 3: CAVEMAN validation set used to validate flows on the M3 westbound between J3 and J4

Link 103047401 is a member of the validation link set for the M3, junctions 3 to 4 as shown in Figure 3. This figure shows the Project Network link identifiers and their equivalent Link position numbers in the CAVEMAN validation template for a grade-separated junction shown in Figure 1.

There are four test calculations that can be used to validate the accuracy of link 103047401 in this set. Over a seven day period, the total flow (i.e. vehicles/hour) measured on the link will equal:

The flows measured on links 103043201 (18) and 103047602 (17);
The flows on links 103048102 (11) and 104044701 (4);
The flow on link 103048201(1);
The flows on links 103048402 (2) and 104044701 (4).

Link flows are derived from traffic counts received from Traffic Monitoring Equipment and each link has one or more flow formulae which converts traffic counts measured at equipment locations to a vehicle flow rate at a reporting point. The flow formulae for each link used in this example are shown in Table 3. Where it is possible at least two flow formulae are identified for each link, with the most accurate designated as Priority 1, the next as Priority 2, etc. In cases where there are no alternative flow formulae, Priority 1 and Priority 2 formulae are the same. A full description of the derivation of flow formulae is described elsewhere [2].

By examining the Priority 1 flow formulae for links 103047401 (12), 103043201 (17) and 103043201(18) it can be seen equipment (represented by the LMS term in the flow formulae) which supplies the traffic count data for the link ID 17 and 18 are used to derive flow data for link ID 12. The proposed test to verify the flows measured on link 103047401 using flows observed on 103043201 and 103043201 is therefore invalid as the validation equation is recursive. These recursive validation equations are identified as part of the daily CAVEMAN analysis process. The results of recursive calculations can be used to identify errors in the validation model but cannot be used to validate link flows.

LINK_ID

Priority

formulae

103048201

Fl((((LMS30101199@+0)*0.5)+((LMS30101199@+1)*0.5))+(((LMS30101140@+0)*0.5)+((LMS30101140@+1)*0.5)))

Fl((LMS30101186@-4)+(LMS30101135@-4))

103048402

Fl(LMS30101199@+0)

Fl(((LMS30101186@-5)*1.05)-(LMS30101140@+0))

103044701

Fl(LMS30101149@+0)

103048102

Fl(LMS30101195@+0)

Fl(((LMS30101188@+5)+(LMS30101136@+5))*0.95)

103047401

Fl((((LMS30101188@+0)*0.6)+((LMS30101188@+1)*0.4))+(((LMS30101136@+0)*0.6)+((LMS30101136@+1)*0.4)))

Fl((LMS30101195@-5)+(LMS30101139@-5))

103047602

Fl(LMS30101188@+0)

Fl(((LMS30101195@-5)*1.05)-(LMS30101136@+0))

103043201

Fl(LMS30101136@+0)

Table 3: Flow formulae used to derive link flow rates from Traffic Monitoring Equipment

The calculation output for the sample set is summarised below and simulates a scenario where a fault developed on a TMU outstation mid-way though Day 2 causes a significant reduction in flow to be observed on the link 103047401. Tables 4a and 4b show the 7-day averaged flow data and results of the unclassified error validation error calculations as a result of the equipment fault.

LINK_ID

Day 1 (v/h)

Day 2 (v/h))

Day 3 (v/h)

Day 4 (v/h)

Day 5 (v/h)

Day 6 (v/h)

Day 7 (v/h)

103048201

1513

1743

1889

2094

2053

2026

2054

103048402

1488

1546

1651

1681

1639

1608

1629

103044701

208

224

252

267

268

270

276

103048102

1507

1553

1648

1704

1686

1667

1732

103047401

1793

1796

1583

1367

1093

943

811

103047602

1282

1283

1131

976

781

673

579

103043201

517

503

443

383

306

224

227

Table 4a: 7-day averaged flow data for the M3, junction 3 to 4 link set

Calculation

Recursive

Day 1 E (%)

Day 2 E (%)

Day 3 E (%)

Day 4 E (%)

Day 5 E (%)

Day 6 E (%)

Day 7 E (%)

17,18,-12

Yes

0.33

0.57

11,4,-12

Yes

4.45

18.21

36.21

56.50

69.02

84.94

1,-12

16.94

3.00

17.64

42.02

61.00

72.97

86.79

2,4,-12

5.57

1.44

18.36

35.08

54.25

66.28

80.58

Min Error (%)

4.45

1.04

17.64

35.08

54.25

66.28

80.58

Table 4b: 7-day error analysis for the M3, junction 3 to 4 link set

The minimum error used to monitor the accuracy of flow data is slow to respond to the fault, with the full affects being evident in the calculation output after 7 days. The CAVEMAN system would raise an exception report after two days.

Tables 5a and 5b show the same fault scenario using single day averaged data. In this case the CAVEMAN reporting would raise an exception after 1 day - allowing fault to be diagnosed and progressed at an earlier stage. In both cases, the error calculation results for the recursive formulae (17, 18,-12) are shown. This highlights the need to identify and exclude recursive validation formulae from the error reporting process.

LINK_ID

Day 1 (v/h)

Day 2 (v/h))

Day 3 (v/h)

Day 4 (v/h)

Day 5 (v/h)

Day 6 (v/h)

Day 7 (v/h)

103048201

2132

2217

2258

2320

1473

1690

2285

103048402

1693

1735

1767

1821

1175

1400

1812

103044701

264

281

295

290

188

156

283

103048102

1858

1825

1847

1976

1337

1437

1846

103047401

2278

1135

777

818

540

565

772

103047602

1493

754.5

522

567

381

406

516

103043201

791

383.5

257

253

160

161

258

Table 5a: single-day averaged flow data for the M3, junction 3 to 4 link set

Calculation

Recursive

Day 1 E (%)

Day 2 E (%)

Day 3 E (%)

Day 4 E (%)

Day 5 E (%)

Day 6 E (%)

Day 7 E (%)

17,18,-12

Yes

0.26

0.31

0.34

0.20

0.37

0.24

0.26

11,4,-12

7.09

59.96

93.56

93.90

95.45

95.23

93.55

1,-12

6.62

64.60

97.63

95.73

92.75

99.73

98.99

2,4,-12

15.16

55.96

90.56

88.29

86.55

93.40

92.29

Min Error (%)

6.62

55.96

90.56

88.29

86.55

93.40

92.29

Table 5b: single-day error analysis for the M3, junction 3 to 4 link set

9. Conclusions

Application of the CAVEMAN process to the HA Project network has demonstrated successful continuous assessment of flow accuracy on a daily basis using the LIP methodology; this has now been accepted by the HA as the prescribed method for assessing service level for traffic flow data. The process also enables the identification of equipment faults and Project Network link flow model inconsistencies and inaccurate flow formulae.

There is potential to use the LIP methodology in other traffic network applications where a suitable validation model can be derived. Examples may include the development of a model designed to validate the count data from Traffic Monitoring Equipment and the development of processes to investigate traffic behaviour in specific areas.

10. Acknowledgements

NTCC would like to acknowledge the support of the Highway's Agency in the development of CAVEMAN process as a method of validating the accuracy of flows on the Project Network.

References

[1] L D Howe Detecting Errors in Loop-Based Flow Data Using a Long-term Integration Process (LIP) TEC 48 Issue 6 (June 2007)
[2] [2] L D Howe Deriving Accurate Flow Data for Unmonitored Links TEC 49 Issue 3 (March 2008)