In this study, we explored to combine traffic maps and smartphone trajectories to model traffic air pollution, exposure and health impact. The approach was step-by-step modeling through the causal chain: engine emission, traffic density versus traffic velocity, traffic pollution concentration, exposure along individual trajectories, and health risk. A generic street with 100 km/h speed limit was used as an example to test the model. A single fixed-time trajectory had maximum exposure at velocity of 45 km/h at maximum pollution concentration. The street population had maximum exposure shifted to a velocity of 15 km/h due to the congestion density of vehicles. The shift is a universal effect of exposure. In this approach, nearly every modeling step of traffic pollution depended on traffic velocity. A traffic map is a super-efficient pre-processor for calculating real-time traffic pollution exposure at global scale using big data analytics.

Traffic pollution is the dominant source of air pollution in most metropolitan areas and has major health effects. 50% of the world’s population lives in urban areas covering only 0.4% of the earth’s surface, and 70% are projected to live in urban areas by 2050 [

In most cities, air pollution levels exceed the guideline maximum levels established by the WHO (World Health Organization) to protect human health. People most exposed are those who spend a much time in heavy traffic [

Recent research projects (e.g., CitiSense, CITI-SENSE, EveryAware, and iSPEX) provided air quality information at smartphones to citizens equipped with low-cost pollution sensors [

Since 2013, smartphones with location services has gained nearly 100% mobile phone market penetration. A second trend is that dedicated mobile networks are built to add billions of new sensors to the internet that is the Internet of Things (IoT). Indeed, 90% of all available data today were generated within the last two years. Trends in computer science include big data, data mining, advanced analytics, cognitive computing, virtual reality, and robotics. New algorithms are based on more abstract mathematics, including topology, network theory, and functional analysis. New knowledge is extracted from data lakes filled by streams of heterogeneous data. Simulators predict future states of complex systems as digital twins of the real systems. In this study, traffic emissions, pollution, exposure and health effects were modeled by traffic map and smartphone data.

The modelling principles were to value simplicity, computational speed and scalability, above accuracy. Accuracy improvements can be added to the model at a later stage. The overall method is to mathematically model effects along the causal chain starting with single vehicle tailpipe emissions, ending up with health risk. Traffic map data and smartphone trajectories are used wherever possible.

can be further estimated. The individual exposure and health risk can be scaled up to the population exposure and health risks.

Vehicle tailpipe emission is the sum of emissions from an idle engine and a working engine. The idle engine gives a constant base load while the engine is turned on [

q j = ∑ k N n k E 0 , j k + ∑ k N n k E 1 , j k v k (1)

where E 0 , j k is the idle engine emission factor (kg/s) of the j^{th} pollutant of vehicle type k, E 1 , j k is the working engine emission factor in (kg/m) for the j^{th} pollutant and vehicle type k, n k is the number of vehicles of type k per length in (m), and v k is the velocity (m/s) of a vehicle of type k. On average, vehicles move with the local traffic flow velocity v, so v k = v , and Equation (1) simplifies to

q j = ∑ k N n k E 0 , j k + v ∑ k N n k E 1 , j k (2)

The change of vehicle distribution n k is slow, over years, such that the distribution of vehicle types is approximately constant in time and space at country level. Thus, a set of effective constant parameters E 0 , j , E j and n can be applied

E 0 , j = 1 n ∑ k N n k E 0 , j k (3)

E 1 , j = 1 n ∑ k N n k E 1 , j k (4)

n = ∑ k N n k (5)

Plug-in Electric Vehicles (PEVs) and Plug-in Electric Hybrid Vehicles (PEHVs) are positive for air quality, but market penetration varies among countries. For example, the share of PEVs of the new car sales in 2015 was 0.66% in the USA and 22.39% in Norway [

q j = ( E 0 , j + E 1 , j v ) n (6)

In the next section, traffic maps information is assessed.

A traffic map (for example from Google, Here, TomTom, Yandex, and Baidu) shows near-real-time traffic velocity or travel time by a color code of data on street segments in a street map [

v i = s i t i (7)

Two velocity averages are used in traffic engineering [

#Math_24#, andspace mean velocity v s that is the average velocity of m ve-

hicles passing a fixed street segment of length s where each velocity is calculated by time intervals t i to cross street segment from start to end, for example by video camera recording the segment, and

v s = ( 1 m ∑ i m 1 v i ) − 1 = ( 1 m ∑ i m t i s ) − 1 = s ( 1 m ∑ i m t i ) − 1 (8)

In practice, the time mean velocity is about 2% greater than the space mean velocity. Smartphone location service can be used to measure velocity by spatial difference s i in individual GPS (Global Positioning System) positions over a fixed sampling time interval t and average over vehicles on a street segment

v t ( GPS ) = 1 m ∑ i m s i t (9)

Global traffic maps are calculated by millions of GPS positions, other static and dynamic input data, filtering, position corrections, and historical data to fill in blanks [

In traffic engineering, the fundamental diagram for traffic flow relates traffic flux ( n v ), the number of vehicles passing a fixed point per time, to vehicle density [

1 n = c 1 + c 3 v + c 2 v 0 − v (10)

where v 0 is the free float traffic velocity at zero density n = 0 , c 1 is a fixed distance, c 2 is a constant of the term that ensures zero density as v → v 0 , and c 3 is a constant time interval per vehicle. For safe driving in Norway, c 3 ≈ 3 s . By inverting 1/n

n = 1 c 1 + c 3 v + c 2 v 0 − v (11)

Thus, the vehicle density in Equation (6) can be obtained from the traffic map velocity. The maximal density n max is given for complete standstill v = 0 as follows:

n max = 1 c 1 + c 2 v 0 (12)

By inserting Equation (11) in Equation (6), the emission rate per unit length q j becomes

q j = E 0 , j + E 1 , j v c 1 + c 3 v + c 2 v 0 − v (13)

MacNicholas (2009) [

v v 0 = 1 − ( n n max ) α 1 + c ( n n max ) α (14)

where v 0 is the free flow velocity, n max is the maximal vehicle density, and c and α are curve-shape constants. The end-points are ( n = 0 , v = v 0 ) and ( n = n max , v = 0 ) . The free flow velocity is based on the speed limit, and the maximum density is given by the average length of vehicles plus a safety margin. The parameters α and c are specified by curve fitting to measured data. MacNicholas (2009) [

n n max = ( 1 − ( v v 0 ) 1 + c ( v v 0 ) ) 1 / α (15)

Inserting Equation (15) in Equation (6) gives

q j = ( E 0 , j + E 1 , j v ) n max ( 1 − ( v v 0 ) 1 + c ( v v 0 ) ) 1 / α (16)

Van Aerde [

Air dispersion is modelled by a Gaussian plume [^{th} pollutant c j (in kg/m^{3}) at a x, y, z- position, relative to the center of line source in the downwind x, crosswind y and vertical z directions are given as [

c j ( x , y , z ) = q j 2 2 π σ z ( u sin θ + u 0 ) { e − 1 2 ( z − h σ z ) 2 + e − 1 2 ( z + h σ z ) 2 } × [ erf ( | ( L 2 − y ) sin ( L 2 − y ) sin θ − x cos θ 2 σ y | ) + erf ( | ( L 2 + y ) sin ( L 2 + y ) sin θ + x cos θ 2 σ y | ) ] (17)

where q j is the line source strength or mass emission rate per unit length (kg/(s×m)), θ is the angle between the wind direction and the road in the range 0˚ - 180˚, h is the effective source height, L is the line source length that is the length of a street segment, u is the average wind speed, and u 0 is the wind speed correction due to a traffic wake. The standard deviations σ y = σ y ( x ) and σ z = σ z ( x ) are the horizontal and vertical dispersion coefficients that depends on atmospheric stability. The erf ( x ) is the error function

erf ( x ) = 2 π ∫ 0 x e − τ 2 d τ (18)

has unit slope flow small x, erf ( x ≪ 1 ) ≈ x , and tend to unity for large x, erf ( x ≥ 2 ) ≈ 1 . Atmospheric stability classes are A (very unstable), B, C and D (neutral), E and F (very stable). Consider, for simplicity, that the wind is perpendicular to the road that is θ = 90 ∘ ( sin θ = 1 ,#Math_73#). The two error functions model a tapering-off of the concentrations over a distance of the order of σ y at the ends of the street segment, i.e., at y ≈ ± L / 2 . For relevant x, the standard deviation σ y is small compared to the street half-length, and the tapering-off-effect was ignored. Both error functions are approximately equal to unity, and their sum is equal to two, and Equation (17) reduces to

c j ( x , z ) = q j 2 π σ z ( u sin θ + u 0 ) { e − 1 2 ( z − h σ z ) 2 + e − 1 2 ( z + h σ z ) 2 } (19)

Next, the vertical standard deviation is modeled. Turbulent wakes or trailing vortices behind vehicles form at fluid mechanical Reynolds numbers R e greater than about 1000

R e = ρ v l μ (20)

where ρ is air density, v is traffic flow velocity, l is the size of a vehicle and μ is air dynamic viscosity. Wake turbulence mixes released pollutants [

Consider two cars A and B, with B in front of car A. In congestion, the distance between the inlet suction of car A and the tailpipe outlet of a car B may be one meter. The exhaust gas of car B is almost directly sucked into car A, and the people in car A are heavily exposed to pollution. At this stage, this added congestion exposure is ignored.

Turbulent mixing increases the size of the emission source by σ y , 0 and σ z , 0

σ y = ( σ y , 0 2 + σ y , 1 ( x ) 2 ) 1 / 2 (21)

σ z = ( σ z , 0 2 + σ z , 1 ( x ) 2 ) 1 / 2 (22)

Empirical Pasquill Gifford sigmas [

σ y , 1 ( x ) = a 1 x ( 1 + x a 2 ) a 3 (23)

σ z , 1 ( x ) = a 4 x ( 1 + x a 2 ) a 5 (24)

The tailpipe and the suction inlet have small vertical positions compared to the mixing length, z ± h ≪ σ z , 0 ≤ σ z . Thus, the two exponential terms in Equation (19) are both approximately equal to unity and their sum is equal to two, so that:

c j = 2 q j 2π σ z ( u sin θ + u 0 ) = 2 ( E 0 , j + E 1 , j v ) 2π σ z ( u sin θ + u 0 ) ( c 1 + c 3 v + c 2 v 0 − v ) (25)

Velocity is the key variable of pollution concentration. Next, exposure is modeled.

Human exposure is concentration times the residence time [

X i j = ∑ k N c j k t i k (26)

where X i is the total exposure for person i over a specified period, c j k is the concentration of pollutant j concentration in microenvironment or street segment k, t i k is the residence time of the person i in segment k, and K is the total number of microenvironments.

Individual time-activity patterns are mapped by smartphone location service trajectories. Exposure depends on two types of trajectories: i) Fixed-time trajectory: Individual trajectory of fixed time duration, such as the working hours of taxi drivers, and people residing near a street with heavy traffic; and ii) Fixed- route trajectory: Individual who has to move from location A to B, no matter how long time it takes, such as a commuter who travels the same route from home to work every workday.

Fixed time exposure is a sum over time intervals t i k up to a given total time T = ∑ k N t i k of person i at position p ( t ) = ( x p ( t ) , y p ( t ) , z p ( t ) )

X i j = ∑ k N c j k ( p ( t ) ) t i k (27)

The c j k ( p ) includes the sum of concentration contributions from all street segments and is mathematically a convolution. Residents may have a large daily time T but not directly at the peak pollution on the street.

Now, consider a fixed route trajectory. The residence time t i k of exposure at street segment k is related to traffic flow velocity v i k and length s i k of the road segment as:

s i k = v i k t i k (29)

Solved for t i k and inserted into the exposure

X i j = ∑ k N c j k s i k v i k (30)

X i j = ∑ k N X i j k = ∑ k N E 0 , j + E 1 , j v i k π / 2 σ z ( u sin θ + u 0 ) ( c 1 + c 3 v i k + c 2 v 0 , i k − v i k ) s i k v i k (31)

where residence time and velocity for a given street segment are functions of time, t i k = t i k ( t ) , and v i k = v i k ( t ) . During rush hours, the exposed time for a fixed route is longer than outside rush hours. Equation (31) has velocity singularities for v i k → v 0 , i k and v i k → 0 with associated low and high velocity regimes.

The low velocity regime, characterized by v i k ≪ E 0 , j / E 1 , j , and v i k ≪ v 0 , i k , is derived by Taylor expanding [

X i j k ≈ 2 E 0 , j ( 1 − ( 1 + c 3 c 2 v 0 , i k 2 1 + c 1 c 2 v 0 , i k − E 1 , j E 0 , j v 0 , i k ) v i k v 0 , i k ) 2π σ z ( u sin θ + u 0 ) ( c 1 + c 2 v 0 , i k ) t i k ≈ 2 E 0 , j n max , i k 2π σ z ( u sin θ + u 0 ) t i k ~ v i k − 1 (32)

The exposure is given by the travel time, t i k = s i k v i k − 1 , and the exposure per travelled distance may become large. Hence, congestion is a high pollution exposure regime.

In the high velocity regime, v i k ≫ E 0 , j / E 1 , j the velocity effects of travel time and working engine cancels, and exposure is proportional to vehicle density

X i j k ≈ 2 E 1 , j s i k 2π σ z ( u sin θ + u 0 ) n k (33)

where n k ≈ v 0 , i k − v i k c 2 . In the limit of free flow velocity v i k → v 0 , i k , n k → 0 and

the exposure vanishes. The high-velocity regime is a low exposure regime. To the best of our knowledge, the discovered effect of velocity on exposure new. In the next section, the input parameters to the model are specified.

For the specification of input parameters to the traffic exposure model, Engine emission factors for pollutants are shown in Tables 1-3 from US EPA [

^{1}Light-duty gasoline-fueled vehicles; ^{2}Light-duty gasoline-fueled trucks; ^{3}Heavy-duty gasoline-fueled vehicles; ^{4}Light-duty diesel vehicles; ^{5}Light-duty diesel trucks; ^{6}Heavy-duty diesel vehicles; ^{7}Motorcycles

meter. Based on the normalized Equation (15), the density-velocity shape parameters are assumed to be fixed.

The traffic map provides static data such as street segment length s i k and orientation, and dynamic velocities. A smartphone location service provides trajectories. Weather conditions (e.g., wind direction, and atmospheric stability classes) can be given by a near real-time weather map layer. Currently, traffic and weather map layer data are not available for public use, so collaboration with data providers is needed.

A person moving through a city accumulates a dose of pollution through exposure that gives an incremental increase in health risk that is statistically reflected in the public health. Traditionally, one distinguishes between short-term (i.e. minute, hour, day) acute exposure to pollution that may result in headache/irritation or an asthma attack, and long term, years to lifetime, exposure that can lead to chronic effects including cancer, chronic obstructive pulmonary disease, and neurological problems.

The dose equals concentration times respiration rate times duration and is linear in exposure. The respiration rate, for normal adults is 12 - 20 breaths per minute. Each breath volume (or tidal volume) is about 5 liters or 30 - 37 ml/kg and total lung volume is about 6 liters. An average of 16 breaths per minute gives a standard deviation of ±25%. Respiration rate increases with increasing heart rate, possibly linearly. Except for runner, bikers and other high-activity persons, people in traffic are passive in a vehicle and have a heart rate close to the resting heart rate; in the range 60 - 100 beats/minute.

It is assumed that the risk R (both for an individual and for population) saturates at a maximum level R max , where an increase in exposure gives no further increase in the risk. The exposure level that saturates the risk depends on the seriousness of the risk. For example, the risk of a slight headache due to traffic pollution will saturate at a small exposure, while number of years lost due to early death will saturate at an extremely high exposure. The saturation effect can be modelled by a logistic differential equation as:

d R d X = r R ( 1 − R R max ) (34)

For small risks R ≪ R max , the risk grows exponentially as function of exposure with a rate r. The growth rate is reduced linearly as the risk increases, and stops growing at maximum risk R max . The logistic risk differential equation can be solved analytically by partial fraction expansion after the R-terms on the right hand side of Equation (34) are moved to the left hand side of Equation (34). The initial condition is a background risk R b = R ( X = X b ) at exposure X = X b , as:

R = R max ( 1 + ( R max R b − 1 ) e − r ( X − X b ) ) − 1 (35)

Consider a far-from-saturation regime, let ( R b / R max ) r ( X − X b ) ≪ 1 , ( R b / R max ) < 1 , and r ( X − X b ) ≪ 1 . Then the following sequence of approximations is justified:

R ≈ R ( X = 0 ) e r X ≈ R b e r ( X − X b ) ≈ R b ( 1 + r ( X − X b ) ) (36)

Divide (36) by R b and then subtract unity from both sides and obtain:

R − R b R b ≈ r ( X − X b ) = α b X − X b X b (37)

where α b = r X b . This approximation applies to serious health effects such as early deaths. Since the relative increase in risk is proportional to the relative increase in exposure, the exposure figures can be used as a proxy for health risk figures.

Traffic pollution’s impact on health depends both on accumulated exposure (one cause) and on the vulnerability of the person. For example, children and elderly people are more vulnerable to pollution, but also less exposed in traffic. Other factors are body weight, other diseases such as asthma, and exposure to other sources of pollution.

Traffic maps predict one-hour or daily traffic based on historic and current traffic. Individual preferred route selection can be optimized by weighting “time to target location” versus “pollution exposure to target location”. Cities have a typical daily M-shaped density peak of morning and afternoon rush hours due to the tidal flow of commuters.

Moreover, one may predict population health risk to optimize urban planning of transportation infrastructure, and residential and working areas. It may even be possible to develop urban simulators as a digital twin to the city where every person in the city has simulated trajectories and automatic collection of exposures and health risks, and used to answer “what if” questions as a valuable tool for politicians and urban planners.

The plots in Figures 1-11 are explained in

pears as an inverse in the concentration, so the inverse standard deviation is representative of the decaying amplitude of concentration away from the street. Results showed that the calmer meteorological situation (i.e., very stable atmosphere stability class) leads to the higher pollution concentration, and verse versa. ^{3} for a traffic flow velocity of 45 km/h.

_{2} is 40 μg/m^{3} winter averages and a red limit is 40 μg/m^{3} annual averages [

Traffic map companies have developed methods to predict future or typical traffic based on current and historic traffic. The future traffic can be predicted on a short-term basis, typical one hour. By combining smartphone trajectory, this traffic prediction can be used to predict the next hour exposure for a given planned route. It would then be straight forward to compare several possible routes and make an intelligent choice based on weighting “time to target location” versus “pollution exposure to target location” and optimize the route based on individual preferences.

Traffic is well-known to display certain typical patters. Large cities have a typical daily M-shaped density peak of morning and afternoon rush hours due to the flow of commuters in and out of the city. The peak sizes vary typically with weekdays. This information can be used in projecting future long term exposure, identify high exposure groups and check if negative health impact is well-correlated to high exposure groups.

Further one may use the prediction or average maps to estimate where the future population health risk is highest and direct infrastructure investments to minimize a combination of “population travel time” and “population health risk”. Our modelling of exposure showed that the high exposure at low velocity scales as X i j k ~ v i k − 1 and this correlates well to the traffic map itself, since a small v i k gives both high exposure and congestion. Most people travel on the peaks of the M-shaped rush hour peaks it is clear that just reducing the size of the rush hour peaks would lead to a significantly improved population health. Exposure maps and health risk maps could be a highly useful tool for urban planning of transport infrastructure in interaction to where people live and work. Even more useful for predictions would be to develop urban simulators where every person in the city have simulated trajectories and would then get simulated exposures and health risks. An urban simulator could then be used to answer all kinds of “what if” questions. An urban simulator could be a highly valuable tool for politicians and urban planners. We predict that by 2030 urban trajectory simulators are routinely being applied in urban planning.

It is feasible to combine traffic maps data with smartphone location service trajectories and big data analytics to simulate near real-time traffic air pollution exposure and health risk. Advantages of the approach are: i) low cost, ii) near real-time, iii) effortless citizen participation, and iv) global scalability.

Nearly every modeling step of traffic air pollution depends on traffic velocity. A traffic map is a super-efficient pre-processor for calculating real-time traffic pollution.

Universally, the exposure and health risk has a peak at lower velocities than the peak of concentration. Congestion is a higher health risk than conventionally believed.

We thank Mr. Mike Kobernus at NILU (Norwegian Institute for Air Research) for help with the language. Any remaining errors are the responsibility of the authors.

