This paper abandons the “rational man” hypothesis in traditional economics and utilize s the existing “reciprocal” principal-agent model (Pu Y. J, 2007) to analyze the benefits of environmental protection inputs. The results of the study show that if humans increase their environmental investment on the basis of rational input is set to , naturally, the increase in revenue under the assumption of “reciprocity” is r, when η > 2r, the environmental investment made by humans under irrational assumptions can bring Higher income level than “rational people”. Thus, the enlightenment of government behavior is: first, the government should fully recognize the “reciprocal” characteristics of the natural environment when formulating environmental protection policies; second, the government should further increase investment on the basis of rational input, when the increased investment meets the aforementioned conditions, the purpose of increasing human income can be achieved.
The harmonious development of human society and the natural environment has always been an important issue in the process of social and economic development. In the process of development of human society, once the pollution and damage caused to the natural environment exceeds the limit of environmental tolerance that exceeds the bearer threshold of environment, not only the social and economic development is unsustainable, but also the human society has to pay a heavy cost. Therefore, it is necessary to invest funds for environmental protection in economic development.
However, people have long recognized the importance of environmental protection investment. With the continuous development of the social economy, the population has increased greatly and the demand for resources has become more and more huge, especially the environmental and ecological pollution caused by the development of modern industry is getting worse. People gradually realized that natural resources are not inexhaustible and even began to become scarce resources that restrict human social and economic development. Therefore, environmental resources have become an important part of social productivity and their protection behavior should be involved in the distribution of national income. This part of national income allocated to environmental protection is environmental protection investment [
This paper discusses this question from a special perspective, and it can provide a new way of thinking for the benefit of green investment. The structure of this paper is as follows: Part 2 is a review of relevant literature, which provides theoretical basis for the research; Part 3 is an analysis of the “prisoner’s dilemma” of green investment; Part 4 is a principal-agent model of environmental protection investment under the condition of “rational person”; Part 5 is a principal-agent model of environmental protection investment under the theory of reciprocity; Part 6 is the conclusion and enlightenment.
The relationship between man and nature is interdependent and indivisible. Nature is the material basis and premise of human existence and man is the product of the long-term development of nature. This is the harmony between man and nature [
Panayotou [
Other scholars have discussed the relationship between environmental protection investment and production efficiency. There are three main hypotheses: “constraint hypothesis”, “porter hypothesis” and “uncertainty hypothesis”. The “constraint hypothesis” believes that if the corresponding capital is invested to reduce pollution, which will internalize the negative externalities of the enterprise, what is more, increasing the cost burden of the enterprise and affecting the production efficiency of the enterprise. Gray’s [
In addition, some scholars’ research on the benefits of environmental protection investment is based on the input-output model. For example, Leontief [
These studies are based on the classical hypothesis of complete rationality in traditional economics. The “rational person” assumes that people are all “rational people” who pursue self-interest, the purpose and motivation of all parties are to maximize the self-interest, which invisibly strengthens the contradictions and conflicts in social economic relations to a certain extent. Obviously, it is contrary to the connotation of environmental protection and cannot provide a reasonable explanation for the harmonious development of man and nature. In actual social life, the parties involved in behavioral choices do not have the conditions for complete information and the behavioral decisions made by people are not completely rational, they also are influenced by human factors such as personal feelings. Herbert Simon [
If the hypothesis of “rational people” is not conducive to the study and interpretation of the emergence of some cooperative behaviors in reality, what should be explained? From a biological point of view, the earliest explanation is that cooperation is conducive to the evolution of the group, so the biological individual will consider the interests of the group that is the theory of “group selection” [
With the deepening of relevant research, behavioral economics and its reciprocity theory can break this limitation and explain it to some extent. Behavioral economics is an interdisciplinary subject that uses psychological principles and experimental methods to verify the basic behavioral assumptions of economics and uses behavioral games as its microscopic basis to study the laws of economic activity. Professor D. Kahneman of Princeton University and Professor A. Tversky of Stanford University [
Similarly, when discussing the relationship between man and nature, we often call on people to “protect nature is to protect human beings”, which fully demonstrates that the act of protecting nature is reciprocal to nature and mankind; building socialism in contemporary China. In a harmonious society, we also strongly advocate the construction of “harmony” between man and nature. The “harmony” here also emphasizes the reciprocity between man and nature. Therefore, the aforementioned theory of fair reciprocity can also be extended to the fair reciprocity and cooperation between man and nature. Saving nature and protecting the environment are essentially the reciprocal behavior between man and nature [
Since the reform and opening up more than 30 years ago, China’s economy has been moving at a rapid pace. Although it has harvested fruitful results in economic and social development, it has also paid the price of environmental problems. Economic construction has been at the Primary location for many years and environmental protection has fallen to a secondary position, in exchange for the rapid growth of the economy at the expense of natural resource consumption and environmental damage. Environmental problems such as smog and water pollution continue to remind us that this development is not the best way. If people and nature are regarded as both sides of the game, the choice of this behavior is in the “prisoner’s dilemma.”
In recent years, people have gradually recognized the importance of environmental protection and are also increasing their efforts and funding to manage and improve the natural environment. However, there are still many unreasonable behaviors in the development. The governance and investment of environmental protection often start to take action after some major environmental events have taken place.
We can explain this by creating a simple game model. Under the assumption of “rational man”, since both human beings and nature are rational, if humans invest in environmental protection, they will be transformed into corresponding cost pressures, which will affect human economic benefits. when environmental problems exceeds the natural bearing limit, nature will retaliate against human beings through major environmental events, which can accelerate the recovery of nature itself, but will cause significant economic losses to human society. According to this, the following game matrix can be formed:
Thus human and natural behavioral choices will constitute the income matrix shown in
1) The relationship of human income in four cases. First of all, in the case of nature does not retaliate against the human society, it is clear that when humans do not invest in environmental protection, they have greater economic benefits than when they invest in environmental protection. Even if the living environment
Natural Humanity | No retaliation | retaliation |
---|---|---|
investment | (A, B) | (C, D) |
No investment | (E, F) | (G, H) |
has been improved because of human input, nature has not retaliated against human beings. Therefore, human beings’ feelings about environmental crises are not obvious and the investment in the environment has not brought corresponding benefits to have more feelings, so A < E; Secondly, in the case of nature choose to retaliate against human society, it is obviously that humans choose to invest less than economic loss when they choose not to invest, so C < G; Because the damage caused by natural retaliation is often very large, it is much more invested than humans in environmental protection, so G < A. In summary, “not investing” is a dominant strategy of human beings, the magnitude relationship of the benefits is C < G < A < E.
2) The relationship of natural income in four cases. First of all, In the case of humans choose to invest, the nature chooses retaliation for greater gains, because on the one hand the natural environment has benefited from human input, on the other hand the recovery has been completed through retaliation, so B < D, secondly, in the case of humans choose not to invest, the benefits of natural environment choice retaliation are also greater, so F < H, for nature, revenue is the same by retaliation against humans to complete environmental recovery, or through human input management to complete environmental recovery, that is to say B = H. All in all, “retaliation” for human beings is a dominant strategy of nature, the magnitude relationship of the benefits is F < H = B < D.
Based on the above analysis, “not investing” is a dominant strategy of human beings. “Retaliation” is also a dominant strategy of natural. Therefore, (no investment, retaliation) is a Nash equilibrium in this case, (G, H) is the benefits of all parties under the Nash equilibrium conditions. This equilibrium can explain in the real economic life that while developing the economy, it ignores the protection of the environment until it invites nature to retaliate against humans through major natural events. Under the assumption of rational people, it is easy to fall into a “prisoner’s dilemma”, that is, human beings have no incentive to invest in environmental protection, and naturally they will retaliate against human beings while human beings continue to deplete natural resources.
In fact, there is a better choice in this game model, which is (investment, no retaliation). This is because, when choosing (investing, not retaliation), the human benefit is the income G when A is greater than the choice (no investment, retaliation), and naturally the income in both cases is equal, i.e. B = H, then choose The total return A + B in the case of (investment, no retaliation) is significantly greater than the gain G + H at the time of selection (no investment, retaliation). The concrete manifestation in reality is that while developing the economy, we must also pay attention to the protection of the natural environment, and control the consumption and destruction of nature within the range that the natural environment can bear, and then make people and nature develop harmoniously. However, people often wake up after experiencing natural violent revenge, only to know that they must protect the environment and make huge environmental protection investment. “First pollution and governance” is the situation in the model (investment, retaliation). The basic situation facing our country is this. In addition, (no investment, no retaliation) this situation is impossible.
The central problem of the principal-agent model is to solve the moral problems that arise due to information asymmetry. The model is a representative and generalized principal-agent model, which is widely used in the study of principal-agent relationships at home and abroad. In the model, the agent selects the level of effort to maximize its deterministic equivalent net income based on the constraints of incentive compatibility, and ensures that its deterministic equivalent net income is not less than its minimum net retained income. Under these two constraints, the principal seeks to maximize the incentives for his desired utility [
In essence, we can regard it as a game between human beings and nature. Human beings rely on nature to make gains, whether nature will retaliate against humans, and when and how to retaliate against human beings. It is said that the information is asymmetrical, or that human beings get the information at a high price, because once the natural retaliation against humans will cause irreversible losses. Therefore, we try to use the Holmstrom-Migrom principal-agent model [
As mentioned in the previous section, the best way to develop the economy is to focus on the protection of the natural environment while controlling the economy, and to control the consumption and destruction of natural resources within the limits of the natural environment. However, when investing in the protection of the environment, how much is invested is efficient, and whether the best way of economic development can be achieved is the issue that we should focus on.
Let a be the contribution or sacrifice naturally made to meet the development needs of human beings. It is a one-dimensional variable; the total benefit of human beings obtained from nature is π : π = α + θ , where θ is the process by which humans obtain benefits from nature. Other uncertainties faced in the middle, and θ obeys a normal distribution with 0 as the mean and σ 2 as the variance, so expect E ( π ) = a and variance V a r ( π ) = σ 2 .
In general, we assume that humans are risk-neutral, and naturally risk-averse (because natural adventures will cause significant losses to humans), and the absolute risk aversion of humans and nature is constant, and both sides The optimal choice is linear [
Nature as an agent, its income function is: s ( π ) = α + β π , α is the spontaneous contribution of human beings to environmental protection in life (such as people out of morality or compassion, to rescue wild animals; or consciously garbage Classification and recycling, etc.), β is the share or proportion of human (government) that uses part of the total national economic income for environmental protection purposes.
Since human beings are risk-neutral as principals, given the natural (agent) income function s ( π ) = α + β π , the human (client) utility function can be set to v [ π − s ( π ) ] , and the expected utility of human beings is:
E v [ π − s ( π ) ] = v { E [ π − s ( π ) ] } = v { E [ − α + ( 1 − β ) π ] } = v [ − α + ( 1 − β ) a ]
Since the first derivative of function v [ π − s ( π ) ] is a constant, it is possible to assume that w is the benefit of the human (client) after the environmental protection input, and there is v ( w ) = w , then the expected utility of the human (client) is equal to the expected return, that is E v [ π − s ( π ) ] = − α + ( 1 − β ) a .
As the previous assumption, the absolute risk aversion of the natural (agent) is constant and is a constant, set to ρ , and then set m as the actual income of the natural (agent), then the calculation formula according to the risk aversion degree is:
ρ = − u ″ u ′ , ρ > 0
So ρ = − d ln u ′ d m , ln u ′ = − ρ m + B , where B is an arbitrary constant.
Then there are u ′ = e B e − ρ m , the solution gets u = − e B ρ e − ρ m + C , where B and C are constants. For the sake of simplicity, we can set B = ln ρ , and thus there are e B ρ = 1 , so the whole formula becomes u = − e − ρ m .
According to the principal-agent model, we set c ( a ) as the cost of the natural (agent) sacrifice to satisfy the needs of humans (principals) a the cost of complete self-recovery, which is equivalent to the cost of money, and c ( a ) = b a 2 / 2 , where b > 0 is the cost coefficient.
Therefore, the actual benefits of nature (agents) are:
m = s ( π ) − c ( a ) = α + β ( a + θ ) − b a 2 2
According to the definition of deterministic equivalent income: If u ( χ ) = E u ( m ) , where m is random income and u ( χ ) is the utility function of income, then χ is m deterministic equivalent income. Therefore, for nature as
an agent, there are: − e − ρ χ = ∫ − e − ρ m g ( θ ) d θ , of which g ( θ ) = A e − θ 2 2 σ 2 is a normal distribution density function of θ , so there are:
− e − ρ χ = − ∫ e − ρ [ α + β ( a + θ ) − b a 2 2 ] A e − θ 2 2 σ 2 d θ ,
After finishing, get:
− e − ρ χ = − e − ρ ( α + β a − b a 2 2 − ρ β 2 σ 2 2 )
So have: χ = α + β a − b a 2 2 − ρ β 2 σ 2 2 = E ( m ) − ρ β 2 σ 2 2 , therefore, χ is a deterministic equivalent income of m.
Assume that the maximum environmental pollution and damage caused by human economic development, that is, the carrying threshold of the natural environment is n, that is, the ultimate return of natural being is lower than this limit, it is necessary to conduct human revenge. So it satisfies the constraints:
χ = α + β a − b a 2 2 − ρ β 2 σ 2 2 ≥ n
Order ∂ χ ∂ a = 0 , calculate a = β b .
Therefore, under rational conditions, the maximum benefit and input of humans (principals) should be the solution to the following problems:
max α , β [ − α + ( 1 − β ) a ] s .t . α + β a − b a 2 2 − ρ β 2 σ 2 2 ≥ n a = β b
In the principal-agent model, the constraint of the above problem is an equation, so the constraint is brought into the objective function, which can be calculated:
α = n + 1 2 ( ρ σ 2 − 1 b ) 1 ( 1 + b ρ σ 2 ) 2
β = 1 1 + b ρ σ 2 > 0
At this time, the expected benefits that human beings receive as principals are:
E v = − n + 1 2 b ( 1 + b ρ σ 2 )
α As a human being’s spontaneous contribution to environmental protection in life, its promotion relies on the vigorous publicity and education of government departments and public media; β as a large-scale and purposeful environmental protection investment of government departments, it is environmental protection. The main part of the investment; and under the rational assumptions, the maximum benefit that humans can get is Ev, which is certainly much greater than the gains that humans receive when they suffer losses due to natural retaliation.
Considering nature as a party to participate in the game, if the assumption of “rational man” is abandoned to transform the principal-agent model, then Rabin’s reciprocity theory of “returning the money and repaying the teeth” is more suitable to explain the nature of nature. Nature can be described in this way: if human beings are friendly to nature and know how to be kind to nature when acquiring resources, then nature will also return to human beings, so that human society and economy will continue to develop; if human beings are not friendly to nature, they will be unrestrained in nature. Development and destruction, then it will also retaliate against humans. Therefore, we can try to introduce Rabin’s reciprocity theory into the principal-agent model, and then analyze the investment and benefits of environmental protection.
If humans (principals) increase the investment of environmental protection funds on the basis of rational input, the total income of nature (agent) under the maximum contribution condition is higher than its minimum acceptable limit n, The increase of the input amount is 0 ≤ r ≤ η ; and naturally as an agent, its behavior is consistent with the previous reciprocity, so naturally it will make more contributions to human beings than under rational conditions to increase human income, and this increased contribution is a * . Therefore, the natural deterministic benefit is n + r , and n + r is less than the natural benefit of naturally increasing the contribution of human beings without increasing their contribution to environmental protection, that is 0 ≤ r ≤ η .
Under the assumption of “rational people”, people’s spontaneous contribution to environmental protection in life is α 0 , even under the assumption of “reciprocity”. Therefore:
α 0 + η + β ( a + a * ) − 1 2 ρ β 2 σ 2 − b 2 ( a + a * ) 2 = n + r Solution:
a * = 1 b 2 b ( η − r ) + ( 1 − b ρ σ 2 ) [ β 2 − 1 ( 1 + b ρ σ 2 ) 2 ] (tossing negative roots)
And because humans increase the proportion of the purpose of environmental protection investment on the basis of rational input, so β is unchanged, and
β = 1 1 + b ρ σ 2 , so there are:
a * = 1 b 2 b ( η − r )
Because of 0 ≤ r ≤ η , we get a * ≥ 0 , which means a * is a meaningful real solution. In the case that humans (principals) increase their input, 0 ≤ r ≤ η on the one hand shows that the contribution of nature to human beings is greater than that under rational conditions, and that nature as an agent has chosen to be smaller than its rational state. In return for human beings, the benefits of human beings must also increase, and the natural behavior must be “reciprocal”.
Thus, under irrational conditions, the expected benefits of humans (clients) are:
E v * = − α 0 + ( 1 − β ) ( a + a * ) − η = − α 0 + ( 1 − β ) [ β b + 1 b 2 b ( η − r ) ] − η
Find E v * to η partial derivatives and make them equal to 0:
η = r + b ρ 2 σ 4 2 ( 1 + b p σ 2 ) 2
Obviously, r ≤ η .
E v * = E v − r + b ρ 2 σ 4 2 ( 1 + b p σ 2 ) 2
among them, E v = − n + 1 2 b ( 1 + b p σ 2 ) is the expected return of human beings under rational assumptions.
As can be seen from the calculation results of E v * , There will be E v * ≥ E v when b ρ 2 σ 4 2 ( 1 + b ρ σ 2 ) 2 ≥ r . And because
η = r + b ρ 2 σ 4 2 ( 1 + b ρ σ 2 ) 2 , b ρ 2 σ 4 2 ( 1 + b ρ σ 2 ) 2 = η − r
so b ρ 2 σ 4 2 ( 1 + b ρ σ 2 ) 2 > r can be rewritten as η > 2 r , that is to say, when humans
increase the environmental investment on the basis of rational input more than twice the amount of natural increase under the “reciprocity” hypothesis, human beings Under irrational conditions, the gains obtained under rational conditions are greater, and the actual returns are also higher than their carrying thresholds.
Using the principal-agent model based on the hypothesis of “rational man” to study the behavioral game between human and nature, the optimal strategy is to require the constraint to give the natural (agent) actual income exactly equal to the natural carrying threshold. Or the natural income of nature is at the edge of the carrying threshold, which is the inevitable choice of human (client) as a “rational person.” That is to say, in the game, human beings as rational agents will inevitably give nature as little real income as possible while keeping nature (agents) from retaliation, because if you increase the actual income of nature, then increasing the cost of human beings is not conducive to gaining benefits. On the other hand, humans also believe that rational nature does not allow humans to gain more benefits by increasing profits.
However, after the improvement of the principal-agent model, it is proved that at η > 2 r time, the income of human and nature in the irrational state can be greater than the maximum benefit in the rational state, and then the total income of the system composed of man and nature. It is also greater than the total system revenue under rational conditions. This is a result of making people’s investment in environmental protection more efficient, and mutual benefit and win-win between man and nature. In fact, this is a Pareto improvement of human and natural game behavior.
It is also foreseeable that in different natural environments or in different aspects of the natural environment, the natural reciprocity motivations are not the same (That is r, there are big and small), so humans can use the characteristics of natural irrationality. Carry out environmental protection investment. Pay attention to environmental protection inputs while obtaining resources from nature, For example, investment in environmental protection under the conditions put
in more rational than η = r + b ρ 2 σ 4 2 ( 1 + b ρ σ 2 ) 2 , Thereby, human society and nature
develop in harmony, and thus gain greater benefits, achieve sustainable use of natural resources and sustainable development of human society.
The existing analysis and research on environmental protection investment and benefit are generally carried out under the assumption of “rational person”. According to the current behavioral economics and behavioral game theory, this paper expands the “rational person” hypothesis into the “reciprocity” hypothesis. The commission-agent model of environmental protection investment benefit analysis embodying “reciprocal” irrational behavior was constructed. Through the research and analysis of the model, the results show that the introduction of “reciprocity” makes it possible for humans to give more benefits than their carrying limit in the process of environmental protection input, it will increase the benefits of humans. When humans increase their environmental investment on the basis of rational input by more than twice the increase in the amount of income under the assumption of “reciprocity”, humans and nature receive higher returns under the assumption of “reciprocity”. The benefits obtained under the assumption of “rational people” will enable environmental protection inputs to have higher benefits and achieve sustainable economic development. The game result based on the “rational man” hypothesis is not a mutually beneficial win-win result, and the game result based on “reciprocity” shows that man and nature can achieve mutual benefit and win-win.
Therefore, the enlightenment on the behavior of environmental protection government is: First, the government should not regard the natural environment as the “rational” nature of pursuing profit maximization when formulating environmental protection policies, but should have a certain degree of “reciprocity”. Naturally, it can be seen from the above η > 2 r condition that the greater the reciprocity of nature, the smaller the increase in environmental protection investment. Secondly, when the government invests in environmental protection, it only needs to increase the rational input. The amount of environmental protection input η is greater than twice the natural increase r in income under the assumption of “reciprocity”, which can achieve the goal of further increase in environmental protection investment efficiency under the “rational” assumption.
In addition, for the convenience of research, in this paper, the maximum carrying capacity of the natural environment, that is, the carrying threshold of the natural environment, is set to a constant in the construction of the model. However, in practice, with the deepening of regional social and economic activities and the development of the regional environmental system itself, the regional environmental carrying capacity will change [
The authors thank the financial support of Humanity and Social Science Project 18SKSJ052 from the Chongqing Education Commission. (Project title: Regional competition and cooperation, economic development and sustainable growth in Chongqing.)
Also thanks for the financial support of the national social science fund of China 18XJY022 (Fund title: Study on the Feasible Ways to Develop Green Finance in the New Era).
The authors declare no conflicts of interest regarding the publication of this paper.
Zhao, C.Y., Xiang, T.T. and Peng, H. (2018) Benefit Analysis of Environmental Protection Investment Based on Reciprocity Theory. Theoretical Economics Letters, 8, 3395-3410. https://doi.org/10.4236/tel.2018.815208