Research Study on Corporate Taxation

Description
Many countries impose corporate tax, also called corporation tax or company tax, on the income or capital of some types of legal entities.

American Economic Journal: Economic Policy (August 2010): 1–31 http://www.aeaweb.org/articles.php?doi=10.1257/pol.2.3.1

Dividend and Corporate Taxation in an Agency Model of the Firm†
By Raj Chetty and Emmanuel Saez* Recent evidence on the effect of dividend taxes on firm behavior is inconsistent with neoclassical theories of dividend and corporate taxation. We develop a simple agency model in which managers and shareholders have conflicting interests to explain the evidence. In this model, dividend taxation induces managers to undertake unproductive investments by retaining earnings, and creates a first-order deadweight cost. In contrast, corporate taxes do not distort the manager’s payout decision and may only create second-order efficiency costs. Corporate income taxation may therefore be a more efficient way to generate revenue than dividend taxation, challenging existing intuitions based on neoclassical models. (JEL D21, G35, H25, H32)

T

he 2003 dividend tax reform in the United States has sparked a new wave of research on the effects of dividend and corporate taxation (Jouahn Nam, Jun Wang, and Ge Zhang 2004; Chetty and Saez 2005; Jeffrey R. Brown, Nellie Liang, and Scott Weisbenner 2007). Chetty and Saez (2005) document four empirical results: • Regular dividends rose sharply after the 2003 tax cut, with an implied net-of-tax elasticity of dividend payments of 0.75. • The response was very rapid (total dividend payouts rose by 20 percent within one year of enactment) and was stronger among firms with high levels of accumulated assets. • The response was much larger among firms where top executives owned a larger fraction of outstanding shares (see also Nam, Wang, and Zhang 2004 and Brown, Liang, and Weisbenner 2007). • The response was much larger among firms with large shareholders on the board of directors.

*?Chetty: Department of Economics, Harvard University, Littauer Center, 1805 Cambridge Street, Cambridge, MA 02138, and National Bureau of Economic Research (e-mail: [email protected]); Saez: Department of Economics, University of California, Berkeley, 549 Evans Hall #3880, Berkeley, CA 94709, and National Bureau of Economic Research (e-mail: [email protected]). We thank Martin Feldstein, Roger Gordon, Kevin Hassett, James Poterba, two anonymous referees, and numerous seminar participants for very helpful comments. Gregory Bruich, Keli Liu, Joseph Rosenberg, and Ity Shurtz provided outstanding research assistance. Financial support from National Science Foundation grants SES-0134946 and SES-0452605 is gratefully acknowledged. † To comment on this article in the online discussion forum, or to view additional materials, visit the articles page at http://www.aeaweb.org/articles.php?doi=10.1257/pol.2.3.1. 1

2

American Economic Journal: economic policy

August 2010

It is difficult to reconcile these four findings with either of the two leading theories of corporate taxation: the “old view” (Arnold C. Harberger 1962, 1966; Martin S. Feldstein 1970; James M. Poterba and Lawrence H. Summers 1985) and the “new view” (Mervyn A. King 1977; Alan J. Auerbach 1979; David F. Bradford 1981). The increase in dividends appears to support the old view because dividends should not respond to permanent dividend tax changes under the new view.1 However, the increase in dividend payments is too rapid to be explained by the supply-side investment mechanism of the old view model.2 The rapid dividend payout response could potentially be explained by incorporating a signaling value for dividends as in Poterba and Summers (1985) or B. Douglas Bernheim (1991).3 However, neither signaling models nor the standard old and new view models directly predict findings three and four on the cross-sectional heterogeneity in the dividend payout response by firm ownership structure.4 In this paper, we propose a simple alternative model of dividend and corporate income taxation that matches the four empirical findings based on the agency theory of the firm (Michael C. Jensen and William H. Meckling 1976). The critical feature of the model is a divergence between the preferences of managers and shareholders. We model this divergence as arising from perks and pet projects, although the underlying source of the conflict between managers and shareholders does not matter for our analysis. Shareholders can provide incentives to managers to invest and pay out dividends through costly monitoring and pay-for-performance. Only the large shareholders of the firm choose to monitor the firm in equilibrium (Andrei Shleifer and Robert W. Vishny 1986, 1997). In this model, a dividend tax cut leads to an immediate increase in dividend payments because it increases the manager’s preference for dividends relative to the pet project and increases the amount of monitoring by large shareholders. Firms where managers place more weight on profit maximization, either because the manager owns a large number of shares or because there are more large shareholders, are more likely to increase dividends in response to a tax cut. After showing that the positive predictions of the agency model fit the recent evidence on dividend taxation, we characterize its implications for the efficiency costs of dividend and corporate taxation by deriving empirically implementable formulas for excess burden. We obtain two results that challenge intuitions from existing neoclassical models. First, dividend taxes create a deadweight cost, even if the marginal source of investment is retained earnings, by distorting the tradeoff between pet
1 One way of reconciling the dividend increase with the new view is if the tax cut was perceived as temporary by firms. However, Auerbach and Kevin A. Hassett (2007) document that the share prices of immature firms that are predicted to pay dividends in the future rose when the reform was announced, suggesting that firms perceived the tax cut as fairly permanent. 2 Poterba’s (2004) estimates using an old view model implied that the 2003 tax reform would increase dividend payments by 20 percent in the long run, but that only a quarter of the long-run effect would occur within three years after the tax cut. 3 There is debate in the corporate finance literature about the signal content of dividends. Conditional on information available at time t, dividend increases have little predictive power for future earnings (see e.g., Shlomo Benartzi, Roni Michaely, and Richard H. Thaler 1997; Gustavo Grullon et al. 2005). 4 The empirical evidence is also not fully explained by Hans-Werner Sinn’s (1991) “life cycle” model in which firms progress from the old view to the new view. In that model, the payout response should be smaller among firms with higher levels of accumulated assets, but the data exhibit the opposite pattern.

Vol. 2 No. 3

CHETTY AND SAEZ: DIVIDEND AND CORPORATE TAXATION

3

project investment and dividend payouts. Second, if the contract between shareholders and the manager is second-best inefficient, as is the case in a model with diffuse shareholders, dividend taxation creates a first-order efficiency cost. In contrast, the corporate tax may generate only standard second-order efficiency costs because it does not amplify the manager’s incentive to hoard cash for pet projects.5 This suggests that corporate taxes may be a more efficient way to generate revenue than dividend taxes. Indeed, our analysis suggest that a Pigouvian dividend subsidy would be desirable to correct the negative externality created by agency problems in firms. The most important limitation of our analysis is that it does not explicitly model share repurchases, which give firms a way to return money to shareholders without paying dividend taxes. In the Appendix, we extend the model to permit costly share repurchases, as in Poterba and Summers (1985). The formulas for excess burden remain the same, but the first-order agency-related term depends upon the elasticity of total payouts (share repurchases plus dividends) with respect to taxes. Intuitively, dividend taxes do not have first-order efficiency costs if they simply induce substitution between dividends and repurchases without changing pet project investment. Note that this extension of our analysis relies on a reduced form model for share repurchases and does not explain the puzzle of why firms pay dividends even though dividends are tax-disadvantaged. Understanding the microeconomic foundations of the cost of share repurchases is an issue of great importance for future work, independent of its potential implications for taxation. This paper is related to two contemporaneous theoretical studies motivated by evidence from the 2003 dividend tax cut. Roger Gordon and Martin Dietz (2008) contrast the effects of dividend taxation in new view, signaling, and agency models, and conclude that the agency model is most likely to fit the empirical evidence. The central difference between our model and Gordon and Dietz’s (2008) agency model is in the assumption about which agent sets the firm’s dividend policy. Gordon and Dietz (2008) assume that dividend payout decisions are made by shareholders, whereas we assume that they are made by management. This leads to different results in both the positive and efficiency analysis. Gordon and Dietz’s (2008) model does not directly predict a link between executive or board share ownership and behavioral responses to dividend taxation. Taxing dividends does not create a first-order distortion in their model, since dividends are always set at the second-best efficient level by shareholders. Their model does, however, generate the empirically validated prediction that dividend policies change rarely over time, which our model does not produce. Our model and Gordon and Dietz’s (2008) analysis should therefore be viewed as complementary efforts to explain different aspects of dividend policies. A second recent study is that of Anton Korinek and Joseph E. Stiglitz (2009), who build on Sinn’s (1991) model to analyze the effects of temporary changes in dividend tax rates. They incorporate financing constraints and establish new results on intertemporal tax arbitrage opportunities for firms. In contrast with our model, Korinek and Stiglitz (2009) assume that retained earnings are allocated efficiently
5 The corporate tax does not always have second-order efficiency costs in our model. If it distorts the manager’s contract, it too may generate first-order efficiency costs.

4

American Economic Journal: economic policy

August 2010

by the manager. As a result, they obtain the new view neutrality result that permanent dividend tax policy changes have no effects on economic efficiency. The remainder of the paper is organized as follows. In Section I, we present a neoclassical two-period model that nests the old and new views as a benchmark. In Section II, we introduce agency problems into the model and characterize manager and shareholder behavior. In Section III, we characterize behavioral responses to dividend taxation and compare the agency model’s predictions with the recent empirical evidence. In Section IV, we analyze the efficiency consequences of dividend and corporate taxation. Section V concludes.
I.? The Old and New Views in a Two-Period Model

We begin with a neoclassical two-period model that nests the old and new views and serves as a point of departure for our agency analysis. Consider a firm that has initial cash holdings of X at the beginning of period 0. These cash holdings represent profits from past operations.6 The firm can raise additional funds by issuing equity (E). The firm’s manager can do two things with the firm’s cash holdings: pay out dividends or invest the money in a project that yields revenue in the next period. Let I denote the level of investment and D = X + E ? I denote the firm’s dividend payment in period 0. In period 1, the firm generates net profits of f?(I), where f is a strictly concave function.7 The firm then closes and returns its net-of-tax profits and principal to shareholders. The firms’ profits are subject to two types of taxes. First, the firm pays a corporate tax at rate tc on its net profits in period 1, so that net-of-corporate-tax profits are (1 ? f?(I). Second, it pays a dividend tax at rate td on distributed profits in all periods. tc?)? However, the principal invested by shareholders is not subject to the dividend tax (E ).8 Hence, the net-of-tax payout in period 0 is (1?? td?)D, and the net-of-tax payout in f?(X + E ? D) + X ? D] + E. Investors can also purchase period 1 is (1 ? td?)[(1 ? tc?)? a government bond that pays a fixed, untaxed interest rate of r > 0 (which is unaffected by the dividend tax rate).9 The manager’s objective is to choose the level of equity issues and dividends (and investment) that maximize the value of the firm: (1???td? )[(1???tc? )? f? (X?+?E???D)?+?X???D]?+?E ________________________________ (1)?? max? ?? ? V?=?(1???td? )D???E?+??? ???? ??? . ?? ??? 1?+?r D,?E

6 We can allow part of the existing cash holdings X to represent the principal of shareholders without any impact on the analysis as long as firms cannot return the principal before liquidation and firms do not choose to distribute all their past profits in period 0. 7 The gross production function is F? (I ) = f? (I ) + I. f? (I ) denotes profits net of the depreciation of capital used for production. 8 In the United States, distributed profits are considered dividends for tax purposes, but returning shareholders’ principal is not considered a dividend. 9 Throughout this paper, we abstract from general-equilibrium effects through which changes in td may affect the equilibrium rate of return, r.

Vol. 2 No. 3

CHETTY AND SAEZ: DIVIDEND AND CORPORATE TAXATION

5

To characterize these choices, it is useful to distinguish between two cases: (1) A cash-rich firm that has retained profits X such that its net-of-corporate-tax marginal )? f??(X) ? r; and (2) a cash-constrained firm that has cash X such that return (1 ? tc? )? f??(X) > r. The “new view” model considers firms of the first type, while the (1 ? tc? “old view” pertains to firms of the second type. Cash-Rich Firms—The New View.—First observe that the firm will never set E > 0 and D > 0 simultaneously. If a firm issued equity and paid dividends, it could strictly increase its value V by reducing both E and D by $1 and lowering its tax bill by $td?r/(1 + r?). Now consider the marginal value of issuing equity when D = 0 for the cash-rich firm: (1? ?? td? )(1? ?? tc? )? f??(X)? +? 1 ?V? ___ _____________________ ?? ? (D? =? 0)? =? ?1? +??? ??? ??? ?? ?? 1? +? r ?E )(1? ?? tc? )? f??(X)? ?? r (1? ?? td? _____________________ ??? ??? ?? 0. ?? =??? 1? +? r

This expression implies that a cash rich firm optimally sets E?* = 0. The optimal choice of dividends satisfies the first order condition ? (1? ?? tc? )? f??(X? ?? D*?)? =? r.

Cash-rich firms invest to the point where the net-of-corporate-tax marginal product of investment f??(I) equals the return on investment in the bond, r. Increases in the corporate tax rate reduce the level of investment, increase period 0 dividend payments, and reduce period 1 dividend payments. However, the dividend tax rate td has no impact on dividend payments and investment levels. This is the classic “new view” dividend tax neutrality result (King 1977; Auerbach 1979; Bradford 1981). ) term facThe source of this result is transparent in the two-period case: the (1 ? td? tors out of the value function in equation (1) when E = 0. Dividend taxation has no impact on the behavior of cash-rich firms because they must pay the dividend tax regardless of whether they pay out profits in the current or next period. In contrast, the corporate tax changes the relative price of paying out dividends immediately and investing to earn further profits, and therefore distorts behavior. Cash-Constrained Firms—The Old View.—Now consider a firm with X such that )? f??(X) > r. The marginal value of paying dividends when E = 0 for this (1 ? tc? “cash-constrained” firm is 1 ? td ?V? ___ _____ ?? ? (E? =? 0)? =? 1? ?? td? ???? ? ? )? f??(X?)? +? 1]? ??? [(1? ?? tc? ? ?? 1? +? r ?D r? ?? (1 ? tc? )? f??(X) _____________ =? (1? ?? td? )?? ?? ??? <? 0. ?? 1+r

6

American Economic Journal: economic policy

August 2010

A cash-constrained firm does not pay dividends in the first period because its marginal product of investment exceeds the interest rate. This firm therefore invests all the cash it has: I = X + E. The optimal choice of equity issues is given by (2)? (3)? )(1? ?? tc? )? f??(X?)? <? r E?*? =? 0 if (1? ?? td? (1? ?? td? )(1? ?? tc? )? f??(X? +? E?*?)? =? r if (1? ?? td? )(1? ?? tc? )? f??(X?)? ?? r.

These conditions show that firms that finance their marginal dollar of investment from new equity issues invest to the point where the marginal net-of-dividend and corporate tax return to investment equals the return on investment in the bond, r. Firms that have )(1 ? tc? )? f??(X) < r, have a net-of-tax return below X sufficiently large, so that (1 ? td? the interest rate for the first dollar of equity. These “medium-cash” firms choose the corner solution of no equity (and no dividends) because of the tax wedge. Unlike for cash-rich firms, the dividend tax distorts the behavior of low-cash firms. Implicit differentiation of (3) shows that increases in td reduce equity issues td < 0, ?E?*/?? td < 0 ). This is because the (1 ? td? ) term and investment (?I?*/?? does not factor out of the value function in equation (1) when D = 0 and E > 0. Intuitively, a dividend tax increase lowers the marginal product of investment, but does not affect the price of investment for cash-constrained firms. Firms therefore reduce investment, issue less equity, and pay fewer dividends in period 2, the classic “old view” predictions (Poterba and Summers 1985). Corporate taxes produce the same effects because they affect the value of cash-constrained firms in exactly the same way as dividend taxes. Note that dividend payments are not affected by tax changes in the short run. Following a dividend or corporate tax change, investment and equity issues respond immediately (period 0), and dividends change only when the additional investment pays off (period 1). Efficiency Costs.—Finally, we characterize the efficiency cost of introducing a dividend and corporate tax for the two types of firms. Let ? )? f? (I?)? +? X? ?? D?]/(1? +? r?) Pd? =? D? +? [(1? ?? tc?

denote the dividend tax base, i.e., the total dividend payout over the two periods, and ? (I?)/(1? +? r?) Pc? =? f?

denote the corporate tax base. Total surplus in the economy is W = V + td?Pd + tc?Pc. Using the envelope conditions, differentiating (1) yields dV/dtd = ?Pd and dV/dtc )Pc. Therefore, we obtain the standard Harberger-type formulas for mar= ?(1 ? td? ginal deadweight burden: dP dP dW?? ___ (4)? ?? ? ? ? =? td?___ ?? d?? ? ? ?+? tc?___ ?? c?? ? ? dtd dtd dtd

Vol. 2 No. 3

CHETTY AND SAEZ: DIVIDEND AND CORPORATE TAXATION

7

where ?Pd/?tc = ?Pc denotes the mechanical effect of increasing tc on the firm’s payout, and dPd/dtc ? ?Pd/?tc thus measures the distortion in dividend payments created by the corporate tax due to behavioral responses. Equations (4) and (5) apply to both cash-rich and cash-constrained firms. To obtain further insight into the key determinants of excess burden, it is helpful to consider the old and new view firms separately. For new view firms, dividend taxes do not distort behavior: dPd/dtd = dPc/dtd = 0. In addition, because these firms choose D and I to maximize Pd itself, dPd/dtc = ? Pc. Hence, for new view firms, the formulas for excess burden simplify to dW?? ___ (6)? ?? ? ? ? =? 0 dtd dP dW?? ___ ? ? ?=? tc?___ ?? c?? ? . ? (7)? ?? dtc dtc Intuitively, an increase in tc does not distort total dividend payments because the marginal reduction in period 1 dividends is canceled out by the marginal increase in period 0 dividends for a profit-maximizing firm. As a result, the only distortionary effect of the corporate tax comes from its effect on the corporate tax base itself. For old view firms, dividend and corporate taxes both distort the return to investment in the same way, implying dPc/dtd = dPc/dtc. Because old view firms pay dividends only in period 1, the effects of td and tc on the dividend tax base are fully determined by their effects on profits, which equal the corporate tax base: dPd/dtd )dPc/dtd and dPd/dtc = ?Pc + (1 ? tc? )dPc/dtc. Combining these results, = (1 ? tc? we obtain dPc dW?? dW?? ___ ___ ___ (8)? ?? ? ? ? =??? ? ? ?=? (tc? +? td? ?? tc?td? )??? ?? ? . ? dtc dtd dtc Intuitively, both dividend and corporate taxes reduce the profits earned by old view ) firms in the same way. The total revenue obtained from the two taxes is (td?(1 ? tc? )Pc, leading to the formula in (8). + tc? This analysis yields two general lessons about efficiency costs that we will revisit below. First, dividend taxation has an efficiency cost only for firms which finance investment from new equity issues, whereas corporate taxation has an efficiency cost for both types of firms. Because most investment is accounted for by firms with large amounts of retained earnings, this leads to the view that dividend taxes are a more efficient instrument for raising tax revenue than corporate taxes. Second, when one starts from a situation with no taxes, the introduction of a small corporate tax has a second-order (i.e., small) efficiency cost, as does the introduction of a small dividend tax for old view firms. The main predictions of the old and new view models are summarized on the left side of Table 1. The central assumption underlying these results is that firms’

dPd ?Pd dP dW?? ___ ___ ___ (5)? ?? ? ? ?=? td??a?? ?? ? ? ? ???? ?? ? ? b? +? tc?___ ?? c?? ? , ? dtc dtc ?tc dtc

8

American Economic Journal: economic policy

August 2010

Table 1—Summary of Key Predictions: Neoclassical versus Agency Models
Neoclassical model Old view Initial cash X New view High: r/(1 ? tc? ) > f??(X ) and g?(X ? I*) ? ?r D=0 Very High: r/(1 ? tc? ) > f??(X) and g?(X ? I*) < ?r Low: High: Medium: f??(X ) > r/[(1 ? td? ) r/[(1 ? td? )(1 ? tc? )] r/(1 ? tc? ) > f??(X ) )] ? f??(X) × (1 ? tc? ) ? r/(1 ? tc? D=0 D > 0, f??(X ? D) ) = r/(1 ? tc? E=0 Agency model

Dividends D Equity issues E Productive ? investment I

D=0

E=0 E > 0, f??(X + E) )(1 ? tc? )] = r/[(1 ? td?

E=0 g?(X ? I) = (1 ? tc? )? f??(I?), )f??(I?) > r (1 ? tc?

E=0

D > 0, g?(X ? I* ? D) = ?(1 + r)

I = X, f??(I?) = f??(X ) I < X, f??(I?) I > X, f??(I?) )(1 ? tc? )] = r/(1 ? tc? ) = r/[(1 ? td? Intensive margin: No effect on D, E, I No effect on D, E, I Extensive margin: Some firms shift to low cash regime, start issuing E and increase I

(1 ? tc? ) f??(I?) = r (i.e., I = I*), g?(J?) = ?r

Effects of reducing No effect on D I increases, ? dividend tax td E increases

Intensive margin: D increases, J No effect on D and decreases No effect E, I increases, J on I and E decreases Extensive margin: Some firms shift to very high cash regime, start paying dividends Extensive margin: higher likelihood and larger D initiations if exec. or board share high first-order if ? < 1 dW/dtd = (tc + td ? tc?td? )dPc/dtc + )(1 ? tc? ) (1 ? td? × (1 ? ?)dPc/dtc first-order if ? < 1 dW/dtd = (tc + td )dPc/dtc + ? tc?td? )(1 ? tc? ) (1 ? td? (1 ? ?)dPc/dtc Larger increase in D if exec. or board share high (if third derivatives of g, c small)

Heterogeneity of ? D response to ? tax cut by ? ownership ? structure Efficiency cost ? of td

none

none

none

second-order dW/dtd = (tc + td ? tc?td? )? dPc/dtc

none dW/dtd = 0

none dW/dtd = 0

first-order if ? < 1 dW/dtd = [td + (1 ? )(1 ? ?)]dPd/dtd td? second-order dW/dtc = tc?dPc/dtc

Efficiency cost ? of tc

second-order dW/dtc = (tc + td ? tc?td? )dPc/dtc

none dW/dtc = 0

second-order dW/dtc = tc?dPc/dtc

Notes: This table summarizes the firm’s choice of dividends (D), equity issues (E), and investment (I?) in the neoclassical and agency models. Behavior depends on the level of initial cash holding X, which varies across the columns. I* denotes the optimal investment level from the shareholders’ perspective given the corporate tax tc, which satisfies f??(I?*) = r(1 ? tc? ). In the agency model, we only consider the case where initial cash is high enough so that the firm does not issue equity. Positive predictions reported are for the model in Sections III and IV with an exogenous manager share ? and endogenous monitoring ? so that ? = ?(1 ? td? )(1 + ?). The efficiency costs are reported for the special case in Section IVA with exogenous ? and no monitoring. Section IVB shows that the formulas extend with endogenous ? and monitoring by substituting ? for ? B (share ownership of large shareholders). Note that the efficiency cost formulas ignore changes in the thresholds that define the low versus high cash categories and therefore apply only to firms in the interior of these categories.

managers choose policies solely to maximize firm value. This assumption contrasts with the modern corporate finance literature, which emphasizes the tension between executives’ and shareholders’ interests in explaining corporate behavior and payout policies. The next section incorporates these considerations into the model.
II.? An Agency Model of Firm Behavior

In the remainder of the paper, we restrict attention to cash-rich firms, i.e., those )? f??(X) > r. Firms with (1 ? tc? )? f??(X) < r never pay dividends. Since with (1 ? tc?

Vol. 2 No. 3

CHETTY AND SAEZ: DIVIDEND AND CORPORATE TAXATION

9

our goal is to construct a model consistent with recent evidence on dividend payout behavior, it is the behavior of cash-rich firms that is of greatest interest.10 The predictions of the agency model are summarized in the right half of Table 1. A. Setup The source of agency problems in corporations is a divergence between the objectives of managers and shareholders. We model the source of the divergence as a “pet project” that generates no profits for shareholders, but yields utility to the manager. In particular, the manager can now do three things with the firm’s cash X: pay out dividends D, invest I in a “productive” project that yields net profits f? (I?) for shareholders, or invest J in a pet project that gives the manager private benefits of g(J?).11 Assume that both f and g are strictly concave. The function g should be interpreted as a reduced-form means of capturing divergences between the managers’ and shareholders’ objectives. For example, the utility g(J?) may arise from allocation of funds to perks, tunneling, a taste for empire building, or a preference for projects that lead to a “quiet life.”12 While there is debate in corporate finance about which of these elements of g(J?) is most important, the underlying structure that determines g(J?) does not matter for our analysis. Manager’s Objective.—The agency problem arises because shareholders cannot observe real investment opportunities and have to let the manager choose I, J, and D. Shareholders push managers toward profit maximization through two channels: incentive pay and monitoring. Incentive pay is achieved through features of the manager’s compensation contract such as share grants and bonuses. We model such financial incentives by assuming that the shareholders compensate the manager with a fraction ? of the shares of the company. Monitoring effectively reduces the manager’s utility from the pet project because it increases the probability that pet projects are detected and penalized. We model monitoring by assuming that ? ? 0 units of monitoring reduces the utility the manager derives from the pet project from g(J?) to g(J?)/(1 + ?). Given the shareholders’ choice of ? and ?, the manager chooses I and D to maximize g(J?) (1? ?? tc? )? f? (I)? +? X? ?? D 1? ?? _____ ______ __________________ )cD? +? ? ?? ??? ?? ?? d? +??? ? ? ? ???? ? ? (9)? V?M? =? ?(1? ?? td? 1? +? r 1? +? ? 1? +? r

10 The working paper version (Chetty and Saez 2007) extends the efficiency analysis to the case with equity issues. 11 The manager returns the capital used for investment in the pet project (J?) back to the shareholders in period 1. 12 There is a large literature in corporate finance providing evidence for agency models. Recent examples include Rajan Raghuram, Henri Servaes, and Luigi Zingales (2000), David S. Scharfstein and Jeremy C. Stein (2000), and Marianne Bertrand and Sendhil Mullainathan (2003). Empirical studies have also provided support for the agency theory as an explanation of why firms pay dividends (see e.g., Larry H. P. Lang and Robert H. Litzenberger 1989; William G. Christie and Vikram K. Nanda 1994; Rafael La Porta et al. 2000; George W. Fenn and Liang 2001; Mihir A. Desai, C. Fritz Foley, and James R. Hines, Jr. 2007).

10

American Economic Journal: economic policy

August 2010

subject to the constraint I + J + D = X. Monitoring increases the weight managers )(1 + ?) put on profits relative to the pet project by a factor 1 + ?. Let ? = ?(1 ? td? denote the relative weight that managers place on profits. When ? is low, the manager has little stake in the profits of the firm and is therefore tempted to retain excess earnings and invest in the pet project.13 Shareholders’ Objectives.—Next, we model how shareholders choose the level of monitoring (??). Following Shleifer and Vishny (1986), each shareholder who chooses to monitor the firm incurs a cost of monitoring, whereas the benefits of better manager behavior accrue to all shareholders. There are N shareholders, each of N ?? ??? ?? = 1 ? ?). Each shareholder whom owns a fraction ?i of the shares (so that ? 1 i ? ?? ???i?. chooses a level of monitoring ?i ? 0. The total monitoring level is ? = ? ?? Shareholders incur a fixed cost k if they monitor the firm, i.e., if they set ?i > 0. In addition, they pay a convex and increasing variable cost c(?i?) to do ?i units of monitoring, where c?(?i = 0) = 0. Each shareholder chooses ?i to maximize his net profits (10)? (1? ?? tc? )? f? (I?)? +? X? ?? D __________________ Vi??=??(1? ?? td? )?i?cD? +??? ?? ??? ?? d? 1? +? r ?? k? ×? 1(?i? > 0)? ?? c(?i?),

where 1(?i > 0) is an indicator function. In the Nash equilibrium, ? is determined such that each shareholder’s choice of ?i is a best response to the others’ behavior. It is well-known from the public goods literature that monitoring will be below the social optimum (i.e., the level that would be chosen if one shareholder owned the __ ?? ? ?such that small shareholdentire firm) in equilibrium.14 There is a threshold level ? __ __ ? ? ?will not monitor the firm, while large shareholders with ?i > ?? ? ? ?do ers with ?i < ?? monitor the firm. Since the number of large shareholders is typically small, it is natural to assume that these individuals cooperatively choose the level of monitoring ? by forming a “board of directors” that is in charge of monitoring the manager. Let ?B denote the total fraction of shares held by the board of directors. The board chooses ? to maximize its joint profits net of monitoring costs: (1? ?? tc? )? f? (I?(?))? +? X? ?? D(?) _______________________ )?B?cD(?)? +? ?? ??? ?? ??d? ?? c(?? ). (11)? V?B? =? (1? ?? td? 1? +? r

Ownership Structure.—To close the model, we must specify how the firm’s ownership structure (? and ? B ) is determined. We draw a distinction between the short-run positive analysis and the long-run efficiency analysis in the specification of
13 The pet project g(J) is presumably small relative to the firm’s productive project f? (I?). However, ? is also likely to be small in large publicly traded corporations, where executives own a small fraction of total shares and diffuse share ownership can lead to a low level of monitoring. Combining a small pet project g(J?) with a small ? can make the manager deviate substantially from the shareholders’ optimal investment level. 14 The Coasian solution (Ronald H. Coase 1960) is unlikely to emerge in this setting because of transaction costs in coordinating many small shareholders.

Vol. 2 No. 3

CHETTY AND SAEZ: DIVIDEND AND CORPORATE TAXATION

11

the firm’s ownership structure. In the short run, ownership structures are relatively stable in practice.15 Since the evidence on dividend payout behavior we are attempting to explain concerns the effect of the 2003 dividend tax reform within a two year horizon, we take ? and ? B as fixed in our positive analysis. In the longer run, and particularly when new firms are started, ? and ? B are presumably endogenous to the tax regime. In the efficiency analysis in Section IV, we model how ? and ? B are determined. Allowing for endogenous ownership structure is particularly important in the efficiency analysis because the deadweight cost of taxation depends critically on how ? and ? B are determined. B. Manager Behavior We now characterize the manager’s behavior as a function of his weight on profits )(1 + ?). The manager chooses I and D to ? = ?(1 ? td? g(X? ?? I? ?? D) )? f? (I?)? +? X? ?? D (1? ?? tc? ____________ ?? ? ??max? ???? cD? +? __________________ ?? ?? ??d? +??? ?? ??. ?? ??? 1? +? r 1? +? r I,?D?0

Assume that g?(0) > ?? f??(X), which guarantees an interior optimum in investment behavior. Then I and D are determined by the following first-order conditions: (12)? (13)? )?? f??(I?)? =? g?(X? ?? I? ?? D) (1? ?? tc? ?? r? ?? g?(X? ?? I? ?? D) with strict equality if and only if D? >? 0.

Let D(?) and I?(?) denote the dividend and investment choices of the manager as a function of ?. To characterize the properties of these functions, define the threshold ? g?(X? ?? I? ) __ _________ ?? ?? =??? ?? ? ? >? 0, ?? r??
*?

where I?* denotes the optimal investment level from the shareholders’ perspective: __ )? f??(I?*?) = r. Note that ? ?? ? ?is a monotonic decreasing function of X. We there(1 ? tc? __ __ ?? ? ? as “very high-cash” firms, and those with ? < ? ?? ? , but ? fore label firms with ? > ? )? f??(X) > r, as “high cash” in Table 1. (1 ? tc? Lemma 1: D(?) and I?(?) follow threshold rules: ?? ? , then D(?) = 0, and I?(?) is chosen such that (1 ? tc? ? )?? f??(I?) = g?(X (i) If ? ? ? ? I?). ?? ? , then I?(?)?=?I?*, and D(?)?>?0 is chosen such that ?r?=?g?(X???I?*? ? (ii) If ??> ? ??D).
15 Chetty and Saez (2007) present evidence that managerial and board share ownership is much more stable than dividend payments in the three years after the 2003 dividend tax cut.

__

__

12
2 1.8 1.6 1.4 1.2

American Economic Journal: economic policy

August 2010

Pet investment (J )

D, J, I

1 0.8 0.6 0.4 0.2 0 Period 0 dividends (D )

Profitable investment (I )

0

0.05

0.1

0.15

0.2

?: Manager’s weight on profits
Figure 1. Manager’s Decision Rules as a Function of Weight on Profits Notes: This figure plots the manager’s optimal choice of dividends, profitable investment, and pet project investment as a function of his weight on profits, ?. The simulation assumes a total cash holding of X = 2, profitable investment production function f?(I?) = (1/10)(2I ? (I?2/2)), pet production function g(J?) = (1/100)[2 J ? (J?2/2)], and interest rate r = 10 percent.

Proof: __ . Suppose the firm sets D?> 0. Then the first order condi? Consider ??? ?? ?? )? f??(I?)?=?r, and hence I = I?*. This implies tions (13) and (12) imply that (1?? tc? __ * *? (X ? I? ) = ?? ? ? ??r?, contradicting the supposition. Hence, ? ??r?=?g?(X???I? ???D)> g?? __ ?? ? ????D(?)?=?0. ?? __ ?? ? . Suppose the firm sets D?=?0. Then the first order con? Now consider ??> ? )? f??(I?)???r, and hence I???I?*. This implies ditions (12) and (13) imply that (1???tc? __ __ *? ? ? ??r, contradicting the supposition. Hence, ??> ?? ? ? ?? ??r???g?(X ? I?) ? g?(X???I? )?= ?? D(?)?>?0, and (13) yields the desired expression for D(?). Figure 1 illustrates the threshold rules that the manager follows by plotting D(?), I?(?), and J?(?) with quadratic production functions when tc = 0. When ? is below __ , the marginal value of the first dollar of dividends is nega? the threshold value ?? ? ? tive in the manager’s objective function. The optimal level of dividends is therefore zero, the corner solution. Intuitively, if managers have a sufficiently weak interest in profit maximization, they retain as much money as possible for pet projects and do

Vol. 2 No. 3

CHETTY AND SAEZ: DIVIDEND AND CORPORATE TAXATION

13

not pay dividends. For ? above ?? ? ? , further increases in the weight on profits ? lead ? to increases in dividends and reductions in pet investment on the intensive margin: (14)?
__ r? ?? _______ D?? (?)? =? ???? ? . ? > 0 for ? > ?? ? ? g??(J?(?)) __

__

?? ? , the manager pays no ? Now consider the manager’s investment choice. When ? ? ? dividends, and splits retained earnings between investment in the profit-generating project and the pet project. He chooses I to equate his private marginal returns of investing in the two projects, as in equation (12). An increase in ? increases productive investment I and reduces pet investment J: )? f??? (I?(?)) (1? ?? tc? __ ___________________________ ??? ?????? . ? ? >? 0 for ? < ?? ? ? (15)? I ?? (?)? =? ???? (1? ?? tc? )?? f??? (I?(?))? +? g?? (X? ?? I?(?)) ?? ? , the manager has enough cash to pay a dividend to shareholders. He ? Once ? > ? )? f??? (I?) = r, implying that I is fixed at I?* sets the investment level such that (1 ? tc? __ ?? ? . Intuitively, the manager would only pay a dividend if his private return ? for ? > ? to further investment in the profitable project was below the interest rate. Since the tradeoff between dividends and profitable investment is the same for managers and shareholders, the manager only begins to pay a dividend once he has reached the optimal level of investment from the shareholder’s perspective, I?*. C. Board Behavior In the short run, the board’s only decision is to choose the level of monitoring. The board takes ?B as fixed and chooses ? to maximize (16)? V?B? =? (1? ?? td? )?B?Pd?(?)? ?? c(?? ),
__

)? f? (I?(?)) + X ? D(?)]/(1 + r) denotes the firm’s where Pd?(?) = D(?) + [(1 ? tc? total payout as a function of ?. Because both D and I are (weakly) increasing in ?, ). Hence, the first Pd?(?) is also (weakly) increasing in ?. We have d?/d? = ?(1 ? td? order condition with respect to ? is (17)? c?(?) = (1 ? td? )?B?Pd ?? (?)??(1? ?? td? ).

Intuitively, the board chooses ? such that the marginal increase in the board’s share of profits by raising ? is offset by the marginal cost of monitoring. The second-order condition for an interior maximum is (18)? )?B?Pd ?(?)[?(1? ?? td? )]2? ?? c?(?)? <? 0. (1? ?? td?

14

American Economic Journal: economic policy

August 2010

Since c?? (? = 0) = 0, by assumption, the optimal ? is always in the interior, and ).16 This secondhence (18) must be satisfied at the optimal level of monitoring ?? (td? order condition turns out to be useful for the comparative statics analysis below.
III.? Positive Analysis: Effects of Dividend Taxation

We now characterize the effect of dividend tax changes on firm behavior to show that the agency model explains the four empirical findings discussed in the introduction as well as other evidence. For any variable x ? {D,?I,?J?}, dx? d??? dx? ___ ___ ___ ?? ? ? =??? ? ? ? ? ???? ? ?? dtd d? dtd because td affects the manager’s objective only through his weight on profits ?. We ), we have characterized dx/d? in the previous section. As ? = (1 + ?)?(1 ? td? d? d??? ___ ___ (19)? ?? ? ? ?=? ??(1? +? ?? )? +? ?(1? ?? td? )??? ? ?? ? . dtd dtd To calculate d?/dtd? , implicitly differentiate the board’s first-order condition for ? in (17) to obtain ?B[2?(1? ?? td? )Pd ? ?(?)? +? [?(1? ?? td? )]2(1? +? ?? )Pd ??(?)] d? ___ ______________________________________ ? ?? ? ? =? ???? ????? ???? ?? . (20)? ?? 2 dtd c?? ?? Pd ???B(1? ?? td? )?[?(1? ?? td? )] Combining (19) and (20) leads to 2?B??2(1? ?? td? )2Pd ? ?(?)? +? ?(1? +? ?)c? d??? ___ ____________________________ (21)? ?? ? ? ?=? ???? ???? ???? ??? <? 0. dtd c?? ?? Pd ???B(1? ?? td? )?[?(1? ?? td? )]2 The board’s second-order condition for ? in (18) implies that the denominator of this expression is positive. The numerator is positive because Pd is increasing in ? and c is convex. Equation (21) therefore shows that a reduction in the dividend tax rate leads to an increase in the weight ? that managers put on profits through two )? that channels. First, a decrease in td mechanically increases the net stake (1 ? td? the manager has in the firm, effectively by reducing the government’s stake (td?) in the firm’s profits. Second, a decrease in td generally increases the level of monitoring ? by the board.17 Intuitively, monitoring rises because the return to monitoring )?B also rises when td is increased, since the external shareholders’ net stake (1 ? td? falls, while the cost of monitoring is unchanged.

The second-order condition could hold with equality, a knife-edge case that we rule out by assumption. It is possible that d?/dtd > 0 if the third derivatives g?(J?), f??(I?), c?(?? ) are sufficiently large in magnitude. When f, g, and c are quadratic, d?/dtd is unambiguously negative. Hence, barring sharp changes in the local curvature of the production functions, monitoring falls with the dividend tax rate.
17

16

Vol. 2 No. 3

CHETTY AND SAEZ: DIVIDEND AND CORPORATE TAXATION

15

Given that d?/dtd < 0, it is straightforward to characterize the short-run effect of dividend taxation on firm behavior. Because the manager follows a threshold rule in ?, changes in td lead to both intensive and extensive margin responses. We therefore analyze the effects of a discrete dividend tax cut from td = t1 to td = t2 < t1 on a firm’s behavior. Let ?x = x(t2) ? x(t1) denote the change in a variable x caused by (t1) > 0 from (21). the tax cut, and note that ?? = ?? (t2) ? ?? Proposition 2: A dividend tax cut (t2 < t1) has the following effects on behavior for a cash-rich firm: (i) If ?(t2)? ??? ?? ? : ?D? =? 0, ?I? > 0, ?J < 0, and ?I + ?J = 0. ? ?? ? ?< ?? (t2): ?D > 0, ?I > 0, ?J < 0, and ?I + ?J < 0. (ii) If ?? (t1) < ? ?? ? ? ? ?? (t1): ?D > 0, ?I = 0, and ?J < 0. (iii) If ? Proof: (i?) When ?? (t2) ? ? ?? ? , D(t2) = 0 by Lemma 1. Since ?? ? (t2) > ?? (t1), D(t1) = 0 also. Therefore ?D = 0. Since I + J + D = X, and X is fixed, it follows that ?I + ?J = 0. Finally, (15) implies that dI/dtd = (dI/d?)(d?/dtd) < 0 when __ . Hence, ?I > 0 and ?J = ??I < 0. ? ? ? ?? ?? ?? ? ? < ?? (t2), Lemma 1 implies D(t1) = 0, while D(t2) > 0. (ii?) When ?? (t1) < ? Hence, ?D > 0. Since ?D > 0, ?I + ?J = ??D < 0. By Lemma 1, I?(t2) )? ?? (t1)? f??? (I?(t1)) = g?? (X ? I?(t1)). Since ?? (t1)?r = I?*, while I?(t1) satisfies (1 ? tc? )? f??? (I?(t1)) > r = (1 ? tc? )? f??? (I?*?), < g?(X ? I?(t1)) by (13), it follows that (1 ? tc? which implies I?(t1) < I?(t2). Hence, ?I > 0 and ?J = ??D ? ?I < 0. ?? ? ? ?(t1), I?(t1) = I?(t2) = I?* because ?? ? (t2) > ?? (t ). Equation (14) (iii?) When ? __ 1 ?? ? . Hence, t2 < t1 ? ? implies that dD/dtd = (dD/d?)(d?/dtd) < 0 when ? > ? ?D > 0. Finally, ?J = ? ?D < 0. Proposition 1 shows that the dividend tax cut (weakly) increases dividend pay) that managers place ments for all cash-rich firms because it raises the weight ?? (td? on profits. The effect differs across three regions of ?. For managers who place a __ ?? ? ?), dividend payments remain undesirable after very low weight on profits (?? (t2) < ? the tax cut and ?D = 0. The second region consists of firms who were nonpayers __ ? ? ? ), but cross the threshold for paying when the tax prior to the tax cut (?? (t1) < ?? rate is lowered to t2. These firms initiate dividend payments after the tax cut. The third region consists of firms who had ? high enough that they were already paying dividends prior to the tax cut. The tax cut leads these firms to place greater weight on net-of-tax profits relative to the pet project, and therefore leads to increases in the level of dividends. Note that these changes in dividend payout policies occur in
__ __ __ __ __ __

16

American Economic Journal: economic policy

August 2010

period 0 itself. This is consistent with the evidence that many firms announced dividend increases in the weeks after the 2003 tax reform was enacted. Now consider the effect of the dividend tax cut on investment behavior. The tax cut increases the net-of-tax return to the profit-generating project while leaving the return to pet investment unaffected. As a result, the manager substitutes from investing in perks to the profit-generating project, and I (weakly) increases while J falls. __ ?? ? , the manager shifts toward I from J, but total ? In the first region, where ?? (t2) < ? investment (I + J?) is unchanged. In the second region, where the firm initiates a dividend payment, investment in I rises to the shareholders’ optimum I?*, while investment in J is reduced to finance the dividend payment and the increase in I. In __ (t1), ? this region, total investment falls when the tax rate is cut. Finally, when ? > ?? ? ? * the manager maintains I at I? and reduces investment in J to increase the dividend payment. An interesting implication of these results is that a dividend tax cut weakly lowers total investment I + J for cash-rich firms with an agency problem. Total investment, I + J, is the measure that is typically observed empirically since it is difficult to distinguish the components of investment in existing datasets. This prediction contrasts with the old view model, where a tax cut raises investment and with the new view model, where a tax cut has no effect on investment. Intuitively, a tax cut reduces the incentive for cash-rich firms to (inefficiently) over-invest in the pet project. It is important to note that the same result does not apply to cash-constrained firms in the agency model. A tax cut raises equity issues and productive (as well as unproductive) investment by such firms. Hence, a dividend tax cut leads to a (efficiency increasing) reallocation of capital and investment across firms, but its effect on aggregate investment is ambiguous. This result is potentially consistent with the large empirical literature on investment and the user cost of capital, which has failed to identify a robust relationship between tax rates and aggregate investment (see e.g., Robert S. Chirinko 1993; Desai and Austan D. Goolsbee 2004). Next, we examine how the effect of the tax cut on dividend payments varies across firms with different ownership structures. We, again, distinguish between extensive and intensive margin responses. Proposition 3: Heterogeneity of dividend response to tax cut (t2 < t1) by ownership structure: ?? ? , then initiation likeli? (i) Extensive Margin: likelihood of initiation. If ?? (t1) < ? hood increases with ? and ? B: • If ?D > 0 for ? then ?D > 0 for ?? > ? • If ?D > 0 for ?B then ?D > 0 for ??B > ?B. ?? ? ? < ?? (t2): ?? D/?? > 0, (ii) Extensive margin: size of initiation. If ?? (t1) < ? ?? D/?? B > 0. ?? ? ? ? ?? (t1) and g and c are quadratic: ?? D/?? > 0, (iii) Intensive margin. If ? ?? D/?? B > 0.
__ __ __

Vol. 2 No. 3

CHETTY AND SAEZ: DIVIDEND AND CORPORATE TAXATION

17

Proof:

(i) The result follows directly from the effect of ? and ?B on ?. Observe that ?? ??? ___ ___ ? ? ?? =? (1? ?? td? )(1? +? ?? )? +? ?(1? ?? td? )?? ? ? ? ?? ? ?? ?? ?? (1? +? ?)(1? ?? td? )c?? +? Pd ? ??B?(1? ?? td? )3 _____________________________ =??? ???? ???? ??? >? 0 c?? ?? Pd ???B(1? ?? td? )?[?(1? ?? td? )]2

using the second-order condition for ? in (18). Similarly,

?2(1? ?? td? )3Pd ? ?(?) ?? ??? ____ ________________________ ____ ? ?? ?? ? ? =? ?(1? ?? td? )?? ? ?? ? ? =??? ??? ??? ??? >? 0. ??B ??B c?? ?? Pd ???B?(1 ? td? )?[?(1? ?? td? )]2 Note that ?D > 0 at a given ? ? D(?? (t2,??)) > 0. Since ??/?? > 0, we know ) > ?? (t2,??). From (14), we have ?D/?? > 0, which in turn implies that ?? (t2,???? )) > D(?? (t2,??)) > 0 ? ?D > 0 for ??. Exploiting the result that ?? ?/??B D(?? (t2,???? > 0 yields the analogous result for ?B. ?? ? ? < ?? (t2), D(t1) = 0 and hence ? D = D(t2). It follows that (ii) When ?? (t1) < ? ?? D/? x = ? D(t2)/? x = (? D/??)(??/? x) for x ? {?,?B}. We know that ? D/?? > 0 from (14). Since ??/?? > 0 and ??/??B > 0 from (i), it follows that ? D(t2)/?? > 0 and ? D(t2)/?? B > 0, which proves the claim. ?? ?< ?(t1), the dividend level is positive both at the initial and new tax ? (iii) When ? rate, and hence there is an intensive-margin response. Using equation (21), we have 2?B?2(1? ?? td? )2Pd?(?)? +? ?(1? +? ?? )c? dD? d??? dD?? r? ?????? ___ _______ ___ ___ ____________________________ ? ? ?=??? ? ? ? ? ?=??? ? ???? ???? ?? . ??? ? (22)? ?? 2 d? dtd dtd g?(J?(?)) c?? ?? Pd????B(1? ?? td? )?[?(1? ?? td? )] ?? ? ?< ?, Pd?(?) = D(?) + ((1 ? tc? )? f? (I?*) + X ? D)/(1 + r?). Since g?(J?(?)) When ? is constant when g is quadratic, we have D?(?) = ? r/g? constant, and hence D?(?) (?) is also constant, and Pd?? (?) = 0. Equation (22) therefore simplifies to = 0, Pd?? dD?? r???? r? ??? ___ __ ____ b. ?? ? ? ?=??? ? a?(1 + ?? )? ?? 2?B??2(1? ?? td? )2??? ? dtd g? g??c? Recognizing that c? > 0 and g? < 0 are constant, we have r????? 2?? r? ???? __ __ ____ ?D? =??? ? e?? t1????(? 1? +? ?? (td? ))dtd? +??? ?? ? ? ?2[(1? ?? t2)3? ?? (1? ?? t1)3?]??? ? f? . 3 B g? g??c?
__ __ __

??

t2

Because t1 > t2, the first term inside the curly brackets is negative. The second term inside the curly brackets is also negative because t1 > t2 and g? < 0. Because the

18

American Economic Journal: economic policy

August 2010

1.6

1.4 t d = 20% 1.2

Period 0 dividends (D)

1 Region 1 no change 0.8 Region 2 extensive margin Region 3 intensive margin

0.6

0.4

t d = 35%

0.2

0 0.03

0.04

0.05

0.06

0.07

0.08

0.09

?: Fraction of shares owned by manager

Figure 2A. Effect of Tax Cut on Dividends by Managerial Shareownership

multiplicative factor r/g? outside the curly brackets is negative, we have ??D/?? > 0 and ??D/??B > 0. Figure 2A plots D against ? in two dividend tax regimes with t1 = 35 percent and t2 = 20 percent, and the corporate tax tc = 0. The figure illustrates the three results in Proposition 2. First, among the set of firms who were nonpayers prior to the tax cut, those with large executive shareholding (high ?) are more likely to initiate dividend payments after the tax cut. This is because managers with higher ? are closer __ ?? ) of paying dividends to begin with, and are therefore more likely ? ? to the threshold (? to cross that threshold. Second, conditional on initiating, firms with higher ? initiate larger dividends. Because D(t2), the optimal dividend conditional on paying is rising in ?, the size of the dividend increase, ?D = D(t2), is larger for firms with higher values of ? in this region. Third, among the firms who were already paying dividends prior to the tax cut, the intensive-margin increase in the level of dividends is generally larger for firms with higher ?.18 Intuitively, the manager’s incentives are more sensitive to the tax rate when he owns a larger fraction of the firm. These three results apply analogously to the board’s shareholding (?B ), as shown in Figure 2B.
18 This result holds as long as there are no sharp changes in the local curvature of the production functions. If g?(J?) and c?(?? ) are sufficiently large in magnitude, it is possible to have ? 2 D/? td ?? B > 0.

Vol. 2 No. 3

CHETTY AND SAEZ: DIVIDEND AND CORPORATE TAXATION

19

1

td = 20% 0.8

Period 0 dividends (D)

Region 1 no change 0.6

Region 2 extensive margin

Region 3 intensive margin

0.4

0.2

td = 35%

0

0.01

0.02

0.03

0.04

0.05

0.06

0.07

0.08

0.09

0.1

?B : Fraction of shares owned by board
Figure 2B. Effect of Tax Cut on Dividends by Board Shareownership Notes: These figures show how the effect of a dividend tax cut on dividends varies across firms with different ownership structures. In Figure 2A, the lower curve plots dividends versus the fraction of shares owned by the manager (?) when the tax rate is 35 percent. The upper curve plots the same when the tax rate is 20 percent. Figure 2B plots dividends versus the fraction of shares owned by the board of directors in the two tax regimes. Simulations use the same parametric assumptions as in Figure 1 along with c(?) = (1/1000) ? 2.

A change in td has a greater effect on ? when ?B is large, leading to a larger dividend response. Auxiliary Predictions.—The agency model predicts that firms with more assets and cash holdings (higher X) are more likely to initiate dividend payments following a tax cut.19 In contrast, neoclassical models that nest the old and new views (Sinn 1991) predict that firms with higher assets will respond less to a tax cut. Chetty and Saez (2005) document that firms with higher assets or cash holdings were more likely to initiate dividends after the 2003 tax reform, consistent with the agency model. The importance of the interests of “key players” (executives and large external shareholders) is underscored by Chetty and Saez’s (2005) finding that firms with large nontaxable shareholders (such as pension funds) were much less likely to
Firms with higher X are closer to the threshold of paying dividends because ? ?? ? ?is falling in X and ? is rising in X. A tax cut is therefore more likely to make firms with higher X cross the threshold and initiate dividend payments.
19

__

20

American Economic Journal: economic policy

August 2010

change dividend payout behavior in response to the 2003 tax reform. Although we have not allowed for heterogeneity in tax rates across shareholders in our stylized model, it is straightforward to show that the introduction of nontaxable shareholders would generate this prediction. If the board includes nontaxable large shareholders, a change in td has a smaller impact on the board’s incentive to increase monitoring. Hence, a tax cut causes a smaller increase in ? and generates smaller ?D.
IV.? Efficiency Costs of Dividend and Corporate Taxation

We divide our analysis of the efficiency costs of dividend and corporate taxes into two parts. We first build intuition using a special case where ownership structure (? and ?B) is fixed and monitoring (??) is fixed at 0. We then relax these assumptions and characterize efficiency costs when the manager’s contract is endogenously determined. The lessons obtained from the special case carry over to the general model with some qualifications. A. Fixed Contracts When ? is fixed at 0, total surplus in the economy (W?) is simply the sum of the shareholders’ payoff, the manager’s payoff, and government revenue from the dividend and corporate taxes: ? ? ? W? =? V?M? +? V?S? +? td?Pd?(?)? +? tc?Pc?(?)

g(J?) )? f? (I?)? +? X? ?? D (1? ?? tc? _____ __________________ =? ?(1? ?? td? ?? b? +??? )aD? +? ?? ?? ??? ? ? ?? 1? +? r 1? +? r )Pd?(?)? +? td?Pd?(?)? +? tc?Pc?(?), +? (1? ?? ?)(1? ?? td?

)? f? (I?) + X ? D)/(1 + r?) and Pc = f? (I?)/(1 + r?) denote the where Pd = D + ((1 ? tc? dividend and corporate tax bases as above. Recognizing that D and I are chosen by the manager to maximize his own surplus, we exploit envelope conditions and obtain the following expressions for the marginal excess burden of raising the two tax rates: dPd dPd dPc dW?? ___ ___ ___ ___ ? ? ? =? tc??? ?? ? ? ?+? td??? ?? ? ? ?+? (1? ?? td? )(1? ?? ?)??? ?? ? ? (23) ?? dtd dtd dtd dtd ?Pd ?Pd dPc dPd dPd dW?? ___ ___ ___ ___ ___ ___ ? ? ?=? tc??? ?? ? ? ?+? td?a?? ?? ? ? ? ???? ?? ? ? b? +? (1? ?? td? )(1? ?? ?)a?? ?? ? ????? ? ?? ? ? b, (24)? ?? dtc dtc dtc dtc ?tc ?tc

tc = ?Pc denotes the mechanical effect of increasing tc on the firm’s where ?Pd/?? payout as above. The first two terms in each of these formulas correspond exactly to those in the equations for deadweight loss in the neoclassical model in (4) and (5). These terms reflect the traditional Harberger-type distortions created by taxes because the firm underinvests relative to the social optimum. Although these terms

Vol. 2 No. 3

CHETTY AND SAEZ: DIVIDEND AND CORPORATE TAXATION

21

are identical to those in the neoclassical models, the elasticities themselves may differ: even cash-rich firms have dPd/dtd < 0, in contrast with the new view model. The third term in the two formulas arises from the agency problem (? < 1). This term reflects the externality that the manager imposes on other shareholders by under-providing dividends and investing in the pet project. An increase in tax rates exacerbates this preexisting distortion. Note that unlike the Harberger terms, which are second-order (proportional to td and tc), the agency term is first-order. t??* ?? such that ?(1 ? ? t??* ?? = 1, as ? ) t??* ?? + This first-order term disappears if td is set at ? d? d? d? ?? (1 ? ?) = 0. The dividend subsidy ? ) t??* ??< 0 exactly corrects the externality due (1 ? ? t??* d? d? to the misalignment between managers’ and shareholders’ objectives. Absent revt??* ??< 0 and tc = 0 maximizes social welfare. Rather enue requirements, setting td = ? d? than taxing dividends, it would be desirable to implement a Pigouvian dividend subsidy to correct the externality that arises from the misalignment between managers’ and shareholders’ objectives. In contrast, there is no such rationale for subsidizing corporate profits in the agency model. As in the neoclassical model, it is helpful to distinguish firms that pay dividends in period 0 from those that do not to gain more insight into the excess burden formulas. First consider “very high-cash” firms that have X large enough so that ? >? __ ?? ? . By Lemma 1, these firms pay dividends in period 0, set I = I?*(tc? ? ), and set J such )r = g?? (J?). For such firms, profitable investment is unaffected by the that ?(1 ? td? dividend tax (?I/?? td = 0 ), implying dPd/dtd = (r/(1 + r?))(dD/dtd). Conversely, the corporate tax does not affect pet project investment (?J/?? tc = 0 ) because tc does not affect the tradeoff between D and J. Because the manager sets I to maximize Pd, the only effect of a change in the corporate tax on total dividend payouts is the mechanical effect: (1? ?? tc? )? f??(I?) ___ dPd ?I? dD?? r? ?????? ___ _____ ___ ?? ?? ? ? ? =? ?? Pc? +? e__________ ?? ?? ? ?? ? ? +??? ? ? ? f? ?????? 1? +? r dtc 1? +? r dtc ?? tc ?Pd ?J? r? ??? _____ ___ ___ =? ?Pc? ???? ? ?? ? ? =? ??Pc? =??? ?? ? . ? ??? 1? +? r ?? ?? tc tc

Combining these results, we obtain the following expressions for marginal excess burden for dividend-paying (very high cash) firms: dPd dW?? ___ ___ (25)? ?? ? ? ? =? [? td? +? (1? ?? td? )(1? ?? ?)]?? ?? ? ? dtd dtd (26)? dPc dW?? ___ ___ ?? ? ? ?=? tc??? ?? ? . ? dtc dtc

The dividend tax has a first-order deadweight cost whereas the corporate tax has a second-order deadweight burden that coincides with that in the neoclassical new view model. Intuitively, for firms that have sufficient cash holdings to pay dividends, investment is set at the optimal level from the shareholders’ perspective. The agency problem only distorts the tradeoff between period 0 dividends and pet project investment. Dividend taxes encourage managers to increase pet project investment,

22

American Economic Journal: economic policy

August 2010

exacerbating this preexisting agency problem. In contrast, corporate taxes do not affect the tradeoff between pet investment and period 0 dividends. Now consider high cash firms that do not issue equity, but also do not pay divi__ ??). Such firms set I such that (1 ? tc? )(1 ? td? )?? f??(I?) = g?? (X ? I?). dends (? < ?? ? ? For these firms, dividend and corporate taxes both distort the return to investment in the same way, implying dPc/dtd = dPc/dtc. The effects of td and tc on the dividend tax base are fully determined by their effects on profits, implying dPd/dtd )dPc/dtd and dPd/dtc = ??Pc + (1 ? tc? )(dPc/dtc). Combining these results, = (1 ? tc? we obtain dPc dPc dW?? dW?? ___ ___ ___ ___ (27)? ?? ? ? ? =??? ? ? ?=? (tc? +? td? ?? tc?td? )?? ?? ? ? ?+? (1? ?? td? )(1? ?? tc? )(1? ?? ?)?? ?? ? . ? dtc dtd dtc dtc The first term in this formula coincides with that in equation (8) for excess burden for firms that do not pay dividends in the neoclassical (old view) model. The second term is due to the agency problem, which increases the excess burden of both the __ X?? ? . For managers choosing between an corporate and dividend tax for firms with X < ? untaxed pet project investment and taxed profitable investment at the margin, both the dividend and corporate taxes distort investment behavior. Because these managers are already underinvesting in I from the shareholders’ perspective, both taxes exacerbate this preexisting distortion to the same degree. How are the two lessons about efficiency costs obtained from the neoclassical analysis in Section I affected by agency problems? First, dividend taxation always generates deadweight loss, even for cash-rich firms. Second, the dividend tax creates first-order deadweight costs by distorting dividend payout decisions, whereas the corporate tax generates second-order efficiency costs for firms that pay dividends. To see the importance of the distinction between the first-order and second-order terms, consider the marginal excess burden of raising the dividend tax from the current rate of td = 15 percent. In the Execucomp data used in Chetty and Saez (2005), total executive share ownership averages less than ? = 0.03 in all years.20 In equa)(1 ? ?) tion (25) for dividend-paying firms, the first-order agency term (1 ? td? )(1 ? ?)/(td + (1 ? td? )(1 ? ?)) = 84 percent of the therefore accounts for (1 ? td? marginal excess burden of a dividend tax increase. Hence, agency effects are likely to be the primary driver of any efficiency costs of dividend taxes. A useful feature of the formulas for excess burden in (25) and (26) is that they are functions of a small set of parameters that can, in principle, be estimated empirically, such as the elasticities of dividend payments and corporate profits with respect to tax rates. The primitives of the model, such as the pet project payoff g(J?), affect efficiency costs only through the high-level elasticities that enter the formula. Estimating these structural parameters would be difficult as they represent reduced forms of complex contracts and payoffs for shareholders and management.

Although this calculation focuses solely on stock ownership, accounting for other forms of incentive-based pay is unlikely to raise ? significantly. Existing studies have measured ? more broadly by computing the change in the wealth of a CEO when his firm’s value increases by $1. These studies estimate that ? is less than 1 percent on average for CEOs of publicly traded corporations in the United States (see Kevin J. Murphy 1999 for a survey).

20

Vol. 2 No. 3

CHETTY AND SAEZ: DIVIDEND AND CORPORATE TAXATION

23

The formulas for excess burden we have derived above ignore the possibility that the firm may return profits to shareholders through share repurchases instead of dividends. In the Appendix, we extend the model to allow for costly share repurchases, as in Poterba and Summers (1985). We obtain the same excess burden formulas as those above, except that the first-order agency term depends upon the effect of tax changes on total payout. Intuitively, the cash left over for pet project investment is determined by total payout, and not just dividends. Therefore, an increase in td has first-order deadweight costs if it reduces total payout and does not simply induce substitution between share repurchases and dividends.21 An increase in tc continues to have second-order deadweight costs for very high-cash firms. The main limitation of this approach to incorporating share repurchases is that it relies on an ad-hoc cost to explain why firms pay dividends despite the tax advantage of repurchases. Microfounded models of share repurchases may have different welfare implications. B. Endogenous Contracts We now show that the formulas derived above generalize to a model with endogenous contracts and monitoring. We begin by modeling how the manager’s contract (? ) is determined, and then turn to the efficiency analysis, which takes into account the impact of taxes on this contract. Determination of Manager’s Contract.—We model the determination of the manager’s contract using the standard principal-agent framework in the corporate finance literature with a risk-neutral principal and risk-averse manager. The critical assumption we make is that this contract is chosen by the board of directors, who initially own a fraction ?B of the firm’s shares, and whose objective is to maximize their own profits net of monitoring costs. The remaining shares 1 ? ?B are owned by small shareholders whose interests are not directly represented on the board of directors. This captures the fundamental conflict between ownership and management, that small minority shareholders are passive investors who do not participate in management decisions. For reasons we describe below, it is important to ensure that the board has a set of tools that spans the set of tax instruments available to the government. We therefore expand the manager’s compensation contract to include three components. First, the board can compensate the manager by giving him a fraction ? of the company’s shares. Second, the manager receives a fixed salary S independent of profits and dividends. The salary S is paid in period 2 before the firm is liquidated. Third, the man)? f? (I?) ager receives a bonus equal to a share b of after-tax corporate profits (1 ? tc? generated in period 2. In addition to these three choice variables, the board continues to choose the level of monitoring ? as above. The board faces a tradeoff in setting the manager’s contract because he is averse to risk. If the manager were risk neutral, he would buy the entire firm to resolve the agency problem and maximize total surplus. For tractability, we use a standard
21 In Chetty and Saez (2006), we present suggestive evidence that companies did not substitute dividends for repurchases. However, further empirical work is needed to estimate this substitution elasticity precisely.

24

American Economic Journal: economic policy

August 2010

CARA-Normal framework to model the risk the manager faces. In particular, assume that the firm’s profits are given by f? (I?) + ?, where ? ? N(0,??2). In this generalized model, the manager’s total consumption is (1? ?? tc? )(1? ?? b)(? f? (I?)? +? ?)? +? I? +? J? ?? S ________________________________ )? X? ?? I? ?? J? +??? ???? ??? ?? V?M?=?(1? ?? td? 1? +? r? )b(? f? (I?)? +? ?) (1? ?? tc? g(J?) S? ?? _____ _____________ +??? ? ?? ??? +??? ? ???? ? +???________________ ?? . 1? +? r 1? +? r (1? +? r)(1? +? ?)

[

]

It is convenient to rewrite this expression as ?
? ??˜?(? ?? f? (I?)? +? ?)? +? I? +? J? ?? S ? S? ?? _____ ____________________ ??˜?? ??cX? ?? I? ?? J? +??? ??? ??? ? ?? d? +??? V? ? =? ?
M

g(J?) _____________ +??? ? ???? , (1? +? r)(1? +? ?)

1? +? r

1? +? r

? ? ? ? ˜??= (1 ? tc? ˜??allows ˜?? ˜?? ? ?= (1 ? td? )? and ?? ? )[(1 ? b) + b/? ? ??˜?? ]. Introducing ?? ? ? ?and ?? ? where ?? us to eliminate tc and td from the manager’s objective, which is another way to see that the government and private sector have equivalent tools. Any change managerial incentives caused by changes in government policies can, in principle, be fully undone by changes in the manager’s contract. M ) = ?(1/?)e??V? , where ? denotes The manager’s utility function is u(V?M? the level of absolute risk aversion. Exploiting the CARA-Normal properties, the expected value received by the manager can be written as ? ??˜? ? ? ? ? f? (I?)? +? I? +? J? ?? S d ?c ________________ ???? EV? ? =???˜? ?? ?? X? ?? I? ?? J? +??? ?? ??? ? ??
M



??2?? ??2 g(J?) S? ?? ?? ? ?? ??˜? ? ? ??? ?2?????? ˜?? _____ __ ? ? ???? ? ?_______ . ? ? + _____________ ?? ? ?? ???? +??? 1? +? r 2 (1? +? r)2 (1? +? r)(1? +? ?)

1? +? r

Note that the maximization problem of the manager who chooses I and J to maxiin the deterministic model solved in Lemma 1. mize EV?M is identical to the problem ? ? ˜?? ? ,? ? ??˜?? , and ?. Hence, I and J depend upon ?? The board chooses S, b, ?, and ? to maximize the board’s share value, taking into account the manager’s incentive constraints and participation constraint EV?M ? 0. (I?)/(1 + r?) the corporate tax base and Pd = X ? I ? J + As above, denote by Pc = f? )(1 ? b)? f? (I?) + I + J ? S]/(1 + r) the dividend tax base. Note that we can [(1 ? tc? rewrite Pd as ?
?˜ ˜?? ? ??? ? ??? 1 ? tc ? ?? _________ ?? ? ? ? f? (I?)? +? I? +? J? ?? S ? ?? ? ? ˜ 1?? ??˜?? ?? ˜?? Pd?(?? ? ,??? ? ??? ,??,?tc? )? =? X? ?? I? ?? J? +???_______________________ ??? ??, ?? 1? +? r ?

Vol. 2 No. 3

CHETTY AND SAEZ: DIVIDEND AND CORPORATE TAXATION

25

where the dependence on tc captures the mechanical change in Pd holding fixed the ?? ˜? ? ,? ? ??˜?? ,?). Note that ?Pd/?tc = ?Pc/(1 ? ? ? ??˜?? ). With this notation, ? manager’s contract (?? ?? ?˜ ˜?? ? ,? ? ?? ?? ,?) to maximize the board chooses (?? (28)?
? ? ? ˜?? ˜?? WS? =? (?B(1? ?? td? )? ????˜? ?? )Pd?(?? ? ? ,??? ? ? ,??,?tc? )? ?? c?(?? ).

The minority shareholders surplus is (29)?
? ? ˜?? ˜?? WM? =? (1? ?? ?B)(1? ?? td? )Pd?(?? ? ,??? ? ? ,??,?tc? ).

Since the manager’s surplus is pinned at zero by his participation constraint, total surplus in the economy (W ) is the sum of the shareholders’ welfare and government revenue: ? W? =? td?Pd? +? tc?Pc? +? WS? +? WM.

Efficiency Cost.—Using the envelope theorem, we have dWS /dtd = ? ?B?Pd ? ? ? ˜?? ˜?? ˜?? ) ? ?? ? )(?Pd/?? ? tc) = ?Pc(?B(1 ? td? ) ? ?? ? ? /(1 ? ?? ) ? ) ? . We and dWS/dtc = (?B?(1 ? td? )dPd/dtd and dWM/dtc therefore have dWM/dtd = ?(1 ? ?B)Pd + (1 ? ?B)(1 ? td? )dPd/dtc. Combining these results yields = (1 ? ?B)(1 ? td? (30)? dPd dPc ___ ___ ___ ??dW?? ? ? ? =? tc??? ?? ? ? ?+? [td? +? (1? ?? td? )(1? ?? ?B)]??? ?? ? ? dtd dtd dtd

Equations (30) and (31) coincide with those in the special case above, replacing ? with ?B. To understand these equations, it is useful to distinguish between two cases: ?B = 1 and ?B < 1.
Case 1: ? B = 1. When there are no minority shareholders, the first-order terms in (30) and (31) disappear and deadweight burden becomes a second-order function of the tax rate as in the neoclassical old view model. The marginal deadweight cost of taxation is small at low tax rates even though the contract between the manager and board has ? < 1, leading to inefficient pet project investment and under-provision of dividends by the manager. This result contrasts with the intuition developed in the previous section that taxing a market with a preexisting distortion leads to a first-order efficiency cost, which is a classic result in public finance (Auerbach 1985; Hines 1999; Auerbach and Hines 2003; Lawrence H. Goulder and Roberton C. Williams III 2003; Louis Kaplow 2008). There are two reasons that our result differs from that of other studies in the tax literature. First, we have designed the model so that the government does not have

?Pd dPc dPd dW?? ___ ___ ___ ___ ?? ? ?+? [td? +? (1? ?? td? )(1? ?? ?B)]a?? ?? ? ? ? ???? ?? ? b ? . ? ? ?=? tc??? ? (31)? ?? dtc dtc dtc ?? tc

26

American Economic Journal: economic policy

August 2010

an intrinsic technological advantage in fixing the agency problem relative to the private sector. Any change in incentives for the manager that can be achieved by chang) can be achieved by changing the private contract (?,?b,?S). ing the tax system (td,?tc? Second, the contract between the manager and the shareholders is constrained efficient when ?B = 1: absent taxes, the compensation of the manager is designed to maximize surplus subject to the technological constraint that only managers can make the investment and payout decisions for the firm. Hence, the size of the preexisting distortion due to agency problems is endogenously minimized by the private sector when ?B = 1 in this model. In contrast, the preexisting distortions analyzed in the previous section, and in the studies cited above, are exogenously fixed. The government has a technological advantage in fixing these distortions—it can use a dividend subsidy whereas the private sector cannot—and thus dividend taxes have first-order costs. The general lesson, which is of relevance beyond dividend taxation, is that identifying a preexisting distortion is not sufficient to infer that government taxes or subsidies will have first-order effects on welfare. It is critical to understand the private sector’s ability to alter the size of the distortion, in particular whether the private sector has the same tools as the government and whether the private sector reaches the second-best efficient outcome. In the context of dividend taxation, there is no obvious reason that government intervention is a superior method of resolving agency problems than the tools available to shareholders.22
Case 2: ?B < 1. When ?B < 1, the interests of diffuse shareholders are ignored by the board and the private contract no longer maximizes total private surplus. As )(1 ? ?B) terms in a result, taxes have first-order effects, as shown by the (1 ? td? t* ? ?? ? ( where ?B(1 ? ? t* ?? ?? = 1 ) ) (30) and (31).23 As in the fixed contract case, setting td = ? d d t* ? ?? and tc corrects the externality and eliminates these first-order terms. Setting td = ? d? = 0 thus maximizes social welfare absent revenue constraints. With endogenous contracts, the size of the first-order term in the excess burden formulas is determined by ?B instead of ?. This is because the ultimate source of the externality is that the large shareholders under-provide monitoring and pay-for-performance incentives to the manager when ?B < 1. The model can be further generalized to permit endogenous determination of the fraction of large shareholders ?B, as shown in Chetty and Saez (2007). Large shareholders often buy a large block of shares through tender offers (Shleifer and Vishny 1986). Such tender offers are made in the self-interest of the acquirer and do not take into account the interests of the remaining diffuse shareholders. This creates an agency problem because ?B is determined in a way that does not maximize

22 Governments may be able to affect the contracting technology in a way that the private sector itself cannot achieve through regulation (Shleifer and Vishny 1997). For example, if shareholders rights are protected in courts, shareholders may have more control over managers, reducing c?(?? ) and leading to a first-order efficiency gain. The key point is that dividend taxes do not affect contracting technology directly, holding fixed the regulatory structure embodied by the function c?(?? ). 23 In the case with endogenous contracts, the corporate tax can have first-order effects even for very high-cash firms because it distorts the manager’s contract, which in turn affects payout decisions.

Vol. 2 No. 3

CHETTY AND SAEZ: DIVIDEND AND CORPORATE TAXATION

27

total private surplus. As a result, dividend taxation continues to generate first order efficiency costs and a dividend subsidy can be used to correct the externality. The results with endogenous contracts explain why our formulas for excess burden differ from that obtained in Gordon and Dietz’s (2008) agency model. Gordon and Dietz (2008) assume that the board of directors set the level of dividends on behalf of all shareholders, which is analogous to assuming ?B = 1 in our model. This is the reason that the efficiency cost of dividend taxation takes the standard second-order Harberger form in their model.
V.? Conclusion

The public finance literature on corporate taxation has focused primarily on models of profit-maximizing firms. In contrast, since Jensen and Meckling (1976), the corporate finance literature has emphasized deviations from profit maximization by managers as a central determinant of firm behavior. This paper has taken a step toward bridging this gap. We analyzed the effects of dividend taxation in an agency model, and showed that it can explain many aspects of the empirical evidence on firms’ responses to taxation that pose problems for existing neoclassical models. We used this model to characterize the efficiency cost of dividend taxation. Dividend taxation has first-order efficiency costs when managers’ interests differ from shareholders and companies are owned by diffuse shareholders—which is perhaps the most plausible description of modern corporations (Shleifer and Vishny 1997). Our analysis suggests that the main source of inefficiency from increasing the dividend tax rate is the misallocation of capital by managers because of reduced monitoring, and not the distortion to the overall level of investment emphasized in the “old view” model. From a policy perspective, if agency problems are prevalent, dividend taxation should be used relatively little if the government has other tools (e.g., progressive income taxation integrated with corporate taxation) that have similar distributional effects but do not create first-order distortions. We see two important directions for future research. First, while our model explains evidence on the effects of dividend taxation, it does not directly explain other stylized facts about dividends such as the smoothness of dividends, payment of dividends while issuing equity, and the use of dividends despite the tax advantage of share repurchases. It is critical to build a micro-founded model that explains this evidence without appealing to ad hoc costs to fully understand the effects of dividend and corporate taxation. Second, our analysis calls for further empirical work related to agency issues in corporate taxation. In our model, a dividend tax cut raises efficiency by improving the allocation of capital: firms with excess cash holdings invest less following a tax cut, while cash-constrained firms invest more. Testing whether tax reforms generate such heterogeneous investment responses across firms would shed light on the empirical importance of this allocation efficiency mechanism.
Appendix: Incorporating Share Repurchases

Suppose the firm can return money to shareholders through untaxed share repurchases in period 0, which we denote by R. Returning R to shareholders through

28

American Economic Journal: economic policy

August 2010

repurchases has a cost c(R) that is distributed across all shareholders and is increasing and convex.24 The neoclassical model in Section II can be extended to allow such share repurchases by replacing equation (1) with
?? V? =? R? ?? c?(R?)? +? (1? ?? td? )D? ?? E? (32)?? m ax? ??? D,E



)[(1? ?? tc? )? f? (X? +? E? ?? D? ?? R?)? +? X? ?? D? ?? R?]? +? E (1? ?? td? _______________________________________________ ?????? ? ????? . +??? 1? +? r

Cash-rich firms paying dividends D > 0 set R such that c?? (R?) = td so that an increase in td increases R, creating partial substitution between dividends and share repurchases. Cash-constrained firms that raise equity E > 0 do not repurchase shares. Intermediate firms may repurchase shares. The efficiency formulas (4) and (5) are unchanged. In the agency model of Section IVA with exogenous ? and no monitoring, let us focus on very high cash firms that pay dividends D > 0 for simplicity. For such firms, the resource constraint is I + J = X ? D ? R, and we can write the manager’s value as (1? ?? tc? )? f? (I?)? +? I? +? J g(J?) _________________ _____ )QD? +??? ?? ??? ? ? ?? b? +??? ?? . ? V?M? =? ?[R? ?? c?(R?)]? +? ?(1? ?? td? 1? +? r 1? +? r

Denoting by D? = D + R the total period 0 payout, we have I + J = X ? D? and

This is the sum of the problem in the baseline agency model with D? replacing D plus a separable repurchase problem involving R that is equivalent to the repurchase problem in the neoclassical model. The first-order condition for R is therefore c?? (R?) = td, as in the neoclassical model above. The first-order conditions for the other vari)? f??(I?) = r ables are identical to those in baseline model without repurchases: (1 ? tc? )r?/(1 + r) = g?? (J?)/(1 + r). Hence, the key comparative static results and ?(1 ? td? for the agency model in Section II and III hold with repurchases. Now consider the efficiency analysis. Social welfare is ? W? =? V?M? +? V?S? +? td?Pd? +? tc?Pc
24

(1? ?? tc? )? f? (I?)? +? I? +? J g(J?) _________________ _____ V?M? =? ?[? td?R? ?? c?(R?)]? +? ?(1? ?? td? )QD?? +??? ?? ??b? +??? ? ? ?? ?? . 1? +? r 1? +? r

(1? ?? tc? )? f? (I?)? +? I? +? J g(J?) _________________ _____ =? ?[R? ?? c?(R)]? +? ?(1? ?? td? )aD? +??? ?? ??b? +??? ? ?? ??? 1? +? r 1? +? r +? (1? ?? ?)(R? ?? c?(R?))? +? (1? ?? ?)(1? ?? td? )Pd? +? td?Pd? +? tc?Pc,

In practice, share repurchases are taxed at a lower rate than dividends. It is straightforward to introduce a tax rate ts on share repurchases without changing the analysis as the results do not depend upon the specification of c?(R?).

Vol. 2 No. 3

CHETTY AND SAEZ: DIVIDEND AND CORPORATE TAXATION

29

where Pd = D + ((1 ? tc? )? f? (I?) + I + J)/(1 + r) and Pc = f? (I?)/(1 + r?) denote the dividend and corporate tax bases as above. The marginal excess burden of raising the dividend tax is dPd dPd dPc dW?? ___ ___ ___ ___ ? ? ? =? tc??? ?? ? ? ?+? td??? ?? ? ? ? +? (1? ?? td? )(1? ?? ?)?? ?? ? ? ? (33)??? dtd dtd dtd dtd (34)? dR?? ___ +? (1? ?? ?)(1? ?? c?? (R?))?? ? ? dtd

d?(Pd? +? R) dPc dPd ___ _________ ___ =? tc??? ?? ? ? ?+? td??? ?? ? ? ?+? (1? ?? td? )(1? ?? ?)??? ?? ?? , ? dtd dtd dtd

where d?(Pd + R)/dtd is the effect of the dividend tax on total payout. This formula coincides with (23) except that the first-order term has d?(Pd + R)/dtd instead of dPd/dtd. Intuitively, R is chosen optimally by the manager from the shareholders’ perspective, so the first-order agency related term in the excess burden formula depends only on the distortion in pet project investment. Pet project investment is determined by total payout, not just dividend payments, and thus total payout is what matters for the agency problem. In contrast, the standard Harberger terms are related to distortions in the dividend tax base itself and therefore continue to have the same form as in the model without repurchases. Similarly, the excess burden of raising the corporate tax rate is d?(Pd? +? R) ?Pd ?Pd dPc dP dW?? ___ ___ ___ _________ ___ ? ? ?=? td?a___ ?? d?? ? ? ????? ?? ? b ? ? +? tc??? ?? ? ? ?+? (1? ?? td? )(1? ?? ?)a??? ?? ?? ? ? ???? ?? ? b ? . ??? dtc dtc dtc dtc ?? tc ?? tc

As c?? (R?) = td, R is unaffected by tc, and therefore d(Pd + R?)/dtc = dPd/dtc = ?Pc tc? . Hence, even with share repurchases, corporate taxes have second-order = ?Pd/?? deadweight burden for very high-cash firms: ? dPc dW?? ___ ___ ?? ? ? ?=? tc??? ?? ? ? ? . dtc dtc

References
Auerbach, Alan J. 1979. “Wealth Maximization and the Cost of Capital.” Quarterly Journal of EcoAuerbach, Alan J. 1985. “The Theory of Excess Burden and Optimal Taxation.” In Handbook of

nomics, 93(3): 433–46.

Public Economics, Vol. 1, ed. Alan J. Auerbach and Martin Feldstein, 61–127. Amsterdam: NorthHolland. Auerbach, Alan J., and Kevin A. Hassett. 2007. “The 2003 Dividend Tax Cuts and the Value of the Firm: An Event Study.” In Taxing Corporate Income in the 21st Century, ed. Alan J. Auerbach, James R. Hines, Jr., and Joel B. Slemrod, 93–126. Cambridge, UK: Cambridge University Press. Auerbach, Alan J., and James R. Hines, Jr. 2003. “Taxation and Economic Efficiency.” In Handbook of Public Economics, Vol. 3, ed. Alan J. Auerbach and Martin S. Feldstein, 1347–1421. Amsterdam: North-Holland. Benartzi, Shlomo, Roni Michaely, and Richard H. Thaler. 1997. “Do Changes in Dividends Signal the Future or the Past?” Journal of Finance, 52(3): 1007–34.

30

American Economic Journal: economic policy

August 2010

Bernheim, B. Douglas. 1991. “Tax Policy and the Dividend Puzzle.” RAND Journal of Economics, Bertrand, Marianne, and Sendhil Mullainathan. 2003. “Enjoying the Quiet Life? Corporate GoverBradford, David F. 1981. “The Incidence and Allocation Effects of a Tax on Corporate Distributions.” Brown, Jeffrey R., Nellie Liang, and Scott Weisbenner. 2007. “Executive Financial Incentives and

22(4): 455–76.

nance and Managerial Preferences.” Journal of Political Economy, 111(5): 1043–75.

Journal of Public Economics, 15(1): 1–22.

Payout Policy: Firm Responses to the 2003 Dividend Tax Cut.” Journal of Finance, 62(4): 1935– 65. Chetty, Raj, and Emmanuel Saez. 2005. “Dividend Taxes and Corporate Behavior: Evidence from the 2003 Dividend Tax Cut.” Quarterly Journal of Economics, 120(3): 791–833. Chetty, Raj, and Emmanuel Saez. 2006. “The Effects of the 2003 Dividend Tax Cut on Corporate Behavior: Interpreting the Evidence.” American Economic Review, 96(2): 124–29. Chetty, Raj, and Emmanuel Saez. 2007. “An Agency Theory of Dividend Taxation.” National Bureau of Economic Research Working Paper 13538. Chirinko, Robert S. 1993. “Business Fixed Investment Spending: Modeling Strategies, Empirical Results, and Policy Implications.” Journal of Economic Literature, 31(4): 1875–1911. Christie, William G., and Vikram Nanda. 1994. “Free Cash Flow, Shareholder Value, and the Undistributed Profits Tax of 1936 and 1937.” Journal of Finance, 49(5): 1727–54. Coase, Ronald H. 1960. “The Problem of Social Cost.” Journal of Law and Economics, 3(1): 1–44. Desai, Mihir A., C. Fritz Foley, and James R. Hines, Jr. 2007. “Dividend Policy Inside the Multinational Firm.” Financial Management, 36(1): 5–26. Desai, Mihir A., and Austan D. Goolsbee. 2004. “Investment, Overhang, and Tax Policy.” Brookings Papers on Economic Activity, 2: 285–338. Feldstein, Martin S. 1970. “Corporate Taxation and Dividend Behaviour.” Review of Economic Studies, 37(1): 57–72. Fenn, George W., and Nellie Liang. 2001. “Corporate Payout Policy and Managerial Stock Incentives.” Journal of Financial Economics, 60(1): 45–72. Gordon, Roger, and Martin Dietz. 2008. “Dividends and Taxes.” In Institutional Foundations of Public Finance: Economic and Legal Perspectives, ed. Alan J. Auerbach and Daniel N. Shaviro, 204– 24. Cambridge, MA: Harvard University Press. Goulder, Lawrence H., and Roberton C. Williams III. 2003. “The Substantial Bias from Ignoring General Equilibrium Effects in Estimating Excess Burden, and a Practical Solution.” Journal of Political Economy, 111(4): 898–927. Grullon, Gustavo, Roni Michaely, Shlomo Benartzi, and Richard H. Thaler. 2005. “Dividend Changes Do Not Signal Changes in Future Profitability.” Journal of Business, 78(5): 1659–82. Harberger, Arnold C. 1962. “The Incidence of the Corporation Income Tax.” Journal of Political Economy, 70(3): 215–40. Harberger, Arnold C. 1966. “Efficiency Effects of Taxes on Income from Capital.” In Effects of Corporation Income Tax, ed. M. Krzyzaniak, 107–17. Detroit: Wayne State University Press. Hines, James R., Jr. 1999. “Three Sides of Harberger Triangles.” Journal of Economic Perspectives, 13(2): 167–88. Jensen, Michael C., and William H. Meckling. 1976. “Theory of the Firm: Managerial Behavior, Agency Costs and Ownership Structure.” Journal of Financial Economics, 3(4): 305–60. Kaplow, Louis. 2008. The Theory of Taxation and Public Economics. Princeton, NJ: Princeton University Press. King, Mervyn A. 1977. Public Policy and the Corporation. London: Chapman and Hall. Korinek, Anton, and Joseph E. Stiglitz. 2009. “Dividend Taxation and Intertemporal Tax Arbitrage.” Journal of Public Economics, 93(1–2): 142–59. Lang, Larry H. P., and Robert H. Litzenberger. 1989. “Dividend Announcements: Cash Flow Signalling vs. Free Cash Flow Hypothesis?” Journal of Financial Economics, 24(1): 181–91. La Porta, Rafael, Florencio Lopez-de-Silanes, Andrei Shleifer, and Robert W. Vishny. 2000. “Agency Problems and Dividend Policies Around the World.” Journal of Finance, 55(1): 1–33. Murphy, Kevin J. 1999. “Executive Compensation.” In Handbook of Labor Economics, Vol. 3B, ed. Orley Ashenfelter and David Card, 2485–2563. Amsterdam: North-Holland. Nam, Jouahn, Jun Wang, and Ge Zhang. 2004. “The Impact of Dividend Tax Cut and Managerial Stock Holdings on Firm’s Dividend Policy.” University of New Orleans Department of Economics and Finance Working Paper 2004-09. Poterba, James. 2004. “Taxation and Corporate Payout Policy.” American Economic Review, 94(2): 171–75.

Vol. 2 No. 3

CHETTY AND SAEZ: DIVIDEND AND CORPORATE TAXATION

31

Poterba, James M., and Lawrence H. Summers. 1985. “The Economic Effects of Dividend Taxation.”

In Recent Advances in Corporate Finance, ed. Edward I. Altman and Marti G. Subrahmanyam, 227–84. Homewood, IL: Dow Jones-Irwin Publishing. Rajan, Raghuram, Henri Servaes, and Luigi Zingales. 2000. “The Cost of Diversity: The Diversification Discount and Inefficient Investment.” Journal of Finance, 55(1): 35–80. Scharfstein, David S., and Jeremy C. Stein. 2000. “The Dark Side of Internal Capital Markets: Divisional Rent-Seeking and Inefficient Investment.” Journal of Finance, 55(6): 2537–64. Shleifer, Andrei, and Robert W. Vishny. 1986. “Large Shareholders and Corporate Control.” Journal of Political Economy, 94(3): 461–88. Shleifer, Andrei, and Robert W. Vishny. 1997. “A Survey of Corporate Governance.” Journal of Finance, 52(2): 737–83. Sinn, Hans-Werner. 1991. “The Vanishing Harberger Triangle.” Journal of Public Economics, 45(3): 271–300.



doc_173239997.pdf
 

Attachments

Back
Top