Processing math: 100%
World Scientific
Skip main navigation

Cookies Notification

We use cookies on this site to enhance your user experience. By continuing to browse the site, you consent to the use of our cookies. Learn More
×

Labor Market Returns to Education and English Language Skills in the People's Republic of China: An Update

    https://doi.org/10.1162/adev_a_00124Cited by:12 (Source: Crossref)

    Abstract

    We reexamine the economic returns to education in the People's Republic of China (PRC) using data from the Chinese General Social Survey 2010. We find that the conventional ordinary least squares estimate of wage returns to schooling is 7.8%, while the instrumental variable estimate is 20.9%. The gains from schooling rise sharply with higher levels of education. The estimated returns are 12.2% in urban provinces and 10.7% in coastal provinces, higher than in rural and inland areas. In addition, the wage premium for workers with good English skills (speaking and listening) is 30%. These results are robust to controls for height, body mass index, and English language skills, and to corrections for sample selection bias. Our findings, together with a critical review of existing studies, confirm the growing significance of human capital as a determinant of labor market performance in postreform PRC.

    I.  Introduction

    The People's Republic of China (PRC) saw a four-fold increase in the level of consumption per capita and unprecedented economic growth during 1980–2010. The country's transition to a market economy saw the dissolution of social safety net programs and the end of full employment. Substantial physical capital investment during this transition led to greater demand for high-skilled labor, thereby increasing the importance of education as a determinant of labor market earnings (Heckman and Yi 2012). In prereform years, wages were administratively set, which suppressed the true returns to cognitive skills and schooling (Fleisher and Chen 1997, Chen and Feng 2000, Démurger 2001, Fleisher and Wang 2004). Returns to schooling were low in the early years after the beginning of economic reform in 1978 but increased in the 1990s (Zhao and Zhou 2002).1 Therefore, an updated analysis of how education is paying off in the labor market is important for understanding the evolution of income distribution in transition economies.

    The PRC's rapid economic growth was accompanied by a considerable increase in earnings inequality.2 Moreover, the country's postreform “open door” policy attracted foreign direct investment and multinational companies, leading to strong demand for skilled workers along the rapidly expanding industrial coast.3 Therefore, it is important to understand how skills and education are rewarded across rural and urban locations, and across coastal and inland cities.4

    Understanding the determinants of rising returns to education—a labor market phenomenon in transition economies—can also help us understand the difference between the PRC and other transition countries in terms of labor market characteristics. Unsurprisingly, following the shift from an administratively determined wage system to a market-oriented one in the early 1990s, there has been a significant increase in research on the economic profitability of human capital investment in the PRC.

    Most estimates of labor market returns correspond to the early years of reform and hence are unlikely to be a good guide given the unprecedented transformation of the PRC economy during the 1990s. Spatial differences in infrastructure growth and physical investment are also likely to have caused important variations in the way schooling impacts labor market earnings (Fleisher and Chen 1997). Therefore, we add to the existing literature by using the Chinese General Social Survey (CGSS) 2010 dataset and provide an up-to-date account of the labor market returns to education in the PRC.

    Our empirical model accounts for two important determinants of earnings: health capital, which includes height, body mass index (BMI), and self-reported health status; and English language proficiency that were both ignored by most of the recent studies on the PRC. Moreover, our empirical analysis addresses concerns over endogeneity and sample selection biases. We use information on parental death during the respondent's childhood and parental schooling as excluded instruments to estimate the instrumental variable (IV) model. Nonrandom selection into waged work is modeled using data on various measures of nonlabor income. Lastly, we report estimates for various subgroups—men versus women, rural versus urban, and coastal versus inland provinces—to document the heterogeneous nature of returns to schooling and skills in postreform PRC.

    The rest of the paper is organized as follows. Section II briefly reviews the literature. Section III and section IV describe the data and empirical framework used in our study, respectively. Section V presents our econometric results. We conclude in section VI.

    II.  Literature Review: What Do We Know about Returns to Education in the People's Republic of China?

    Existing studies on the PRC have estimated a Mincer-type earnings function using a variety of micro datasets. Our review of the published literature on returns to education for the period 1987–2016 identified a total of 68 studies (Table 1).5

    Table 1.  Summary of Existing Studies on Returns to Education in the People's Republic of China

    Data SourceSampleAuthorStudy PeriodMethodReturns Estimate (%)
    Chinese Household Income Project (CHIP)Urban and ruralJohnson and Chow (1997) 1988OLS3.3–4.0
    UrbanKnight and Song (1991) 1986OLS2.4–3.0
    Knight and Song (1995) 1988OLS2.3
    Xie and Hannum (1996) 1988OLS2.2–4.5
    Liu (1998) 1988OLS2.9–3.6
    Maurer-Fazio (1999) 1988OLS4.5 (female), 2.9 (male)
    Li (2003) 1995OLS5.4
    Bishop and Chiou (2004) 1988–1995OLS2.8 (1988)–5.6 (1995)
    Li and Luo (2004) 1995OLS; IV; GMM7.5–8.9; 15.3–15.6; 15.0
    Appleton, Song, and Xia (2005) 1988–2002OLS3.6 (1988)–7.5 (2002)
    Bishop, Luo, and Wang (2005) 1988–1995OLS1.5 (1988)–4.4 (1995)
    Hauser and Xie (2005) 1988–1995OLS2.0 (1988)–7.4 (1995)
    Knight and Song (2005) 1995–1999OLS3.2 (1995)–4.1 (1999)
    Yang (2005) 1988–1995OLS3.3–3.9 (1988) to 5.9–7.3 (1995)
    Démurger et al. (2009) 2002OLS4.2–7.4
    Zhong (2011) 2002OLS; IV6.0–6.8; 4.2–6.5
    Wang (2013) 1995–2002OLS; IV5.6 (1995, female)–8.1 (2002, female), 3.6–6.6 (male); 7.3–11.8 (female), 4.4–8.8 (male)
    Qu and Zhao (2017) 2002–2007OLS6.0–9.6 (2002) to 2.1–4.0 (2007)
    RuralKnight and Song (1993)1988OLSNot statistically different from 0
    Parish, Zhe, and Li (1995) 1993OLS3.1
    MigrantsDémurger et al. (2009) 2002OLS3.6–7.3
    Zhu (2015) 2002–2007OLS; NP3.8 (2002)–4.9 (2007); 4.8–5.5
    Qu and Zhao (2017) 2002–2007OLS2.3–2.9 (2002) to 3.3–3.8 (2007)
    China Health and Nutrition Survey (CHNS)Urban and ruralFang et al. (2012) 1997–2006OLS; IV9; 20
    UrbanChen and Hamori (2009) 2004–2006OLS; IV7.7 (female), 8.1 (male), 7.9 (married women), 8.0 (married men); 14.5 (married women), 12.6 (married men)
    Qiu and Hudson (2010) 1989–2000OLS5.1–6.9
    Kang and Peng (2012) 1989–2009OLS; IV2.2 (1989, female)–10.3 (2009, female), 2.6–7.0 (male); 5.7–8.9 (female), 5.6–11.0 (male)
    Ren and Miller (2012) 1993–2006OLS2.0 (1993, female)–5.2 (2006, female), 0.8–5.6 (male)
    Chinese Twins SurveyUrbanLi, Liu, Ma, and Zhang (2005) 2002OLS; FE; GLS8.2–8.4; 2.5–2.7; 2.5–2.7
    Li et al. (2007) 2002OLS; FE; GLS6.3–7.0; 3.2; 3.3
    Zhang, Liu, and Yung (2007) 2002OLS; IV8.8–9.8; 9.1–10.5
    Li, Liu, and Zhang (2012) 2002OLS; FE8.4; 2.7–3.8
    China Urban Labor Survey (CULS)UrbanGiles, Park, and Wang (2008) 2001OLS; IV8.3–9.6; 8.0–8.3
    Cai and Du (2011) 2001–2010OLS10.2 (2001)–11 (2010)
    Gao and Smyth (2015) 2001–2010OLS; IV; Lewbel IV6.8 (2001)–8.6 (2010); 8.2–9.1; 5.2–4.8
    Panel Data of Urban Residents in 20 citiesUrbanZhao and Zhou (2002) 1978–1993OLS1.8–3.5
    Fleisher and Wang (2005) 1975–1991OLS; IV1.4 (1975)–5.9 (1991); -0.1–15.8
    Urban Household Survey (UHS)UrbanZhang et al. (2005) 1988–2001OLS; Heckman5.2 (1988, female)–13.2 (2001, female), 2.9–8.4 (male); 5.2–12.5 (female), 2.8–7.5 (male)
    Ge and Yang (2011) 1988–2007OLS3.6 (1988)–11.4 (2007)
    Rural Urban Migration in China (RUMiC)UrbanSakellariou and Fang (2016) 2009IV6–9
    MigrantsSakellariou and Fang (2016) 2009IV7–8
    China Household Finance Survey (CHFS)UrbanMishra and Smyth (2015) 2011OLS; IV; Lewbel IV8.6–8.7; 17.5–18.9; 12.9–13.5
    OthersUrbanJamison and Van Der Gaag (1987)1985OLS5.5 (female), 4.5 (male)
    Byron and Manaloto (1990) 1986OLS; WLS3.7; 3.9
    Maurer-Fazio (1999) 1992OLS4.9 (female), 3.7 (male)
    Qian and Smyth (2008b) 2005OLS12.1 (full sample), 9.3 (female), 13.6 (male)
    Deng and Li (2010) 2008OLS5.6–6.8
    Mishra and Smyth (2013) 2009–2010OLS; IV10.8–11.9; 21.4–22.9
    Jamison and Van Der Gaag (1987)1985OLS5.5 (female), 4.5 (male)
    RuralYang (1997) 1990OLS2.3
    Wei et al. (1999) 1991OLS4.8
    De Brauw and Rozelle (2008) 2000Heckman4.3
    MigrantsMeng and Zhang (2001) 1996OLS4.8
    De Brauw and Rozelle (2008) 2000Heckman7.8
    Deng and Li (2010) 2008OLS6.8
    Frijters, Lee, and Meng (2010) 2008OLS3.0–4.0

    FE = fixed effects, IV = instrumental variable, GLS = generalized least squares, GMM = generalized method of moments, NP = nonparametric kernel regression (nonparametric estimation), OLS = ordinary least squares, WLS = weighted least squares.

    Notes: Only studies that used household survey data are presented in the table. Eleven studies that estimated only returns to higher education (college education) were excluded from the table (Gustafsson and Li 2000; Zhou 2000; Knight and Song 2003; Heckman and Li 2004; Fleisher et al. 2005; Wang et al. 2007; Li et al. 2012; Wang 2012; Carnoy et al. 2013; Meng, Shen, and Xue 2013; Messinis 2013). Eight studies that used only firm survey data were excluded from the table (Peng 1992; Gregory and Meng 1995; Fleisher, Dong, and Liu 1996; Meng and Kidd 1997; Fleisher and Wang 2001, 2004; Ho et al. 2002; Maurer-Fazio and Dinh 2004). One study (Mishra and Smyth 2015) used both household survey data (CHFS) and firm survey data (2007 Shanghai-matched worker–firm survey), but only the results based on household survey data are listed in the table.

    Source: Authors’ review of the literature.

    Of these studies, 52 included residents in urban areas, 8 included residents in rural areas, and 10 were rural–urban migrants, while only 6 covered both urban and rural areas. Most studies (59) used household survey datasets. These include the Chinese Household Income Project (CHIP, 27 studies); China Health and Nutrition Survey (CHNS, 5); Chinese Twins Survey (4); China Urban Labor Survey (CULS, 3); Panel Data of Urban Residents from 20 cities in six provinces (3); China Urban Household Income and Expenditure Survey (CUHIES, 2); and Urban Household Survey (UHS, 2). A total of 13 studies used data from other well-established household surveys, such as the Chinese Labor Market Research Project (CLMRP) and Rural Urban Migration in China (RUMiC), among others. The remaining 8 studies used data from several firm-based surveys, while only 1 study (Mishra and Smyth 2015) used data from both a household survey (China Household Finance Survey) and a firm-level survey (Shanghai matched worker-firm survey 2007). In this section, we discuss only those studies that used household survey datasets.

    A stylized fact from the literature is that returns to education in the PRC labor market in the 1980s and early 1990s were extremely low compared with the average returns in other Asian countries (9.6%), low- and middle-income countries (11.2%–11.7%), and the world (10.1%) (Psacharopoulos 1994). The rate of return in studies using data from the 1986, 1988, and 1993 CHIP surveys ranged from 1.5% to 4.5% for urban areas (Knight and Song 1991, 1995; Xie and Hannum 1996; Johnson and Chow 1997; Liu 1998; Maurer-Fazio 1999) and 0%–4% for rural areas (Knight and Song 1993; Parish, Zhe, and Li 1995; Johnson and Chow 1997). Apart from the findings using CHIP dataset, researchers who employed data from other household surveys during this period found comparatively low rates of return to schooling, around 3.7%–5.9% for urban areas, compared with 2.3%–4.8% for rural areas (Jamison and Van Der Gaag 1987, Byron and Manaloto 1990, Yang 1997, Wei et al. 1999, Maurer-Fazio 1999, Zhou 2000, Zhao and Zhou 2002, Fleisher and Wang 2005).

    Another stylized fact is that returns to education have increased since the mid-1990s, along with improvements in wages and workers’ contractual rights (Chan and Nadvi 2014). Studies that employed CHIP datasets found that the economic returns to each additional year of schooling increased to around 4.4%–8.9% in 1995 (Li 2003; Bishop and Chiou 2004; Li and Luo 2004; Bishop, Luo, and Wang 2005; Hauser and Xie 2005; Yang 2005), 4.1% in 1999 (Knight and Song 2005), 7.5%–8.1% in 2002 among urban residents (Appleton, Song, and Xia 2005; Wang 2013), and 3.6%–7.3% in 2002 among migrants (Démurger et al. 2009).

    Findings from studies using non-CHIP datasets also indicate an increased rate of return after 1995. For example, research using another widely used dataset, CHNS, found that the rate of return rose sharply to 6.9% in 2000 (Qiu and Hudson 2010), 8.1% in 2004 (Chen and Hamori 2009), and around 9% in 2006 (Fang et al. 2012) in urban areas. Again, based on the CHNS dataset, Ren and Miller (2012) found that the returns to women increased from 2% in 1993 to 7% in 2004, while the returns to men increased from 0.8% to 3.1%. Similarly, Kang and Peng (2012) documented a larger increase in returns to education for Chinese women than men using the expanded CHNS dataset from 1989 to 2009. More precisely, the rate increased from 2.2% in 1989 to 10.3% in 2009 for women, but only from 2.6% to 7% for men. Additionally, these increased returns to schooling since the mid-1990s have been recorded in a large number of studies that used non-CHIP or non-CHNS survey datasets, including studies on rural workers (De Brauw and Rozelle 2008); migrant workers (Meng and Zhang 2001; Maurer-Fazio and Dinh 2004; De Brauw and Rozelle 2008; Deng and Li 2010; Frijters, Lee, and Meng 2010; Sakellariou and Fang 2016); and urban workers using the Chinese Twins Survey dataset (Li, Liu, Ma, and Zhang 2005; Zhang, Liu, and Yung 2007; Li et al. 2007; Li et al. 2012); CULS (Giles, Park, and Wang 2008; Cai and Du 2011; Gao and Smyth 2015); and CUHIES (Meng, Shen, and Xue 2013).

    Apart from the overall returns to education, earlier studies looked into returns to specific education levels. Studies based on data from the period after higher education reform documented a sharp increase in returns to college education (Heckman and Li 2004; Fleisher et al. 2005; Giles, Park, and Wang 2008; Qian and Smyth 2008b; Zhong 2011; Li et al. 2012; Wang 2012; Carnoy et al. 2013; Meng, Shen, and Xue 2013), compared with those from before the reform period (Gustafsson and Li 2000, Knight and Song 2003, Li 2003, Bishop and Chiou 2004). Moreover, research on the postreform period argued that graduates from elite colleges earned a premium over other college graduates even after controlling for cognitive ability, academic major, college location, and students’ individual characteristics and family backgrounds (Zhong 2011, Li et al. 2012). Existing literature also found that women benefited more from a university education than men, and similarly, urban residents earned more than rural residents with the same college degree (Qian and Smyth 2008, Wang 2012).

    The pattern of returns to education in different regions has also changed since the mid-1990s. In contrast to the finding of Liu (1998), Li (2003) observed that the rate of return was higher in less developed provinces, such as Gansu, than in high-income provinces, such as Guangdong.

    There are additional stylized facts relating to methodological issues. First, recent research has employed an instrumental variable (IV) approach to solve the endogeneity bias in educational attainment.6 For the PRC, the IV estimates were higher than the corresponding ordinary least squares (OLS) estimates (Fleisher and Wang 2004; Heckman and Li 2004; Li and Luo 2004; Fleisher et al. 2005; Fleisher and Wang 2005; Zhang, Liu, and Yung 2007; Giles, Park, and Wang 2008; Chen and Hamori 2009; Zhong 2011; Fang et al. 2012; Kang and Peng 2012; Wang 2012; Mishra and Smyth 2013; Wang 2013; Gao and Smyth 2015; Mishra and Smyth 2015; Sakellariou and Fang 2016). Most of these studies used family-background variables to estimate the IV model. For instance, Heckman and Li (2004) used the 2000 CUHIES, and parental education and year of birth as instruments for an individual's education. Similarly, based on the 1995 CHIP data, Li and Luo (2004) estimated returns to schooling for young workers in urban areas using parental education and variables related to siblings as instruments. Moreover, using the 1988–2002 CHIP data, Fleisher et al. (2005) explored the private returns to schooling at the university level. They found that the IV and semiparametric estimates on the rate of return for college graduates were higher when parental schooling was the proxy for ability.7

    In summary, while findings from existing research vary in terms of data sources, methods, and study periods, they generally confirm that gains from schooling have increased significantly. The estimated returns to schooling are higher in urban areas than in rural locations, and higher for female workers than for male workers. Moreover, the IV estimates that used parental education as instruments for an individual's schooling yielded higher returns than the OLS estimates. For the prereform period, the OLS estimates of the rate of return are around 1.4%–1.9% in urban areas, compared with 0%–2.6% in rural areas. For the postreform period, the OLS estimates show an increase of 3.3%–9% for the full sample, compared with the IV estimates of up to 20%. The OLS estimates also show an increase of 0%–4.8% for the rural sample, and OLS estimates of 1.5%–12.1% for the urban sample, compared with the IV estimates of 4.2%–22.9%.

    III.  Data

    In this paper, we use data from the CGSS 2010. The main advantage of CGSS over existing datasets (such as CHNS, CHIP, CLMRP, and RUMiC) is that, in addition to being representative of rural and urban areas of the PRC, it offers information on both language skills and health of the respondents. The CGSS 2010 sampled a total of 11,783 individuals, where 38.7% were from rural areas and 51.8% were women. Table A1 provides a breakdown of the sample observations across different groups and work status: (i) agricultural waged work, (ii) nonagricultural waged work, (iii) self-employed, (iv) in the labor force but unemployed, and (v) not in the labor force. Most studies relied on the second age group, females age 16–55 years and males age 16–60 years (16 is the youngest legal working age in the PRC, while 55 and 60 are the official retirement age). In this study, we follow Schultz (2002) and restrict the analysis to women age 25–55 years and men age 25–60 years. Our main analysis is restricted to individuals in waged work, both in agricultural and nonagricultural sectors. After ignoring cases with missing data, our working sample contains 4,223 waged workers. Table A2 summarizes all variables used in the regression analysis.

    IV.  Empirical Framework

    As explained in section II, past studies on the PRC rarely controlled for cognitive skills despite the fact that market reforms of the 1990s were likely to have increased demand for such language and numeracy skills. Although schooling is expected to capture returns to cognitive skills, recent research documents a systematic economic return to cognitive skills around the world independent of schooling completed (Hanushek et al. 2015). Therefore, it is useful to know, in the context of the PRC, the pathways through which schooling is rewarded in the labor market.

    Similarly, individuals with more schooling may have higher wages because they have better health and healthier behaviors.8 At the same time, school attendance may ignore skills acquired through social channels and in the workplace. Existing studies on the PRC have not fully considered the interaction between schooling, skills, and health capital in determining labor market success. Recent studies have instead focused on the possibility that schooling is endogenous, owing to omitted health components, or that return to schooling is understated, because it does not capture the quality of human capital. Consequently, researchers have modeled schooling attainment as an endogenous determinant of earnings by employing instrumental variable techniques (Li and Luo 2004, Heckman and Li 2004, Mishra and Smyth 2013, Chen and Hamori 2009, Mishra and Smyth 2015, Gao and Smyth 2015, Sakellariou and Fang 2016). In addition, some researchers have accounted for nonrandom selection into waged work by employing Heckman's (1979) two-step procedure (Zhang et al. 2005, Chen and Hamori 2009).

    Keeping the above issues in mind, we specify a Mincerian earnings function where the log of monthly employment income (measured in renminbi) is regressed on years of schooling; work experience; work experience squared; gender; marital status; and a series of additional control variables including ethnicity; hukou type; marital status; health factors (height, self-reported health status, and BMI); proficiency in English; and location dummies.9 In addition, we account for the endogeneity of years of schooling in the earnings function.

    Existing studies on developed and developing countries such as the PRC have attempted to address the issue in an IV framework in two settings: experimental and nonexperimental. Experimental studies rely on various institutional reforms, such as changes in the minimum age of leaving school (Harmon and Walker 1995), which result in exogenous variation in educational attainment. Nonexperimental studies, on the other hand, use family background (Li and Luo 2004); parents’ education (Heckman and Li 2004, Mishra and Smyth 2013); and spouse's education (Chen and Hamori 2009, Mishra and Smyth 2013, Gao and Smyth 2015) as instruments for education in the PRC and other countries (Trostel, Walker, and Woolley 2002). In this paper, we follow the second approach.

    Therefore, in addition to OLS estimates, we present IV estimates where we instrument schooling completed using the following as excluded instruments: whether a parent died when the respondent was 14 years old, father's education, and mother's education. Following Case, Paxson, and Ableidinger (2004) and Gertler, Levine, and Ames (2004), we assume that timing of parental death is exogenous and serves as a negative shock to the respondent's schooling. On the other hand, the father's and mother's education are not correlated to their children's inherent abilities but have influence on their children's education when we use them as excluded instruments. It should be noted that studies that used parental education as an instrument to estimate returns to education in the PRC have often done so only for a subsample. This is because of how the survey is designed, where the instruments are available only for the respondents whose parents are present in the same household (Wang 2013). Our dataset doesn't suffer from this problem as all respondents are asked about parental background in a retrospective manner.

    Apart from the endogeneity problem, another common methodological concern is the sample selection problem. If individuals select into the labor force on the basis of some unobserved attributes that also affect their wages, OLS estimates would yield biased estimates of the correlation between education and wages. In this paper, we follow Heckman (1979) to correct for nonrandom selection into waged work. First, we estimate a probit function for labor force participation where a sample selection correction term, lambda, is computed. Then the earnings function is estimated with the selection correction term as an extra variable. For the purpose of identifying the lambda term, at least one variable needs to be excluded from the wage equation, which is otherwise included in the probit equation. In our model, we follow Duraisamy (2002) and Asadullah (2006) who used data on nonlabor income (i.e., income received from bequest) as an excluded variable, leaving it out of the wage equation.10

    V.  Results

    A.  Ordinary Least Squares Estimates of Returns to Education

    In this section, we estimate returns to education by adding additional controls for factors that are correlated with both wages and schooling. Moreover, we formally include a measure of English language skills alongside schooling.11 Table 2 reports OLS estimates of the Mincerian earnings function for the full sample. To understand the true returns to education, we pursue a stepwise approach, sequentially adding controls for language proficiency and three measures of health—height, self-reported health status, and BMI—in the regression function. Four patterns follow from our analysis.

    Table 2.  Ordinary Least Squares Estimates of the Determinants of Earnings with and without Controls for Language Skills and Health Endowments (full sample)

    (1)(2)(3)(4)(5)
    Personal characteristics
    Experience0.0040.0050.0060.0080.008
    (0.59)(0.75)(1.03)(1.30)(1.20)
    Experience squared−0.001−0.001**−0.001**−0.001**−0.001**
    (1.01)(2.22)(2.32)(2.27)(2.19)
    Female−0.376***−0.393***−0.246***−0.237***−0.235***
    (14.64)(15.32)(7.05)(6.83)(6.75)
    Minority−0.0020.0030.007−0.012−0.011
    (0.04)(0.06)(0.17)(0.27)(0.27)
    Nonagricultural hukou0.205***0.196***0.175***0.174***0.173***
    (5.59)(5.36)(4.79)(4.79)(4.76)
    Currently married0.0550.0740.0710.0480.048
    (1.29)(1.76)(1.69)(1.16)(1.15)
    Schooling and cognitive skills
    Years of education0.088***0.080***0.079***0.078***0.078***
    (20.98)(18.51)(18.39)(18.24)(18.16)
    Good English skills0.317***0.306***0.306***0.307***
    (7.10)(6.88)(6.92)(6.94)
    Health capital
    Height (centimeters)0.014***0.013***0.014***
    (6.14)(5.79)(5.84)
    Self-reported health status:
    Bad−0.178***−0.179***
    (3.94)(3.95)
    Good0.116***0.112***
    (3.75)(3.60)
    Body mass index (BMI):
    BMI < 18.5, underweight−0.061
    (1.24)
    25 ≤ BMI < 30, overweight0.002
    (0.07)
    BMI≥30, obese−0.147*
    (1.68)
    Geographic location
    Rural−0.420***−0.423***−0.423***−0.413***−0.413***
    (11.54)(11.68)(11.73)(11.52)(11.50)
    Eastern (coastal) region0.404***0.388***0.376***0.370***0.371***
    (12.90)(12.43)(12.10)(11.97)(12.00)
    Western region−0.052−0.057−0.039−0.022−0.021
    (1.60)(1.77)(1.21)(0.68)(0.66)
    Constant6.238***6.164***3.712***3.770***3.773***
    (60.03)(59.35)(9.01)(9.19)(9.19)
    Number of observations4,2234,2234,2234,2234,223
    Adjusted R-squared0.490.490.500.510.51

    Notes: *, **, and *** indicate significance at the 10%, 5%, and 1% levels, respectively. “Good English skills” is a dummy variable which indicates whether a respondent's English skills (including speaking and listening) are at or above the standard proficiency level (=1) or not (=0). For self-reported health status, the reference category is “in normal health condition.” For body mass index (BMI), the reference category is “normal, 18.5 ≤ BMI < 25.” For regional dummies, the reference group is “central region.”

    Sources: Chinese General Social Survey (CGSS) and authors’ calculations.

    First, education has a significant and positive impact on earnings in the PRC even after we control for English language proficiency and health capital (specification 3). The rate of return to an additional year of schooling ranges from 7.8% to 8.8% in the full sample. Our OLS estimate is similar to the estimated average rate reported in existing literature on the PRC, which ranges between 7% and 10% (Chen and Hamori 2009, Mishra and Smyth 2015). The biggest decline in estimated returns to education (from 8.8% to 8%) occurs when we control for language proficiency (specification 1 versus 2). The decline in the rate of return to education after controlling for language skills may simply be because English is part of the institutional education received in school. Therefore, when such components of education are included in the regression, they underestimate the true returns to education.

    Second, in contrast to Mishra and Smyth (2015) where language proficiency has no statistically significant relationship with wages in the PRC, our results indicate a clear correlation—individuals with good English speaking and listening abilities earn wages that are 30% higher than those who do not have these skills (column 5). This positive earnings premium from foreign language skills is consistent with existing studies focusing on both developed countries (Leslie and Lindley 2001, Dustmann and Fabbri 2003 on the United Kingdom, Bleakley and Chin 2004 on the United States) and other developing countries (Azam, Chin, and Prakash 2013 on India; Di Paolo and Tansel 2015 on Turkey). Moreover, compared with returns to other skills, the returns to a foreign language (i.e., English skills) are extremely high (Fasih, Patrinos, and Sakellariou 2013).12

    Third, consistent with the literature for both developed countries (Case and Paxson 2008, 2009; Heineck 2008; Hübler 2006) and developing countries (Schultz 2002, 2003; Dinda et al. 2006), health capital matters for earnings in the PRC. The OLS estimates suggest an additional centimeter of adult height is associated with a 1.4% higher wage in the full sample. This result is very close to some of the recent studies on returns to health capital in the PRC, including Gao and Smyth (2010) who were the first to confirm the height–wage premium in the PRC using the CULS 2005 data. They found that the wage return to height in urban areas is 1.1% and 0.9% for men and women, respectively. A later study by Elu and Price (2013) documented a similar rate of return to height (1.1%) based on urban and rural sample data from the CHNS 2006. Besides the height–wage premium, the returns to self-reported health status in our paper are also close to the results found by Zhang (2011) and Fang et al. (2012).

    Fourth, work experience is not rewarded in terms of higher wages in the full sample. Subsample estimates of the earnings function presented in Table 3 show that this is also true for rural areas of the PRC.13 However, we find a significant and inverse U-shaped relationship between experience and earnings in urban areas of the PRC. This is consistent with previous studies on urban areas of the PRC (Bishop and Chiou 2004; Appleton, Song, and Xia 2005; Gao and Smyth 2015). The return to work experience is low, only 2.7% in urban areas of the PRC using the CGSS 2010 dataset. This is in line with Appleton, Song, and Xia (2005), who document an increase in returns to education but a decrease in the returns to work experience in postreform PRC. Bishop and Chiou (2004) also report evidence of declining returns to experience in urban areas of the PRC between 1988 and 1995. One possible explanation for this declining return is that, unlike education, experience was overrewarded prior to the reform. Payments for seniority were a central feature of the prereform wage structure.14 The other possibility is that skills acquired in a socialist economy by older workers have declined in value following the PRC's labor market transition to one more market oriented.

    Table 3.  Ordinary Least Squares Estimates of the Determinants of Earnings with and without Controls for Language Skills and Health Endowments (urban versus rural)

    UrbanRural
    (1)(2)(3)(4)(5)(1)(2)(3)(4)(5)
    Personal characteristics
    Experience0.021***0.027***0.028***0.028***0.027***0.0090.0060.0040.0010.001
    (2.57)(3.28)(3.42)(3.42)(3.35)(0.83)(0.57)(0.35)(0.07)(0.01)
    Experience squared−0.001***−0.001***−0.001***−0.001***−0.001***−0.001−0.001−0.001−0.001−0.001
    (2.87)(3.46)(3.52)(3.36)(3.33)(0.94)(1.17)(1.27)(1.45)(1.38)
    Female−0.369***−0.377***−0.278***−0.278***−0.278***−0.503***−0.511***−0.336***−0.321***−0.319***
    (12.16)(12.49)(6.29)(6.30)(6.28)(11.71)(11.85)(6.19)(5.94)
    Minority−0.102−0.091−0.093−0.101−0.0990.0410.0410.0530.0290.030
    (1.63)(1.47)(1.50)(1.64)(1.61)(0.68)(0.69)(0.90)(0.51)(0.51)
    Nonagricultural hukou0.0450.0450.0340.0310.0310.634***0.611***0.595***0.586***0.584***
    (1.25)(1.28)(0.95)(0.88)(0.88)(6.12)(5.86)(5.75)(5.72)(5.69)
    Currently married−0.032−0.017−0.013−0.024−0.0250.265***0.266***0.241***0.197**0.197**
    (0.69)(0.37)(0.29)(0.53)(0.54)(3.32)(3.34)(3.04)(2.50)(2.50)
    Schooling and cognitive skills
    Years of education0.132***0.124***0.123***0.122***0.122***0.026***0.024***0.024***0.023***0.022***
    (26.64)(23.89)(23.68)(23.59)(23.51)(3.75)(3.42)(3.35)(3.29)(3.22)
    Good English skills0.215***0.210***0.209***0.208***0.298**0.289**0.282**0.285**
    (5.08)(4.96)(4.96)(4.92)(1.91)(1.86)(1.83)(1.86)
    Health capital
    Height (centimeters)0.009***0.009***0.009***0.018***0.016***0.016***
    (3.08)(3.01)(3.08)(5.19)(4.61)(4.64)
    Self-reported health status:
    Bad−0.192***−0.194***−0.109*−0.108*
    (2.91)(2.95)(1.71)(1.70)
    Good0.0420.0350.214***0.211***
    (1.13)(0.96)(4.24)(4.16)
    Body mass index (BMI):
    BMI < 18.5, underweight−0.052−0.046
    (0.81)(0.64)
    25 ≤ BMI < 30, overweight−0.0120.044
    (0.33)(0.79)
    BMI ≥ 30, obese−0.240**−0.023
    (2.38)(0.16)
    Geographic location
    Eastern (coastal) region0.475***0.465***0.458***0.452***0.453***0.244***0.244***0.233***0.216***0.216***
    (13.00)(12.74)(12.55)(12.40)(12.45)(4.41)(4.41)(4.25)(3.98)(3.97)
    Western region0.0840.085*0.098**0.102**0.102**−0.198***−0.200***−0.178***−0.152***−0.149***
    (1.87)(1.89)(2.19)(2.28)(2.27)(4.35)(4.40)(3.94)(3.38)(3.29)
    Constant5.449***5.418***3.878***3.906***3.898***6.395***6.366***3.192***3.361***3.354***
    (45.35)(45.28)(7.54)(7.60)(7.58)(35.68)(35.42)(5.01)(5.32)(5.30)
    Number of observations2,2882,2882,2882,2882,2881,9351,9351,9351,9351,935
    Adjusted R-squared0.430.440.440.440.450.190.190.210.220.23

    Notes: *, **, and *** indicate significance at the 10%, 5%, and 1% levels, respectively. “Good English skills” is a dummy variable which indicates whether a respondent's English skills (including speaking and listening) are at or above the standard proficiency level (=1) or not (=0). For self-reported health status, the reference category is “in normal health condition.” For body mass index (BMI), the reference category is “normal, 18.5 ≤ BMI < 25.” For regional dummies, the reference group is “central region.”

    Sources: Chinese General Social Survey (CGSS) and authors’ calculations.

    B.  Ordinary Least Squares Estimates versus Instrumental Variable and Heckman Two-Step Estimates

    We check the reliability of OLS estimates on the causal relationship between education capital and wages by comparing them with estimates using the IV and Heckman two-step models. Table 4 presents the returns to schooling based on OLS, IV, and Heckman sample selection correction estimation models for the full sample. Subsample specific results (female versus male, urban versus rural, and coastal versus inland regions) are also presented in the bottom panels of Table 4. All regressions control for personal characteristics, location dummies, and height, which is a predetermined health endowment (height). IV estimates are based on early parental death and parental education as excluded instruments. This serves as a way to address potential endogeneity bias in the estimated returns to education. On the other hand, excluding nonlabor income from bequest in the Heckman model identifies the selectivity term (lambda). Comparing OLS and selectivity-corrected Heckman estimates can help us understand the extent of sample selection bias in the OLS estimates.

    Table 4.  Ordinary Least Squares, Instrumental Variable, and Heckman Estimates of the Returns to Education

    OLSIVHeckman Two-Step
    Full sample (N = 4,223)0.078*** (18.16)0.209*** (10.42)0.082*** (16.19)
    F-test on excluded IVs171.19
    Sargan overid test (p-value)0.56
    Lambda−0.045 (0.25)
    Female sample (N = 1,797)0.090*** (13.80)0.237*** (8.39)0.097*** (5.72)
    F-test on excluded IVs99.05
    Sargan overid test (p-value)0.48
    Lambda−1.974 (2.20)
    Male sample (N = 2,426)0.071*** (12.00)0.179*** (6.47)0.074*** (12.00)
    F-test on excluded IVs78.76
    Sargan overid test (p-value)0.68
    Lambda0.085 (0.36)
    Urban sample (N = 2,288)0.122*** (23.51)0.219*** (11.77)0.134*** (18.08)
    F-test on excluded IVs161.69
    Sargan overid test (p-value)0.77
    Lambda0.165 (0.90)
    Rural sample (N = 1,935)0.022*** (3.22)0.088*** (1.52)0.021*** (2.72)
    F-test on excluded IVs41.39
    Sargan overid test (p-value)0.72
    Lambda0.054 (0.14)
    Eastern (coastal) region (N = 1,586)0.107*** (15.41)0.248*** (10.55)0.123*** (13.84)
    F-test on excluded IVs107.54
    Sargan overid test (p-value)0.52
    Lambda0.596 (2.51)
    Central region (N = 1,435)0.056*** (7.69)0.249*** (3.05)0.063*** (6.39)
    F-test on excluded IVs22.87
    Sargan overid test (p-value)0.24
    Lambda−0.458 (1.48)
    Western region (N = 1,202)0.054*** (6.77)0.101*** (2.89)0.057*** (6.73)
    F-test on excluded IVs44.48
    Sargan overid test (p-value)0.23
    Lambda−0.075 (0.20)

    IV = instrumental variable, OLS = ordinary least squares.

    Notes: *, **, and *** indicate significance at the 10%, 5%, and 1% levels, respectively. Early parental death along with father's and mother's education are used as excluded instruments in the IV model. Nonlabor income received from bequest is used as an excluded identifying variable in the Heckman model. For regional dummies, the reference group is “central region.” All regressions were controlled for covariates included in model 5 of Table 2.

    Sources: Chinese General Social Survey (CGSS) and authors’ calculations.

    In the OLS model, the estimated return is 7.8%. Furthermore, the result of the endogeneity test in column 2 rejects the null hypothesis that the OLS estimates are consistent. Using father's and mother's education and whether a parent died when the respondent was 14 years old as instruments, the IV rate of return yields 20.9%, which is 13.1 percentage points higher than the OLS return. Moreover, consistent with the international literature (Mendolicchio and Rhein 2014), we find that returns to education for female workers (OLS: 9%; IV: 23.7%) are higher than for male workers (OLS: 7.1%; IV: 17.9%) in both methods. The gender difference in returns to schooling increases by approximately 3% after correcting for endogeneity bias.

    Table 4 also reports returns to schooling for urban versus rural residents, and coastal versus inland provinces. Returns to schooling are higher for urban workers (OLS: 12.2%) than their rural counterparts (OLS: 2.2%), which is consistent with earlier studies that report a clear gap in returns to education between urban and rural areas (Zhang 2011). Once again, the OLS estimates are smaller than the IV estimates in all of these subsamples. In addition, the true rate of return is underestimated by 9.7 percentage points for urban workers and by 14.1 percentage points for workers in the coastal region, compared with only 6.6 percentage points for rural workers and 4.7 percentage points for workers in the western area.

    One explanation for the relatively larger size of the IV estimates is that the instruments are weak or nearly invalid, or both (Murray 2006, Wooldridge 2002). The first stage regression results of the IV model along with the diagnostic test results are presented in Table A3. The F-test statistic corresponding to the estimated coefficients of early parental death and parental education are both significant and large (19 and 151, respectively), implying that the instruments are strong and significant determinants of years of schooling completed. Results also show that if a parent died when the child was 14 years old, then his years of schooling are reduced dramatically.

    Turning to Heckman estimates, we do not find significant evidence of sample selection bias in our analysis. The identifying variable in the probit model has the expected sign (see Table A3). Higher unearned income from bequest is found to significantly decrease labor market participation. Nonetheless, the lambda term is not significant.

    Overall, results from Table 4 confirm that for CGSS data, we can rely on OLS estimates to examine the causal relationship between schooling and earnings. OLS, if anything, only leads to more conservative estimates of the true returns to years of education completed in the PRC.15 Therefore, the next section exclusively discusses estimates obtained from the OLS regression of wages to understand how returns to education and language skills vary in the PRC.

    C.  Heterogeneous Returns to Education and Language Skills

    Next, we explore two particular channels through which returns to skills and schooling may have changed in postreform years. First, we reestimate returns to education and language skills for all subsamples. Second, we reestimate the returns to different levels of education vis-à-vis language skills for the full sample and all subsamples. Because the OLS method is shown to consistently produce a conservative estimate in the previous section, we use this to understand the heterogeneous nature of the returns in our data.16

    Table 5 repeats the analysis presented in Table 2 for various subsamples, but only results specific to education and language skills are reported. The subsamples are female, male, urban, rural, eastern region, central region, and western region. First, we find that returns to education for female workers (9%) are still higher than for male workers (7.1%), even after controlling for personal characteristics; health indicators (height, self-reported health status, and BMI); and geographic locations, which is consistent with findings from previous studies (Kang and Peng 2012, Mishra and Smyth 2013, Wang 2013). The returns to women with good English skills (36%) are also higher than the returns to men (23%, see column 5). Second, in addition to this gender gap in returns to schooling, we observe a clear rural–urban gap in the returns. Our finding is consistent with Meng, Shen, and Xue (2013) who find that the rates of return to each additional year of schooling increased from 8% to 9.3% during 1988–2009. This increase is even larger in urban areas (about 3 percentage points higher), which is similar to the finding of Gao and Smyth (2015) for the period 2001–2010.

    Table 5.  Ordinary Least Squares Estimates of the Returns to Education versus Language Skills, by Gender and Location

    (1)(2)(3)(4)(5)
    Female sampleYears of education0.103***0.093***0.091***0.090***0.090***
    (N = 1,797)(16.13)(14.17)(13.98)(13.90)(13.80)
    Good English skills0.379***0.372***0.369***0.362***
    (5.88)(5.79)(5.80)(5.67)
    Adjusted R-squared0.510.520.520.530.53
    Male sampleYears of education0.079***0.072***0.071***0.071***0.071***
    (N = 2,426)(13.67)(12.21)(12.04)(12.08)(12.00)
    Good English skills0.243***0.232***0.232***0.232***
    (3.95)(3.78)(3.80)(3.80)
    Adjusted R-squared0.450.460.460.470.47
    Urban sampleYears of education0.132***0.124***0.123***0.122***0.122***
    (N = 2,288)(26.64)(23.89)(23.68)(23.59)(23.51)
    Good English skills0.215***0.210***0.209***0.208***
    (5.08)(4.96)(4.96)(4.92)
    Adjusted R-squared0.430.440.440.440.45
    Rural sampleYears of education0.026***0.024***0.024***0.023***0.022***
    (N = 1,935)(3.75)(3.42)(3.35)(3.29)(3.22)
    Good English skills0.298**0.289**0.282**0.285**
    (1.91)(1.86)(1.83)(1.86)
    Adjusted R-squared0.190.190.210.220.23
    Eastern (coastal)Years of education0.122***0.109***0.108***0.108***0.107***
    region(18.32)(15.50)(15.33)(15.39)(15.41)
    (N = 1,586)Good English skills0.319***0.312***0.309***0.304***
    (5.70)(5.56)(5.55)(5.46)
    Adjusted R-squared0.440.450.450.450.46
    Central regionYears of education0.063***0.059***0.058***0.056***0.056***
    (N = 1,435)(8.76)(8.04)(7.95)(7.72)(7.69)
    Good English skills0.209**0.214**0.194**0.196**
    (2.29)(2.34)(2.16)(2.19)
    Adjusted R-squared0.340.350.350.370.37
    Western regionYears of education0.060***0.057***0.056***0.056***0.054***
    (N = 1,202)(7.44)(7.00)(6.99)(6.93)(6.77)
    Good English skills0.221**0.214*0.212*0.230**
    (1.88)(1.83)(1.82)(1.98)
    Adjusted R-squared0.420.430.440.440.45

    Notes: *, **, and *** indicate significance at the 10%, 5%, and 1% levels, respectively. “Good English skills” is a dummy variable which indicates whether a respondent's English skills (including speaking and listening) are at or above the standard proficiency level (=1) or not (=0). Full specifications for models 1–5 are shown in Table 2.

    Sources: Chinese General Social Survey (CGSS) and authors’ calculations.

    Turning to region-specific estimates, our analysis shows clear regional differences in the returns to education. The bottom three panels of Table 5 report estimates by region. We find that the eastern region of the PRC (i.e., coastal provinces) has a comparatively higher rate of return to schooling (10.7%) than the central (5.6%) and western regions (5.4%).

    One explanation for this regional difference in returns to education might be the observed widening gap in the production of cognitive skills, assessed in terms of differences in per student recurrent expenditure, teacher quality, and physical conditions of schools between coastal and inland areas (Qian and Smyth 2008a; Cheng 2009; Bickenbach and Liu 2013; Yang, Huang, and Liu 2014; Whalley and Xing 2014). Zhong (2011) examined the relationship between college quality and returns to higher education in the PRC and confirmed that the returns vary significantly depending on school quality. Moreover, he found that the maximum earnings gap between recipients of high- and low-quality higher education is 28%, and the gap for annual returns reached 1.4% after controlling for ability. Thus, better education quality at both basic education level (Cheng 2009) and higher education level (Bickenbach and Liu 2013) has resulted in higher returns to education in coastal areas of the PRC.

    Table 6 shows the returns to different levels of education for the full sample and seven subsamples. We find that the returns to schooling increase with higher levels of education, which are consistent with results found in studies of developing countries (Kuepié and Nordman 2016). We calculate the average rate of return ri specific to each level using the estimated OLS coefficients in the following way:

    ri=(βi-βi-1)/(Yi-Yi-1)
    where i is the level of education, Yi is the year of schooling at education level i, and βi is the estimate of the coefficient on the corresponding education level dummy in the wage regression. Thus, the rate of return to higher education, a bachelor's degree and above, is 31.9%, which is higher than the returns found in some studies that focused on the prehigher education expansion period. For example, based on 1981–1987 data from the Chinese Academy of Social Sciences, Meng and Kidd (1997) found that the rate of return to a bachelor's degree or higher relative to primary education is 29.1% in 1981 and 31.3% in 1987.17 Moreover, we also find that female workers benefit more from having higher education than men. Similarly, urban residents are rewarded more than rural residents with the same level of college education, which is consistent with findings from Qian and Smyth (2008b) and Wang (2012).

    Table 6.  Ordinary Least Squares Estimates of the Returns to Schooling by Levels of Education (full sample and subsamples)

    Full SampleFemaleMaleUrbanRuralEastern (Coastal) RegionCentral RegionWestern Region
    Level of education
    Junior secondary0.149***0.149***0.126***0.229***0.088*0.183***0.097*0.137**
    (4.29)(2.83)(2.72)(3.97)(1.88)(2.67)(1.84)(2.14)
    Senior secondary0.453***0.582***0.366***0.602***0.251***0.529***0.306***0.474***
    (10.53)(8.40)(6.58)(10.04)(3.51)(7.09)(4.42)(5.42)
    Semibachelor0.872***1.114***0.693***1.018***0.665***0.916***0.704***0.915***
    (15.92)(13.36)(9.44)(15.58)(3.23)(10.68)(7.23)(7.57)
    Bachelor and above1.191***1.501***0.981***1.335***0.733***1.296***0.904***1.018***
    (20.06)(16.23)(12.58)(19.25)(2.72)(14.51)(7.46)(7.77)
    Good English skills0.150***0.147**0.115*0.154***0.1580.205***0.0550.103
    (3.28)(2.24)(1.80)(3.55)(1.00)(3.61)(0.58)(0.87)
    Number of observations4,2231,7972,4262,2881,9351,5861,4351,202
    Adjusted R-squared0.530.560.480.450.230.480.380.46

    Notes: *, **, and *** indicate significance at the 10%, 5%, and 1% levels, respectively. “Good English skills” is a dummy variable which indicates whether a respondent's English skills (including speaking and listening) are at or above the standard proficiency level (=1) or not (=0). Full specification is in model 5 of Table 2. For “level of education,” the reference category is “at or below primary level,” following Meng and Kidd (1997).

    Sources: Chinese General Social Survey (CGSS) and authors’ calculations.

    Given such convexities in the earnings function, income inequality is unlikely to be reduced through school education unless equality in access to higher education is ensured.18 This is also confirmed by the fact that educational endowments (schooling as well as skills) are distributed unequally in the PRC. The average number of years of schooling in Shanghai is 13.8, which is clearly higher than in the full sample (9.7), the eastern region including Shanghai (11.6), the eastern region excluding Shanghai (11.4), the central region (8.9), and the western region (8.1). Moreover, the percentage of respondents that have good English skills in Shanghai is also higher (43.1%) than in the full sample (11.2%), the eastern region including Shanghai (20.1%), the eastern region excluding Shanghai (17.8%), the central region (6.3%), and the western region (5.4%).

    VI.  Conclusion

    In this paper, we have reexamined the economic returns to education in the PRC using a recent dataset that is representative of all provinces. When the endogeneity problem is not addressed, OLS estimates underestimate the true returns to schooling in the PRC. The IV estimates yield a much higher return to schooling—20.9% compared with the OLS estimate of 7.8%. In addition to commonly used instruments such as father's and mother's education, we used parental death when the respondent was 14 years old, which proved to be a strong excluded instrument in the first stage regression.

    In general, our estimates are much higher than what has been reported in earlier studies on the PRC, particularly those that used prereform labor market datasets. This confirms that returns to education have steadily increased following the process of transition toward a market economy. Our evidence also confirms that individuals in coastal and urban locations (particularly nonstate sector employees) and young workers with market-relevant language skills were rewarded with higher returns to their education than their counterparts in rural and inland locations. The findings support the conclusions of recent studies that it took about 2 decades for the PRC to raise their workers’ respective returns to education to the 10% level (Hung 2008; Meng, Shen, and Xue 2013).

    The transition of the Chinese labor market from a centrally planned to a market-oriented system has contributed to a significant increase in earnings inequality by increasing the rewards for education and work experience. The estimated return is much larger for higher education compared with secondary education. Market reforms may have also increased the price of unobserved skills (Meng, Shen, and Xue 2013). This may explain why we find a systematic labor market advantage enjoyed by those with English language skills and why the return is highest in coastal provinces where private sector jobs have the highest concentration. This finding is consistent with the evidence that schooling contributes to labor market performance in educationally advanced countries by enhancing labor market relevant functional literacy skills. Given our evidence on the convexities in returns to education and the significance of human capital as a determinant of labor market performance in postreform PRC, policies that improve access to cognitive skills are likely to reduce income inequality and boost economic growth in the coming decades.

    Notes

    1 This pattern of rising returns to education is similar to the experience of other economies in Central and Eastern Europe that went through the transition from a planned economy to a market economy (Hung 2008).

    2 According to one account, the average real earnings of Chinese urban male workers increased by 350% during 1988–2009, increasing the variance in log earnings by 94% (Meng, Shen, and Xue 2013).

    3 For the interplay between human capital and foreign direct investment in the PRC, see Liu, Xu, and Liu (2004); Su and Liu (2016); and Salike (2016).

    4 According to Hung (2008), the returns to education in Central and Eastern Europe were about 2%–4% in the pretransition period, while those in the PRC were even lower at less than 2%.

    5 For existing meta-analyses of studies on returns to education in the PRC, see Liu and Zhang (2013) and Awaworyi and Mishra (2014). Moreover, for a review of developing country estimates, see Psacharopoulos and Patrinos (2004).

    6 For relevant international studies, see Arabsheibani and Lau (1999); Trostel, Walker, and Woolley (2002).

    7 Recently, some researchers have used the Lewbel (2012) IV method rather than the traditional IV approach to study the returns to schooling in the PRC, especially in urban areas (Gao and Smyth 2015, Mishra and Smyth 2015). Findings from either the conventional IV approach or the Lewbel IV method suggest that measurement errors exert a downward bias on OLS estimates.

    8 The positive relationship between schooling and health is well established in the literature (see, for example, Grossman 2008; Silles 2009; Conti, Heckman, and Urzua 2010; and Heckman et al. 2014).

    9 Since CGSS does not have data on work experience or tenure, we use information on age and school completion to define postschool experience. We assume the legal age for starting work is 16 years old. For those who completed secondary schooling, we calculate experience as current age minus years of schooling minus 6, but for those who didn't complete secondary schooling, experience is current age minus 16. This definition is consistent with existing studies on the PRC (Qian and Smyth 2008b, Gao and Smyth 2015, Mishra and Smyth 2015).

    10 We also considered income from land leasing and sale of property as additional identifying variables, but these were not significant in the first stage.

    11 English language skills are measured as a binary indicator and refers to proficiency at or above the standard level.

    12 This is also true for the PRC. For example, Giles et al. (2003), using data from the China Adult Literacy Survey (CALS), find that the estimated return to adult literacy (capturing knowledge of the vernacular) for residents in urban areas of the PRC is 9.3%–11.4%.

    13 For rural areas of the PRC, Li, De Brauw, Rozelle, and Zhang (2005) also find experience to be insignificant, based on Heckman estimates of the earnings function.

    14 Moreover, Appleton et al. (2002) document an inverse U-shaped relationship between general work experience and the probability of retrenchment in the PRC in 1999. If experience was overrewarded in the prereform period, then experienced workers would be at greater risk of retrenchment and their wage premiums would subsequently decline. Other studies employing a similar measure of “postschool experience” in the context of urban areas of the PRC are Qian and Smyth (2008b) and Mishra and Smyth (2015). While Qian and Smyth (2008b), using 2005 survey data from the PRC's Institute of Labor Studies (ILS), do not find any significant relationship between experience and wages, Mishra and Smyth (2015) confirm a convex relationship between experience and earnings.

    15 Another reason to treat OLS estimates as conservative is because the larger value of the IV estimates may be capturing treatment effects only for the subgroup of observations that comply with the instrument, i.e., the causal effect is identified for the observations affected by the instrument (“compliers”) so that the estimates are of a “local average treatment effect” (LATE), averaged across these compliers (Imbens and Rubin 1997, Wooldridge 2002, Murray 2006). In our case, the IV estimation arguably captures the returns to education only for those individuals whose schooling are very sensitive to their parents’ support. If so, the effect size cannot be generalized to the whole population.

    16 This approach to using OLS to understand heterogeneous returns assumes that across subsamples studied, the direction and extent of downward bias in OLS estimates remain the same.

    17 Studies based on data from the posthigher education reform period documented a sharp increase in returns to college education (Heckman and Li 2004, Fleisher et al. 2005, Qian and Smyth 2008b).

    18 For evidence on the role of higher education in explaining income inequality in the PRC, see Yang and Qiu (2016).

    Appendix Table A1.  Distribution of Sample Individuals by Work Status

    NWaged Work (Agricultural)Waged Work (Nonagricultural)Self-EmployedIn Labor Force but UnemployedNot in Labor Force
    Without age limitationFull sample11,72424.9%29.0%9.8%6.7%29.6%
    Female6,07925.1%23.2%7.5%6.3%37.9%
    Male5,64524.8%35.2%12.3%7.1%20.6%
    Urban7,1734.4%39.0%12.4%7.3%36.9%
    Rural4,55157.2%13.2%5.7%5.7%18.2%
    Female: 16–55 years old Male: 16–60 years oldFull sample8,64424.5%38.1%12.5%7.2%17.7%
    Female4,27925.5%31.8%10.1%6.9%25.7%
    Male4,36523.6%44.2%15.0%7.5%9.7%
    Urban5,3634.0%50.7%15.7%8.1%21.5%
    Rural3,28158.1%17.5%7.4%5.8%11.2%
    Female: 25–55 years old Male: 25–60 years oldFull sample7,74726.3%38.2%13.3%7.2%15.0%
    Female3,80927.3%31.6%10.8%6.9%23.4%
    Male2,93825.4%44.6%15.6%7.4%7.0%
    Urban4,7454.5%51.7%16.9%8.1%18.8%
    Rural3,00260.9%16.8%7.6%5.6%9.1%

    Source: Chinese General Social Survey (CGSS).

    Appendix Table A2.  Descriptive Statistics for Waged Workers

    MeanSD
    Monthly employment income (renminbi)1,631.372,283.98
    Personal characteristics
    Years of experience27.8610.06
    Female*0.430.49
    Minority*0.090.29
    Nonagricultural hukou*0.400.49
    Currently married*0.890.31
    Schooling and cognitive skills
    Years of education (years of schooling)9.704.45
    Level of education:
    Bachelor and above*0.110.31
    Semibachelor*0.110.31
    Senior secondary*0.190.49
    Junior secondary*0.300.46
    Primary and below (base group)*0.290.45
    Good English skills*0.110.32
    Health capital
    Height (centimeters)165.387.49
    Self-reported health status:
    Bad*0.120.32
    Normal (base group)*0.210.40
    Good*0.670.47
    Body mass index (BMI):
    BMI < 18.5, underweight*0.070.25
    18.5 ≤ BMI < 25, normal (base group)*0.720.45
    25 ≤ BMI < 30, overweight*0.190.39
    BMI ≥ 30, obese*0.020.14
    Excluded instruments (IV model)
    EducationFather (in years)5.274.61
    EducationMother (in years)3.384.30
    Parent died when respondent was 14 years old*0.030.18
    Labor force participation identifying variable (Heckman model)
    Nonlabor income received from bequest (renminbi)30.15707.22
    Geographic location
    Rural*0.460.50
    Eastern region*0.380.48
    Central region*0.340.47
    Western region*0.280.45

    IV = instrumental variable, SD = standard deviation.

    Note: *indicates dummy variables equal to 1 if true, and otherwise equal to 0.

    Sources: Chinese General Social Survey (CGSS) and authors’ calculations.

    Appendix Table A3.  First Stage Regression of Instrumental Variable and Heckman Models (full sample estimates only)

    IV First Stage (individual's schooling)Heckman First Stage (labor force participation)
    Personal characteristics
    Age0.0350.044***
    (0.72)(12.90)
    Age squared−0.001−0.001***
    (1.50)(14.01)
    Female−0.801***−0.181***
    (5.84)(15.89)
    Minority−0.109−0.001
    (0.66)(0.01)
    Nonagricultural hukou2.241***−0.016*
    (16.34)(1.81)
    Currently married0.315*−0.005
    (1.93)(0.48)
    Schooling and cognitive skills
    Years of education0.005***
    (5.07)
    Good English skills2.454***0.061***
    (14.62)(5.09)
    Health capital
    Height (centimeters)0.024***−0.001
    (2.64)(0.30)
    Self-reported health status:
    Bad−0.827***−0.039***
    (4.67)(3.33)
    Good0.1080.029***
    (0.89)(3.38)
    Body mass index (BMI):
    BMI < 18.5, underweight−0.2030.001
    (1.05)(0.07)
    25 ≤ BMI < 30, overweight0.130−0.023**
    (1.06)(2.49)
    BMI ≥ 30, obese−0.379−0.021
    (1.12)(0.79)
    Family background (instruments)
    Parent died when respondent was 14 years old (yes = 1)−0.857***
    (3.00)
    EducationFather(years)0.138***
    (9.60)
    EducationMother (years)0.122***
    (7.71)
    Labor force participation identifying variable
    Nonlabor income received from bequest (renminbi)−0.012***
    (3.93)
    Geographic location
    Rural−1.682***0.106***
    (12.19)(11.98)
    Eastern region0.535***0.005
    (4.43)(0.54)
    Western region−0.497***0.026***
    (3.98)(2.85)
    Constant4.394**
    (2.38)
    Adjusted R-squared/Pseudo R-squared0.540.17
    Number of observations4,2236,618
    F-test of significance: parental death only19.03***
    F-test of significance: parental education variables only151.61***

    IV = instrumental variable.

    Notes: *, **, and *** indicate significance at the 10%, 5%, and 1% levels, respectively. Early parental death along with father's and mother's education are used as excluded instruments in the IV model. Nonlabor income received from bequest is used as an excluded identifying variable in the Heckman model.

    Sources: Chinese General Social Survey (CGSS) and authors’ calculations.