INTRODUCTION
In terms of education level, Cuban immigrants positively self-select in their migration decision, that is, the people with the highest levels of education are those who tend to migrate. In general, migrants have more years of study than the mean of distribution of years of study of workers in Cuba, according to Cobas and Fernández (2014), and therefore they are workers with extensive skills in their source country. In addition, when considering legal migration, there is a tendency for the U.S. government to grant visas to both younger and better qualified individuals.
Since it is the most highly skilled people who migrate, it is undoubtedly of interest to analyze the whole distribution of earnings and quantify the effect of socioeco nomics variables in different location of the distribution of earnings.
Following to Motel and Patten (2012), the United States has been the main destination for migrants from Cuba, and other Latin American countries. The U.S. Census for 2012 reveals that 52,4 million people (16,95 % of the entire population) in the United States are of Hispanic origin (U.S. Census Bureau, 2012); 1,12 million of them, i.e. 3 % of the foreigners living in the U.S. were born in Cuba.
It is well known that an important proportion of the labor force in the U.S. is made up of immigrants. According to the U.S. Bureau of Labor Statistics (2013) in 2012 0,6 % of U.S. workers comprised people of Latin ethnicity, Cubans made up 5 %, making them the third biggest group of Hispanic immigrants in the U.S.
In line with Friedberg (1992), upon arrival in the U.S., immigrants can be expected to be at an earnings disadvantage with respect to natives because they lack certain skills and information that natives have. Over time, they may increase their income when they improve their English and adapt to the specific skill sets of the country.
In step with Aupetit and Gérard (2009), the migration of higher educated individu als has a negative impact on countries of birth since most educational investment on migrants is lost. Following to Cuecuecha (2005), in terms of benefits to the host country, migration increases its production capacity and technological capabilities, and from an economic point of view entails no significant costs in terms of social services, which would be required if the majority of those who migrated had less education.
This paper describes the earnings of Cuban immigrants in the U.S. using Ordinary Least Square (OLS) and Quantile Regression (QR). Quantile Regression is a method for estimating the relationship between a response variable and a set of explanatory variables for the whole conditional probability distribution of the response variable.
The explanatory variables considered are those that areconsidered as most important in the relevant literature: years of education, age on arrival in the U.S., potential job experience, sex, marital status, ethnicity, citizenship status and proficiency in English. The applications of QR range from the field of education; like the studies of Rangvid (2007), to biostatistics; like Terry, Wei and Esserman (2007) and economics like Chamarbagwala (2010).
Some papers, using cross-section models, have tended to confuse the true as similation of immigrants in the U.S. labor market. Chiswick (1978) found that the wage of Cubans immigrants in the U.S. who arrived in 1965-1969 increased by about 37 % within the first 10 years after immigration. This could lead to the conclusion that the earnings of immigrants grow rapidly and this rapid growth enables immigrants to overtake the earnings of native workers within 10-15 years of immigration. Borjas (1985) compared different cohorts, at the same points of their U.S. life cycle, using cross-section models within immigrant cohorts, and found that for the same period, 1965-1969, the earnings of Cuban immigrants decreased by about 25 %.
Some recent studies about earnings distribution of Latin immigrants in the U.S. reveal considerable disadvantages with respect to native-born people. Chiswick and Miller (2008) find that immigrants from non-English speaking countries have mean hourly earnings of around 12 % less than those of native born workers in the U.S.; Elliott and Lindley (2008) concludes that differences in human capital endowment and socioeconomic characteristics explain some of the lack of earnings assimila tion; Hunt (2012) concludes that non-English-speaking immigrants earn less than their native counterparts. Borjas (2015) finds that Cuban immigrants who arrived in the United States between 1995 and 1999 experienced a wage decrease of 4,4 % in their first 10 years in the U.S.
The contribution of this article is twofold. Firstly, it uses QR method to estimate the effects of different socioeconomic characteristics on the conditional probability distribution of the earnings of Cuban immigrants in the U.S. to analyze the distri bution of earnings of Cuban immigrants in the U.S. and quantify the effect of this socioeconomic variables. Secondly, it illustrates the differences in the impact of these characteristics between higher and lower income workers. Taking into account that a QR model proposes different regression lines for the different quantiles of the earnings distribution, the contribution of the socioeconomics characteristics on different levels of earnings can be compared.
The paper is structured as follows: Section 1 reviews the presence of Cubans in the U.S., Section 2 describes the methodology used, Section 3 presents the results of our estimations and Section 4 sets out the main conclusions.
1. CUBANS IN THE U.S.
Although most of the population of Cuban origin in the U.S. have arrived within the past 50 years, the Cuban presence in the country has a long history. According to Pérez (1986), between 1871 and 1958 some 221.000 Cubans migrated to the U.S., mainly to Florida, the most proximate state to the island. Following to Pérez (2001) this figure is dwarfed by the approximately 1,25 million Cubans who have migrated to United States since 1959. This exodus has also been mainly to Florida, particu larly Miami, where approximately three out of every five Cubans in the U.S. reside. Unlike most ethnic groups in other cities, overrepresented in only a few sectors of the economy, the presence of Cubans has been widespread in the Miami economy, Portes and Shafer (2007) state.
There have been five distinct periods of Cuban migration: The first was from 1959-1962, when around 200.000 Cubans migrated generally from the upper classes; the second period lasted from 1965-1973, when the U.S. conducted "freedom flights" and 260.500 applicants were permitted to leave the country in airlifts sponsored by the U.S.
According to Pérez (2001), the third wave came in 1980-1981, known as the Mariel boatlift period when some 125.000 Cubans migrated, including those from lower socioeconomic classes. Then came the so-called Balseros Crisis of 1994, when Cuba announced it would not restrain Cubans from leaving the country by raft or sea vessels and, consequently, over 40.000 balseros successfully migrated to the U.S.
The final period, from 1995 to the present, has been characterized by a steady outflow of some 20-30.000 Cubans migrating legally every year to the U.S. under accords reached following the Balseros Crisis. In addition, there has been a smaller, but unquantifiable number of Cubans migrating illegally1. Krogstad (2016) assures that the number of Cubans who have entered the U.S. has spiked dramatically since President Obama announced the renewal of relations with Cuba in 2014 with a 78 % increase in the number of illegal Cuban migrants entering the U.S., reportedly the largest wave of migration since the 1990s.
Thus around 1,5 million Cubans have immigrated to the United States since the 19th Century, settling predominately in Miami. They dominate the political scene in South Florida and have more recently become a leading demographic in political elections in the twenty-first century. Their success in the United States, according to Krogstad (2016), is partly due to the favorable treatment that Cubans receive as a consequence of The Cuban Adjustment Act, U.S. legislation passed during the Cold War to assist Cubans fleeing from communism and partly due to factors inherent in their distinct demographic characteristics.
Following the study of Pérez (2001), until the end of the 1990s, Cubans were the oldest, wealthiest and most educated Latino immigrants in the United States. In 2000, the median age of Cubans was 41,3 years, more than ten years older than the next eldest who were Central and South Americans, averaging 29,9 years of age. Furthermore, 24,8 percent of Cubans held a bachelor's degree compared with 18,0 percent of Central and South Americans. One in four Cubans 25 and older is a college graduate, more than double the rate of other Hispanics.
Finally, Suárez-Orozco and Páez (2002) state that Cuban Americans are more educated than other Latinos, and native-born Cubans show higher rates of college graduation than non-Hispanic whites. Otherwise, Hernández and Foladori (2014) assure since the turn of the century, the trend has been for younger (aged 20-40) and more highly qualified professional Cubans to migrate.
2. METHODOLOGY
Many of the issues that social researchers are currently analyzing, are related to the values of variables of interest located at the tails of the distribution of the variable of interest. The Quantile Regression (QR) method proposed in this paper measures the effect of the explanatory variables at various points of the (log) hourly earnings distribution. The difference at various quantiles provides significant information about the effects of the covariates considered on the hourly earnings spread. A linear quantile regression model2 links the conditional quantiles of the response variable to the covariates linearly.
With this method it is possible to study the conditional distribution of the (log) hourly earnings over the different socioeconomic variables at different locations and thus provide an overview of the links between (log) hourly earnings and the socioeconomic variables selected for the study.
As pointed out in Machado and Mata (2005) the estimated Quantile Regression coefficients can be interpreted as rates of return of labor market skills at different points of the conditional wage distribution. In general, the parameter estimated in linear quantile regression models have the same interpretation as those in any other linear regression model but now defined for a specified quantile, as Koenker (2005) says.
A robust OLS estimation on Equation (1) employing the heteroskedasticity robust HC4 estimator of Cribari-Neto (2004) was initially carried out. This estimator improves sample performance when the term of error is independent but heteroskedastic, especially in the presence of influential observations (Zeileis, 2004).
Where wi is the logarithm of gross hourly earnings for individual i, using total pre-tax wage and salary income (expressed in contemporary dollars), i.e, money received as an employee for the previous year as the measure of earnings; xi is a vector of socioeconomics characteristics of Cuban immigrants in the U.S. including an intercept, εi is the error term and is a vector of unknown parameters.
Next, the proposed model in Equation (1) under the conditional Quantile Re gression (QR) is estimated. For any t v (0,1) a linear quantile regression model can be written as
The quantile function of the response variable wi conditional on covariate vector xi at a given quantile parameter τ is given by
No specific assumptions are made for the error term, apart from ετi and ετj being independent for i ≠ j and to consider that the distribution function at 0 is τ. The estimation of regression quantiles coefficients, ßτ, can differ across the different τ -quantiles τ, so the marginal effect of a particular explanatory variable may not be homogeneous across the quantiles.
Since all continuous explanatory variables are centered at their median value, we use the centercept concept of Wainer (2000).
3. EMPIRICAL RESULTS
The data used in this paper come from the random sample of 1% of the 2011 American Community Survey (ACS) provided by Integrated Public Use Microdata Series (2011)3. This sample includes only individuals who entered the U.S. at the age of 17 years or over. This approach is intended to exclude people who completed their education in U.S, as Lowell, Pederzini and Passel (2008) state.
In addition, only individuals who entered the U.S. when less than 50 years old are considered, since the assumption is that the group aged between 16 and 49 years consists of those most likely to migrate for economic reasons, as Bertoli, Fernández-Huertas and Ortega (2013) point out in their study. Only individuals in work and those aged between 25-64 at the time of the survey are considered.
The sample consists of 19.079 observations. Table 1 gives a summary of the variables considered in the estimation models for Cubans immigrants in the U.S. As can be seen, 43 % of the people in the sample are women, 63 % are married, 45 % are nationalized in the U.S., 53 % speak English well or very well and approximately 3 % are black. 50 % of Cubans in the U.S. were 30 years old or more at the time of their entry into the U.S., have 12 or more years of education and earn $8,33 per hour or more. On average they have 27 years of potential experience.
Table 1 Statistics of covariates: Mean and Standard Deviation.4
Variable | Description | Mean | Standard Deviation |
---|---|---|---|
Woman | Dummy variable: 1 if woman | 0,4264 | 0,4946 |
Black | Dummy variable: 1 if black | 0,0309 | 0,1730 |
Married | Dummy variable: 1 if married | 0,6283 | 0,4833 |
AmericanCitizen | Dummy variable: 1 if an American citizen | 0,4461 | 0,4971 |
EnglishProficiency | Dummy variable: 1 if proficiency in English | 0,5314 | 0,4990 |
AgeImm | Age of the individual (in years) at time of migration | 29,81 | 8,2119 |
YearsEducation | Years of Education of the individual | 12,46 | 2,9781 |
Experience | Potential Experience of the individual 4 | 27,43 | 11,5426 |
Log Hourly Earnings | Response Variable | 2,6219 | 0,7724 |
Source: Author's own elaboration based on the 2011 American Community Survey (ACS) in the U.S. provided by IPUMS (2011).
Figure 1 and 2 show where individuals work depending on their level of education. We considered whether individuals in Cuba and in the U.S. held jobs with a profes sional category in line with their education level. A comparison of the occupation of Cubans in Cuba and Cubans in the United States reveals that a higher percentage of Cubans in U .S. with 13 or more years of education work in jobs that require lower skills than they actually possess.

Source: Author's own elaboration based on the Population and Housing Census of Cuba 2002 provided by IPUMS-International (2011).
Figure 1 Job Category in Cuba

Source: Author's own elaboration based on the 2011 American Community Survey (ACS) in the U.S. provided by IPUMS (2011).
Figure 2 Job Category in the U.S.
Figure 3 presents a histogram and box plot of (log) hourly earnings for Cuban immigrants. The histogram of (log) hourly earnings gives a good picture of the pres ence of positive skewness.

Source: Author's own elaboration based on the 2011 American Community Survey (ACS) in the U.S. provided by IPUMS (2011).
Figure 3 Histogram and Box Plot of (log) Hourly Earnings.
Following to Cobas and Fernández (2014), nearly 60 % of Cubans in the United States who have 13 or more years of education take jobs with low skill requirements and 78 % of those with 9-12 years of education work as skilled workers. In the same line, Cattan (1993) assures that many Cubans begin their working life in U.S. in jobs with skill levels much lower than and very different from those that they held in Cuba.
Table 2 contains the sample correlations between the explanatory variables. As can be seen, there are significant correlations between Proficiency in English and Citizenship Status (r = 0,3387), that is, Cubans with greater Proficiency in English are more likely to be citizens. Age on arrival in the U.S. and proficiency in English are negatively associated (r = - 0,3611); this means that younger individuals seem to be the most predisposed to learn English.
Table 2 Sample correlation of variables

Note: All correlations are significant at the 1% significance level. * means significant only at the 10% significance level. Source: Author's own elaboration based on the 2011 American Community Survey (ACS) in the U.S. provided by IPUMS (2011).
As can be seen in the first column of Table 3, in the OLS estimation there is only one non-significant explanatory variable (Black) to explain the (log) hourly earnings, at the significance leel of 5%. The remaining explanatory variables are statistically significant. Note that being married, being a naturalized American, speaking English well or very well, years of education and potential job experience show a positive link to (log) hourly earnings, while being a woman and age on entry in the U.S. show a negative link to the response variable.
Table 3 OLS and Linear Quantile Regression.

Significance Codes.*** 0,001 ** 0,05 * 0,10. Standard Errors in parentheses. Source: Author's own elaboration based on the 2011 American Community Survey (ACS) in the U.S. provided by IPUMS (2011).
The estimate of the return to years of education on the mean earnings is lower than expected, equal to 6,23 %. An additional year of potential job experience is shown to increase mean earnings by 0,46 %. Being a woman decreases mean earn ings by 28,39 % which is the greatest decrease in mean earnings. The model explains approximately 15 % of the variance in (log) hourly earnings.

Source: Author's own elaboration based on the results of QR estimation
Figure 4 Quantile Regression.
Figure 4 shows the Quantile Regression results and Table 3 presents the Quantile regression estimations for five values of τ-quantiles: 0,10; 0,25; 0,50; 0,75 and 0,90. Increments in earnings associated with the different socioeconomic characteristics vary across the earnings distribution5. Being a woman causes a decrease in wages in all quantiles of the distribution, the decrease is highest from the 25th percentile to the 75th percentile. On average, women in the United States are paid just 77 cents for every dollar paid to men. For Latinas, the gap is larger. Latinas in the United States are paid, on average, just 54 cents for every dollar paid to white, non-Hispanic men (Fry and Taylor, 2013).
Being black is not a variable that explains the variability of (log) hourly earnings from the central part of the distribution to the upper part, since it is not a significant predictor in any percentile from the median at a significance level of 5%. For people who earn less it is a significant predictor and it causes decreases in earnings. Being married is a variable that produces a homogeneous effect on the different quantiles of the distribution. Being an American citizen has little effect on hourly earnings and only a minor effect in the upper quantiles.
The return to proficiency in English is higher for people who earn more. For the 90-th percentile it gives rise to an increase in hourly earnings of about 18% compared to around 10 % for the 10-th percentile. In line with Lazear (1999) the payoff for immigrants from learning the English language is likely to depend on the frequency with which they will use those skills in their everyday interactions. Borjas (2015) shows that more recent immigrants to the U.S. are improving their English language skills at a far lower rate than earlier immigrants.
Regarding age on entry in the U.S, being younger is more important in the upper quantiles of the distribution of earnings. The return to potential experience is almost zero across the distribution of earnings of workers, but it is greater for people who earn more.
The number of years of education has a smaller impact on earnings in the lower quantiles (4,25 % for 10th percentile and 5,25 % for 25th percentile) than in the upper quantiles (7,27 % for the 75th percentile and 7,51 % for the 90th percentile).
The returns to education are higher at the top of the conditional earnings distribu tion but lower than expected. One possible explanation for this situation is related to over-education. In this line, Martins and Pereira (2004) state lower earnings for over-educated workers increase the skill dispersion of pay by extending the lower tail of the wage distribution of the highly educated. In the same way, Bohon (2005) assures immigrants may also encounter difficulties in obtaining good jobs due to language barriers, less extensive network connections and lags in cultural adjustment.
Finally, Machado and Mata (2005) adds that another reason may be the fact that Cuban workers are not as highly valued in the U.S market now as they were years ago, due to increases in the emigration of highly skilled people. Skilled workers, earning relatively higher wages, have become relatively more abundant and, as a result, their relative wages have decreased.
4. CONCLUSIONS
This paper has analyzed the distribution of earnings of Cuban immigrants in the U.S in terms of certain observable characteristics: years of education, potential job experience, age at time of emigration, ethnicity, marital status, sex, citizenship status and proficiency in English. It uses the 2011 American Community Survey (ACS) of the U.S. provided by IPUMS (2011a). Within the sample, only workers aged between 25 and 64 years who immigrated to the United States when they were between 17 and 49 years old are considered.
For the analysis, Ordinary Least Squares (OLS) estimation with standard er rors calculated using the heteroskedasticity robust HC-4 estimator of Cribari-Neto (2004) was first employed and then Quantile Regression, a technique which allows us to characterize the whole distribution of earnings of Cuban immigrants in U.S was proposed.
When we use OLS estimation with heteroskedasticy-robust standard errors, all explanatory variables except the fact of being black prove to be significant at the 5 % significance level. All other variables being constant, speaking English well or very well is the variable that produces the biggest increase in mean hourly earnings at about 15,64 %; being a woman produces a decrease in mean hourly earnings of 28,39 %; being married produces an increase of about 6,54 %,; being a U.S citizen produces an increase of 8,84 %; one more year of study increases expected hourly earnings by 6,23 % and one year of potential experience increases expected hourly earnings by 0,46 %.
However, the application of Quantile Regressions shows how the influence of the different variables considered on hourly earnings varies across the earnings distribution. With this type of estimation method differences between highly-skilled and low-skilled Cuban immigrants in the labour market can be detected.
The main conclusions of this article are the following: being a woman decreases hourly earnings at all points of the distribution, with the decrease being greater for individuals in the central part of the earnings distribution. The return to proficiency in English is greater for those people who earn more and the returns to education has a smaller impact on earnings in the lower quantiles and a greater impact at the top of the conditional earnings distribution but even then it is lower than expected. In particular, this article demonstrates how the return to education on earnings is less for Cuban born workers than expected.