EXPLORING THE RELEVANCE OF TWO-PART MODELS IN INNOVATION RESEARCH: TOWARDS A BETTER UNDERSTANDING OF INNOVATION SALES
Abstract
In empirical studies, firms’ innovation is often measured as the proportion of sales that can be attributed to new products or services. In a review of 20 years of research on innovation, we find this variable to be used in 141 studies providing a useful common ground to compare and contrast findings. We argue that two-part models are attractive when modelling innovation as they separate the decision to innovate from the amount of sales generated by innovation. Our review shows that only a minority of studies employ two-part models, most often in sub-optimal ways. We develop a two-part modelling framework that is suitable for innovation research and different from the conventional methods prevalent in econometrics. Furthermore, we provide a worked example of how it can be implemented. This paper contributes new knowledge about two-part models for innovation research and hopes to lay the ground for new theorising in the future.
Introduction
Innovation is a widely studied outcome in business and management research in the past decades (Damanpour et al., 2012). Commonly, innovation is operationalised as what we under a collective term choose to call “innovation sales”, described as, e.g., the proportion of sales attributable to new and improved products (Love and Roper, 2001), the fraction of the firm’s turnover relating to products new to the world market (Laursen and Salter, 2006), or the share of sales derived from new services (Leiponen, 2006). The proportion of innovation sales to total sales indicates the importance of innovative products in a firm’s portfolio (Love et al., 2014). One argument frequently used in the innovation literature is that it reflects not only firms’ ability to introduce innovative products to the market but also reflects their short-term commercial success (e.g., Berchicci, 2013; Hashi and Stojčić, 2013; Roper et al., 2013). The share of turnover from innovation sales tends to show the actual level of innovativeness because it assesses the end result of the innovation process and includes the entire innovation process (Mihalache et al., 2012). Thus, despite the diverse foci of the innovation literature, empirical research is often relying on standardised and comparable measures of innovation as a dependent variable. Using a standardised, commonly accepted dependent variable allows for cumulative knowledge building as authors can easily relate new research to the existing one and compare and contrast research findings.
While innovation research for many years have relied on proportional indicators of innovation’s contributions to an aggregate total outcome, researchers have used remarkedly varying approaches when it comes to statistically estimating innovation sales. This is important as using different statistical approaches to model the same outcome reduces the comparability of research results and may reflect different underlying theories about the determinants of innovation. One particular issue is whether innovation sales should be considered as a singular or two-part process (Ramalho and da Silva (2009), see also Mullahy (1986)). The former approach treats innovation as a bounded continuous variable (Hecker, 2016), whereas the latter considers it a divided process where firms initially either introduce a new product or not, and conditional on a new product introduction attempt to derive sales from the new product (Weterings and Boschma, 2009). That is, the process generating participation in innovation is separated from the process generating the magnitude, i.e., the commercial success. In this paper, we set out to outline how innovation is currently studied in research using the fractional innovation-to-sales ratio as the outcome and explore when and how innovation should be studied as a two-part process and how this approach can leverage new opportunities for theorising innovation as a multi-step process.
To investigate how innovation scholars analyse the share of sales due to innovative products or services, we conduct a systematic methodological review (Aguinis et al., 2023) of 20 years of research on innovation using innovation sales as the dependent variable. We isolate our search to studies where firm size is among the covariates in order to create comparability and follow how the relationship between the same variables changes across different choices of estimations. For our purposes, firm size is relevant as the firm size-innovation sales relationship can potentially be described using two-part process logic. Larger firms have greater access to financial and human resources (see detailed descriptions of this relationship in Vaona and Pianta (2008) and Audretsch et al. (2014). These factors might be related to explaining the decision to innovate or not, and, conditioned on that, the degree of innovation. While easier access to resources such as capital may help the firm overcome the threshold of introducing new products, it might not affect the short-term commercial success of the newly introduced product in terms of innovation sales. Firm size also serves as an example of a variable for which competing hypotheses might be posited. For instance, large companies may have higher marketing budgets leading to higher sales revenue from new products. Therefore, firm size serves as a useful illustrative example of how a two-part framework allows for directly testing competing hypotheses about the influence of a well-known variable on innovation sales. Further, firm size is a variable that is often found in studies as an independent or control variable. We identify not less than 141 articles using innovation sales as the dependent variable. Among these studies, a total of 27 employ a two-part model, however, only a subset of these use a justification related to understanding how different factors are related to different parts of the innovation process (see, for instance, Weterings and Boschma, 2009). Our comprehensive screening of the literature reveals considerable heterogeneity in the estimation techniques used and disagreement about the fundamental question of the choice between a singular and a two-part process that reflects not only a statistical practice but an underlying theoretical idea about how firms’ innovation should be conceptualised.
The state of this research reflects a limited recognition of the significant potential of using two-part models in innovation research. In this paper, we develop a two-part modelling framework suitable for innovation sales. Contrary to other research fields, it is central to understanding innovation sales as a two-part process that zero innovation sales can arise in two different ways: (1) A firm did not introduce a new product, or (2) a firm introduced a new product, but this product failed to generate any sales. We argue that a two-part model for innovation sales needs to be able to make this distinction. In this paper, we propose a framework that is different from the ones usually used in econometrics. The framework differentiates between zeroes originating from the innovation introduction process and zeroes from the sales process and we demonstrate its usefulness using simulations and an empirical example. The empirical example serves to illustrate central points about two-part processes and as an easy-to-use guide for future research. Our objective is not to propose new methods of measuring complex innovation processes, but rather to illuminate how two-part statistical approaches can help further leverage the insights obtained from innovation indicators that are already well known to innovation researchers and widely used in the field.
We aim to make several important contributions to the innovation literature and the methodological literature on two-part models. For research on innovation, we show the relevance of considering a two-part model when modelling innovation sales relative to total sales. While some theoretical frameworks, such as those proposed by Crepon et al. (1998) and Ramalho et al. (2011), offer insights, there is a concurrent need for empirical models adept at accurately handling this distinct type of innovation output. We develop a suitable two-part framework that can handle the kind of data arising when analysing innovation sales and thereby hope to contribute to improving the statistical practices used to estimate innovation to improve the trustworthiness of results. We further speculate that increased use of two-part models enables researchers to generate new theoretical insights about when firms engage in innovation efforts and how successful they are.
This paper is structured as follows. First, we introduce the intuition behind two-part models for studying innovation sales. Next, we present our review of the existing research and identify this research practice. Based on this, we build our framework for understanding innovation sales as a two-part process and illustrate it using an empirical example.
Understanding Innovation Sales as a Two-Part Process
In this section, we address in a non-technical fashion why two-part processes are often be relevant to describe innovation.
How to think about two-part processes for innovation sales
When we measure innovation sales, the innovation sales data are a mixture of two processes where there are different causes for the same observation, namely, observations at zero. First, zeroes can happen structurally. Zero innovation sales mean that nothing happened. And nothing can happen either because the probability of making an innovation sale is low, or because the process that generates innovation sales never got started. Zero-observations occurring due to the firm not developing any new products are called structural zeroes (Tang et al., 2018). Alternatively, zeroes may occur naturally when the probability of selling the new product is low. For instance, a firm has developed a new product, but three years later this product has not been commercialised and has yet to make any sales. These firms developed and possibly even introduced a new product to the market, but their innovation sales just happened to be zero. These zeroes are also known as random zeroes (Tang et al., 2018). To summarise, zeroes can arise from two separate data-generating processes: (1) either a firm does not introduce new products or (2) a firm innovates, but the innovation failed to generate any sales.
Data for the share of sales derived from new products that have been collected without excluding non-innovators, i.e., firms who did not introduce any innovative products, often contain a large number of observations at zero. For example, D’Angelo and Baroncelli (2020) found that 60% of the firms in their sample have no sales of innovative products. Similarly, Berchicci (2013) reports that one-third of the observations have a value equal to zero for the dependent variable innovation sales. If a relatively large proportion of the firms in a sample report zero innovation sales, it seems implausible that such a large share of firms would have no success at all with selling their new products. Instead, innovation data with a large amount of zeroes should make us suspect that there are different causes for the same observation. It seems more logical that the overwhelming majority of firms with zero innovation sales have not developed any new products. Structural zeros may account for most of the zero observations, but the amount of random zeros is not insignificant. For example, Weterings and Boschma (2009) found that out of the 219 firms in their sample, 42 did not introduce any new products (structural zeros) and 12 firms did not yet gain any turnover from the sales of the new products that they developed (random zeros).
The logic of qualitatively different zeroes is illustrated in Fig. 1, where y is the amount of innovation sales revenue, t is the total sales, and s is the fraction of sales derived from the innovative products, i.e., innovation sales revenue divided by total revenue. Each firm introduces a new innovation with probability p1 and does not innovate with probability 1−p1. When a firm does not introduce a new product it always produces zero innovation sales, s=0 (a structural zero). Firms that innovate may produce either s=0 (a random zero) or 1≥s>0. Note that this structure implies that we have two distinct processes, where the second process is conditional on the outcome of the first. Generating innovation sales revenue, whether zero or a positive amount, is naturally conditional on the firm bringing a new product to the market in the first place. As the name signifies, a two-part process has two parts. The first process of introducing an innovation is commonly referred to as the participation decision, whereas the second process resulting in the amount of innovation sales is called the magnitude decision (Wooldridge, 2010b, Chap. 17.6).

Fig. 1. The logic of a two-part process for innovation sales.
Interpreting the zeroes as arising from two different processes is not a challenge encountered in adjacent fields. In finance, a common example is leverage, i.e., the question of whether a firm should issue debt or not and if yes how much debt it should issue (Cook et al., 2008). Once a firm has decided to issue debt, leverage is never zero, but strictly positive (Ramalho and da Silva, 2009). In international business, another example is globalisation, i.e., the share of business done abroad (Wulff and Villadsen, 2020). If a firm is global then as per definition has some degree of globalisation (e.g., Ganotakis and Love, 2012), which causes zeroes to be generated solely by the first process of deciding whether to go global or not. For these more traditional two-part processes, it is only necessary to treat strictly positive outcomes in the magnitude part. However, it is completely possible for a firm that has introduced a new product to experience zero sales and it is important to distinguish between the often notable amount of observations at zero.
Several studies of innovation sales use large-scale survey data from the European Community Innovation Survey (CIS) or surveys using similar measurements.1 CIS is based on the definitions and measurement concepts for innovation data as laid down in the Oslo Manual (OECD and Eurostat, 2018) and constitutes the official innovation statistics for the EU. It is designed to measure innovation inputs, innovation outputs, and innovation-relevant characteristics of firms and their market environment. The CIS questionnaire is set up in a way that gives rise to a selection problem. Specific filters ensure that non-innovators do not have to answer detailed questions about innovation activities such as their share of sales of innovative products. Thus, in this case, the magnitude part can only be estimated for firms that report at least one innovation. This also means that by construction, only the two-part process fits the understanding of innovation as detailed in the CIS data collection. If this selection is not properly taken into account, the estimation of the innovation intensity equation will be biased.
If the data at hand contain a large share of zeros (or other boundary values) practitioners should consider the explanation, resonate whether separate processes are in place, and consider using two-part frameworks thereby allowing the two processes to be explained differently (Ramalho and da Silva, 2013). Having two separate processes enables researchers to investigate whether and how variables associated with engaging in the early phases of innovation, e.g., new product development are also associated with the amount of innovation sales, i.e., commercial success. While the process of innovation is not independent from the commercial success of new products or services, the same explanatory variable may have a different influence on the introduction of an innovative product and the share of sales derived from innovations, both in effect size, magnitude, and functional form. Current approaches are not well suited to understanding the complexities and interactions between different aspects of firm innovations.
Hypothesising about a two-part process for innovation sales
Innovation researchers have argued that a two-part modelling approach allows us to obtain a fuller picture of innovation success (Leiponen and Helfat, 2010). The first part provides information about whether the firm engages with innovation or not. If a firm has introduced any new or improved products it reflects a baseline engagement with product innovation. The second part reflects not only the firm’s ability to introduce new products to the market but also the new product’s short-term commercial success (Roper and Hewitt-Dundas, 2015). Although both parts reflect some innovation output, the existence of a novel product is hardly a good predictor of the economic performance of an innovation. A product innovation represents only a premarket result of innovation activity (Barlet et al., 2000). In other words, firms that generate innovative products are not guaranteed to have their products welcomed by the marketplace. Because only the second part contains information about market acceptance and actual performance, it is crucial to untangle the effects from the two parts, when we hypothesise about an explanatory variable having an effect on innovation.
As discussed in the previous section, innovation sales data are often characterised by a large number of observations at zero. In such cases, researchers have to decide whether one- or two-part models should be used. Clearly, this decision depends crucially on the assumptions and interpretations of the source of the observed zeros. In cases with structural zeros only, if the zeros may be interpreted as a result of a utility-maximising decision, a one-part model is appropriate. For example, Wagner (2001) argued that firms choose the profit-maximising volume of exports, which might be zero or a positive quantity, and therefore uses a one-part model to explain the exports-to-sales ratio. If the zeros and the positive values may be best described by different mechanisms it is more reasonable to model separately the participation and the amount decisions using two-part models (Ramalho et al., 2011). In other words, this idea that the separation of a participation and magnitude part would reflect two different choices also means that they could be affected in different ways or by different explanatory variables.
Mairesse and Mohnen (2002) explained the innovation production function as a simplified representation of the innovation processes which links innovation outputs, such as innovation sales, to innovation inputs. The authors base their work on CIS data and specify the innovation production function with a two-equation model, where the first equation, accounting for the propensity to innovate, includes explanatory variables such as industry dummies and firm size, and the second equation, accounting for the share of innovative sales, includes additional variables such as variables related to R&D efforts and the competitive environment in which the firm operates. Choosing the relevant output, input, and control variables for the specification and estimation of such a function requires a solid understanding of actual innovation processes at work within firms (Robin and Schubert, 2013). As exemplified by Mairesse and Mohnen (2002), the inputs for the two distinct processes may differ. The construction of the CIS questionnaire sets some practical constraints for the inputs, as for the first equation, the independent variables need to be available for all firms, whereas, for the second equation, independent variables only need to be available for firms that introduced a new product (Kampik and Dachs, 2011). More interestingly, a theory may lay the ground for the authors to argue a two-part setup where the explanatory variables play different roles in each process. For instance, Roper et al. (2013) tested whether externalities of openness have different effects on the propensity and intensity of innovation. Similarly, Weterings and Boschma (2009) examined which characteristics of relationships with customers may affect the likelihood that a firm brings a new product or service to the market and which characteristics affect the innovative performance. Thinking in terms of fixed cost barriers to innovation may also be a way to generate hypotheses that motivate a two-part model. A two-part model arises naturally when fixed costs affect the decision to enter a particular state (Wooldridge, 2010b, Chap. 17.6). For a firm, the decision to innovate a new product may depend on several considerations, including the availability of sufficient capital to invest. The way that capital availability affects the decision to introduce a new product may be quite different from how it affects the relative success of the innovation once it starts selling.
In the empirical application at the end of our paper, we provide an example where we suspect a different impact of firm size on the introduction of an innovation and the short-term commercial success measured by the share of innovation sales.
Current Practices for Modelling Innovation Sales
In order to understand how the existing research has modelled innovation sales as an outcome, we conducted a systematic literature review (Aguinis et al., 2020). Our objective with the review is two-fold. First, we want to create an overview of how frequently a two-part model is employed and how the choice of a two-part model is justified in the literature. While innovation processes can legitimately be theorised as one-part or two-part processes, we aim to explore the potential two-part logic used when scholars hypothesise about an explanatory variable having an effect on innovation sales. Second, we want to create an overview of the statistical modelling strategies used to model innovation sales as a two-part process and pinpoint potential pitfalls before we develop a two-part framework suitable for innovation sales in the following section.
To accomplish these objectives, it is useful to not only focus on the dependent variable. Focusing on an independent variable as well enables us to make more precise comparisons between the existing studies and follow how the relationship between the same variables changes across different estimation techniques. Therefore, we focus specifically on studies including organisation size as an explanatory variable. As we explain in detail later, the firm size-innovation sales relationship can potentially be described using two-part process logic: Larger firms have greater access to financial and human resources. These factors might be related to explaining the decision to innovate or not, but not necessarily to the amount of innovative sales. Further, the size of the firm is commonly used as a covariate in innovation studies. Among internal organisational factors determining an innovation process, firm size measured as the number of employees is one of the most prominent and has been presented as a key determinant of innovation since Schumpeter (1983). We note that firm size measured as the number of employees is such a widely used variable that we effectively include most research on innovation sales. In our initial comprehensive survey of top journals, we include around 82% of articles focused on innovation sales, therefore, only excluding a minority because they do not include firm size as a covariate.
To establish an overview of the literature using innovation sales as the dependent variable, we scoped a broad review aiming to include as many studies as possible (Aguinis et al., 2020). We focus on the recent research published between 2000 and 2019. We followed a two-step sequence for identifying relevant studies. Initially, we did a thorough screening of the four top journals Research Policy, Journal of Product Innovation Management, Strategic Management Journal, and Organization Science. Based on this, we identified relevant search terms and conducted a broader search in the Web of Science database. In total, we identified 141 studies fulfilling our search criteria of studying innovation sales and including size among the covariates. A detailed description of the search for relevant research articles is available in Table A.1, the specific search terms can be seen in Table A.2, and an overview of all studies can be found in Table A.3. By reviewing the papers, we identified two main areas for improvement: (1) Theoretical justification and hypothesising of a two-part process for innovation sales, and (2) statistical modelling of a two-part model suitable for innovation sales.
Theoretical reasoning and hypothesising
In the 27 two-part papers, we identify five main justification practices used by practitioners to justify their use of a two-part model. In Table 1, we summarise how practitioners justify their use of a two-part model using one or more types of justification. Only relatively few papers relied on the main arguments discussed in the second section for using a two-part setup to model innovation sales: If the data at hand contain a large share of boundary values or if theory dictates substantial and/or directional effect differences, we should consider two-part models. No studies explicitly link recognition of a significant proportion of zero-observations in the specific dataset to the choice of a two-part model, although several studies recognise a large share of observations at zero. About 29.6% of the studies, on the other hand, resonate with the possibility of explanatory variables having different effects on the propensity to innovation and fractional innovation sales outcome and relate this to their separate estimation method. This justification practice has no overlap with the practice used by almost two-thirds of the papers describing the two outcome measures as complementary. Whereas the justification practice resonating effect differences is an acknowledgement of the innovation sales process as two different data-generating processes, the latter is merely an argument of using two outcome variables intended to reflect different aspects of innovation outcome.
About 22.2% of the papers referred to selectivity bias as an argument for separate estimations, but only half of these studies also considered potential effect differences. About 33.3% of the studies partly justified their choice of using two innovation outcomes due to the availability of measures in the data source, e.g., CIS or IIP. Further, all of these papers also relied on the justification practice of describing how the two measures complement each other. About 7.4% of the papers in our review used one of the outcome measures in a robustness test.
Overall, scholars do not seem to be aware of the stream of the two-part literature and none of the 27 studies refer to their setup as a “two-part model”.
Justification practice | Examples | |
---|---|---|
Reflected that the effects of explanatory variables on the propensity of innovation may be different from the effects on share of innovation sales | 29.6% | “In this way, it can be more easily interpreted which characteristics of the relationship with customers may affect the likelihood that a firm brings a new product or service to the market and which characteristics affect the innovative performance” (Weterings and Boschma, 2009) |
Described the propensity of innovation and share of innovation sales as complementary innovation output measures | 63.0% | “By using both the binary innovation and product sales variables, we obtain a fuller picture of innovation success” (Leiponen and Helfat, 2010) |
Referred to selectivity bias as a consequence of not using both propensity of innovation and share of innovation sales | 22.2% | “In the CIS questionnaires, specific filters ensure that non-innovators do not have to answer detailed questions about innovation activities. Thus, the intensity equation can only be estimated for firms that report at least one innovation. If this selection is not properly taken into account, the estimation of the intensity equation will be biased” (Robin and Schubert, 2013) |
Referred to the availability of innovation output measures in the data source | 33.3% | “Innovation in the IIP is represented by three main variables” (Woerter and Roper, 2010) |
Used either propensity of innovation or share of innovation sales as robustness | 7.4% | “We perform two robustness checks. In the first case we perform the estimation only on innovating firms, and in the second we replace the dependent variable with a dummy product innovator variable.” (Love et al., 2014) |
Although more than one-fourth of the papers reflect potential effect differences as seen in Table 1, our review revealed that this is not necessarily apparent in the hypothesis statements. Among those studies formulating hypotheses, only three studies explicitly mention both of the dependent variables, the propensity to innovate and the share of innovative sales, in their hypotheses. Only in the study by Spithoven et al. (2013) do the hypotheses hold a different association between the explanatory variable in question and the two different outcomes.
Regression strategies and treatment of zero-observations
So far, we have not addressed the statistical modelling strategies for the two-part model for innovation sales. Despite studying the same dependent variable, we observe that the researchers employ a range of different estimation techniques in their studies.
Researchers have to make other decisions than whether to treat innovation sales as a one or two-part process. An important choice relates to modelling the specific outcome, innovation sales. As described, it is fractional in nature and our review shows that authors use a variety of statistical techniques. We found that the Tobit model is the dominating choice for modelling the magnitude part of a two-part model. About 21 studies relied on the Tobit model while five used linear regression. Some studies also combine methods in different two-stage IV-methods such as Heckman models. No studies used a binomial regression model and only one study used a fractional regression model to model the magnitude part, although these methods are designed for fractional outcomes (Papke and Wooldridge, 2008; Villadsen and Wulff, 2021a). Like it is the case in international business (Wulff and Villadsen, 2020), innovation scholars are likely unaware that the Tobit makes distributional assumptions about the conditional density. This means that if the errors are heteroskedastic or non-normal, the Tobit estimator is inconsistent (Newey, 1987). As we explain later, innovation sales data is by nature heteroskedastic. Innovation scholars are therefore currently placing the bulk of their empirical two-part evidence on a potentially inconsistent estimator. A linear regression strategy is less problematic, but still not optimal. A linear regression might provide a good approximation of the average partial effect, but there is no guarantee that we obtain good estimates of the partial effect for a wide range of covariate values (Wooldridge, 2010b, Chap. 15.2). If researchers are interested in valid estimates of partial effects across the range of the variable of interest, they should instead rely on fractional or binomial regression.2
Next, we turn to the processing of random and structural zeros in this research. We intended to summarise the treatment of zero-observations in the two-part papers. However, as it turned out, in many studies it is unclear how the random and structural zeros are treated. For example, some studies describe how the regression for the magnitude part includes “innovators only”. Still, they estimate two separate models for the participation and the magnitude part with the same sample size making it difficult to discern how they treat zero-observations. Also, whereas some studies only consider firms with a positive amount of innovation output as innovators, others include firms that did not yet gain any turnover from the sales of the new products that they developed as innovators. Accordingly, there seems to be no consensus in the literature about whether firms producing a random zero are considered innovators or not. We identified three different versions of two-part models for innovation sales: (1) No zeros were included in the magnitude part, (2) random zeros were included in the magnitude part, and (3) both types of zeros are included in the magnitude part. To exclude firms that introduced new products, but had no sales attributable to innovations as in the first scenario is potentially problematic. These studies may be wrongly classifying innovators as non-innovators thereby potentially distorting the two-part model results. This corresponds to practitioners following the more traditional two-part setup of looking at strictly positive outcomes in the magnitude part. The second scenario is ideal and follows the ideas presented in the second section. The third scenario is equivalent to a one-part regression, but the authors still acknowledge the participation state by including the binary regression.
What can we learn from the studies using two-part models?
As discussed, two-part models separate the decision to participate in innovation from the process generating the magnitude of innovation sales. One-part models that do not make this separation implicitly assume that both processes are generated by the same mechanisms. One way to assess whether a two-part model is relevant is by comparing the coefficients for the regression predicting innovation participation with the coefficient predicting the magnitude of innovation. One-part models make the implicit assumption that the signs of the coefficients for the explanatory variables’ effect are the same for the probability of being an innovator and the extent of innovation sales. If these coefficients are in a different direction, it would indicate a serious threat to the assumptions behind one-part models. In the 27 studies uncovered in our literature review using two-part models, we identified 35 separate tests where we could identify the coefficient of firm size for both the first-part and second-part regression.
Only in two studies are the coefficients for innovation participation and magnitude both positive and significant. They are both negative in an additional one study. This suggests that only around 9% of the tests do not violate the assumptions behind a one-part model. In 22 tests (63%) the coefficient either changes from positive to negative or is insignificant in one regression and significant in the other. These results suggest that the mechanisms generating participation in innovation may be different from the mechanisms explaining the magnitude, and indicate that researchers should often consider the relationship between firm size and innovation sales a two-part process. While we do not have this information available for the majority of studies using one-part models, we have no reason to believe the picture would be radically different. We found none of the one-part studies describing that they have tested the premises for the assumption to treat innovation sales as a singular process. We summarise these results in Table 2.
Coefficient of Firm Size | No. of Tests |
---|---|
Both participation and magnitude positive and significant | 2 |
Both participation and magnitude negative and significant | 1 |
Both non-significant | 10 |
Participation positive, magnitude negative | 3 |
Participation negative, magnitude positive | 0 |
Participation significant, magnitude non-significant | 13 |
Participation non-significant, magnitude significant | 6 |
In sum, our review reveals an unexploited potential of two-part models in management and business research. Scholars seem to lack arguments as to why a two-part model is even necessary or theoretically relevant. The literature review also indicates confusion about how to properly model a two-part innovation sales process.
Statistical Modelling of a Two-Part Model for Innovation Sales
In this section, we start building a two-part framework suitable for innovation sales.
The basic innovation sales model
Before we move to describe the two-part model of innovation sales, it is important to consider the basic statistical properties of the innovation sales variable. The model we build for innovation sales should be able to tell us about how a variable of interest is related to innovation sales. Partial effects are a very useful tool for this. A partial effect describes how the fraction or amount of innovation sales changes when some variables of interest change while others are held fixed. For each measurement, we can think of there being some proportion, p, of the total amount of sales, t, that is attributable to recent innovations. The higher this proportion, p, the more of a firm’s sales are attributable to innovation. This makes modelling p of great interest: If we can figure out which factors are related to p, firms will know which factors to manipulate to increase their innovation performance.
The model we build will make it possible to obtain valid estimates of the partial effects. By valid, we mean that (i) the estimator of partial effects is consistent (i.e., that the estimated partial effect converges to the true value as the sample size increases) and (ii) that inferences based on p-values and confidence intervals are correct (i.e., that the 95% confidence interval contains the true value 95% of the time). The model will also respect the bounded nature of the outcome allowing the partial effects to level off towards the bounds. In this way, we can obtain valid partial effects not just at the average, but for values across the range of the variable of interest.
Statistically, we seek to connect observed amounts of innovation sales, say in dollars, to variables associated with different average innovation sales. A good place to start is to think of this as a binomial probability model.3 Usually, we use the binomial distribution to model the number of “successes” out of a fixed number of possibilities. Here, we can think of it as drawing dollars (revenue) from a big pile of sales. Each dollar can either be attributable to recent innovations (success) or not. The model can be stated formally like this
In real data, yi, ti, and si vary depending on firm and industry characteristics. For instance, larger firms will tend to have higher sales, but usually also a higher proportion of innovation sales (Camisón-Zornoza et al., 2004). As our interest lies in the latter, the last line in Eq. (1) sets up a regression model that allows the proportion to vary across relevant characteristics. Specifically, F maps the probability parameter, p, onto the linear model space of Xβ where X is a matrix containing the relevant characteristics and β is a vector of regression weights. F is any link function that compresses the linear model space to make it fit within the probability space. Interestingly, we do not even need to know ti in order to learn about the relationship between X and yi.4
The two-part model
We now extend our basic model into a two-part model. For such cases, where the observations at one or both boundaries of a fractional dependent variable occur with a too large frequency than seems to be consistent with a simple model, or if theory dictates potential effect differences, a better approach may be the employment of a two-part model with a participation and magnitude part. Here, the discrete component is modelled as a binary model and the continuous component as a fractional regression model (Ramalho et al., 2011). The magnitude decision is what we have covered in the basic model. We combine the two parts in one two-part setup in the following way:
With a two-part model, we can connect a linear model and link function to each process. The parameters differ because any one predictor may be associated differently with each part of the model. Even the link functions and the variables that are present in each model do not need to be the same. In the two-part specification above we allow the variables that predict innovation participation and magnitude to be different. Further, if an explanatory variable, x is contained in both Zi and Xi, the two-part model allows for x to be distinctively related to the two processes, both in terms of sign and magnitude. As discussed previously, this is very advantageous for modelling innovation sales. As we show later in the empirical example, we might have variables that are associated with one or both processes inside the zero-inflated binomial mixture.
Two-part estimation
The proposed two-part model uses one part to predict participation and one part to predict the magnitude, but where the magnitude part is allowed to be zero. Participation is simply a binary choice model of the probability of introducing a product or innovation:
The second part of the model governs the magnitude of innovation success. If ti and yi are known, we assume the following conditional mean specification based on our binomial model:
To consistently estimate partial effects, we use quasi-maximum likelihood estimation (QMLE). QMLE fits very well with our desire to invoke as few restrictions as possible and still obtain valid estimates. As long as we pay careful attention to specify the conditional mean correctly, QMLE is consistent even if our distributional assumptions are completely wrong (Gourieroux et al., 1984; Papke and Wooldridge, 1996). Estimation of Eq. (4) by Binomial QMLE is also known as (aggregated) binomial regression, while estimation of Eq. (5) by Bernouilli QMLE is called fractional regression (Wooldridge, 2010b, Chap. 18). Once β has been estimated, average partial effects (APEs) can be estimated by plugging in the estimated parameters and computing the partial effect for each firm in the sample. Finally, we take the average thereby obtaining a consistent estimator for the APE.
When each part has been estimated, other quantities may also be computed. It is possible to compute the total expectation based on both processes similar to the traditional two-part model as described in Ramalho et al. (2011). It can be shown that the total expectation is simply F2(Xiβ2)×F1(Ziβi).
A major advantage of the proposed model is its simplicity. Assuming that the two parts are independent given the covariates, the estimation and interpretation of the two can be done independently. Because these models are standard in modern statistical software, the two-part model is straightforward to implement in practice.
Our model framework’s superiority over the more common two-part setup not taking random zeros into account is demonstrated in Fig. 2. Again, we simulate 20,000 repetitions, this time only from a beta-binomial process with N=400. We vary the number of random zeros and compare the bias and 95% CI coverage for our model to the traditional specification. The participation part of the traditional model suffers the largest bias (D1) increasing steadily as the number of random zeroes increases. This makes sense because the traditional specification does not distinguish between random and structural zeroes. Increasing the share of zeroes that are misclassified as structural, increases the APE bias. The magnitude part shows a slight bias, but it is mostly affected in terms of CI coverage. Importantly, the proposed two-part framework is unbiased with 95% coverage and is completely unaffected by the amount of random zeroes. This demonstrates the clear advantage of implementing a two-part model that can tell the difference between when a firm did not introduce a new product and when it did, but could not attribute any sales to the product.

Fig. 2. Simulation of the bias and coverage of the two-part innovation model versus the traditional two-part model.
Empirical Application: The Effect of Firm Size
In this section, we present an empirical application of the use of the proposed two-part framework for modelling innovation sales. The purpose is to exemplify how a two-part model of innovation sales can be justified and modelled and provide a practical guide for future research. We also show how the results of the two-part model can be interpreted and yield insights not available in a one-part model.
The firm size-innovation sales relationship
The two-part model we propose is still just a statistical model. If you wish to draw valid causal inferences, it is recommended to base the choice of variables on theory using causal graphs (Hünermund et al., 2022; Hünermund and Louw, 2023; Mändli and Rönkkö, 2023). Earlier, we argued that the firm size-innovation sales relation might best be described in terms of a two-part process. This process is depicted as a directed acyclic graph (DAG) in Fig. 3.

Fig. 3. DAG of the causal effect of firm size on innovation sales.
Larger firms have access to greater financial and human resources. Such resources may be regarded as barriers to innovation. Without the necessary financial muscle and know-how, introducing new products becomes difficult for the firm. Thus, larger firms have a greater ability to achieve at least one innovation (Leiponen and Helfat, 2010). This is an example of the fixed-cost barrier that we discussed earlier. Also, recall from our previous discussion that innovation magnitude differs by also indicating short-term commercial success. While easier access to resources such as capital may help the firm overcome the threshold of introducing new products, it might not affect the short-term commercial success of the newly introduced product.
With respect to the direct effect of firm size on innovation magnitude, one may pose two competing hypotheses. On the one hand, the resource-based view of the firm posits that a resource needs to have certain characteristics in order to provide a competitive advantage (Dierickx and Cool, 1989). Since firm size is more a proxy of resource quantity than quality, we suggest that firm size is likely to matter little for short-term commercial success as indicated by innovation magnitude. On the other hand, larger firms may have higher marketing budgets resulting in a stronger brand presence and hence higher sales revenues. In sum, we may pose two competing hypotheses about the effect of firm size on innovations sales:
H1: | Firm size has a positive effect on innovation participation, but no effect on innovation magnitude. | ||||
H2: | Firm size has a positive effect on innovation participation and a positive effect on innovation magnitude. |
Note how these competing hypotheses both have two conditions that must hold to gain support: a positive effect on innovation participation and a positive (H2) or no (H1) effect on innovation magnitude. Having two components makes our predictions more precise thereby putting them at a higher risk of falsification (Aguinis and Edwards, 2014).
We limit our ambition of the empirical application to controlling for two important possible confounding variables driving innovation in general, namely, industry and country dummies. We cannot rule out that some remaining unobserved factors may drive innovation outcomes or firm size. The goal of this empirical example is not to establish any evidence of causality. We see our empirical application as an example where we suggest a potential two-part process relationship to demonstrate the possibilities and usefulness of our proposed two-part framework.
Because H1 implies that firm size is not a direct cause of the magnitude of innovation sales, we set up and perform an equivalence test. In essence, an equivalence test helps determine whether an observed effect is too small to matter and therefore practically equivalent to zero (Lakens, 2017). Equivalence testing is a procedure that enables evaluating informative null results (Harms and Lakens, 2018). First, we need to quantify the smallest effect size of interest (SESOI, Lakens, 2017). We choose the lower-end confidence bound of 0.0467 around the meta-analytic effect size documented by Camisón-Zornoza et al. (2004). For simplicity, we round the SESOI to 0.05. We then use the two one-sided tests (TOST) procedure to test two composite null hypotheses (Schuirmann, 1987): H01:APE≤−0.05 and H02:APE≥0.05. Rejecting both these one-sided tests enables us to conclude that the APE falls within the equivalence bounds and is close enough to zero to practically equivalent (Seaman and Serlin, 1998).
Data and variables
The empirical application relies on data from 2009 from the Management, Organisation, and Innovation Survey, a joint initiative by the European Bank for Reconstruction and Development, and the World Bank Group (EBRD-World Bank, 2010). Descriptive statistics are available in Table 3. After deletion of missing data,5 we have 1458 firm-level observations from 12 countries and 11 industries.
Did not innovate (N=510) | Innovated (N=948) | Total (N=1458) | |
---|---|---|---|
Innovation sales | |||
Mean (SD) | 0 (0) | 0.279 (0.238) | 0.183 (0.234) |
Median | 0 | 0.200 | 0.100 |
[Min, Max] | [0, 0] | [0, 1.00] | [0, 1.00] |
Firm size | |||
Mean (SD) | 218 (334) | 322 (516) | 286 (464) |
Median | 111 | 165 | 150 |
[Min, Max] | [10.0, 3700] | [10.0, 6000] | [10.0, 6000] |
Log of firm size | |||
Mean (SD) | 4.90 (0.880) | 5.22 (0.949) | 5.11 (0.938) |
Median | 4.71 | 5.11 | 5.01 |
[Min, Max] | [2.30, 8.22] | [2.30, 8.70] | [2.30, 8.70] |
Country | |||
Belarus | 20 (4.0%) | 56 (5.9%) | 76 (5.2%) |
Ukraine | 50 (10.0%) | 86 (9.0%) | 136 (9.3%) |
Russia | 56 (11.2%) | 141 (14.7%) | 197 (13.5%) |
Poland | 18 (3.6%) | 74 (7.7%) | 92 (6.3%) |
Romania | 64 (12.7%) | 73 (7.6%) | 137 (9.4%) |
Serbia | 52 (10.4%) | 77 (8.1%) | 129 (8.8%) |
Kazakhstan | 50 (10.0%) | 68 (7.1%) | 118 (8.1%) |
Lithuania | 14 (2.8%) | 78 (8.2%) | 92 (6.3%) |
Bulgaria | 61 (12.2%) | 75 (7.8%) | 136 (9.3%) |
Germany | 38 (7.6%) | 174 (18.2%) | 212 (14.5%) |
India | 79 (15.7%) | 54 (5.6%) | 133 (9.1%) |
Industry | |||
other manufac. | 162 (32.3%) | 355 (37.1%) | 517 (35.5%) |
food | 44 (8.8%) | 132 (13.8%) | 176 (12.1%) |
textiles | 47 (9.4%) | 41 (4.3%) | 88 (6.0%) |
garments | 64 (12.7%) | 44 (4.6%) | 108 (7.4%) |
chemicals | 20 (4.0%) | 47 (4.9%) | 67 (4.6%) |
plastics | 18 (3.6%) | 47 (4.9%) | 65 (4.5%) |
non-metallics | 25 (5.0%) | 33 (3.5%) | 58 (4.0%) |
basic metals | 16 (3.2%) | 15 (1.6%) | 31 (2.1%) |
fabricate metal | 51 (10.2%) | 107 (11.2%) | 158 (10.8%) |
machinery | 31 (6.2%) | 83 (8.7%) | 114 (7.8%) |
electronics | 24 (4.8%) | 52 (5.4%) | 76 (5.2%) |
Respondents are asked whether their firm has introduced any new products and services in the last three years. This question acts an indicator variable of participation. In this survey, 948 (65%) introduced new products or services and are therefore asked about the proportion of sales over the past three years attributable to new products and services. This is effectively the magnitude part of innovation sales. In Table 3, we can see that the minimum innovation sales for the innovators is zero. Eight firms introduced new products but had these new products account for zero proportion of their total sales. As explained earlier, it is expected that some firms with a low probability of making innovation sales will experience no short-term commercial success. Thirty three firms had all their newly introduced innovation account for all of their total sales. The median innovation sales among the innovators is 0.2 indicating that newly introduced products and services account for a minority of the total sales.
Firm size is measured as the number of employees. From Table 3, it is clear that firms that introduced new products are generally larger than non-innovators. The median number of employees for non-innovators is 111 while it is 165 for the innovators. This is an early indication that firm size seems to matter for the propensity to introduce new products or services. The descriptive statistics on the two possible confounding variables country and industry are also available from Table 3.
Data analysis
Our data analysis follows the same basic steps outlined in Villadsen and Wulff (2021a) for fractional one-part models. After having specified the model, we perform specification tests of the conditional mean. Finally, we analyse the results relying on APEs and graphical plots.
Step 1: Model formulation
Based on our DAG (Fig. 3), we specify the following conditional mean specification for the fractional part:
Our DAG is agnostic about the directional and functional form relationship between firm size and innovation sales. We expect that firm size is positively and logarithmically related to introducing new products. While larger firms are more likely to introduce a new product, the effect of adding additional size once a firm has reached a certain size is likely to wear off. Eventually, the positive effect of being larger wears off due to inertia (Mihalache et al., 2012). In other words, we expect the effect to increase at a decreasing rate. To allow diminishing returns to size, we log-transform firm size (Villadsen and Wulff, 2021b).
Step 2: Specification tests
Because the consistency of our estimates depends on the conditional mean specifications being correct, we test the specifications. We use Ramsey’s (1969) RESET test using two powers (Ramalho and Ramalho, 2012) and the generalised goodness of functional form (GGOFF) test (Ramalho et al., 2014). The test results are available in Table 4. Neither the RESET (Participation: χ2=3.63, p=0.057; Magnitude: χ2=0.60, p=0.441) nor the GGOFF (Participation: χ2=3.72, p=0.156; Magnitude: χ2=0.81, p=0.667) tests reject the null hypothesis that the models are correctly specified at an alpha of 5%.
Participation | Magnitude | Total | |||
---|---|---|---|---|---|
coef | APE | coef | APE | APE | |
Log firm size | 0.205*** | 0.0674*** | 0.0412 | 0.0136 | 0.0281*** |
(4.96) | (5.08) | (1.55) | (1.55) | (3.94) | |
TOST (H02:APE≥0.05) | 1.09 | 17.05*** | 3.06*** | ||
(0.8518) | (0.0000) | (0.0011) | |||
RESET test | 3.633 | 0.592 | |||
(0.057) | (0.441) | ||||
GGOFF test | 3.720 | 0.811 | |||
(0.156) | (0.667) | ||||
Observations | 1458 | 1458 | 956 | 956 | 1458 |
Step 3: Results
Coefficients from binary (Mood, 2010) and fractional (Ramalho and Ramalho, 2010) models are sensitive to unobserved heterogeneity. Consequently, we should not compare coefficients across models or assess their magnitude (Hoetker, 2007). Instead, we avoid direct interpretation of coefficients by relying on summary measures and graphical illustrations of APEs (Wulff, 2015). Table 4 includes the main results. We observe a clear difference in the APEs when comparing the participation and magnitude parts. Firm size is positively associated with the probability of introducing new products or services. Increasing the size of log firm size by one unit is associated with around a 0.07 increase in the probability of new introduction. We can reject the null of no effect (z=5.08, p=0.000) and cannot reject that the effect is larger than the SESOI of 0.05 (z=1.31, p=0.9051). We thus conclude that the effect of log firm size on participation is substantial. For the firms that introduced a new product, the APE is around 0.01, meaning that a one-unit increase in log firm size is associated with around a 0.01 increase in the proportion of innovation sales. We cannot reject the null of no effect (z=1.55, p=0.121) and can reject that the effect is larger than the SESOI of 0.05 (z=4.13, p=0.0000). We thus conclude that the effect of log firm size on the magnitude part is practically equivalent to zero. This lends support to H1, but not H2.
Finally, we can compute the total change in innovation sales participation and magnitude. The estimated APE is about 0.03 and while we can reject the null of no effect (z=3.94, p=0.000), we also reject that the APE is large enough to be interesting (z=3.06, p=0.0011). Thus, we can conclude that the total change is different from zero, but not large enough to be of interest. Graphical illustrations of APEs across the range of log firm size are available in Fig. 4. Figure 4(a) shows that the APEs of log firm size are non-constant across the range of log firm size. As expected, the APEs decrease as firms get larger. This is consistent with the idea of diminishing returns to firm size The gained access to resources matters most when firms are small and matters less when firms are already large. Figure 4(b) shows that the APEs are statistically equivalent to zero across the entire range of log firm size. This enforces the conclusion that the effect of log firm size on the magnitude part of innovation sales is equivalent to zero.

Fig. 4. Average partial effect surrounded by 95% CIs.
While log firm size has a substantial effect on introducing new products, log firm size has no practical significance on the magnitude of innovation sales for firms that made new product introductions. In other words, firm size might matter quite a bit for getting a new product launched, but it might not matter for the short-term commercial success of the product. Our analysis also demonstrates the importance of separating the two processes to entangle product introduction effects from actual performance effects. In this case, we reach three different conclusions about the impact of firm size depending on how we compute the APE. This information cannot be obtained by the use of a one-part model.
Examples of traditional models
In this section, we briefly examine what would have happened had we used traditional approaches on the example dataset (Table 5). First, we examine what happens if we drop the random zeroes and estimate the fractional probit model on the magnitude part. The estimates are very close to our proper two-part specification from before. This is not surprising, as this datasets contain only 8 (0.8%) random zeroes limiting the amount of damage an improper handling of random zero can afflict. However, as shown in our simulation earlier, in datasets with a higher percentage of random zeroes this approach can result in a severe loss of statistical power.
Magnitude zeros dropped | Magnitude zero mix | |||
---|---|---|---|---|
coef | APE | coef | APE | |
Log firm size | 0.0388 | 0.0129 | 0.104*** | 0.0270*** |
(1.46) | (1.46) | (3.84) | (3.87) | |
TOST (H02:APE≥0.05) | 17.63*** | 10.81*** | ||
(0.0000) | (0.0005) | |||
Observations | 948 | 948 | 1458 | 1458 |
Second, we examine what would happen if we did not discriminate between random and structural zeroes in the magnitude part. In this model, the estimated APE of 0.0270 is twice as large compared to the proper two-part specification. Furthermore, the estimate is now significant (p=0.0000) suggesting a nonzero effect. This leaves us with a different conclusion than earlier and highlights the benefits of our two-part specification.
Discussion
In a systematic literature review of two decades of management and innovation research, we focused distinctively on the relation between firm size and innovation sales to understand if this widely studied relationship should be understood as a two-part process and to illustrate the value of this estimation technique. We observe a lack of justification for the choice between singular and two-part models, as well as significant heterogeneity in the estimation techniques. To address this gap, we develop a two-part modelling framework specifically designed for innovation sales. The framework presented in this paper is a simple way for researchers to model two-part innovation sales processes while correctly distinguishing structural from random zeroes. It builds on rigorously tested ideas and is easy to implement using modern statistical software. By applying simulations and an empirical example, we demonstrate the usefulness and practicality of our framework, offering guidance for future research endeavours. Our contributions extend to both the innovation literature and the methodological literature on two-part models. We emphasise the relevance of considering a two-part modelling approach when analysing innovation sales relative to total sales, aiming to enhance statistical practices and improve the trustworthiness of research outcomes. To our knowledge, no prior studies have examined the use of two-part models for fractional outcomes with qualitatively different zeros such as innovation sales. This study represents a novel exploration of two-part models for fractional outcomes such as innovation sales, where zero-observations can occur due to two distinct data-generating processes: (1) a firm does not introduce an innovation, or (2) a firm innovates, but the resulting innovation fails to generate any sales.
Practical implications
While our core focus of this paper is to illustrate the underutilised research potential in two-part models for innovation research, our analysis also holds important practical implications. The two-part modelling framework has several implications for advancing our understanding of innovation for both researchers and managers. Relying on the logic of two-part models, managers are provided with valuable insights into the factors influencing the decision to engage in innovation efforts and the subsequent success of those efforts. This potentially provides a more comprehensive understanding of their firm’s innovation outcomes. By applying this framework, managers can identify key factors that influence both the initiation and success of innovation efforts. This can be particularly valuable for strategic planning, as it allows firms to allocate resources more effectively between innovation activities and commercialisation strategies. A focus on how different factors may contribute to the success of internal innovation activities and the external market performance of new products or services allows for more fine-grained decision-processes and a better understanding of firms’ strengths and weaknesses concerning fundamental business processes. By paying attention to the qualitatively different zeroes and distinguishing between the presence of innovation and its commercial success, managers can assess the effectiveness of their innovation strategies more accurately. This distinction helps firms tailor their strategies, whether by fostering a more innovation-conducive environment or by addressing market-entry barriers for new products and services. We suggest that increased utilisation of two-part models opens up avenues for generating new theoretical insights into firms’ engagement in innovation efforts and their levels of success.
Methodological implications for research
The model framework presented in this paper is a simple way for researchers to model two-part innovation sales processes while correctly distinguishing structural from random zeroes. It builds on rigorously tested ideas and is easy to implement using modern statistical software. In our review, we identified a total of 27 papers that relied on a two-part process. However, many did not follow the practices we recommend in this paper. In the remainder of the discussion, we focus on the methodological implications of the analysis and consider some possible extensions and alternative approaches.
Our model framework assumes that we have data that can distinguish between structural and random zeroes. However, this difference might not be directly observable based on the data we have. In such cases, it would be possible to rephrase the magnitude part by including into y the structural zeroes leaving us with ym. Next, we include an indicator variable, ri, on the right-hand side to tease out the effect of the structural zeroes: E(ymi|Xi)=F2(Xiβ2+αri). Following Tang et al. (2018), we could treat ri as a latent variable to be estimated by a different model. Estimation could possibly be done by quasi-limited information likelihood (Wooldridge, 2014), or maybe even the less computationally demanding control function approach (Wooldridge, 2015). Further, we would need the specification of a model for ri, which is likely to be challenging. In sum, the proposed model could potentially be expanded to account for cases where the observed data does not contain information on the difference between structural and random zeroes.
Another possible extension is to assume that the two processes are dependent. Two-part dependency happens when a separate, unobserved process governs the types of firms that innovate. This problem is analogous to the well-known sample selection problem often encountered in management (Clougherty et al., 2016). If unobserved factors affecting the participation process are correlated with factors that influence the magnitude process, the framework proposed here will fail. Schwiebert and Wagner (2015) discussed a generalised approach to accounting for selection effects in fractional models. If a valid instrument is available, researchers may rely on the implementation in Wulff (2019) to cover the magnitude part thereby accounting for process dependence.
The model framework presented in this paper is built around traditional econometric principles. A popular alternative approach for mixture modelling is to expand the focus from the conditional mean to the full distribution of the outcome (Kneib et al., 2023). A distributional approach for innovation sales would assume a distribution consistent with p2, typically a beta distribution, and estimate the model parameters using maximum likelihood (Kieschnick and McCullough, 2003). When combined with a binary model for the participation part, the model is known as the zero-inflated beta (ZIB) model (Cook et al., 2008). However, a ZIB model is likely not a good approach for modelling two-part process innovation sales. First, the beta distribution is not defined for values at the boundaries, i.e., 0 and 1, and as in our empirical example, real data often includes both these values in the magnitude part. Second, even if our data contains no boundary values, the ZIB is inconsistent if yi follows a different distribution than the beta distribution (Wooldridge, 2010a, Chap. 18.6). A feature of the approach proposed in this paper is that it is based on QMLE and thus consistent no matter the true distribution of yi. If innovation researchers wish to apply the ZIB, they should have an actual interest in the conditional distribution. Otherwise, they might be giving up the feature of consistency provided by QMLE.
In summary, our study advances the understanding of innovation by suggesting a framework for analysing innovation sales as a two-part process. This approach not only deepens insights into firms’ innovation decisions and outcomes but also emphasises the value of distinguishing between the intent to innovate and its commercial success. Relying on this understanding, researchers and managers can gain a clearer picture of the complexities of innovation. We hope an increasing use of two-part frameworks can help inspire future innovation management research and ultimately support more effective innovation strategies in practice.
ORCID
Camille Pedersen https://orcid.org/0009-0002-4722-0142
JESPER N. Wulff https://orcid.org/0000-0002-7976-0939
Anders R. Villadsen https://orcid.org/0000-0001-9935-6151
Notes
1 For example, data from Irish Innovation Panel (IIP), Swiss Innovation Panel (SIP), UK Innovation Surveys (UKIS), and the Spanish Technological Innovation Panel (PITEC) survey use measures corresponding to the European Community Innovation Survey.
2 For a detailed discussion of alternative estimators for fractional outcomes, see Wulff and Villadsen (2020) and Villadsen and Wulff (2021a).
3 The binomial assumption that we build our model on is almost always too restrictive for practical applications (Gelman et al., 2020, Chap. 3). In fact, the binomial assumption fails if unobserved firm effects play an important role in the firm’s amount of innovation sales. This phenomenon is also known as unobserved heterogeneity and we suspect that it is unlikely that researchers can convincingly rule it out when modelling innovation sales. We can add this unobserved heterogeneity to the model by applying a beta-binomial density assuming that each firm has its own unique, unobserved proportion of innovation sales As indicated by its name, the beta-binomial is a mixture of a binomial and a beta distribution. We can think of the beta-binomial as a hierarchical model where we start with a binomial distribution and then let p follow a beta distribution. Because of the extra distributional layer, the beta-binomial distribution has a larger variance for a fixed p. In Fig. B.1, we visualise the characteristics of innovation sales as a bounded variable with natural limits and compare the binomial density to the more realistic beta-binomial density for the basic innovation sales model. In Fig. B.2, we demonstrate the more realistic case with unobserved heterogeneity in the two-part model by visualising the zero-inflated beta-binomial (ZIBeta-Binomial). In Fig. B.3, we show the QMLE is consistent even in the face of unobserved heterogeneity.
4 As long as ti is exogenous, we can focus exclusively on the fractional variable, si (Wooldridge, 2010b, Chap. 18). While we would prefer to know both yi and ti, knowing the proportion of innovation sales, si, is enough. This is very convenient since some data sources only contain data on si, not on yi and ti.
5 Data are missing for around 7.8% of respondents with respect to the dependent variable, while around 10.4% are missing information on the size of the firm. Due to the low level of missingness, these observations are removed using listwise deletion.
Appendix A
We began by searching all articles published in a 13-year period from 2007 to 2019 in four leading journals publishing research on innovation: Research Policy, Journal of Product Innovation Management, Strategic Management Journal, and Organization Science. This search resulted in 56 studies and gave us an initial understanding of the relevant research in the area and allowed us to develop search terms for the next steps. Second, we made two searches in Web of Science (WoS) using the search terms. The specific search terms can be seen in Table 7. The first search added studies published between 2000 and 2006 in the four reviewed journals. A second search was made to locate studies published over the entire year range from 2000 to 2019 in all other journals in the “Business” or “Management” categories in WoS. In total, our search yielded 2,976 hits that we screened in further detail. In the end, 84 relevant studies were added. Third, we made a small citation search using a backward search in the articles located in the systematic journal search. We found 17 relevant studies of which only one relevant paper was not a duplicate. In total, 141 studies in our review include one or more estimates of how firm size relates to innovation sales. A total of 27 papers (19%) used a two-part model. See an overview of all studies in Table A.3.
Step | Search strategy | No. of papers | No. of papers using two-part model |
---|---|---|---|
1 | Systematic journal search of all articles in Research Policy, The Journal of Product Innovation Management, Strategic Management Journal and Organization Science from 2007 to 2019 | 56 | 13 |
2 | Web of Science search in Research Policy, The Journal of Product Innovation Management, Strategic Management Journal and Organization Science from 2000 to 2006 | 4 | 1 |
3 | Web of Science Search in Business and Management journals from 2000 to 2019 | 80 | 12 |
4 | Backwards citation search in relevant papers located in Step 1 | 1 | 1 |
Total | 141 | 27 |
Search hits | |||||
---|---|---|---|---|---|
Search Words | SMJa | JPIMa | OSa | RPa | WOSb |
Broad innovation search words | |||||
innovat* output | 11 | 7 | 2 | 29 | 164 |
innovat* performance | 33 | 57 | 9 | 121 | 1.566 |
innovat* success | 2 | 13 | 0 | 13 | 141 |
innovat* sales | 0 | 0 | 0 | 2 | 19 |
innovat* intensity | 0 | 0 | 0 | 7 | 45 |
innovat* indicator | 0 | 0 | 0 | 1 | 6 |
firm innovativeness | 2 | 7 | 3 | 4 | 96 |
innovat* revenue | 0 | 1 | 0 | 0 | 5 |
Variable specific search words | |||||
share of turnover | 0 | 0 | 0 | 1 | 1 |
share* of sales | 0 | 0 | 0 | 0 | 13 |
percent* of total sales | 0 | 0 | 0 | 1 | 6 |
percent* of sales | 0 | 0 | 0 | 0 | 24 |
share of new products | 0 | 0 | 0 | 0 | 2 |
proportion of turnover | 0 | 0 | 0 | 0 | 1 |
percent* sales | 0 | 0 | 0 | 0 | 3 |
percent* of turnover | 0 | 0 | 0 | 1 | 4 |
percent* of firm* turnover | 0 | 0 | 0 | 0 | 0 |
share* of total turnover | 0 | 0 | 0 | 0 | 0 |
percent* of total turnover | 0 | 0 | 0 | 0 | 0 |
sales share* | 0 | 0 | 0 | 0 | 9 |
sales of new product* | 1 | 0 | 0 | 0 | 9 |
sales from new product* | 0 | 1 | 0 | 1 | 3 |
percent* sales | 0 | 0 | 0 | 0 | 3 |
proportion of total sales | 0 | 0 | 0 | 0 | 1 |
proportion of firms’ total sales | 0 | 0 | 0 | 0 | 0 |
product percent* | 0 | 0 | 0 | 0 | 1 |
proportion of sales | 0 | 1 | 0 | 1 | 8 |
percent* share* | 0 | 0 | 0 | 0 | 8 |
product performance | 2 | 77 | 3 | 3 | 396 |
product share | 0 | 0 | 0 | 0 | 0 |
product sales | 2 | 11 | 0 | 3 | 196 |
Authors and year | Journal | One-part | Two-part |
---|---|---|---|
Love et al. (2014) | Strategic Management Journal | x | |
Garriga et al. (2013) | Strategic Management Journal | x | |
Mihalache et al. (2012) | Strategic Management Journal | x | |
Leiponen and Helfat (2010) | Strategic Management Journal | x | |
Roper et al. (2016) | Research Policy | x | |
Beck et al. (2016) | Research Policy | x | |
Higon (2016) | Research Policy | x | |
Arora et al. (2016) | Research Policy | x | |
Howell (2016) | Research Policy | x | |
Grimpe and Sofka (2016) | Research Policy | x | |
Kafouros et al. (2015) | Research Policy | x | |
Roper and Hewitt-Dundas (2015) | Research Policy | x | |
Cappelli et al. (2014) | Research Policy | x | |
Hottenrott and Lopes-Bento (2014) | Research Policy | x | |
Love et al. (2014) | Research Policy | x | |
Berchicci (2013) | Research Policy | x | |
Robin and Schubert (2013) | Research Policy | x | |
Hashi and Stojčić (2013) | Research Policy | x | |
Keupp and Gassmann (2013) | Research Policy | x | |
Roper et al. (2013) | Research Policy | x | |
Sharif and Huang (2012) | Research Policy | x | |
Wu (2012) | Research Policy | x | |
Fu (2012) | Research Policy | x | |
Roper and Arvantitis (2012) | Research Policy | x | |
Lokshin et al. (2011) | Research Policy | x | |
Cuijper et al. (2011) | Research Policy | x | |
Love et al. (2011) | Research Policy | x | |
Cantner et al. (2011) | Research Policy | x | |
Woerter and Roper (2010) | Research Policy | x | |
Eom and Lee (2010) | Research Policy | x | |
Escribano et al. (2009) | Research Policy | x | |
Kirner et al. (2009) | Research Policy | x | |
Weterings and Boschma (2009) | Research Policy | x | |
Aschoff and Sofka (2009) | Research Policy | x | |
Roper et al. (2008) | Research Policy | x | |
Weterings and Koster (2007) | Research Policy | x | |
Sidhu et al. (2007) | Organization Science | x | |
Wadhwa et al. (2017) | Organization Science | x | |
Roper et al. (2017) | Research Policy | x | |
Bourke and Roper (2017) | Research Policy | x | |
Steinberg et al. (2017) | Research Policy | x | |
Hussinger et al. (2018) | Research Policy | x | |
Hewitt-Dundas et al. (2019) | Research Policy | x | |
Tsinopoulos et al. (2019) | Research Policy | x | |
Haus-Reve et al. (2019) | Research Policy | x | |
Bei (2019) | Research Policy | x | |
Spithoven et al. (2010) | The Journal of Product Innovation Management | x | |
Faems et al. (2010) | The Journal of Product Innovation Management | x | |
Duysters and Lokshin (2011) | The Journal of Product Innovation Management | x | |
Ganotakis and Love (2012) | The Journal of Product Innovation Management | x | |
Matsuno et al. (2014) | The Journal of Product Innovation Management | x | |
Dul and Ceylan (2014) | The Journal of Product Innovation Management | x | |
Grimpe et al. (2017) | The Journal of Product Innovation Management | x | |
Mohammadi et al. (2017) | The Journal of Product Innovation Management | x | |
Maria et al. (2018) | The Journal of Product Innovation Management | x | |
Stephan et al. (2019) | The Journal of Product Innovation Management | x | |
Love and Roper (2001) | Research Policy | x | |
Laursen and Salter (2006) | Strategic Management Journal | x | |
Faems et al. (2005) | The Journal of Product Innovation Management | x | |
Leiponen (2006) | The Journal of Product Innovation Management | x | |
Czarnitzki and Kraft (2004) | Small Business Economics | x | |
Caloghirou et al. (2004) | Technovation | x | |
Marsili and Salter (2006) | Technology Analysis and StrategicManagement | x | |
Aschoff and Schmidt (2008) | Review of Industrial Organization | x | |
Liu and Zou (2008) | Journal of World Business | x | |
Fosfuri and Tribó (2008) | Omega-The International Journal of Management Science | x | |
Love and Roper (2009) | Technovation | x | |
Grimpe and Kaiser (2010) | Journal of Management Studies | x | |
Schneider and Veugelers (2010) | Industrial and Corporate Change | x | |
Cockburn et al. (2010) | Industrial and Corporate Change | x | |
Chiang and Hung (2010) | R & D Management | x | |
de Visser et al. (2010) | Technovation | x | |
Neyens et al. (2010) | International Journal of Technology Management x | ||
Laursen (2011) | Science and Public Policy | x | |
Berchicci (2011) | IEEE TRANSACTIONS ON ENGINEERING MANAGEMENT | x | |
Wang (2011) | Technological Forecasting and Social Change | x | |
Kampik and Dachs (2011) | Industrial and Corporate Change | x | |
Cosh et al. (2012) | Small Business Economics | x | |
Andries and Wastyn (2012) | Journal of Knowledge Management | x | |
Grimpe and Hussinger (2013) | Industry and Innovation | x | |
Spithoven et al. (2013) | Small Business Economics | x | |
Wagner (2013) | IEEE TRANSACTIONS ON ENGINEERING MANAGEMENT | x | |
Love and Ganotakis (2013) | International Business Review | x | |
Vahter et al. (2014) | Industry and Innovation | x | |
de Leeuw et al. (2014) | Journal of Business Research | x | |
Sabidussi et al. (2014) | Journal of Engineering and TechnologyManagement | x | |
Wu and Wu (2014) | International Business Review | x | |
Andries and Czarnitzki (2014) | Small Business Economics | x | |
Beck and Schenker-Wicki (2014) | European Journal of International Management | x | |
Gkypali et al. (2015) | Eurasian Business Review | x | |
Yang et al. (2015) | Industrial Marketing Management | x | |
Arvanitis et al. (2015) | Review of Industrial Organization | x | |
Popadic et al. (2015) | Organizacija | x | |
Arvanitis et al. (2015) | International Journal of Innovation Management | x | |
Wang et al. (2015) | Journal of Business Research | x | |
Waldner et al. (2015) | Business Models and Modelling | x | |
Wubben et al. (2015) | International Journal of Technology Management | x | |
Yun et al. (2016) | Science, Technology and Society | x | |
Hecker (2016) | International Journal of Innovation Management | x | |
Chen et al. (2016) | R & D Management | x | |
Wu et al. (2016) | Journal of Business Research | x | |
Bourke and Roper (2016) | Technovation | x | |
Berchicci et al. (2016) | Industrial and Corporate Change | x | |
Alcalde and Guerrero (2016) | International Entrepreneurship and Management Journal | x | |
Piening et al. (2016) | Journal of World Business | x | |
Estrada et al. (2016) | Industrial Marketing Management | x | |
Xia and Liu (2017) | Journal of International Business Studies | x | |
Fernández-Olmos and Ramírez-Alesón (2017) | Technovation | x | |
Monteiro et al. (2017) | Long Range Planning | x | |
Yi et al. (2017) | Technovation | x | |
Teng and Yi (2017) | Frontiers of Business Research in China | x | |
Martinez et al. (2017) | R & D Management | x | |
Sun et al. (2017) | Asian Journal of Technology Innovation | x | |
Gimenez-Fernandez and Sandulli (2017) | Industry and Innovation | x | |
Bayona-Saez et al. (2017) | Management Decision | x | |
Martinez et al. (2017) | Technovation | x | |
Sun et al. (2017) | Management Decision | x | |
Zhang et al. (2017) | Management Decision | x | |
Cordón-Pozo et al. (2017) | International Journal of Technology Management | x | |
Peeters and Martin (2017) | R & D Management | x | |
Xia and Liu (2018) | British Journal of Management | x | |
Hagedoorn et al. (2018) | Journal of Management Studies | x | |
Zouaghi et al. (2018) | Technological Forecasting and Social Change | x | |
Ramírez-Alesón and Fernández-Olmos (2018) | The Journal of Technology Transfer | x | |
D’Agostino and Moreno (2018) | Industrial and Corporate Change | x | |
Raymond et al. (2018) | Information Technology and Management | x | |
Criscuolo et al. (2018) | Industry and Innovation | x | |
Tsai and Huang (2019) | Journal of Service Management | x | |
Medase and Barasa (2019) | European Journal of Innovation Management | x | |
D’Angelo and Baroncelli (2020) | Technology Analysis and Strategic Management | x | |
Carree et al. (2019) | Journal of Engineering and Technology Management | x | |
Hartono and Kusumawardhani (2019) | Global Business Review | x | |
Martinez et al. (2019) | British Journal of Management | x | |
Koo and Lee (2019) | R and D Management | x | |
Fan et al. (2020) | Group and Organization Management | x | |
Christensen et al. (2019) | Small Business Economics | x | |
Adams et al. (2019) | Journal of Business Research | x | |
Bernal et al. (2019) | Industry and Innovation | x | |
Ubeda et al. (2019) | The Journal of Technology Transfer | x | |
Crass et al. (2019) | International Journal of the Economics of Business | x | |
Leiponen (2005) | Industry and Innovation | x | |
Total | 114 | 27 |
Appendix B

Fig. B.1. Illustration of a one-part process for innovation sales relative to total sales. In (A1), we have set the maximum total sales, t, to 100 and plotted the binomial density for selected values of p. The end result of compressing the proportion to the probability space using the logistic transform is what we observe in (A2)–(A3). In (A2) we show how random draws of innovation sales from a binomial distribution might look like. We have let total sales follow a Poisson process where the rate decreases as x increases. In (A3), we observe how the simulated data example shown in (A2) looks when converted to a proportion. Because innovation sales never exceed total sales, the proportion is bounded from above while the non-negative constraints secure a firm bound from below. In the bottom row, (B1)–(B3), the same plots are made for a beta-binomial distribution. Comparing (A1) to (B1), we observe that there is a higher number of ways that a beta-binomial can produce different amounts of innovation sales for any given p resulting in much wider distributions.

Fig. B.2. Illustration of a two-part process for innovation sales relative to total sales. In (A1), we show the density function for ZIBeta-Binomial. For every p each distribution has a spike at zero. This spike arises from the extra capacity for generating zeroes. Zero-inflated distributions do not need a low p to generate zeroes. The higher the proportion of zeroes generated, the more density mass will be distributed towards the spike at zero. The consequence of simulating innovation sales is evident in (A2). Many extra zeroes added by the participation process are made visible as gaps in the innovation sales across the range of the explanatory variable. In the example in (A2)–(A3), the innovation participation does not depend on the explanatory variable but is scattered randomly. In the bottom row, (B1)–(B3), we have simulated a positive effect of x on participation. While the density does not change (B1), the distribution of the zeroes across the range of the explanatory variable is clearly affected (B2). While hardly any firms have innovation sales at low values of x, the zeroes get rarer as x increases and disappear almost entirely at high values. In (B3), we show a similar picture where proportions larger than zero get more frequent as x increases.

Fig. B.3. Simulation of the bias and coverage of QMLE. QMLE can consistently estimate the APE even under distributional misspecification. In this (A1)–(B1), we demonstrate this with a simple simulation for the magnitude part. We simulate 20,000 repetitions from a binomial (A1) and beta-binomial process (B1), respectively. The beta-binomial adds unobserved heterogeneity to the model by assuming that each firm has its own unique, unobserved proportion of innovation sales. From the plot, we can clearly see that QMLE consistently estimates the true APE. As the sample size increases, we can observe the estimated APE converges to the true value. The convergence is exactly the same no matter if the conditional distribution is correctly specified as binomial or incorrectly specified as beta-binomial (A1)–(B1). Note also that the plot is the same for binomial and fractional regression if we do not scale the APE by ti (A1)–(B1). As long as we know the fraction, s, we can compute the APE. The failure of the binomial assumption needs fixing only when it comes to the standard errors. Fortunately, this can easily be fixed by using fully robust sandwich errors, which are computed using standard statistical software. In (A2)–(B2), we demonstrate that we can obtain 95% coverage of the true APE when using robust standard errors to compute 95% confidence intervals. Already at low sample sizes, we can achieve coverage very close to the desired 95%. When we use the beta-binomial to simulate the presence of unobserved firm characteristics, we can observe that coverage suffers mildly at lower sample sizes. This reflects that the data generated are much noisier and more samples are needed to obtain good coverage.