Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Management control system design and effectiveness NYENRODE BUSINESS UNIVERSITEIT Management control system design and effectiveness Proefschrift ter verkrijging van het doctoraat aan de Nyenrode Business Universiteit op gezag van de Rector Magnificus, prof. dr. E.A. de Groot en volgens besluit van het College voor Promoties. De openbare verdediging zal plaatsvinden op woensdag 16 april 2008 des namiddags om vier uur precies door Anne-Marie Kruis geboren op 10 februari 1977 te Leidschendam Leescommissie Promotoren: Prof. dr. R.F. Speklé Prof. dr. G.W.J. Hendrikse (Erasmus Universiteit Rotterdam) Overige leden: Prof. dr. J. Bouwens (Universiteit van Tilburg) Prof. dr. E.A.G. Groenland Prof. dr. F.G.H. Hartmann (Erasmus Universiteit Rotterdam) Nyenrode Research Group (NRG) Internet: www.nyenrode.nl/nrg ISBN 978-90-8980-001-5 Cover lay out by Bingo Graphic Design, Rotterdam Printed by HAVEKA, Alblasserdam © 2008, Anne-Marie Kruis All rights reserved. No part of this publication may be reproduced or transmitted in any form or by any means electronically or mechanically, including photocopying, recording, or by any information storage and retrieval system, without prior permission in writing from the author. Contents Preface.............................................................................................................................................. i Summary.........................................................................................................................................ii Chapter one – Introduction ......................................................................................................... 1 1.1 Point of departure...................................................................................................................... 1 1.2 The main themes ....................................................................................................................... 4 1.3 The nature of this study ............................................................................................................ 6 1.4 Outline of this thesis ................................................................................................................. 8 Chapter two – Configurations of control ................................................................................ 11 2.1 Management Control Systems ............................................................................................... 11 2.1.1 Purposes and dimensions of control............................................................................... 11 2.1.2 Complementarities........................................................................................................... 13 2.2 The transaction cost approach to management control......................................................... 15 2.2.1 The general argument...................................................................................................... 16 2.2.2 Control problems: activities and contingency variables............................................... 18 2.2.3 Control solutions: archetypes ......................................................................................... 20 2.2.4 Matching problems and solutions: effectiveness ........................................................... 22 2.3 Reflections on the theoretical ideas ....................................................................................... 25 2.3.1 The configurational approach ........................................................................................ 25 2.3.2 Occurrence of misfits....................................................................................................... 27 2.4 Concluding remarks ................................................................................................................ 29 Chapter three – Methodology ................................................................................................... 31 3.1 The methodological approach and the choice for a survey .................................................. 31 3.1.1 The methodological framework ...................................................................................... 31 3.1.2 Survey research: method and critique............................................................................ 33 3.2 Design of the questionnaire .................................................................................................... 35 3.2.1 The questionnaire’s set-up – matching the theory......................................................... 35 3.2.2 Focus on activities - matching the level of analysis ...................................................... 37 3.3 Validity-feedback interviews ................................................................................................. 39 3.4 The pre-test of the questionnaire............................................................................................ 41 3.4.1 Purposes of pre-testing.................................................................................................... 42 3.4.2 The Three-Step Test-Interview........................................................................................ 42 3.4.3 Pre-test execution and results......................................................................................... 44 3.5 Sampling and survey administration...................................................................................... 45 3.6 Description of the dataset ....................................................................................................... 47 3.7 Concluding remarks ................................................................................................................ 50 Chapter four - Operationalizing the framework.................................................................... 53 4.1 Mapping management control systems ................................................................................. 54 4.2 MCS Effectiveness.................................................................................................................. 61 4.3 Characteristics of the activities .............................................................................................. 63 4.3.1 Uncertainty....................................................................................................................... 64 4.3.2 Asset specificity................................................................................................................ 69 4.3.3 Ex-post information asymmetry ...................................................................................... 73 4.4 Measuring misfit ..................................................................................................................... 73 4.4.1 Defining the archetypes................................................................................................... 74 4.4.2 Calculating misfit............................................................................................................. 77 4.4.3 The misfit variables ......................................................................................................... 80 4.5 Control variables ..................................................................................................................... 81 4.6 Closing remarks ...................................................................................................................... 82 Chapter five - Analyses and results .......................................................................................... 83 5.1 The relevant archetype............................................................................................................ 84 5.2 A comparison of the subsamples ........................................................................................... 89 5.3 The occurrence of fit and misfit ............................................................................................. 93 5.4 Testing the MCtce..................................................................................................................... 96 5.4.1 The general model ........................................................................................................... 97 5.4.2 Arm’s length control........................................................................................................ 99 5.4.3 Result Oriented Machine Control.................................................................................104 5.4.4 Action Oriented Machine Control ................................................................................107 5.4.5 Exploratory control ....................................................................................................... 111 5.4.6 Boundary control ...........................................................................................................113 5.5 Misfit and the contingency variables ...................................................................................114 5.6 Effectiveness of control and organizational effectiveness .................................................118 5.7 Summary of findings ............................................................................................................120 Chapter six - Conclusion and discussion ...............................................................................123 6.1 The effectiveness of control systems ...................................................................................123 6.1.1 Summary of findings ...................................................................................................... 124 6.1.2 Limitations......................................................................................................................129 6.2 Results related to methodology............................................................................................132 6.3 Future research - another point of departure ....................................................................... 133 6.3.1 Configurational fit and complementarities .................................................................. 134 6.3.2 Uncertainty and the perception of control ................................................................... 135 6.3.3 Effectiveness of control and effectiveness of the organization ................................... 137 6.3.4 MCS design and effectiveness .......................................................................................138 References................................................................................................................................... 139 Appendix A - Description of a management control system ...................................................I Appendix B - Overview of survey questions ......................................................................... VII Samenvatting........................................................................................................................... XVII Curriculum Vitae .....................................................................................................................XIX Preface People who dedicate their lives to scientific research are curious. However, it was more than sheer curiosity that led me to decide on becoming a PhD student. After finishing my studies I felt there was still much more to learn and discover; what better place to do that than at university. I have very much enjoyed my time as a PhD student. I have learned many things, visited great places, and met wonderful people. At this point I would like to thank everyone who has contributed to this research project, each in his or her own way. First of all, many thanks to the students of group 1 MAC3.2/3.3 (2006 and 2007) for collecting the data. I very much appreciate your efforts. Also those people who participated in the pre-test of the questionnaire deserve a warm ‘thank you’. Tony Hak, thank you for discussing the TSTI method with me. I guess we share a passion for methodological issues. Furthermore, I would like to thank my colleagues and fellow PhDs from the Netherlands as well as abroad, for showing me that life in academia is both challenging and fun. Both at Erasmus University Rotterdam and at Nyenrode Business University, I was very lucky to be part of a great team and I thank my colleagues for making me feel at home at university. To Hilco, my officemate, I would like to say thanks for making the office a good place to be. Some people deserve more than a general acknowledgement. I thank Marcel Bonnet for his valuable advice on many issues and for anticipating a perfect fit between PhD student, project, and supervisor. I thank George Hendrikse, my thesis supervisor, for always revealing different lines of thought and showing me alternative perspectives. Special thanks to Roland Speklé, my thesis supervisor, for teaching me so many things about academia and about doing research. Thank you for giving me room to explore and grow, while being there for me providing support and giving advice. I also thank all my friends for believing in me, being there, and distracting me with lots of fun stuff. My parents, my brother, and my sister-in-law I would like to thank for their support, trust, and enthusiasm and for listening to all my stories. Finally, to those of you who wonder what will be next: I intend to keep on doing those things I enjoy and that I am reasonably good at. I am a researcher, I am a teacher, and I am still curious. Therefore, I will stay in academia. Anne-Marie Kruis, Feb. 2008 Summary1 This doctoral thesis describes the set-up and findings of an empirical study of management control system design and effectiveness. Management control systems are packages of control instruments that are used to coordinate the work and to provide incentives through rewarding and punishment. Examples of control instruments are rules and regulations, budgets, performance targets, and bonuses. Organizations choose from a variety of instruments to set up their control systems. However, not every possible combination of instruments will be effective. The workings of one control instrument depend on its combination with the others. They are interrelated. Numerous studies focus on individual control instruments, but there is little empirical work that researches control systems in their entirety. Therefore a lot remains to be learned about manifestations of these systems as packages of interrelated control instruments. Even less is known about the effectiveness of control. With this study I contribute to the existing knowledge base by specifically concentrating on management control systems in their entirety and assessing their relative effectiveness. My central problem is to find out which control systems are effective given the circumstances. The Transaction Cost Theory of Management Control (Speklé, 2001a, 2001b, 2004) provides insights into these matters and is taken as the theoretical basis of this study. The theory submits that five effective forms of control systems (so-called archetypes) exist: arm’s length control, result oriented machine control, action oriented machine control, exploratory control, and boundary control. Each archetype is effective for the control of specific activities, but not for others. Examples of activities are the production of a good or service, human resources, or research and development. These activities are characterised by their level of uncertainty, asset specificity, and ex-post information asymmetry. The theory specifies, for instance, that the archetype arm’s length control is the most effective control system for activities with low levels of uncertainty and low levels of asset specificity. The main claim of the theory is that if the management control system of an organizational unit closely resembles its relevant archetype, it will be more effective in controlling its activities than other management control systems would. The relevance of the archetype depends on the activity of the organizational unit. Using survey data on 258 organizational units, I test these claims empirically. 1 Een Nederlandse samenvatting vindt u achterin dit proefschrift. To enable hypotheses testing, I first need to determine the relevant archetype for each case. Therefore each observation is allocated to one of five subsamples (one for each archetype) according to the characteristics of its activities. The results from a regression analysis show that resemblance of a management control system with the archetype arm’s length control has a positive impact on the perceived effectiveness of the control system. As predicted by the theory, this effect is confined to the group of cases that experience low levels of both uncertainty and asset specificity. For those cases that have result oriented machine control as their relevant archetype, regression analysis shows a null-effect: resemblance of a management control system with the archetype does not affect the effectiveness of the control system. For action oriented machine control, exploratory control, and boundary control the respective subsamples are too small to allow regression analysis. The bivariate analyses used instead provide tentative results that do not support the hypotheses. Other results relate to the research topic more broadly. First of all, I find a positive correlation between the effectiveness of the management control system and the effectiveness of the organizational unit. The strength of this relationship differs across the subsamples that represent the archetypes. Secondly, evidence suggests that uncertainty has a direct negative impact on the perceived effectiveness of the control system. Because uncertainty also influences the design of these systems, its overall role is complex. A final contribution of my study is a methodological one. I explicate all necessary steps to operationalize the theory and to map management control systems in their entirety. Therefore, this thesis exemplifies one approach that enables the study of management control system design and effectiveness as such. I also introduce ‘validity-feedback’ interviews as a powerful method to pre-test the quality of survey instruments. It is generally applicable and provides feedback on the quality of questionnaires beyond that retrieved by other pre-test methods. Finally, to enhance the measurement of uncertainty I set up a general measurement model with formative indicators (instead of reflective ones). This improves the construct validity. The approach can also benefit the future measurement of asset specificity. Chapter one – Introduction 1.1 Point of departure In this doctoral thesis I build directly on the work by Speklé in which he applies Transaction Cost Economics (TCE; Williamson, 1975, 1985, 1991) to study management control problems. He has created the Transaction Cost Theory of Management Control (Speklé, 2001a, 2001b, 2004) and claims that control structures are coherent combinations of control instruments that must be studied simultaneously. Not only does he show what these systems might look like, by articulating a comprehensive framework with several distinct control archetypes, but -using TCE logic- he also explicates the circumstances under which each archetype exhibits the largest relative effectiveness. His theory can be characterised as a contingent configuration theory (Speklé, 2004). Speklé is not the only one who takes on a configurational approach to study control, nor is he the first to apply TCE in this light2. Hofstede (1981), for instance, implicitly adopts the idea of control packages and presents a typology of six different control types: routine control, expert control, trial and error control, intuitive control, judgemental control, and political control. He relates the applicability of each to characteristics of organizational activities. Ouchi (1979, 1980) relies partly on TCE to study three mechanisms of control within organizations: markets, hierarchies, and clans. He explains that the mechanisms themselves overlap in organizations and he identifies several contingency variables that determine which one should be emphasized. Snell (1992) builds from these works and studies use of several human resource management practices by executives. He differentiates between behaviour control systems, input control systems, and output control systems. He explicitly describes these types as combinations of practices that, as a set, establish control. The common thread running through these studies is the identification of several generic control forms. The authors choose similar contingency variables that determine the applicability of these control forms: (1) knowledge about desirable actions/transformation processes, (2) measurability of outputs/results, and (3) ambiguity of goals. These contingency variables are also represented in Speklé’s theory, be it under different names. The generic 2 The aim here is merely to illustrate some main themes of interest by referring to several studies. My approach is eclectic and focuses on management control literature. For extensive reviews of research within the MC literature see, for instance, Otley, Broadbent, and Berry, 1995; Fisher, 1995; Luft and Shields, 2003; Chenhall, 2003. 1 Chapter one control forms represent combinations of multiple control instruments. What remains unclear from the studies, however, is how to balance the different control forms/instruments in a given situation in order to establish an effective control system. It is unknown, for instance, which combination of hierarchical and clan controls (Ouchi, 1980) is effective for the control of a certain organization. Speklé elucidates these matters by specifying five archetypes that represent these balances. He provides rich descriptions of the characteristics of each archetype. This enables the comparison of observed management control systems with the archetypes. Moreover, he makes explicit claims about their relative effectiveness. Thus, building from and combining prevalent knowledge, he adds to our understanding of control the explicit propositions on effective combinations of control instruments. With this study, I test the theoretical claims of the Transaction Cost Theory of Management Control (henceforth: MCtce). This will provide insights into the empirical power of the theory and involves theory refinement. Since its development in 2001, parts of the MCtce have been used in several studies (see for instance, Nicholson, Jones, and Espenlaub, 2006; Widener, 2004; Langfield-Smith and Smith, 2003; Van den Bogaard and Speklé, 2003). Some helpfully apply the theory to explain their findings of case studies (Nicholson et al., 2006; Van den Bogaard and Speklé, 2003). Those authors find their evidence consistent with patterns and control forms as predicted by the theory. Others want to extend the framework either by adding additional contingency variables related to the strategy of the organization (Widener, 2004), or by taking in trust to represent the social context of control (Langfield-Smith and Smith, 2003). As of yet, however, little is known about the empirical power of the MCtce. This doctoral thesis encompasses the first large-scale quantitative study of the theory. Therefore challenges lie ahead in operationalizing the ideas. The goal of my research project is to study control system design and effectiveness. The central problem is to find out which management control systems are effective. Specifically concentrating on management control systems in their entirety, I study about 20 different aspects of control simultaneously. Moreover, I assess the effectiveness of these systems using a newly developed measurement instrument. Studying management control systems, as opposed to studying single control instruments, receives ever more attention. The conceptualization of management control systems as packages of interrelated elements (see for instance, Otley, 1980; Flamholtz, 1983; Abernethy and Chua, 1996) is generally accepted. Moreover, authors stress the importance of assessing these elements of control jointly (Otley, 1980; Holmstrom and Milgrom, 1994) and warn that studying specific instruments in isolation likely leaves models underspecified (Chenhall, 2003). Despite these regular calls for a more holistic approach in studying control systems, little empirical work emerged that 2 Introduction focuses specifically on multiple control instruments and their interrelationships (Otley, Broadbent, and Berry, 1995; Chenhall, 2003). In his review of contingency-based research on management control systems, Fisher (1995) concludes that most of the empirical work focuses on a single contingency variable and a single control variable. Two recent studies form notable exceptions: both Widener (in press) and Bedford (2005) specifically take on a more holistic approach towards control systems. Both studies will be introduced shortly; a detailed discussion follows in later chapters. Using survey data, Widener (in press) studies Simons’ levers of control framework (Simons, 1995) at the company level. She takes in both antecedents and outcomes of control in her models to explain organizational effectiveness. She also explores complementarities among the beliefs, boundary, diagnostic, and interactive systems of control. Bedford (2005) uses cluster analysis to research which combinations of cybernetic controls, administrative controls, and socio-ideological controls are present at the level of the strategic business unit. In this study, I also focus on control systems in their entirety and add to the empirical evidence on control system design. Building from the MCtce, my study starts with explicit theoretical expectations on what combinations of different control elements will be effective. Therefore, I can study the balance of control elements more rigorously than the before-mentioned authors could. Moreover, I focus on effectiveness of control directly as the dependent variable. A lot remains to be learned about manifestations of management control systems, but even less is known about the effectiveness of control. First of all, researchers typically relate aspects of the management control system to one or more contingency variables, assuming effectiveness rather than testing for it (Chenhall, 2003). Secondly, researchers who aim to test control effectiveness as a separate variable take the effectiveness of the organization as a proxy and do not actually measure the effectiveness of the management control system. In so doing, however, they assume linkages between the workings of the management control system and the organizational outcomes, for which there is no compelling evidence (Chenhall, 2003). The only exception is Ferreira and Otley (2005) who develop a measure of the perceived effectiveness of a control system at the company level. Inspired by their work, I set up a new measure of control effectiveness that covers more aspects of control and that is more generally applicable. I also measure effectiveness of organizational units and can thus shed some light on the linkages between the performance of the management control system and unit effectiveness. This study contributes to research within the field in several ways. First of all, we gain knowledge about the empirical power of the Transaction Cost Theory of Management Control 3 Chapter one and engage in theory refinement. Secondly, this large-scale empirical study on management control systems and their effectiveness contributes to our understanding of the workings of control. Moreover, I develop the measurement instruments needed to actually conduct this type of study, working with generally applicable measures and mapping control systems in their entirety. Therefore my study also contributes by exemplifying one approach to conduct this type of study. Finally, along the way, I address a number of methodological issues of broader importance. These issues relate, for instance, to use of the survey method and to scale construction. The following paragraphs describe the main themes of this research project and the nature of my work. This chapter ends with an outline of the thesis. 1.2 The main themes This study is about design and effectiveness of management control systems (MCSs). My main concern is to learn about manifestations of control systems, their resemblance (or lack thereof) with the theoretical archetypes as specified by Speklé, and their contingent effectiveness. The general research problem is to find out which control systems are effective given the circumstances. An introduction to management control systems as such, the theory, and effectiveness of control follows. All managers experience the workings of management control systems and face choices inherent in the design of these systems. A variety of control instruments is at their disposal, for instance financial rewards, budgets, and rules or behavioural guidelines. These instruments should further the accomplishment of organizational goals through their influence on employee behaviour within the organization. We refer to their combination as a management control system3. The systems are used to motivate employees, to communicate organizational goals, and to stimulate or prevent certain behaviour. Because of their influence on employee behaviour, MCSs affect organizational performance. An MCS that does not function well can negatively impact the organization. At best, a dysfunctional control system keeps the organization from obtaining results. The worst case scenario, however, encompasses theft or even bankruptcy when managers exploit resources for their own sake. The application of control instruments serves several purposes. Speklé (2004) indicates as the main purposes (1) knowledge capturing, (2) coordination, and (3) providing incentives through rewarding and punishment (see also Jensen and Meckling, 1992, and Zimmerman, 3 The word ‘system’ here is just a synonym for ‘structure’. This use of terms is consistent with research within the field of management accounting and control. 4 Introduction 2000). The first purpose concerns making knowledge that rests in several parts of the organization available for decision making. The control instruments used in this light are related to the allocation of decision rights. Coordination can be established, for instance, by communicating goals through performance targets. Providing incentives concerns both furtherance of positive behaviour and prevention of behaviour that could harm the company. This can be done, for instance, by using financial rewards and setting boundaries. The control purposes directly relate to the raison d’être of the organization: they further the accomplishment of organizational goals. Management control systems are packages of control instruments. The effects of single control instruments depend on how these are being used as well as on their combination with other instruments. The packages are thought to inhibit synergies (Milgrom and Roberts, 1990, 1995). More importantly, choices already made for certain control instruments within a management control system restrict the available alternatives for choosing additional ones (Speklé, 2004, p11). The control instruments cannot be used in all possible combinations. Therefore, researchers within the field adhere more and more to the study of packages of control instruments or control systems, rather than concentrating on single control instruments. But, what do we know about effective design of control systems? Are some of these systems effective, whereas others are not? Or does it depend on the specifics of the situation which systems best mitigate control problems? A management control system is expected to be effective when it can manage the specific control problems an organization or organizational unit faces. This is the central tenet of the Transaction Cost Theory of Management Control. The theory submits that specific groups of control problems can be predicted to prevail by examining the characteristics of the activities of an organization. As a consequence, the relevant level of analysis is the activity and I study the management control systems that surround these. Looking at organizational units, for instance a business unit or a production department, activities can be a multiplicity of things ranging from the production of a good or service, to research and development activities or human resources. Therefore, to study activities identification of organizational units as such will not suffice. The theory identifies three dimensions over which activities differ: the associated level of uncertainty, asset specificity, and ex-post information asymmetry. Once an activity is characterized over these dimensions control problems associated with the activity can be foreseen. It makes sense to match expected control problems with possible control solutions, which management control systems provide. Not every MCS will be an effective match 5 Chapter one though. The MCtce shows that five theoretical constructs of MCSs (archetypes) differ in their problem solving capabilities and, consequently, that each can handle best a different group of control problems. Thus, the archetypes represent effective control structures, each effective for the control of certain activities, but not for others. This implies that if an MCS of an organizational unit closely resembles its relevant archetype, it will be more effective in controlling its activities than other forms of MCSs would. Using survey data I test these theoretical claims empirically by mapping the MCSs of a large number of organizational units and assessing their contingent effectiveness. 1.3 The nature of this study To learn about the effectiveness of management control systems I draw hypotheses from the Transaction Cost Theory of Management Control. As already mentioned, this is the first large-scale quantitative study of this theory, and overall there is little empirical work on management control systems in their entirety and the effectiveness thereof. Thus, although specific hypotheses drawn from theory will be tested in the chapters to come, for two reasons this study is (at least partly) exploratory in nature. First of all, decisions have to be made regarding the operationalization of the theory. The archetypes, for instance, are described qualitatively and empirical profiles have to be set up. All decisions regarding operationalization still have to be made as part of this research project. Secondly, the theory introduces five archetypes, each effective for the control of specific activities. Therefore, to enable hypotheses testing for all archetypes a variety of activities must be studied. I will compare, for instance, the MCS of a production unit with that of a research and development unit, or the MCS of a sales unit with that of a recruitment unit. I extend prior research by focussing explicitly on control systems in their entirety and deviate from it by studying a range of different activities. A lack of examples from prior research necessitates the development of new measures. This has consequences for the methodological approach underlying my work. In the remaining part of this paragraph I will introduce and illustrate some of the main issues. Regarding data collection, three things are important: first of all, it is necessary to map management control systems (1). Detail is important to pick up a control system in its entirety without loosing track of the nuances of control. At the same time though, only a large number of observations can enable hypotheses testing (2). Moreover, to test hypotheses for all five archetypes I need data on various activities (3). Such variety in activities can be found throughout different industries and companies, and within companies at different hierarchical 6 Introduction levels. This type of data collection necessitates the use of a survey, which can reach all kinds of organizational units each with its own activities, and meet the requirements of providing a large quantitative dataset. To compare such a diverse set of activities I abstain from studying the specifics of the control system and focus on generic forms of control. For instance, I do not document the specific rules or procedures used within an organizational unit, like a rule on handling toxic fluids or rules on private use of the internet. Rather I will ask about whether rules or regulations apply as such and about the consequences of non-compliance. Taking on this abstract view towards control enables the study of generic control structures and hence the comparison of control systems from distinct organizational units. Of course, as already shown above, studying generic control instruments as such is not new. However, the possibilities for building from prior research are limited, because there are hardly any studies that focus on MCSs in their entirety. Similar things go for studying the characteristics of the activities of organizational units. These must also be measured in a way that enables comparison among a large variety of them. Activities are characterised by their level of uncertainty, asset specificity, and ex-post information asymmetry. These variables are difficult to measure, especially in a way that enables comparison across industries (Shelanski and Klein, 1995). Consequently, researchers applying TCE typically study a selected industry or one specific type of organizational unit, such as sales departments (e.g. Anderson, 1985) or internal audit departments (e.g. Speklé, Van Elten, and Kruis, 2007). Again, the possibilities for building from prior research are limited. Opting for a survey necessitates the development of new measurement instruments able to pick up MCSs and to assess their quality in a generally applicable way that enables comparing a variety of organizational units. Verification of the content validity will be of utmost importance. Several questions have to be answered to assess this validity: can we actually grasp general strands of MCSs as wished-for using solely the information obtained by the questionnaire? And, is this picture more or less complete, i.e. will it suffice for our purposes of comparing configurations of control? As a means to address these problems, pre-testing the quality of the questionnaire receives ample attention. I introduce a new technique, referred to as ‘validity-feedback interviews’, as a powerful method to obtain valuable feedback on the content validity of survey instruments. Measurement issues are also dealt with explicitly throughout this work, especially regarding formative versus reflective indicator models. Researchers from marketing and organization recently expressed their concerns about the lack 7 Chapter one of consistent scale construction and the serious consequences thereof (see for instance MacKenzie, Podsakoff, and Jarvis, 2005; Diamantopoulos and Siguaw, 2006). Bisbe, BatistaFoguet, and Chenhall (in press) conclude that in empirical research into management accounting and control systems the issue is rarely addressed and that it is common practice to rely uncritically on reflective models. All in all, two chapters deal with methodological and statistical issues. 1.4 Outline of this thesis The problem of management control system design and effectiveness and the empirical test of the Transaction Cost Theory of Management Control form the core of this thesis. Chapter two starts by elucidating the theoretical ideas. It introduces the Transaction Cost approach to studying control problems and discusses configurational theories. Management control systems are described and their substance is defined. Moreover, in this chapter I set out the research question and hypotheses, and address the underlying assumptions. Chapter three explains the methodology and introduces a methodological framework. The framework highlights a step by step approach to test the hypotheses. Each step involves choices regarding methods. This chapter discusses the general methodology. The development of the questionnaire is the central topic and testing the instrument’s quality receives considerable attention. Chapter three also covers the data collection process and holds a description of the dataset. Next, chapter four provides details on the operationalization of the methodological framework. Separate paragraphs focus on the measurement of every variable and provide descriptive statistics. Chapter five contains the analyses and results of this study. Firstly, some final preparatory steps are taken. Then, I present the results of hypotheses testing, as well as results on additional analyses regarding the resemblance (or ‘fit’) of observed control systems and the archetypes, and the relationship between fit and effectiveness. Moreover, the role of uncertainty and relationships between the contingency variables and MCS design are objects of study. Chapter six presents the overall conclusions and discusses the results, limitations, and future research avenues. Two appendices are added to this thesis. Appendix A holds a description of the management control system of an organizational unit to illustrate the type of information as well as the amount of detail the questionnaire generates. Such descriptions were used during interviews as part of the pre-test of the questionnaire to assess the validity of the instrument. 8 Introduction Appendix B gives an overview of all questions posed. Along with details on each question, like answer categories and sources, I provide notes and guidance on use of the questionnaire. Reading this appendix should enable other researchers to work with (parts of) the instrument themselves should they choose to do so. 9 Chapter two – Configurations of control In this thesis, I adhere to the idea of control configurations and study management control systems in their entirety rather than single control instruments. Explaining contingent effectiveness of control systems is my central concern. This chapter discusses the theoretical basis. First of all, paragraph 2.1 provides a working definition of a management control system. Claiming to study control in its entirety requires a broad view of the MCS as a package of interrelated control instruments. The Transaction Cost Theory of Management Control plays a central role in this research project and will be introduced next in paragraph 2.2. I will discuss the general arguments of the theory as well as the claims it makes regarding MCS effectiveness. Moreover, hypotheses are drawn from this theory. Paragraph 2.3 contains reflections on the theoretical ideas and discusses the configurational approach. It deals with the notion of fit, the occurrence of misfits, and the assumptions about effectiveness. Finally, paragraph 2.4 ends this chapter with concluding remarks and some thoughts on the testing of the theoretical ideas. 2.1 Management Control Systems In the introduction, a management control system was simply referred to as a package of control instruments. Examples of control instruments are the budget, work instructions, performance targets, and bonuses. However, there is ambiguity about what precisely constitutes a control system (Fisher, 1995). In this paragraph, therefore, I introduce the working definition of an MCS that underlies this study. This working definition stems from Speklé (2004) and integrates ideas from the accounting literature and findings from prior research. Firstly, starting from the purposes of the management control system, four dimensions of control are described (2.1.1). Then focus shifts to the interrelationships between them (2.1.2). 2.1.1 Purposes and dimensions of control Within organizations the management control system serves several purposes. Jensen and Meckling (1992) state that the control system provides incentives through performance measurement and evaluation, and through a system for rewarding and punishment. The authors further submit that together with a system that partitions decision rights, the organization’s control system compensates for the lacking ‘invisible hand’ (Smith, 1776) that coordinates activities in the market place. Speklé (2004) adheres to a similarly broad view of 11 Chapter two management control systems. He summarizes the main purposes of the MCS as knowledge capturing, coordination, and providing incentives through rewarding and punishment. To achieve these control purposes combinations of control instruments are used. For instance, making knowledge that rests within the organization available for decision making, can be achieved by delegation of tasks and responsibilities to those who possess the knowledge. Also, use of financial rewards provides incentives and could further certain behaviour. Speklé (2004) refers to four dimensions of control to describe comprehensively the control structure: (1) the allocation of decision rights, (2) use of standards, rules, and regulations, (3) performance evaluation, and (4) rewards and incentives. This is consistent with the prior literature (for instance Zimmerman, 2000; Jensen and Meckling, 1992; Flamholz, 1983), but underlines the role of standards and rules by making that a separate dimension. A specification of each control dimension follows illustrating the control instruments that belong to the dimension. Throughout, I will discuss control from the perspective of a supervisor who exerts control on a manager. The first dimension ‘allocation of decision rights’ incorporates the delegation of tasks and responsibilities by the supervisor to the manager. Choices have to be made regarding how many and which responsibilities to involve. Key control instruments are the level of decentralization and the amount of work autonomy a manager has. Use of standards, rules, and regulations make up a broad and diverse second dimension. For instance, a supervisor can rely on standard operating procedures and work instructions to coordinate the work. This is one type of standardization, commonly referred to as action control (see Merchant, 1982). Other types of rules and standards that focus on employee behaviour, specify what not to do and act as boundary systems4. Rules and regulations thus play different roles within the organization. Another means of coordinating the work is provided by use of performance targets (compare with result control; Merchant, 1982). Different types of these targets belong to this control dimension: financial targets, nonfinancial targets, emergent targets, and (market) benchmarks. A final control instrument that fits this dimension is the budget. Like rules and regulations, budgets can play different roles within organizations. They function, for instance, as a performance target, or serve primarily as a coordination device. 4 The term boundary systems stems from the work by Simons (1995). He explains that boundary systems “delineate the acceptable domain of activity for organizational participants” p39. 12 Configurations of control Regarding performance evaluation, the third dimension, a supervisor’s central concern is the choice of a basis for evaluation. Evaluation can strictly be based on compliance with rules and standards, or hinge on target achievement. For purposes of performance evaluation, a supervisor can also use subjective judgement, or choose to benchmark achievements to long term organizational performance. Naturally, different evaluation bases are present. The final dimension of control represents the reward system and incentives. Both the reward itself and the conditions for being granted one are important. Again, a whole range of possibilities exists: organizations work with a fixed reward, a variable reward, or both. Moreover, circumstances under which a reward is being granted can differ widely. Apart from focusing on the short term benefits related to a bonus, career perspectives play an important role when it comes to rewards and incentives in the long run. The four dimensions of control form the basis of a working definition to study MCSs. Together they represent a broad view of control that is a prerequisite for the study of control in its entirety. Considering these four dimensions each with numerous control instruments, the potential number of distinct MCSs seems endless. However, because of interrelationships between the control elements, only few combinations are expected to be effective. This will be discussed next. 2.1.2 Complementarities Above a management control system is presented as a package of multiple control instruments that together cover four dimensions. However, the effects of control instruments depend on how these are being used as well as on their combination with the other instruments. This idea is consistent with the conceptualization of management control systems as packages of interrelated elements (Otley, 1980; Abernethy and Chua, 1996) that as a set establish control. The following example illustrates how the dimensions connect5. Imagine a situation in which a supervisor delegates tasks and responsibilities to a manager (the first dimension). The manager can only act upon this when he/she is not at the same time required to comply with specific work instructions or regulations (the second dimension). Rather, to guide the manager’s behaviour performance targets may be agreed upon. If the manager’s performance 5 This example only illustrates how the dimensions could interrelate and complement one another. How this works exactly and why, will be explained when discussing the theory in the next paragraphs. Among other things, the explanation hinges on the assumptions made about human behaviour. 13 Chapter two evaluation is based on target achievement, the manager will likely take the targets into account when making decisions (the third dimension). However, if the achievements are not linked to rewards (the fourth dimension), the manager will not face the consequences of his/her actions and might make decisions that do not fit the organization’s goals. The example shows that one dimension of control is not enough to establish effective control within an organization and that control instruments must be balanced somehow. Not every potential combination of control instruments will be effective. As Zimmerman (2000) points out, the systems that partition decision rights, evaluate performance, and reward and punish, compose a three-legged stool. The dimensions must be balanced. The reason is that complementarities among the elements of control exist. Holmstrom and Milgrom (1994) stress the importance of complementarities among elements of an organization’s structure. In their analytical study, they show that elements of the incentive system complement each other and influence each other’s performance. Hence interrelationships are believed to exist between the different elements of control systems. However, due to a lack of studies that address the workings of packages of control instruments (Otley et al., 1995), there is little empirical evidence. Two recent studies (Bedford, 2005, and Widener, in press) provide us with some insights. Both focus explicitly on management control systems, instead of on separate elements of control. Bedford studies management control configurations of strategic business units. He incorporates cybernetic controls, administrative controls, and socio-ideological controls in his study. Using cluster analysis he can distinguish and interpret meaningfully four distinct groups of control systems. He cannot relate the groups systematically to the strategy, environment, or size of the strategic business units. However, the evidence suggests that configurations of control exist, i.e. not every possible combination of control instruments is present in his sample. In her study of Simons’ levers of control framework, Widener (in press) finds evidence that suggests the levers are complementary. Using survey data obtained from chief financial officers, she finds a positive association between the beliefs system and each of the other levers of control (the boundary, the interactive, and the diagnostic systems; Simons, 1995). More emphasis on the beliefs system implies more emphasis on the other levers as well. Moreover, she finds that the joint effect of the four levers on firm performance is stronger than the sum of their separate effects. This suggests that the systems are complementary and should be studied simultaneously to understand performance effects. However, whereas Widener’s study suggests that the joint effect on performance might be stronger, Abernethy and Chua (1996) stress in their longitudinal case study on control system design, the risk of 14 Configurations of control misinterpreting performance effects of single controls when several instruments act as substitutes. The studies mentioned above together underline the importance of studying control elements in combination. If we expect from the large number of control instruments that a variety of MCSs might be used by organizations, but that because of complementarities within control systems, only few will be effective, the key question is which combinations are effective. The theoretical answer to this question is provided by the Transaction Cost Theory of Management Control (Speklé, 2001a, 2001b, 2004). The theory submits that only a few combinations of control instruments are effective. The actual number of distinct MCSs will be limited, because choices already made for certain control instruments within one of the dimensions restrict the available alternatives for choosing additional ones from the other dimensions (Speklé, 2004, p11). From the theory it becomes clear how the elements of control must be balanced to establish effective control. The next paragraph introduces the theory. 2.2 The transaction cost approach to management control The Transaction Cost Theory of Management Control (Speklé, 2001a, 2001b, 2004) provides insights into which control systems are effective and under what circumstances. In his theory Speklé applies Transaction Cost Economics logic (TCE; Williamson, 1975, 1985, 1991) to management control problems. The general approach is the same as in regular TCE and we come across familiar concepts like uncertainty and asset specificity. The two perspectives share a concern for the relative effectiveness of alternative governance structures. As Williamson (1985) states, effectiveness is established by assigning transactions (which differ in their attributes) to governance structures (which differ in their adaptive capacities and the associated costs) in a discriminating way. In terms of management control this translates into the discriminating alignment of activities (i.e. transactions) and management control systems (i.e. governance structures). An introduction to the Transaction Cost Theory of Management Control (henceforth: MCtce) follows, starting with the general argument. I explain the contents of the theory in three steps. Firstly, I focus on control problems as they exist in activities executed within organizational units. Secondly, I discuss management control systems as devices that provide control solutions. Then, the last paragraph brings it all together matching problems with solutions. I will confine myself to those parts of the theory relevant to the study of MCSs at the level of the organizational unit. The MCtce, for instance, also deals with hybrids (i.e. outsourcing of 15 Chapter two activities) which fall outside the scope of this study (for a detailed overview of the complete theory see Speklé 2001a). 2.2.1 The general argument The MCtce revolves around the idea of matching activities and their associated control problems with management control systems that provide control solutions in an economizing way. The theory submits that control problems are inherent in the carrying out of the activities of an organization or organizational unit (Speklé, 2004). Specific activities will induce specific control problems and by studying activities it can therefore be predicted which control problems will occur. Moreover, the theory introduces five control archetypes as effective forms of control. Each archetype is relatively the most effective in controlling certain groups of activities. Establishing effective control comes down to matching control problems and control solutions. Thus, control problems are related to the activities of an organizational unit. For the purposes of this study, I define an organizational unit as consisting of a manager and all employees reporting (in)directly to him/her. Organizational units exist at different hierarchical levels throughout the organization. Activities of these units can be a multiplicity of things ranging from the production of a good or service, to marketing or Human Resources, to finance or ICT. Control problems originate from the interaction of the specifics of an activity and human traits. Two characteristics of human beings come into play: bounded rationality and opportunism. Bounded rationality refers to people’s inability to take in and process all available information (Simon, 1957). It prevents people from taking the most efficient actions possible and from making optimal decisions. Opportunism is defined as “self interest seeking with guile” (Williamson, 1975, p255). People acting opportunistically can cause loss of resources or damage the organization’s reputation. The role of these characteristics and the severity of their consequences depend on the situation, i.e. the activity of the unit. Different activities incorporate different information needs and bring about diverse opportunities for opportunistic behaviour. Hence different activities cause various control problems. On the one hand, these control problems involve a lack of knowledge dissemination throughout the organization, so that information cannot be made available for decision making or performance evaluation. On the other hand, they are associated with waste of resources when employees act in their own interest instead of applying themselves to organizational goal attainment. Taking into account the two human traits, the MCtce predicts which control 16 Configurations of control problems will arise given the specifics of the unit’s activity. In sum, under the assumptions of bounded rationality and opportunism, variation in activities gives rise to variation in control problems. Control solutions are provided by management control systems. MCSs mitigate control problems, for instance, by providing means of knowledge capturing, and by curbing opportunistic behaviour through incentives. However, specific control systems provide solutions only to certain control problems, but not to all. Their relative capabilities to confront control problems differ. Key to effective control, then, is to create as close a match as possible between activities and management control systems. The theory introduces five ideal types or archetypes as effective forms of control and relates those to specific activities. By characterizing the activity of an organizational unit, the relevant archetype for that unit can be determined. Because the archetypes represent effective forms of control, MCSs that resemble their relevant archetype will be more effective than those that do not. Figure 2.1 gives an overview of the theoretical ideas. Figure 2.1 Matching control problems with control solutions bounded rationality opportunism activity of organizational unit (UNC, AS, IA) control problems control solutions MCS of organizational unit archetypes represent an effective match Note: UNC = uncertainty, AS = asset specificity, and IA = ex-post information asymmetry Source: Speklé 2001b, modified The MCtce can be characterised as a contingent configuration theory. Doty and Glick (1994, p232) define configurational theories as follows: “… (configurational theories) identify multiple ideal types, each of which represents a unique combination of the organizational attributes that are believed to determine the relevant outcome(s).” The MCtce introduces five archetypes, and explicates the circumstances under which each is effective thus taking on a contingent configurational approach. In the next paragraphs I will elaborate on both control problems associated with activities and their solutions as provided by the archetypes. Moreover, I will explain the main argument regarding MCS effectiveness. 17 Chapter two 2.2.2 Control problems: activities and contingency variables From the general argument it follows that control problems are inherent in the activities of an organizational unit and occur because human beings characterised by bounded rationality and opportunism carry out the activities. The MCtce compares activities over three characteristics: their level of (1) uncertainty, (2) asset specificity, and (3) ex-post information asymmetry. Scores on these three variables together characterize the activity and give rise to specific control problems, whereas each aspect has its own unique influence. Uncertainty or complexity surrounding an activity creates a lack of ex-ante knowledge about the activity. This knowledge concerns both processes (how best to carry out an activity) and outcomes (what results can we expect). Within TCE-based research, we encounter multiple guises of uncertainty. Generally, researchers distinguish environmental uncertainty and behavioural uncertainty (see Rindfleisch and Heide, 1997; Boerner and Macher, 2001). Regardless of the type, the consequences are always a diminished capability to program activities (Speklé, 2004). When uncertainty is low, programming of activities is possible. Hence, supervisors are able to identify the tasks to be performed and can set up specific instructions. In case of high uncertainty, predefining tasks appears difficult or even impossible and control has to be exercised in alternative ways. Control problems related to dissemination of information and (hence) goal alignment will be present. Note that high uncertainty is problematic because of bounded rationality. In the presence of uncertainty, the limits of rationality could be reached, so that it becomes too costly or impossible to obtain the information required (cf. Williamson, 1975). If it were possible, for instance, to document all potential scenarios and establish detailed procedures for each, control problems surrounding uncertainty would vanish. Uncertainty becomes problematic only in the presence of the second characteristic of an activity, asset specificity. In case of low asset specificity the market provides all required knowledge about transactions through the price mechanism, regardless of the level of uncertainty. However, if asset specificity is present to a nontrivial degree (Williamson, 1985, p60), the market cannot provide the required knowledge and the effects of uncertainty cannot be averted. Studying the hierarchy, only moderate and high levels of asset specificity are considered here. Asset specificity concerns opportunity costs of investments made to support the activity (Speklé, 2004, p12). Central concern is whether investments made in specific assets can be redeployed (Williamson, 1985). If the assets cannot be redeployed, the investments will be 18 Configurations of control considered a loss in case of determination of those activities that exploit the assets. Imagine, for example, machinery that can only produce one component. If it cannot be used for the production of anything else, nor be of any use in a similar organizational unit of another company, opportunity costs of investing in this machinery are significant and asset specificity is high. This situation gives rise to control problems. The organizational unit that produces the component cannot rely on multiple potential buyers. Moreover, the unit that needs the component cannot obtain it elsewhere. The investments, thus, create dependencies that in turn leave room for opportunistic behaviour, which consequently has implications for control needs. Six kinds of asset specificity have been distinguished (Williamson, 1991): site specificity, physical asset specificity, human asset specificity, brand name capital, dedicated assets, and temporal specificity. One or more of these forms could be relevant for activities of an organizational unit, determining asset specificity as such. The presence of asset specificity creates control problems related to opportunistic behaviour. In case of moderate levels of asset specificity, market benchmarks are available to assess performance (Speklé, 2004). Despite some tailoring to specific company needs, supervisors can more or less compare the activity of the organizational unit to alternatives in the market and curtail opportunism to some extent. Controlling activities with high asset specificity calls for a different approach. These activities simply do not have a market alternative that could function as a disciplining device benchmarking performance. Supervisors have to turn to administrative performance targets or rules and regulations to confront opportunism. Finally, activities inhibit different amounts of ex-post information asymmetry6. In organizational and management control literature alike, information differences between actors are a familiar concept. The MCtce, however, assigns a distinctive role to information asymmetries that exist at the end of a period (i.e. when performance evaluation takes place)7. Once an activity has been carried out, a supervisor might not be able to evaluate actually delivered performance due to substantive information differences. What exactly has been achieved? And, did the manager achieve the best results possible or not? Like before, this situation offers possibilities for opportunistic behaviour and calls for specific control approaches. 6 Speklé uses both the terms information impactedness and information asymmetry. The former implies that it is either too costly or impossible to transfer information (Williamson, 1975). Thus, information asymmetry is a direct consequence. I will only focus on information asymmetry because of the more regular use of this terminology within MC and organizational research. 7 Ex-ante information asymmetries are part of uncertainty and not treated separately. 19 Chapter two Uncertainty, asset specificity, and ex-post information asymmetry jointly define the activity. Each characteristic has its own effect on control needs, but its relevance depends on the combination of all three. For instance, by definition information asymmetries are irrelevant in case of low uncertainty. These variables are treated as contingency variables in this study. They are three different aspects of activities that interact and therefore need to be dealt with simultaneously. Like standard TCE, the MCtce relies on two traits of human beings, -bounded rationality and opportunism-, to explain why control problems occur. This might seem very strict. Not all human beings will act opportunistically at every opportunity. However, it is unnecessary to assume that this is the case. It suffices to claim that at least some people are opportunistic sometimes and therefore control problems exist. Note that the MCtce deviates somewhat from standard TCE based research in its choice for the contingency variables. Instead of focusing on frequency as the third variable, the theory stresses the role of ex-post information asymmetry. Frequency cannot play a distinguishing role in explaining different forms of control within the hierarchy (Speklé, 2004). Studying management control design choices within the hierarchy, frequency will always be relatively high because activities with a low frequency will be left to the market (Williamson, 1985). Therefore, to explain differences in the effectiveness of management control structures, the MCtce gives ex-post information asymmetries a more prominent role. This variable also stems directly from TCE (see for instance Williamson, 1975), and is characterised as a derivative condition that arises from uncertainty and opportunism. 2.2.3 Control solutions: archetypes Generally speaking, management control systems provide solutions to control problems. However, not all control systems are effective under all circumstances. The Transaction Cost Theory of Management Control articulates a framework of five archetypes: arm’s length control, result oriented machine control, action oriented machine control, exploratory control, and boundary control. The archetypes exist of a combination of different control instruments that as a set provide effective control solutions. The archetypes derive their effectiveness from their ability to deal with specific control problems in a coherent way. The archetypes are not necessarily equally effective (also see 2.3.1), but each archetype is the relatively most effective for the control of specific activities. The theory gives rich descriptions of all archetypes covering the four dimensions of control. An introduction to the archetypes follows. Table 2.1 gives an overview of their main characteristics. 20 Configurations of control Table 2.1 Characteristics of the control archetypes Arm’s Length Control Machine Control (Result Oriented) Machine Control (Action Oriented) Structure Relative autonomy; involvement higher level management limited as long as performance is satisfactory Decentralized with clearly defined areas of responsibility and accountability Standardization Market-related outcome requirements; external performance benchmarks Performance assessment relative to ‘the market’ Monitoring and performance evaluation Reward & incentive structure Performance dependent bonuses Exploratory Control Boundary Control Well-defined tasks; strict hierarchy; limited room for discretionary behaviour Relatively flat hierarchy; fluid and permeable matrix-like project structures; vague responsibilities Relative autonomy within defined boundaries Predefined performance targets of administrative origins Standardization of behaviour; detailed rules, norms, and instructions No ex ante standards and targets; ‘do your best’; emerging standards Proscriptive codes of conduct; boundary systems; emphasis on behaviour to be avoided Monitoring focused on target achievement; performance assessment relative to targets Monitoring and supervision to ensure compliance to norms and standards Based on emerging standards; subjectively assessed contributions to long term organizational performance Focused on compliance; observance of interdictions; external audits Performance dependent bonuses No direct link between performance and rewards Career prospects dependent on long term past performance; peer pressure Emphasis on threat of punishment of rulebreaking behaviour; tie-in through ‘hostages’ Source: Speklé, 2004 Discussing control again from the perspective of a supervisor who exerts control on a manager, with arm’s length control the supervisor keeps at a distance. The manager has a lot of autonomy and is not subjected to all kinds of rules and regulations specifying what to do. Performance evaluation takes place relative to the market. Thus market benchmarks act as performance targets. In case of satisfactory performance the manager receives a bonus. The second and third archetypes are two variants of machine control: action oriented machine control and result oriented machine control. Standardization of behaviour and detailed work instructions epitomize the action oriented variant. The manager has little autonomy and his/her compliance with rules and standards will be monitored and provides the basis for performance evaluation. There is no direct link between performance and rewards. In stark contrast to this is machine control of the result oriented kind. With this archetype, the manager also has clearly defined responsibilities, but focus is on achievement of performance targets instead of on compliance with rules and regulations. However, performance targets as part of machine control are administrative targets, internally determined. Within this result oriented variant the manager will be rewarded a bonus when he/she achieves the targets. 21 Chapter two The fourth archetype is exploratory control. A main characteristic of this archetype is a lack of ex-ante work instructions and rules. The supervisor defines general goals and gives the manager considerable leeway to act upon them. Moreover, preset performance targets are absent and emergent targets take over. During the carrying out of an activity, information about what can be achieved becomes available for evaluation purposes. The supervisor uses subjective judgements to assess performance and looks at performance in the long run. The manager awaits good career prospects instead of short term bonuses as a reward for his or her job. Boundary control, the fifth archetype, provides the manager with a very high autonomy, restricted only by prohibitive guidelines or boundary systems. There are no performance targets or work related instructions to coordinate the work. The supervisor keeps at a distance and the only thing monitored is compliance with the proscriptive rules. Non-compliance has serious consequences for the manager’s career. 2.2.4 Matching problems and solutions: effectiveness To summarize the argument: uncertainty, asset specificity, and ex-post information asymmetry jointly characterize activities of organizational units. The carrying out of these activities by human beings characterised by both bounded rationality and opportunism creates control problems. Different MCSs can handle different control problems. Therefore some systems control certain activities more effectively than others. Five archetypes represent effective control forms. Each archetype is the relatively most effective for the control of specific activities. From these theoretical insights it follows that a management control system of an organizational unit should resemble its relevant archetype in order to be effective. Which archetype is relevant depends on the activity of the organizational unit. The main claim is: the closer an observed MCS of an organizational unit resembles its relevant archetype, the more effective it will be. Figure 2.2 gives an overview of the archetypes and the activities they control best. 22 Configurations of control BOUNDARY CONTROL Low ARM’S LENGTH CONTROL MACHINE CONTROL action oriented and result oriented Moderate High Low High Uncertainty EXPLORATORY CONTROL Ex-Post Information Asymmetry High Figure 2.2 Archetypes of control and their habitat Asset Specificity Source: Speklé, 2001b, p76 In case of low uncertainty and moderate asset specificity, arm’s length control provides an effective solution. Activities are programmable and market-based standards are available. Use of market benchmarks in guiding behaviour and evaluating performance is thus both feasible and natural. From this the first hypothesis follows (cf. Speklé, 2004): H1: In case of low uncertainty and moderate asset specificity, resemblance of an MCS with the archetype arm’s length control is positively associated with MCS effectiveness. Combined with a high level of asset specificity, low uncertainty calls for machine control. Whether the result oriented or the action oriented type prevails, depends on “the ability to define meaningful and sufficiently restrictive output targets” (Speklé, 2004, p 22). It is a forced choice because the available information will enable the one or the other form. To distinguish between the two types of machine control, I take on the measurability of outputs as an extra contingency variable. The variable functions as a proxy representing the definition of output targets. Researchers generally use this variable to indicate the appropriateness of action versus result controls (see for instance Merchant, 1982 and Ouchi, 1980). When asset specificity is high, market-based standards are no longer available to supervisors. However, if the outputs can be quantified, output control can still provide an effective solution using administrative targets instead of market benchmarks. Result oriented machine control is the relevant archetype. Therefore, the second hypothesis claims: 23 Chapter two H2: In case of low uncertainty, high asset specificity, and high measurability of outputs, resemblance of an MCS with the archetype result oriented machine control is positively associated with MCS effectiveness. When asset specificity is high, but it is impossible to quantify the outputs, result control is no longer feasible. Due to the programmability of activities though, rules and behavioural guidelines provide control solutions. Thus, action oriented machine control reigns. From this hypothesis three follows: H3: In case of low uncertainty, high asset specificity, and low measurability of outputs, resemblance of an MCS with the archetype action oriented machine control is positively associated with MCS effectiveness. High levels of uncertainty make programming of activities difficult or impossible. Standard operating procedures or specific performance targets cannot be used. Other forms of control will be effective depending on the level of ex-post information asymmetry. Sometimes it is possible to evaluate performance afterwards. This is the case when standards that can be used for evaluation purposes, emerge during the process of activity execution. Accordingly, expost information asymmetry is low and exploratory control is the best option. However, standards do not always emerge during the process and ex-post information asymmetry reaches a high level. Consequently, control is hardly possible at all and supervisors can only rely on prohibitive rules specifying what not to do rather than what to do. The archetype boundary control will be effective in this situation. From this line of reasoning hypotheses four and five follow: H4: In case of high uncertainty and low ex-post information asymmetry, resemblance of an MCS with the archetype exploratory control is positively associated with MCS effectiveness. H5: In case of high uncertainty and high ex-post information asymmetry, resemblance of an MCS with the archetype boundary control is positively associated with MCS effectiveness. 24 Configurations of control 2.3 Reflections on the theoretical ideas This paragraph highlights some aspects of the theory that deserve more attention, and firstly, elaborates on configurational thinking (2.3.1). I focus on the notion of fit, which is a central concept in configurational as well as contingency thinking. Next, subparagraph 2.3.2 gives explanations on why misfits would occur. The assumptions about effectiveness and its link to misfit also deserve our attention and will be discussed as well. 2.3.1 The configurational approach Previously, the Transaction Cost Theory of Management Control has been characterised as a contingent configuration theory. Configurations or archetypes represent internally consistent packages of organizational elements (Doty and Glick, 1994). Only a few configurations are effective, because of the interrelationships between the elements. These theoretical assumptions make the approach well suited for the study of management control systems as interrelated packages of control instruments. In this subparagraph I discuss the main features of the configurational approach and the implications for testing the hypotheses. The configurational approach incorporates multiple levels of theorizing (Doty and Glick, 1994). At the level of the grand theory, the link between contingency variables, archetypes, and effectiveness is relevant. The hypotheses stated above are direct tests of the theory at this level. Configurational approaches also incorporate middle-level theories that relate to the internal consistency of the archetypes. Doty and Glick (1994) explain that the configuration of organizational elements has a synergistic rather than an additive effect. These middle-level theoretical ideas will also be addressed in this study, be it in an exploratory way. The configurational approach incorporates a specific notion of fit. Generally speaking, ‘fit’ is the state that leads to effectiveness: there is a match between the organizational structure and the situation in which the organization operates (Donaldson, 2001). In this study, the level of fit depends on the resemblance of an observed management control system with its relevant archetype. Which archetype is relevant depends on the combination of the contingency variables. For instance, for the control of activities with low levels of uncertainty and asset specificity, arm’s length control is the relevant archetype. If the observed MCS in this situation does not resemble the archetype, there will be a misfit and a negative effect on performance. Different concepts of effectiveness exist and different notions of fit. The perception of fit central to this research project can be characterised as configuration contingency fit (in terms of Gerdin and Greve, 2004), as (contingent) ideal types fit (cf. Doty, 25 Chapter two Glick, and Huber, 1993), or as systems fit (in terms of Drazin and Van de Ven, 1985). Distinguishing features of this type of fit are (1) that it concerns multiple structure variables (combined in the archetype) as well as multiple contingency variables, and (2) only a discrete number of fit states exist. Misfit is always expected to influence performance negatively. When dealing with configurations one should compare the effectiveness among members that belong to the same archetype but differ in level of resemblance (Doty et al., 1993). It is predicted that members that marginally resemble the configuration are less effective than those that closely resemble it. Thus, to test configuration theories one should not assign cases to groups according to their relevant archetype and then compare effectiveness among the archetypes. The question arises whether one should compare effectiveness among the archetypes at all. Is the archetype arm’s length control as effective as boundary control if matched to the activities? Doty and colleagues state that configurational theories incorporate the equifinality8 assumption: “multiple equally effective organizational forms exist” (1993, p1199). They assume each ideal type is equally effective: there are more means to an end. Regarding the Transaction Cost Theory of Management Control, the equality or difference in effectiveness among the archetypes is not addressed explicitly, but the equifinality assumption seems inappropriate. Speklé (2001a) submits that the archetypes are each effective given the circumstances, but they represent also the best we can do, and boundary control, for instance, is characterized as “a control structure of last resort”. Assuming equifinality has consequences for the way configurational theories can be tested. For instance, Doty et al. (1993) test relationships between misfit and effectiveness for Mintzberg’s (1983) organizational types using the complete sample. For each case they firstly determine the relevant archetype and thus the relevant misfit score, but in their test they use all cases simultaneously. Under the assumption of equifinality this is perfectly sound. However, when the archetypes are not thought to have the same level of effectiveness, one can only test the theoretical claims for subsamples and compare only those cases that share the same relevant archetype, or alternatively, somehow correct for the differences in the levels of effectiveness each archetype can attain. In chapter five that sets out the analyses, I will explain how I have dealt with these issues in this study. 8 The concept of equifinality can be interpreted in different ways. For an overview and discussion, see Donaldson, 2001. 26 Configurations of control 2.3.2 Occurrence of misfits To test the hypotheses as formulated above, variation in the level of misfit as well as variation in the level of effectiveness must be observed. The underlying assumption is that the way in which organizational units establish control is not by definition effective. Some MCSs are more effective than others depending on the level of fit, in this case their resemblance with the relevant archetype. This implies that misfits will occur in practice. In his Transaction Cost Theory of Management Control, Speklé (2004) mentions a natural tendency towards matching MCSs to activities (because that is effective), yet suggests that mismatches occur routinely. There are several reasons to expect the occurrence of deviations from the archetypes. Regarding the alignment of transactions and governance structures, Williamson (1996) refers to the complexities of organization as well as men’s limited cognitive abilities to explain the occurrence of failures in alignment. It is also acknowledged that effectiveness is the most important, but not the only determinant of control structure. Besides, mismatches can occur during times of change. Contingency literature assumes that states of misfit occur and can last for some time, when one moves from one state of fit to another (Donaldson, 2001). In sum, there are multiple reasons to presume that misfits occur regularly. Thus from theory, I expect that there will be a general tendency towards matching the archetypes, but misfits will occur as well. This triggers another question: can misfits exist for single control elements? Or, put differently, can an organization gradually move from one archetype to another? Paragraph 2.1 described a management control system as a package of interrelated control elements, stressing complementarities among the elements. The MCtce submits that only specific combinations (represented by the archetypes) are effective. The theory does not explicate to what extent deviations from the archetypes are possible, or whether adjustments to the control system accord with changes in the contingency variables. However, in their review of contingency forms of fit in management accounting research, Gerdin and Greve (2004) state that researchers who advocate the configurational approach, assume change to occur by ‘quantum jumps’, rather than via gradual adjustments. Organizations are expected not to adjust their systems until the performance effects are severe enough to justify the costs associated with substantial change. This view differentiates the configurational approach from the contingency approach as illustrated in figure 2.3. 27 Chapter two Figure 2.3 Contingency versus configurational fit Panel A Panel B S T R U C T U R E S T R U C T U R E CONTINGENCY CONTINGENCY Source: Gerdin and Greve, 2004, p306; modified Panel A shows the contingency view to fit. Multiple states of fit exist that are all on the fitline. A deviation from the line implies a state of misfit, but organizations can move along the line and gradual adjustments of the organizational structure are possible (see also Donaldson, 2001). Panel B represents the configurational approach to fit. Only a discrete number of fit states are possible (indicated by the three circles). To move from one state of fit to another a quantum jump has to occur. In all cases, organizations that are in a state of fit show superior performance over those that are in misfit. Change of control systems as such is not under scrutiny in this research project, but assumptions about the occurrence of misfit and the relationship between misfit and effectiveness are relevant. These assumptions have implications for modelling the relationships (for a debate on these issues see Gerdin, 2005a/2005b, and Hartmann, 2005). The theoretical form of the relationship between misfit and effectiveness of control remains unclear. Does a single misfit immediately impact effectiveness? When the complementarities among the control elements are strong this might be the case. Or, is there a minimum level of misfit to be reached before performance effects occur? Burton, Lauridson, and Obel (2002) provide some empirical insights. They study a model with multiple misfits for small- and medium-sized enterprises. They distinguish between the situational context (environmental conditions and technology, for instance) and characteristics of the organization (decentralization and formalisation, for instance) to identify multiple states of fit. The authors show that firms with no misfits perform significantly better than firms with misfits. Moreover, they conclude that neither the type nor the number of misfits matter. A single misfit can lead to a loss in performance. The issues raised in this paragraph have implications for the tests of 28 Configurations of control the theoretical ideas and for the models used in the analyses. The next paragraph looks ahead and renders thoughts on the theory’s empirical test. 2.4 Concluding remarks To research the central question of this project – which MCSs are effective given the circumstances – I build from the Transaction Cost Theory of Management Control (Speklé, 2001a, 2001b, 2004). This theory provides a theoretical answer to the research question. It submits that there are five effective forms of management control systems referred to as archetypes of control: arm’s length control, result oriented machine control, action oriented machine control, exploratory control, and boundary control. Each is effective only for the control of certain activities, but not for others. In this project, I study organizational units with various activities. Examples are production, human resources, research and development, and sales. These activities are characterised over three dimensions: the level of uncertainty, asset specificity, and ex-post information asymmetry. The effectiveness of the archetypes is contingent on these characteristics. For instance, arm’s length control is the most effective control structure for activities with low uncertainty and low asset specificity. From this the main claim of the theory follows: the closer an observed MCS resembles its relevant archetype, the more effective it will be. Five hypotheses are formulated to test this main claim, one for each of the archetypes. The MCtce is a complex theory that studies multiple contingency variables and multiple archetypes and combines these to explain effectiveness of control. It is also a far-reaching theory, which pertains to all kinds of activities. Looking into different activities ultimately implies comparing organizational units as distinct as production and research and development. Therefore, to test the theory and learn about management control system design, I take on a generic approach towards control instead of trying to study all the specifics of control systems. To study control in its entirety I will work from the broad working definition of management control systems and map all four control dimensions. Apart from measurement issues, testing the theory involves the operationalization of the archetypes and the hypothesised relationships. The theory describes the archetypes and the situations in which each is effective in a qualitative manner. Choices have to be made regarding the definition of the archetypes, the boundaries to be set that distinguish high and low levels of the contingency variables, and the appropriate link between misfit and effectiveness. Establishing a match between the theoretical ideas and the empirical constructs 29 Chapter two will be a central concern throughout. The next two chapters deal with the methodology of this study (chapter three) and the operationalization of the concepts (chapter four). 30 Chapter three – Methodology Data form the basis of every empirical study and their quality is a necessary (but not sufficient) condition for making valid inferences about theoretical claims. Thus, to be able to conclude anything about management control system design and effectiveness, firstly reliable measurement of these variables must be ascertained. As I will argue below, the appropriate method for this study is the survey method. Survey research has received a lot of criticism as to the quality of the data obtained. Central concerns are matching the theory with the empirical level of analysis, applying the survey method correctly, and assuring content validity. This chapter shows how I have confronted this critique and assured the quality of my data. I introduce a new technique, referred to as ‘validity-feedback interviews’, as a powerful method to obtain valuable feedback on the content validity of survey instruments. This chapter provides firstly an overview of the general methodological approach. A methodological framework illustrates all necessary steps that enable hypotheses testing. Moreover, the choice for survey based research will be justified and the critique on survey studies discussed. Then, paragraph 3.2 describes the development of the questionnaire which led to a basic set-up. Next, the validity-feedback interviews are introduced (3.3) and the extensive pre-test the questionnaire was put through is described (3.4). In paragraph 3.5 I discuss sampling and the data gathering process. This chapter ends with a description of the dataset (3.6), and concluding remarks on the validity and reliability established through the methodological approach (3.7). Chapter four will present details on exact questions asked and deals with the measurement of individual variables. 3.1 The methodological approach and the choice for a survey This paragraph explicates the methodological steps needed to test the theoretical claims. From that the choice for the survey method follows. Next, I discuss the critique survey research has recently received. 3.1.1 The methodological framework The last chapter set out the theoretical ideas underlying this research project. To learn about MCS design and effectiveness I work with the Transaction Cost Theory of Management Control that explicates several archetypes as well as the circumstances under which each is effective. The main theoretical claim is that the effectiveness of a management control system 31 Chapter three increases with the resemblance of the control system to its relevant archetype. I focus on measuring misfit as the lack of resemblance. Testing hypotheses that relate misfit to effectiveness of control will provide insights into the empirical power of the theory. To enable such hypotheses testing I set up a framework that shows all necessary methodological steps (figure 3.1). Each step involves choices regarding methods, with central concerns the validity and reliability of data collection, scale construction, and measurement of variables. Figure 3.1 Methodological framework define define activity activity determine determine relevant relevant archetype architype compare compare MCS MCS with with relevant relevant archetype archetype map MCS link misfi misfitt to to link effectiveness effectiveness measure measure MCS MCS effectiveness effectiveness First of all, the framework indicates that data collection should return information on management control systems in their entirety and on their effectiveness. This will be done for various activities (for instance, production or human resources), since activities form the level of analysis and I study MCSs surrounding those. Therefore, as a separate step, it is necessary to identify the activities and capture the characteristics thereof. Scores on the three contingency variables, uncertainty, asset specificity, and ex-post information asymmetry, jointly define the activity. The theory indicates the relevant archetype for different groups of activities. In the next step I use the scores on the contingency variables to determine the relevant archetype for each case. For instance, for an activity, which scores high on uncertainty, moderate on asset specificity, and high on ex-post information asymmetry, the relevant archetype is boundary control. According to the theory, resemblance of an MCS with its relevant archetype leads to effectiveness. Therefore, once the relevant archetype for each case is determined, the next step comprises calculation of a misfit score expressing the lack of resemblance of the MCS to its relevant archetype. This step necessitates translation of the archetypes’ qualitative descriptions into quantitative scores for every single element. Finally, misfit and effectiveness of the management control system have to be brought together in a testable model. In sum regarding data collection, three things are important: first of all, it is necessary to map management control systems (1). Detail is important to pick up a control system in its entirety without loosing track of the nuances of control. At the same time, though, only a large 32 Methodology number of observations can enable hypotheses testing (2). Moreover, to test hypotheses for all five archetypes I need data on a vast variety in activities (3). Such variety in activities can be found throughout different industries and companies, and within companies at different hierarchical levels. A survey can reach all kinds of organizational units each with its own activities, and meet the requirements of providing a large quantitative dataset. Moreover, applying the survey method can return detailed information on numerous organizational aspects simultaneously. Archival data are generally unavailable for studies on management control systems (cf. Ittner and Larcker, 2001), especially at lower levels within the hierarchy (think, for instance, about the amount of work autonomy). Moreover, to be able to compare multiple activities and test the theoretical claims, I need to abstain from specifics of the control system and focus on generic forms of control. This generic approach limits the use of available archival data, which are tailored to the organization and difficult to compare over distinct organizational units. Case studies would easily provide enough detail, but lack volume. Thus, survey research can best serve the purposes of this study and is the logical choice. However, survey research requires the operationalization of the constructs, which brings some challenges. First of all, precise definition of an MCS ex-ante is necessary. The challenge is to balance the amount of detail desired against the number and complexity of questions feasible within the survey method. Moreover, taking on a generic approach towards control and focussing on variety in activities implies comparing, for instance, a production unit and a research and development unit in terms of their control structure and its effectiveness as well as the characteristics of their activities. This requires the use of generally applicable measurement instruments. As will become clear below, a lack of examples from prior research limits their use. Apart from having to deal with the critique survey research receives, this study thus faces the invention of new measurement instruments able to pick up MCSs in their entirety and to assess their quality in a generally applicable way that enables comparing a variety of organizational units. Therefore, verification of their content validity will be of utmost importance. 3.1.2 Survey research: method and critique The survey method provides researchers with a powerful tool for conducting empirical research. However, use of the method within accounting research receives much criticism. This critique relates to two main concerns. The first concern is the quality of the match between theory and empirical operationalization, specifically between the theoretically implied level of analysis and the measured level of analysis (Luft and Shields, 2003). A 33 Chapter three mismatch has serious consequences, because it leaves us unsure of whether indeed the theory has been tested or something else. Authors recommend to define precisely the constructs and to be specific about the relationships between them. The second concern with current survey research involves preventing both measurement error and sampling bias. Critique in this regard stresses the lack of effort put into using available techniques to prevent such biases (Ittner and Larcker, 2001; Zimmerman, 2001) leading to data of poor quality. Bias introduced through, for instance, non-random sampling impedes generalization of findings. Moreover, low response rates can distort neatly designed samples. Several techniques are available to curb these problems (for suggestions see Dillman, 2000). Furthermore, measurement error leaves us unsure of content validity: have we actually measured what we intended? Pre-testing a questionnaire proves an effective tool in addressing this problem. Multiple methods are available, yet only few researchers deploy them. Van der Stede, Young, and Chen (2005, p670) find only 23% of all studies part of their assessment on the quality of survey research indicate a pre-test of their instruments. Moreover, researchers report only sparingly on their pre-tests, leaving others not only in the dark about the quality and benefits of these tests, but also without examples of good practice. All in all, Van der Stede et al. hit the nail on the head in concluding that “the key issue with the survey method, (…), centres more on how it is deployed, rather than with the method itself” (2005, p656). The indicated problems with the survey method can thus be overcome by taking the design and execution of the method seriously. The rest of this chapter shows how I have confronted this criticism to obtain data of high quality. Assuring the link between theory and empirics through the operationalization of constructs is our first concern. To establish validity, it is important to be informed by the theory (1) and to put the questionnaire up to pre-testing (2). Both issues will be dealt with extensively in paragraphs 3.2, 3.3, and 3.4. Reliability can be anticipated by using multiple items/questions to measure a single construct, but is being assessed post hoc using statistics (for instance, Cronbach’s Alpha)9. The second issue in survey research concerns preventing selection bias and assuring a reasonable response rate. Both are part of the sampling strategy and the survey administration. I explain the strategy chosen and the results in terms of the dataset obtained in paragraphs 3.5 and 3.6. 9 The next chapter presents details on the measurement of all variables, together with an assessment of their construct validity and reliability. 34 Methodology 3.2 Design of the questionnaire This paragraph gives an overview of how the questionnaire came into existence and thereby addresses the first critical comment on survey research: assuring a match between theory, measurement of constructs, and empirical level of analysis. It starts with a description of the questionnaire’s set-up and explains how it was obtained. Next, attention shifts to creating a focus within the questionnaire to match the theoretical and empirical levels of analysis. The questionnaire’s design consisted of several stages. Starting from the theory, I set up working definitions for all the constructs. Firstly, careful translation of variables’ working definitions into measures led to a basic set-up, which experts reviewed. Two rounds of pre-testing the quality of the questionnaire subsequently resulted in a final version. Figure 3.2 summarizes the design process of the questionnaire. Figure 3.2 Questionnaire design phases translation of working definitions into measures and questions expert review; finish basic set-up (Tailored Design Method) pre-test with two phases (TSTI plus validityfeedback interviews; 16 interviews) finish questionnaire: start data collection first pre-test (TSTI; 6 interviews) revision, feed forward loop to theory second pre-test (TSTI plus validityfeedback interview; 10 interviews) Note: TSTI = Three-Step Test-Interview (explained in the next paragraph) 3.2.1 The questionnaire’s set-up – matching the theory The purpose of the survey is to map management control systems in their entirety, measure the contingency variables, and assess control system performance in order to test the theoretical claims. The information retrieved should enable comparison between activities as distinct as production and Research and Development. This calls for both a high correspondence to the Transaction Cost Theory of Management Control and generally applicable measures that focus on generic control forms. To assure a match with the theory, I build directly from the definitions given in chapter two and turn to prior research as much as possible. The next chapter provides details on individual questions and variables. The questionnaire’s basic set-up was subjected to an expert review in which correspondence between questions and theory was judged. 35 Chapter three Starting of with the rich qualitative descriptions of the archetypes (see table 2.1) I set up a working definition of a management control system covering all four dimensions of control. Next, I turned to both accounting and organization literature in search of measurement instruments. Books by Van de Ven and Ferry (1980) and Price (1997) were especially helpful in finding instruments for parts of the MCS. However, the possibilities for building from prior research are limited, not because researchers do not study general types of control instruments, but because there are hardly any studies that focus on MCSs in their entirety. This triggered the invention of new measures, especially to pick up on the interrelationships between the separate dimensions of control. Instead of focusing on every detail of an MCS, general strands of control form the object of study. For instance, I do not document the specific rules or procedures that are used within organizational units, but ask about whether rules or regulations apply and about the consequences of non-compliance. Taking on this abstract view towards control enables the study of generic control structures and hence makes comparison of various organizational units and their MCSs possible. The theory makes explicit claims about the effectiveness of control. Therefore, this variable should be picked up by the questionnaire as well. However, for MCS effectiveness there is no ready-made measurement instrument available either. Not only does the search for generally applicable measures complicate matters, but also a lack of studies that assess control effectiveness as such. Researchers do not actually measure effectiveness of management control systems, but use the effectiveness of the organization or organizational unit as a proxy (cf. Chenhall, 2003). In so doing, however, they assume linkages between the working of the management control system and the organizational outcomes, for which there is no compelling evidence (Chenhall, 2003). The only exception is Ferreira and Otley (2005) who develop a measure of the perceived effectiveness of a control system at the company level. Inspired by their work, I have developed a new measure of MCS effectiveness that covers more aspects of control effectiveness and is generally applicable. Apart from mapping the management control system and assessing its effectiveness, the questionnaire must be able to identify the activities. Generally applicable measures should pick up the characteristics of these activities in terms of uncertainty, asset specificity, and expost information asymmetry. Considering the enormous amount of empirical TCE-based studies (for recent overviews see David and Han, 2004 and Boerner and Macher, 2001), one would expect to find ready-made measures for these key variables. However, both asset specificity and uncertainty have been measured in very diverse ways (David and Han, 2004; Boerner and Macher, 2001) generally tailored to specific industries or types of organizational units. The variables are difficult to measure, especially in a way that enables comparison 36 Methodology across industries (Shelanski and Klein, 1995). Since I need generally applicable measures these examples are of limited use. Moreover, both asset specificity and uncertainty are broad constructs that can take on multiple guises. Most researchers focus on single aspects, typically human asset specificity and unpredictability (Rindfleisch and Heide, 1997; Boerner and Macher, 2001). If one can reasonably assume ex-ante that one particular type of, for instance, asset specificity will be most important, this approach seems justifiable. However, my study compares very distinct activities for which such assumptions cannot be made. To the contrary, the most important type of both asset specificity and uncertainty likely differs between activities. To establish a close resemblance between theory and operationalization, I measure multiple guises of both constructs. Although I build from prior research whenever possible, my approach deviates substantially and multiple new measurement instruments were developed. Overall this approach resulted in a basic set-up of the questionnaire that contained 81 questions, half of which are newly developed. The design was made in accordance with the Tailored Design Method (Dillman, 2000). The practical recommendations Dillman provides in his book were taken on for general design (how to pose a question), lay-out (navigation, back cover), and writing a cover letter. 3.2.2 Focus on activities - matching the level of analysis The relevant level of analysis is the activity and management control systems surrounding these should be mapped by means of the questionnaire. Examples of activities are sales, accounting, and the production of a certain good or service. To test the theoretical claims I study a variety of activities found at different hierarchical levels throughout the organization. This complicates the ex-ante identification of one activity to ask managers about, because it is entirely possible that managers head an organizational unit with multiple activities, where the unit is their initial point of reference, not the separate activities10. Consider what happens without maintaining an explicit focus on one activity. Managers could probably still answer the questions on management control instruments in use, but would take average scores for all the activities of their unit. Think for instance about the level of work autonomy of unit employees. This might differ between groups of employees working on different activities. In fact, theory predicts that it would. At the outset, it is difficult to assess the full scale and the impact of the problem. If there is only one activity in an organizational 10 Indeed in my sample, 88 percent of the respondents indicated that within their organizational unit multiple activities are carried out (see 3.6 for a description of the dataset). 37 Chapter three unit, there is no problem. If multiple activities are carried out in a unit, but do not differ in terms of the contingency variables, there is no problem either. It would still be possible to relate MCSs to certain types of activities. However, for cases in which dissimilar activities are combined, the theory cannot be tested. The empirical level of analysis would not fit the theoretical one and render tests of the theory invalid. Both the expert review and the pre-test returned anecdotal evidence showing that simply asking managers about control of their unit’s main activities without specifying them, cannot solve the problem. Not only did managers regularly identify several ‘main’ activities within their organizational units, they also indicated differences in control of these activities. This became most apparent during the first pre-test phase when a manager started answering questions on the coordination of work twice: one answer pertained to people working on administrative tasks (half his employees) and another to controllers working as internal consultants (the other half)11. The case is all the more interesting because the manager also indicated differences between these groups in programmability of tasks (an aspect of uncertainty) and in weeks of training needed to perform the job (an aspect of asset specificity). Clearly, when studying management control system design, taking the organizational unit as a level of analysis is different from taking the activity as the focal point. Moreover it is problematic to ask about aspects of control without specifying activities first. It is thus necessary firstly to identify for each organizational unit its main activity. After that it is possible to ask about the characteristics of that activity and the MCS surrounding the activity. To assure a focus on activities as the relevant level of analysis, the questionnaire starts by illustrating what is meant by an organizational unit. Later on managers are asked to list up to five activities of their unit and choose one to focus on throughout when answering questions on the MCS and the contingency variables. Several reminders help to maintain this focus. Figure 3.3 visualizes organizational units (in the questionnaire referred to as subunits). Recall that a manager and all employees reporting directly or indirectly to him or her comprise the unit. The same picture appears on the first page of the questionnaire. 11 The question posed was: “how do you coordinate the work within your unit?” with items: planning, standard operating procedures/work instructions, detailed assignments, assignments stated in general terms with little detail, proposals made by employees followed by consultation with you. Scale rang: 1 (not at all) – 5 (always). The question has been modified since and has number 54 in the questionnaire. 38 Methodology Figure 3.3 Organizational units and focus of research CEO manager manager subunit this is your subunit your supervisor manager (this is you) manager employee employee manager subunit manager employee employee 3.3 Validity-feedback interviews Central to this research is the study of management control systems in their entirety instead of the study of single control instruments. The questionnaire should thus be able to map these control systems. Moreover, it should do so for a whole range of different organizational units. Because I extend prior research by explicitly studying control systems in their entirety and deviate from it by focussing on a whole range of activities, two questions must be addressed: can we actually grasp general strands of MCSs as wished-for using solely the information obtained by the questionnaire? And, is this picture more or less complete, i.e. will it suffice for our purposes of comparing configurations? Such questions relate to the content validity of the survey instrument and can be partly assessed using existing methods for pre-testing like discussing the contents with peers or conducting interviews with respondents on clarity of questions. A more sophisticated method, like the Three-Step Test-Interview (Hak, Van der Veer, and Jansen, 2004; Jansen and Hak, 2005), provides more detailed information and feedback. However, these methods generally provide feedback on single questions, not on the content validity of the overall design. This is especially true when the overall survey is extensive and complex, which complicates maintaining an overview of all information. 39 Chapter three I want to obtain information beyond that retrieved by existing methods and apply a new powerful technique to assess the content validity of the survey as a whole: validity-feedback interviews. The basic idea of this technique is simply to describe the object of study (in casu the management control system) using solely the answers filled out on the questionnaire, and to check the validity and the completeness by comparing the description to the theory, and discussing it with the respondent. In the complete process, firstly, I ask a potential respondent to fill out the questionnaire. Then, purely based on his/her answers, I make a description of the MCS of his/her organizational unit and the effectiveness thereof as well as the characteristics of the unit’s activities. Numerical scores on the items are translated back into sentences. Consider, as an example, the following question: 14) How much does your supervisor use subjective judgements to evaluate your performance? scale: not at all (1), very little (2), somewhat (3), quite a bit (4), very much (5) The respondent circles 4. In the description this answer becomes: ”The unit supervisor uses subjective judgements quite a bit to assess the manager’s performance”. This provides an overview of information obtained when one can solely rely on the survey. During a follow-up meeting with the manager, I pose three general questions to discuss the description: 1. Do you recognize your unit from the description? 2. Do you feel important aspects/characteristics are missing that (also) have an influence on motivating employees and directing their behaviour? 3. Does the description hold any incorrect or false statements? The interviews return two types of feedback: first of all the descriptions provide information on whether we pick up on all aspects of interest. Moreover, the discussion with respondents ascertains we can actually make valid claims about practice. The validity-feedback interviews were held as part of the overall pre-test to be discussed next. Appendix A holds one of the descriptions made of a management control system to illustrate the type of information as well as the amount of detail the questionnaire generates. It is based on the final version of the questionnaire as used for data collection. 40 Methodology 3.4 The pre-test of the questionnaire As a next step in the development of the questionnaire, the basic set-up was put up to pretesting. As mentioned before, pre-tests help assess the quality of a questionnaire and are a prerequisite for solid survey research. One of the critical observations on survey research within accounting is the lack of pre-testing (Van der Stede et al., 2005). Researchers tend to rely on measures simply because they have been used before. However, often there is little known about measures except their use as such. When used in a new study measures are usually modified in numerous ways. General matters are different order of questions, their combination with other questions, and adjustments in wording (however small). Even in the rare occasion that one can use the exact same question or part of a questionnaire, different settings intrinsically change the situation. Researchers cannot predict the impact of these changes without testing. Therefore, all measures should be included in the pre-test, even when they have been used and tested before (cf. Van der Stede et al., 2005, p671, footnote 12). To illustrate this point, consider a result from my pre-test that well illustrates the type of problems addressed and the kind of information obtained through pre-testing. Starting from a study by Anderson (1988), I measure environmental unpredictability as one of the guises of uncertainty. She uses four items that relate to instability associated with environmental turbulences12. Respondents indicate on a seven-point scale whether the market for their product line was simple or complex (1), stable or volatile (2), easy or difficult to monitor (3), and certain or uncertain (4) (see Anderson, 1988, p616). In my questionnaire, I asked to describe the business environment of the organizational unit over the same four dimensions. Two managers of the same company who are faced with the same business judged their business environment completely different. One indicated it was simple, stable, and certain, while the other claimed the exact opposite. To explain this afterwards they compared their environment with the situation a hospital patient on intensive care is in. One can interpret this as uncertain and unstable, but when the patient’s situation is diagnosed ‘stable’ one can look at it from a different perspective as well. Because of this result, the question was deleted from the questionnaire for returning unreliable answers. 12 The original measure holds nine items. The other items are related to another type of uncertainty; hazards of entering new markets. These items are not generally applicable to the variety of activities I study. 41 Chapter three The pre-test conducted in this research project existed of two phases. Both phases and the results of the test will be described, but first subparagraph 3.4.1 examines the purposes of pretesting, and the subsequent paragraph discusses the pre-test method applied apart from the validity-feedback interviews. 3.4.1 Purposes of pre-testing Pre-testing a questionnaire encompasses a host of actions ranging from establishing face validity when discussing the contents with peers, to conducting interviews with respondents on clarity of questions. Purposes and methods of pre-tests vary accordingly. Here pre-testing serves several purposes. Firstly, the quality of individual questions needs to be assessed. Respondents must understand the questions. Wording might be ambiguous and answer categories might be missing. Apart from concentrating on single items, the quality of the overall design is assessed. Is there enough space for comments? Is it clear where to start and when to skip a question? How long does it take respondents to complete the questionnaire? These are general goals when pre-testing any survey. Addressing these issues will benefit response rates and data validity because it enhances the likelihood that questions are understood the way they are intended. On a different level, but equally important, is assessment of content validity. In pursuit of this pre-test goal the relevance of the questionnaire as a whole comes under scrutiny. I use two methods to test the quality of my questionnaire. Apart from the validity-feedback interviews already discussed, I use the Three-Step Test-Interview (Hak et al., 2004), which will be introduced next. 3.4.2 The Three-Step Test-Interview This study relies on the Three-Step Test-Interview (TSTI), a method especially designed for pre-testing self-completion questionnaires (Hak et al., 2004)13. Aim of the TSTI is to observe what happens when a respondent encounters a questionnaire as he would had he received it on his own desk. Characteristic to the method is concurrent think-aloud, a form of cognitive interviewing, which entails the process in which verbal information about responding to a draft version of a questionnaire is being collected (cf. Beatty, 2004, p8). 13 I want to thank Tony Hak for discussing my first experiences with the TSTI with me and providing helpful feedback and stimulating advice. 42 Methodology The TSTI consists of three steps. The first involves the concurrent think-aloud. During this step the interviewee provides insights into his thought process by verbalizing his thoughts when reading and responding to the questionnaire’s items. The researcher merely acts as an observer. This step returns detailed data on clarity of questions, on associations certain words elicit, on whether the interviewee can provide an answer (at all), at what point problems occur and many more. The second step involves completing this think-aloud data by filling gaps on information still missing. An interviewee might stop to express what he thinks at some point or might skip a question without giving the reason. Probing will return the required data at this stage. The type of question asked is: “You shortly paused at question 21. What were you thinking of?”14 The third and last step consists of a regular retrospective interview in which a respondent can be asked to give his overall opinion on the questionnaire, or discuss its contents and the relevance of single questions. In their 2004 article Hak et al. provide a manual with practical guidelines for use of the TSTI. This makes the method generally available for use. I closely followed the manual and opted for field interviews instead of a laboratory setting. At the start of a meeting with each participant of the pre-test, the think-aloud technique was explained and practiced. All interviews were recorded. There is an ongoing debate on both theoretical and empirical validity of cognitive interview techniques. Can thought processes really be revealed and under which circumstances? And, can the effectiveness of these interviews be proved? Willis (2005, Ch13) discusses these issues in his evaluation of cognitive interview techniques. His overall judgement is positive and he advocates their use. Beatty (2004) also reflects on the use of cognitive interviewing and concludes that there is no consensus on the best way to conduct it. He also points out a lack of practical guidelines, for example on how many interviews to conduct and whom to talk to. Consequently a lot is left to the researcher. Hak et al. (2004) provide a manual for use of the TSTI, but cannot solve these practical issues either. How many rounds of testing to conduct, which people to interview, and when to stop asking questions remain judgement calls. During its development, Hak and several other researchers tested the TSTI in three pilot studies (see Hak et al. 2004). In a separate study the productivity of the method is compared to an expert review in which the authors test a questionnaire on alcohol consumption (Jansen and Hak, 2005). Results show that the TSTI 14 Notice the difference with this question: “why did you give this answer?” The difference lies in asking about a respondent’s thoughts a moment ago when answering the question and asking what he is thinking now, in retrospect (cf. Hak et al., 2004, p7). 43 Chapter three produces data beyond that obtained using other methods like an expert review or interviews based on probing. 3.4.3 Pre-test execution and results The pre-test consisted of two consecutive interview phases. The first was held during October and November of 2005 and the second during January and February of 2006. A total of sixteen interviews have been conducted with managers of organizational units who are potential respondents. Both men and women from profit and not-for-profit organizations working at different hierarchical levels participated. Their units were production units, Research and Development, Human Resources, project units, Finance and Accounting, consultancy, recruitment, and Information Technology. While selecting pre-test participants, I specifically tried to cover those organizational units/activities for which each of the archetypes might be relevant. Each interview lasted one and a half hours. Throughout the TSTI was used as an instrument for pre-testing. During the second pre-test phase this technique was complemented with the validity-feedback interviews in a follow-up meeting with each manager. During the pre-test trajectory attention shifted deliberately from a focus on more general testing of clarity and completeness in the first phase to specific focus on validity of information obtained in the last. The first phase consisted of six interviews. Its purpose was a general one with central issues: do respondents experience any difficulty in answering questions (as such) and can they translate generic ideas on what management control is about to their own situation? After the first pre-test some key changes have been made to the questionnaire. The most important one is the addition of questions on activities of the unit. In the revised version managers are asked to specify up to five activities of their unit first, and then choose one to focus on throughout. Several reminders help managers to maintain this focus. This improvement enhances validity because respondents will give information about controls surrounding one activity as opposed to providing an average score for several activities. This, in turn, establishes a fit between the theoretical level of analysis and the empirical one. Moreover, returning to the working definition of an MCS, a structural problem became apparent. The current version of the questionnaire was unable to grasp the interrelationships between the four dimensions of control as defined in the theory. Gaining knowledge on these relations is important for learning about control systems in their entirety as opposed to studying separate instruments. Revision resulted in the deletion of about ten questions and at the same time the introduction of about ten new ones. Other changes involved choosing 44 Methodology different wording, adding an answer category, shortening questions, and adding more detail to the specification of the hierarchical level. The second pre-test stage involved meetings with five managers. Quality regarding clarity and completeness of questions is no longer of primary interest, but assessment of content validity of the questionnaire as a whole becomes the main concern. Will the questionnaire return information about the MCS as intended? Therefore this stage consisted of two interviews with each manager. During the first meeting I applied the TSTI again to assess the overall quality of the questionnaire and particularly the quality of new and modified questions. During a follow-up meeting, the validity-feedback interviews took place, discussing the description of the management control system solely based on the answers filled out on the questionnaire. Results of the validity-feedback interviews were quite encouraging. All managers recognized their units even though described in generic terms. Only one manager indicated a motivational aspect missing that in his opinion strongly guided employees in their daily behaviour. He referred to an explicit focus within the company on corporate responsibility. In the questionnaire this is only picked up in general when asking about the importance of behavioural guidelines in guiding employee behaviour. Few aspects indicated as being incorrect could be dealt with through minor revisions in wording. Changes to the questionnaire were made after each interview. However, two managers last in line responded to the same and final version. These last few interviews were held purely as a confirmatory step. 3.5 Sampling and survey administration In its final form the questionnaire contains 89 questions15. The pre-test revealed that managers need about 50 minutes to complete it by themselves, up to one and a half hours when being interviewed. Intended respondents are managers who head an organizational unit with at least ten employees. Recall that a manager and all employees reporting (in)directly to him/her comprise an organizational unit. This paragraph will firstly discuss the sampling strategy, and then the survey administration. 15 Only part of the questionnaire is used for this study. The complete questionnaire contains questions on control at the level of the organizational unit (control experienced by the manager) as well as on control within each unit (control exercised by the manager). In this study I focus on the former type, control of the organizational unit. 45 Chapter three In setting up a sampling strategy, three requirements for the sample emerged. Overall the sample should be reasonably large (at least 200 observations), it should represent enough variety in activities, and it should hold enough observations of those situations for which theory predicts the occurrence of each of the archetypes. A random sample is unlikely to entail enough information to enable analysis of each archetype because some activities will be carried out more frequently than others. Thus ideally, sampling is based on selection of activities. As different organizational units are likely to engage in different activities, I have defined sampling groups for different types of organizational units (see table 3.1). Dividing observations over these groups should return enough variety in activities. Note that this approach will not introduce selection bias because the activity’s characteristics are explanatory variables in this study (King, Keohane, and Verba, 1994, p137). Table 3.1 Sampling groups I divisions II finance & accounting III research & development business units production units marketing & sales human resources internal auditing shared service centres treasury project units internal consultancy ICT (maintenance) ICT (system development) As two basic ways to administer a survey, Brownell (1995, Ch3) indicates the written questionnaire and the personal interview. However, with such an extensive questionnaire as the one used in this study it is hard to get reasonable response rates. Moreover, in case of high non-response chances of finding enough potential observations for the analysis of each archetype are diminished. Therefore I am extremely grateful that students16 helped contact respondents and collect data, which made it possible to administer the survey by interview. In a meeting on site, the student interviewed the manager posing all questions directly from the questionnaire. The manager has his own questionnaire, can read along, and fill in the answers himself. Only after having completed the entire questionnaire, there was some time left for discussion. The data from these discussions are not available for analysis, but served as an additional learning opportunity for the students only. Meetings took approximately one and a half hours. There are several advantages of this approach: it provides the opportunity to correct misunderstandings, forces respondents to give thought to each question before answering, and 16 Students took part in the course Management Accounting and Control Research, which is taught in the part-time MSc programme in Controlling at Nyenrode Business University. The survey was integrated into the course material. The students already work as controllers or hold a management position. 46 Methodology minimizes chances of skipping a question. Moreover, this approach did not only assure minimum sample size, it also facilitated selection of respondents based on types of organizational units thereby enhancing chances of finding variety in activities. As the major disadvantage of the personal interview, Brownell (1995) justly states the risk of interviewer bias. Although present to some extent in every interview, this caveat can be mitigated – at least partly – by introducing clear protocols/instructions. Moreover, because 60 different students were involved in the interviews, chances of systematic bias are small. I explicitly addressed these issues when briefing students. They were instructed to read aloud each question exactly as formulated, and then pause to let the manager respond. Students were also instructed on how to respond to requests for clarification of questions and words. Managers could ask them to translate a word (from English to Dutch). This might introduce bias if students choose slightly different wordings. Therefore, students and I went through all questions together to clarify the meaning of each. 3.6 Description of the dataset Data were collected two years in a row (April 2006 and April 2007)17. The first round of data collection returned 134 questionnaires. The second round delivered another 124 cases. Data collection in two shifts created the possibility to fine-tune the variety in activities. Consequently, samples intentionally differ in the division of observations over types of organizational units. Both samples combine into a single dataset. There is no indication of any unintentional difference between them that might disturb the analyses. Most of the questionnaire’s items contain semantic differential scales with five categories (see Lattin, Carroll, and Green, 2003, p174). For instance, respondents choose from five possible scores: very easy, quite easy, neither easy nor difficult, quite difficult, and very difficult. An advantage of this type of scale is that respondents always get a reference point instead of just a number. This is especially helpful when a questionnaire contains lots of questions with differing answer categories. The questions related to the variables that measure effectiveness of both the MCS and the unit have a seven point scale, which increases the likelihood of finding enough variance for the statistical analyses. The data obtained are respondents’ perceptions measured on both nominal and ordinal scales. In the analyses, I treat ordinal data 17 I needed two shifts because this research project was integrated in a yearly course with approximately 30 students participating each year. 47 Chapter three as if they were interval data, since “the assumption of an interval scale is reasonable” (Lattin et al., 2003, p8) for my measures. Table 3.2 reports the descriptive statistics for the dataset, including hierarchical level, the number of unit employees, and the number of employees who work on the main activity18. The hierarchical level indicates how many steps away from the CEO the manager’s unit is, and how many steps are left to the bottom level within the hierarchy. Table 3.2 Descriptive Statistics Variable nr of steps to CEO N 258 minimum 0 maximum 10 mean 2.22 std. deviation 1.39 nr of steps down the hierarchy 258 nr of unit employees 257 1 7 2.36 0.99 4 10800 188 nr of main activity employees 802 258 2 3400 88 ratio main activity empl/ unit empl 325 257 0.27 100 63.24 28.41 The dataset shows a vast variety in hierarchical level (from CEO to ten levels down the hierarchy) and in the number of unit employees (from 4 to 10,800). Frequency tables reveal that about 80% of the companies are profit companies, 20% are not-for-profit. Regarding industry I find all sorts of industries represented. Financial services, government, and banking/insurance make up the largest part. None of these sectors are dominant, however, because their individual shares never exceed 11% of the total19. Managers also indicated the type of unit they work in, for instance, a production unit or a treasury department. Recall the goals from my sampling strategy (see 3.5): the dataset should be large, represent a variety in activities, and hold enough observations to represent each of the archetypes. The last goal will be assessed during the analyses, but descriptive statistics of the sample give clues on the first two. Table 3.3 shows the variety in types of units within the sample. Business units, production units, and units finance and accounting respectively form the largest groups. Overall, a multiplicity of various activities is represented in the database. 18 In some cases the number of employees is very small. I have run the analyses (chapter five) both with and without the smallest cases. Results are robust to these changes. 19 Note that there are multiple observations per company. 252 observations stem from 132 different companies (6 were anonymous). This will not introduce bias because the level of analysis is the activity of an organizational unit. Within companies differences between units exist, and within units multiple activities. Moreover, if for instance a company-wide reward system would be installed, I should be able to pick up performance differences for those units for which this is inappropriate and thus I should be able to assess MCS design and effectiveness as intended. Also, in none of my (sub)samples a single company dominates. The number of observations per company is simply too small. 48 Methodology Table 3.3 Description of the dataset type of unit frequency percentage Business unit 48 18.6 Production 32 12.4 Finance and Accounting 31 12.0 Division 19 7.4 ICT 19 7.4 Marketing or sales 17 6.6 Project unit 17 6.6 Human Resources 14 5.4 Research and Development 11 4.3 Company as a whole 8 3.1 Facility management 5 1.9 Treasury 4 1.5 Legal department 3 1.2 Other* 20 11.6 Total 258 100.0 *This group is quite diverse. Examples of units within the group are purchasing, internal consulting, quality management, and back office. In the questionnaire, managers specify the activities of their unit and choose one as the main activity to focus on throughout. Examples of activities specified by respondents are product development, provision of technical support, project management, reporting, and policy making20. The pre-test revealed the importance of asking to identify the activities, which makes it possible to link an MCS to a specific activity and establish a fit between the theoretical and empirical level of analysis. Only 32 out of 258 managers (12%) identified one activity. This underlines the importance of creating the focus in the questionnaire, because without it 226 managers would have provided average scores for several unknown activities. Asking people to choose the main or most characteristic activity for their unit does not automatically imply they choose the activity the largest number of employees work on. Therefore I asked about the number of unit employees as well as about the number of employees working on the main activity as specified. The ratio between employees working on the main activity and total unit employees ranges from 0.27% to 100% with a mean of 63%. Of all managers, 67% choose an activity on which at least half their employees work. 20% indicate all employees (also) work on the main activity. Table 3.4 below gives an overview. 20 There is a large variety in the qualitative descriptions of the activities. Therefore, translating these into keywords and summarising those, is little informative when looking at the complete sample. There are either too many categories or there is too little extra information over that already given here by the division of cases over unit types. In chapter five I will again address the issue, compare subsamples, and illustrate a number of differences in activities for individual unit types. 49 Chapter three Table 3.4 Percentage of employees who work on the main activity percentage employees frequency percent cumulative percentage 0 – 10 % 10 – 50 % 9 76 3.5 29.6 3.5 33.1 50 – 95 % 120 46.7 79.8 95 – 100 % 52 20.2 100 Total 257 100 3.7 Concluding remarks Because I extend prior research by explicitly studying control systems in their entirety and deviate from it by focussing on a whole range of activities, confidence as to the quality of the survey data must be given before running the analyses. Survey research is widely applied yet frequently criticised. Some critique relates to problems inherent to the method, which cannot be resolved: once the questionnaire leaves your desk it’s out of your hands. Other methods also suffer from their own inherent limitations though. However, most of the criticism on survey research within accounting relates to improper use of the method and the resulting poor quality data. This critique can be confronted successfully as I have shown in this chapter. Attention has been paid to assuring a match between theory and operationalization in terms of constructs and level of analysis. Two questions were central: can we actually grasp general strands of MCSs as wished-for using solely the information obtained by the questionnaire? And, is this picture more or less complete, i.e. will it suffice for our purposes of comparing configurations? The extensive pre-test helped identify lots of small and big problems, which have been addressed to the benefit of the questionnaire’s quality. It seems safe to conclude that respondents will understand the questions and can answer these. The questions, moreover, inform us about critical aspects of the theory and can therefore potentially help answer the research problem. From the validity-feedback interviews, we know that it is possible to pick up control practices as experienced by managers and to be quite complete in doing so. The description of a unit’s MCS illustrates this. This new technique proves a powerful method to effectively obtain feedback on the content validity of questionnaires. In sum, the questionnaire is able to map different aspects of an MCS and obtain a detailed yet broad overview of how control takes form at the level of the organizational unit. Moreover, it 50 Methodology makes possible identification and characterisation of activities of the unit as well as assessment of perceived effectiveness of both the unit and the management control system. As for the prevention of selection bias and non-response, survey administration by interview enabled data collection for 258 organizational units with a vast variety in activities. Nonresponse therefore was not an issue. 51 Chapter four - Operationalizing the framework The previous chapter presented an overview of the methodology, introducing the questionnaire, mode of data collection, and a description of the dataset, without providing details on the measurement of the variables. This chapter fills that void and describes the operationalization of the methodological framework (figure 4.1). Separate paragraphs deal with each of the highlighted steps. As in the prior chapter, of central concern is the establishment of a match between the theoretical constructs and their measures. The dataset under study belongs to a larger dataset, which holds information on both control of organizational units (control experienced by the manager), and control within these organizational units (control exercised by the manager)21. In this thesis the former is under research. Firstly, in separate paragraphs I shed light on the mapping of management control systems (4.1) as well as their effectiveness (4.2). Next, paragraph 4.3 describes the measurement of the contingency variables and discusses formative measurement models. Paragraph 4.4 deals with both the operationalization of the archetypes and misfit calculation. Translating the theoretically described archetypes into measurable profiles on which the measurement of misfit depends is an important step in testing configurational theories (see Doty et al. 1993; Doty and Glick, 1994). This step carries a potential risk, because if the profiles do not correspond to the theoretical description, the test of the theory might not be valid (Doty and Glick, 1994). Describing exactly how this step was taken, I show how I have minimized this risk. Finally, paragraph 4.5 presents several control variables and paragraph 4.6 closes this chapter with concluding remarks. The two methodological steps left (determining the relevant archetype and linking misfit to effectiveness) are part of the analyses and will be dealt with in the next chapter. Figure 4.1 Methodological framework define define activity determine determine relevant relevant archetype architype compare MCS with relevant relevant archetype archetype map MCS link misfi misfitt to to link effectiveness effectiveness measure measure MCS MCS effectiveness effectiveness 21 Data has been collected all at once by means of the questionnaire introduced in chapter three. 53 Chapter four Appendix B gives an overview of the exact questions posed, along with notes and guidance on their use. Apart from describing the measurement of the variables and introducing new measurement instruments with a general applicability, some issues of broader importance will be addressed in the next paragraphs. First of all, this chapter exemplifies one approach to operationalize a configurational theory and to study control systems in their entirety as well as the effectiveness thereof. Moreover, I suggest ways to improve general measurement of two main TCE variables: uncertainty and asset specificity. To better grasp the essence of these constructs, I build a formative indicator model (see 4.3). 4.1 Mapping management control systems This paragraph presents the first step of the methodological framework: mapping management control systems. Referring both to theory and prior research, relevant measures and variables will be identified and described, and new measures will be introduced. Most questions contain semantic differential scales with five categories (see Lattin et al., 2003, p174). For instance, respondents choose from five possible scores: very easy, quite easy, neither easy nor difficult, quite difficult, and very difficult. Several questions stem from the work by Van de Ven and Ferry (1980). In their book ‘measuring and assessing organizations’ they develop an extensive survey instrument which enables them to map a large number of organizational aspects at different hierarchical levels. Also noteworthy is the ‘handbook of organizational measurement’ by Price (1997). His overview of frequently studied variables within the organization literature and their measures along with an assessment of their quality was extremely helpful. Chapter two presented four control dimensions that together describe management control systems: allocation of decision rights (1), use of rules, standards, and regulations (2), performance evaluation (3), and the reward structure (4). To grasp control systems in their entirety and enable comparison to the archetypes, all dimensions must be represented in the dataset. I use a total of 20 variables to cover all four. Special attention has been paid to the identification of interrelationships between the dimensions. For instance, I do not only ask about whether performance targets are in use as such, but also about the importance of these targets for performance evaluation and rewarding. The clarification of each variable follows. Like before, I discuss control from the perspective of a supervisor who exerts control on a manager. Table 4.1 gives an overview of how the 20 variables cover the different dimensions of control and how each variable relates to the definition of the archetypes as provided previously in chapter two (2.2.3). 54 55 Source: Speklé, 2004; extended Performance dependent bonuses Reward & incentive structure Standardization Performance dependent bonuses Monitoring focused on target achievement; performance assessment relative to targets Predefined performance targets of administrative origins Market-related outcome requirements; external performance benchmarks Structure Performance assessment relative to ‘the market’ Decentralized with clearly defined areas of responsibility and accountability Relative autonomy; involvement higher level management limited as long as performance is satisfactory Monitoring and performance evaluation Machine Control (Result Oriented) Arm’s Length Control No direct link between performance and rewards Focused on compliance; observance of interdictions; external audits Emphasis on threat of punishment of rulebreaking behaviour; tiein through ‘hostages’ Career prospects dependent on long term past performance; peer pressure Proscriptive codes of conduct; boundary systems; emphasis on behaviour to be avoided Relative autonomy within defined boundaries Boundary Control Based on emerging standards; subjectively assessed contributions to long term organizational performance No ex ante standards and targets; ‘do your best’; emerging standards Standardization of behaviour; detailed rules, norms, and instructions Monitoring and supervision to ensure compliance to norms and standards Relatively flat hierarchy; fluid and permeable matrix-like project structures; vague responsibilities Exploratory Control Well-defined tasks; strict hierarchy; limited room for discretionary behaviour Machine Control (Action Oriented) Table 4.1 Variables covering the dimensions of control related to the control archetypes percentage variable reward; importance in rewarding and for career of: target achievement compliance with rules long term achievements professional skills - importance in performance evaluation of: target achievement compliance with rules long term achievements subjective judgements use of financial targets; use of non-financial targets; benchmarking; boundaries; role of the budget; emergent targets work autonomy; amount of work related discussion variables that map the MCS Operationalizing the framework Chapter four The first dimension of control (allocation of decision rights) is picked up by asking about the coordination of work and assessing the relative autonomy of the manager. Two variables represent this dimension: the amount of work related discussion, and the level of work autonomy. The amount of discussion indicates whether or not the manager works a lot on his own and whether the work is organized such that it requires mutual coordination. This variable is measured with a single question developed by Van de Ven and Ferry (1980). I ask about the frequency of discussions related to the coordination of work, between the manager and other managers or supervisors. To measure work autonomy, managers indicate their influence in several decisions (see table 4.2). The first four items stem from Van de Ven and Ferry (1980). I extended their measure with two items to learn also about work instructions and behavioural guidelines. The average of the six items forms the score on work autonomy. Factor analysis22 supports their combination into a single scale, and the reliability of the scale is satisfactory (Cronbach’s alpha = 0.736). Table 4.2 Items for work autonomy (Q42) Cronbach’s = 0.736 component loadings a deciding what work or tasks are to be performed in your unit b setting standards/targets for subunit performance 0.743 c deciding upon criteria for performance appraisal of employees 0.694 d designing the reward system 0.614 e deciding upon standard operating procedures/work instructions 0.703 f deciding upon other behavioural guidelines Source: items a-d; Van de Ven and Ferry, 1980; modified 0.542 0.672 A number of variables cover the second dimension of control and indicate the presence of standards, rules, and regulations. To cover those aspects of control that differ between the archetypes, the variables should identify use of performance targets, distinguish market related targets from administrative ones, and pick up use of emergent targets and boundary systems. The amount of rules and regulations, like work instructions and behavioural guidelines, is not measured as such. I study their importance only in light of performance evaluation and rewarding. First of all, managers indicate the extent to which several performance targets are in use for their jobs (see table 4.3). The distinction between the different types stems from Bouwens and Van Lent (2004, p15). Mapping management control systems at different hierarchical levels, different target types might be relevant. I am primarily interested in the intensity of target use. 22 Throughout I use principal components factor analysis (PCA) to assess whether items load on a single underlying construct. For my sample size, a minimum component loading of 0.5 is considered significant (Hair, Anderson, Tatham, and Black, 1998, p111/112). 56 Operationalizing the framework Therefore, as an indicator for the intensity of financial target use, I take the highest score out of items a-e. Item f asks directly about non-financial quantitative targets and will be used as such23. Moreover, to learn about benchmarks, managers indicate to what extent two types of performance benchmarks are in use (see table 4.3). Again, the highest of both forms the score on benchmarking. Table 4.3 Performance targets (Q7) and benchmarks (Q8) types of performance targets a stock price related targets d b return targets e revenue targets cost targets c profit targets f non-financial quantitative targets types of benchmarks a market benchmarks or benchmarks based on performance of external peer groups b benchmarks based on performance of internal peer groups or the performance of other subunits Source: types of performance targets based on Bouwens and Van Lent (2004, p15) I capture the presence of boundary systems by asking about the consequences of noncompliance with prohibitive behavioural guidelines. Managers indicate their agreement with the following statement: “Violating subunit-specific prohibitive guidelines will always have serious consequences for my career.” (Q18). It takes two introductory questions to create a focus on boundary systems. Firstly, one question helps to distinguish prohibitive guidelines from general behavioural guidelines (Q16). The distinction is important because only those guidelines that specify what not to do or what to avoid act as boundary systems. A second question is asked to distinguish general prohibitive guidelines that hold for all company employees from subunit-specific guidelines only relevant to the employees and the manager of the unit (Q17). This is important to separate those guidelines that relate to the organizational unit and its activities, from those that are unspecific. I want to learn about, for instance, authorisation levels and procedures instead of about general ethical norms or rules about private use of the internet. This way I conform to the level of analysis and map the MCS surrounding the activity of an organizational unit. Recoding the scores of question Q18 returns an ordinal scale with four values. Each value indicates an increasing importance of boundary controls. The lowest value implies a boundary system is absent. Another control instrument that I relate to ‘standards, rules, and regulations’ is the budget. A budget can have different roles. Respondents indicate the order of importance of three such roles (see table 4.4). Frequencies show how often a particular role was deemed the most 23 Question 7 has one more item, which is not used in the analysis. That item measures use of qualitative targets and represents a rest category. 57 Chapter four important one. Finally, a single question records the use of emergent targets. Managers choose from two statements the one that best describes how their performance targets come about (see table 4.4). Table 4.4 Roles of the budget (Q11) and emergent targets (Q9) a role of the budget the budget serves as a performance target to be met frequency 118 percentage 47 b the budget gives guidance about the way to go 74 29.5 c the budget sets limits to what can be done 59 23.5 251 100 total emergent targets performance targets are set at the beginning of a period and 187 72.8 will not change much under normal circumstances b performance targets come about steadily over time/evolve 70 27.2 during a period total 257 100 Note: both are nominal variables, N = 258, there are 6 missing values, in two cases there was no budget a All in all, eight variables cover two dimensions of control: allocation of decision rights and use of standards, rules, and regulations. Next, I describe the variables that cover performance evaluation, and the reward and incentive structure, as well as the interrelationships between the dimensions. I use groups of similar questions. Regarding performance evaluation, the questionnaire focuses on several aspects: skills, performance targets (including the budget), and compliance with rules/behavioural guidelines (see table 4.5). For each aspect managers indicate its importance to both positive (Q12) and negative (Q13) appraisal. In my sample, managers generally do not distinguish between the two and therefore I can combine their scores. Both questions are new, but the ideas about the set-up stem from studies by Bouwens and Van Lent (2004), Verbeeten (2005), and Van de Ven and Ferry (1980). Table 4.5 Items for performance evaluation (Q12 and Q13) a social skills b professional skills c achievement of performance targets d budget versus actuals = 0.820 e compliance with standard operating procedures/work instructions = 0.823 f compliance with other behavioural guidelines Source: inspired by Bouwens and Van Lent (2004), Verbeeten (2005), and Van de Ven and Ferry (1980) 58 Operationalizing the framework I form two variables: the importance of target achievement including the budget (the average of items c and d), and the importance of compliance with work instructions/behavioural guidelines (the average of items e and f). Factor analysis and reliability analysis confirm the appropriateness of combining these items into averaged summated scales (Cronbach’s alphas are 0.820 and 0.823 respectively). Note that these questions do not only provide information on performance evaluation as such, but that they enable the study of interrelationships between control dimensions. I have already measured use of performance targets as such, and now their role in performance evaluation becomes clear as well. Regarding performance evaluation I do not use the items related to skills. From a theoretical point of view professional skills are only relevant for performance evaluation in the long run or for career perspectives. Social skills do not play any role in distinguishing between the archetypes. During the pre-test, managers indicated that skills might be (or should be) important for long run performance evaluation. Thus, for completeness and consistency I ask about the role of social and professional skills in all questions related to performance evaluation, rewarding, and career perspectives. However, I only focus on items that are closely related to the theory, in casu achievement of performance targets/the budget, and compliance. Two more variables complete the information about performance evaluation. Managers indicate how much their supervisor takes into account long term performance of the unit (Q15a) and/or the company (Q15b). The scores on items a and b are averaged (Cronbach’s alpha = 0.772). Moreover, one question (Q14) picks up the use of the supervisor’s subjective judgements in performance appraisal. Regarding the reward structure, firstly, managers indicate the size of their variable financial reward (Q21). The original five answer categories are recoded (see table 4.6). Table 4.6 Financial variable reward (Q21) frequency percentage a no variable financial reward size of variable reward 58 22.6 b 1-15% 103 40 c more than 15% 96 37.4 total 257 100 Note: other answer categories referred to larger rewards, but these are rare in my sample. 12 managers indicated that 46-60% of their reward is variable, 7 people indicated that this is more than 60%. There is one missing value. 59 Chapter four Using the same aspects that might play a role in performance evaluation, one question (Q22) picks up the importance of several aspects for obtaining a variable financial reward (see table 4.7). Finally, the same question occurs once more (Q23), but now focus is on career prospects. Managers indicate the importance of each aspect for their future career. Again, factor analysis and reliability analysis indicate the appropriateness of combining compliance with work instructions and behavioural guidelines into a single variable for both questions (Cronbach’s alphas are 0.861 and 0.803 respectively). The achievement of performance targets and compliance with the budget, however, cannot be combined and form separate variables. Moreover, in light of career perspectives, the importance of professional skills is taken in as a separate variable as well (Q23b). Table 4.7 Items for obtaining a variable financial reward and career perspectives (Q22 and Q23) a social skills b professional skills c achievement of performance targets d budget versus actuals e compliance with standard operating procedures/work instructions items used separately f compliance with other behavioural guidelines Source: idem questions 12 and 13 (table 4.5) = 0.861 (Q22) = 0.803 (Q23) Together these 20 variables map management control systems over four dimensions considering their interrelationships. They provide a comprehensive yet detailed picture of control. Moreover, the overall measurement instrument is generally applicable to map MCSs of all kinds of organizational units. Similarly important, mapped in this way, the MCSs can be compared to the archetypes. The individual variables relate directly to the descriptions of the archetypes, but they also represent a combination of elements of control that are frequently studied. Traditional controls, like standard operating procedures, financial targets, and budgets, are represented. Moreover, I pick up the contrast between reliance on output controls versus action controls (Merchant, 1982) by assessing the importance of target achievement as well as compliance with rules in performance evaluation and rewarding. Finally, a number of variables represent the more interpersonal and subjective aspects of control, for instance the amount of discussion, use of non-financial targets, use of subjective judgements in performance evaluation, and the importance of professional skills for the career. Table 4.8 gives an overview of all variables that map an MCS and provides descriptive statistics. Appendix B provides the details on each question. 60 Operationalizing the framework Table 4.8 Descriptive statistics for all MCS variables 1 work autonomy 2 amount of discussion 3 use of financial targets 4 use of non-financial targets questions Q42 N 257 min 1.67 max 5.00 mean 3.80 std. deviation 0.66 Cronbach’s alpha 0.736 Q56c Q7a-e (highest) 254 1 5 2.93 1.10 - 258 1 5 4.29 0.90 - Q7f 258 1 5 3.06 1.35 - 257 1 5 3.31 1.27 - 258 0 3 0.81 1.11 - 5 benchmarking 6 boundaries Q8a,b (highest) Q18 7 role of budget Q11 253 0 3 nominal variable 8 9 emergent targets percentage variable reward Q9 Q21 257 257 0 1 1 3 nominal variable nominal variable 255 1.50 5.00 3.80 0.77 0.820 255 1.00 5.00 3.36 0.87 0.823 importance in performance evaluation: 10 target achievement 11 compliance with rules 12 long term achievements Q12c,d; Q13c,d Q12e,f; Q13e,f Q15a,b 255 1.00 5.00 3.48 0.97 0.772 13 subjective judgements Q14 255 1 5 3.39 0.84 - - importance in rewarding: 14 target achievement Q22c 257 1 5 3.63 1.65 15 budget versus actuals Q22d 257 1 5 3.16 1.64 - 16 compliance with rules Q22e,f 257 1.00 5.00 2.16 1.25 0.861 - importance for career: 17 target achievement Q23c 256 1 5 3.99 0.82 18 budget versus actuals Q23d 256 1 5 3.38 1.06 - 19 compliance with rules Q23e,f 256 1.00 5.00 3.19 0.95 0.803 20 professional skills Q23b 256 1 5 4.29 0.70 - 4.2 MCS Effectiveness To measure the effectiveness of a management control system, I have developed a new measurement instrument. Reviewing literature, only one study emerged that measures effectiveness of control directly: Ferreira and Otley (2005) develop a measure of the perceived effectiveness of the MCS at the company level. They focus on the benefits an MCS potentially offers and divide those over three dimensions: quality of generated information (accuracy, for instance) and cost-effectiveness of the system (1), contribution to overall firm performance (2), and satisfaction with meeting information requirements (3). The role of information generation by the control system dominates, whereas other purposes of control, 61 Chapter four like providing incentives, are lacking. I build from their ideas, but assess more aspects of control effectiveness. Organizational objectives form the accepted frame of reference in effectiveness literature (Hitt and Middlemist, 1979). Starting from this basic idea, I capture effectiveness of the control system directly by relating it to the achievement of control goals. In chapter two, several goals of control have been distinguished (2.1.1). To summarise, the control system enables knowledge capturing, supports coordination, and provides incentives through rewarding and punishment (cf. Speklé, 2004, p7). To measure perceived effectiveness, managers rate the performance of their MCSs on 11 aspects, each representing an underlying goal of the MCS (see table 4.9). A seven point scale is used that runs from ‘very poor’ to ‘very good’. The item ‘supporting decision making’ represents knowledge capturing. The items guiding behaviour, coordination, and monitoring belong to the goal of coordination. Items concerning motivation, perceived fairness, performance evaluation, and rewards relate to the goal of providing incentives. Finally, following Ferreira and Otley (2005), one item that captures the contribution of the control system to the performance of the organizational unit was added. This is considered to be an ultimate goal of the control system. The scale originally contained four more items. These items related to the amount of work autonomy managers and employees have, to the use of financial rewards (as opposed to nonfinancial rewards), and to the use of behavioural guidelines. Factor analysis revealed that these do not fit the same underlying construct as the other items and therefore they are no longer part of the scale. However, enough items are left to assess the effectiveness of the management control system. Table 4.9 Items for effectiveness of the MCS (Q85 and Q86) Cronbach’s = 0.891 component loadings a Supporting managers in decision making 0.729 b Guiding employees’ behaviour 0.648 0.690 c Guiding managers’ behaviour d Coordination of work 0.622 e Monitoring 0.624 f Motivating employees 0.717 g Motivating managers 0.769 h Perceived fairness 0.679 i Performance evaluation 0.681 j Use of non-financial rewards 0.629 k Contribution to overall subunit performance Source: item k: Ferreira and Otley, 2005 62 0.751 Operationalizing the framework The 11 items listed in the table load on a single factor and taking their average score returns a summated scale (‘MCS effectiveness’). Cronbach’s alpha for this construct is high (0.891). In cases where either a manager replied ‘not applicable’, or when a value is missing, the average of the items left is taken as the score for MCS effectiveness. Because the sample entails a diverse set of organizational units, items are not expected to be all relevant for each case. Therefore only those items that are relevant should stay on. Following this procedure, the smallest number of items averaged is six24. People indicated poor as well as good performance with a mean score of 4.5 and sufficient variance (table 4.10). Table 4.10 Descriptive statistics for MCS effectiveness scale 1-7 MCS effectiveness questions Q85/86 min 2.18 max 6.55 N 258 mean 4.50 std. deviation 0.79 Cronbach’s alpha 0.891 The focus on attainment of control goals to assess control effectiveness has two advantages: (1) a broad approach to assessing management control systems is taken on covering different purposes of control, and (2) the approach is suitable for all kinds of organizational units (production, R&D, profit, not-for-profit, et cetera). An apparent disadvantage of this approach is the inevitability of ending up measuring perceived effectiveness. Some argue that selfassessment of performance is inherently poor. Ittner and Larcker (2001), for instance, are sceptical about the use of perceived measures and suggest backing them up with ‘hard’ data. However, views on use of subjective measures differ. Van der Stede et al. (2005, p675) conclude that “subjective indicators should not be viewed as poor indicators of performance by virtue of being subjective”. Both have their merits and their use depends on the specific objectives and settings of the study. In my case, it is impossible to work with hard data, since I am comparing such a large variety of organizational units throughout the hierarchy. The only way to compare effectiveness of control among them is to ask the manager directly for an assessment. This approach fits my research objectives well, since I study the relative effectiveness of various MCSs. 4.3 Characteristics of the activities The previous paragraphs accounted for the first two steps of the methodological framework: mapping the MCS and assessing its effectiveness. The next step involves characterizing the activities. Activities obtain scores on the three contingency variables uncertainty, asset specificity, and ex-post information asymmetry. These scores subsequently help determine 24 One case only was left with six items. Its score on MCS effectiveness is not particularly high or low. 63 Chapter four which archetype is relevant for each case according to the theory. To distinguish between the action oriented and result oriented variants of machine control, measurability of outputs is taken on as a fourth contingency variable. Two issues deserve our attention. In the first place, TCE-based research has generated lots of measures for both uncertainty and asset specificity, yet their general applicability is limited. Most TCE research studies homogenous samples and tailors measurement of variables to the specific situation (see 3.2.1). Therefore there are no universally applicable measures available. Moreover, researchers tend to focus on single aspects of uncertainty and asset specificity, typically human asset specificity and unpredictability (Rindfleisch and Heide, 1997; Boerner and Macher, 2001). However, both asset specificity and uncertainty are broad constructs that can take on multiple guises. The diversity in organizational units studied here necessitates measurement of these different guises, because their relevance likely differs among the units. In some units, human asset specificity might dominate, whereas in other units physical asset specificity determines governance choice. Thus, measures need to be broad and generally applicable. This approach also ascertains a better fit with the theory. To combine different guises of uncertainty and asset specificity in a single measure, I set up measurement models with formative indicators. Regular averaged summated scales are inadequate for this approach (more on this below). In recent studies several authors have underlined the importance of distinguishing reflective from formative indicator constructs and they showed the implications of making the wrong choice (e.g. Diamantopoulos and Winklhofer, 2001; Jarvis, MacKenzie, and Podsakoff, 2003; Diamantopoulos and Siguaw, 2006). Together those studies illustrate how misspecification of the relationship between a construct and its indicators can have serious consequences both in terms of construct validity (we are basically measuring and thus studying different things) and in parameter estimation relating the constructs to other variables, thus emphasizing the dangers of drawing false conclusions. Bisbe et al. (in press) conclude that in empirical research into management accounting and control systems the issue is rarely addressed and that it is common practice to rely uncritically on reflective models. The next paragraphs provide details on the measurement of the contingency variables. 4.3.1 Uncertainty In chapter two, uncertainty was defined as a lack of ex-ante knowledge of the activity, which results in a diminished capability to program activities (2.2.2). This knowledge concerns both processes (how best to carry out an activity) and outcomes (what results to expect). 64 Operationalizing the framework Uncertainty arises from two main sources: unpredictability of circumstances surrounding the activity (environmental uncertainty) and lack of possibilities to assess performance afterwards (behavioural uncertainty) (Williamson, 1985). From several review articles (for instance, David and Han, 2004 and Rindfleisch and Heide, 1997), it becomes clear that researchers usually focus on one of these or take them on as separate variables. To measure uncertainty as one broad construct, I focus on three variables. To capture environmental uncertainty, or more generally the lack of ex-ante knowledge about processes, I measure unpredictability of the amount and kind of work. Apart from this first variable, I assess the inability to measure outputs, and goal ambiguity. Those two variables relate to behavioural uncertainty, or more generally, they indicate the lack of knowledge about the outcomes. These three are formative indicators, which together define in broad terms what uncertainty is. Formative indicator constructs are inherently different from reflective ones and thus require different treatment (Bollen and Lennox, 1991). Theory specifies which formative indicators relate to a construct. However, the items are not effects of the construct, but rather cause or define it. For instance, an increase in the level of uncertainty does not imply more goal ambiguity or less measurability of outputs. To the contrary, goal ambiguity leads to or creates uncertainty. Items used for formative constructs together define the construct, but they do not have to correlate (Bollen and Lennox, 1991). Consequently, if one would decide to leave one of the items out, that would alter the construct and thus the definition of uncertainty. Calculating a summated scale is inappropriate for this type of constructs, as are standard reliability and validity analyses based on Classical Test Theory (Bisbe et al., in press). Measurement of a construct with formative indicators involves the creation of an index (Diamantopoulos and Winklhofer, 2001). Basically, the index is a linear combination of the indicators. However, running an ordinary regression analysis is impossible because the construct is a latent construct. One way to solve this is to estimate a MIMIC model (Multiple Indicators of MultIple Causes; Jöreskog and Goldberger, 1975). Diamantopoulos and Winklhofer (2001) mention use of this model as one of the solutions in dealing with formative indicators. Jarvis, Mackenzie, and Podsakoff (2003) also advocate the use of this model as the best way to handle formative scale construction25. Figure 4.2 shows the general model. 25 See Diamantopoulos and Winklhofer (2001) for other possible strategies in handling formative indicator constructs. 65 Chapter four Figure 4.2 General form of a MIMIC model reflective indicator reflective indicator construct of interest 1 formative indicator 3 2 formative indicator formative indicator Source: Diamantopoulos and Winklhofer, 2001, p272 A MIMIC model incorporates both formative and reflective indicators. The formative indicators are combined through a multiple regression. The gamma’s, , indicate the regression weights of each on the construct. Their linear combination enables the calculation of the index. However, only incorporating formative indicators and the latent construct into the model leaves it underidentified: more than one possible solution exists. To solve for this problem, two reflective indicators are added to the model. Reflective indicators are manifestations of the construct and thus caused by it (Bollen and Lennox, 1991). Thinking about uncertainty as the construct of interest, this construct complicates the assessment of what has been achieved as well as the assessment of the output quality. Both are reflective indicators of uncertainty. Reflective indicators are expected to correlate. How to interpret the overall model? “If all (indicators) are conceptually appropriate measures of a single construct” (Jarvis et al., 2003, p214), one interpretation of the model is the measurement of a single construct with both formative and reflective indicators. To measure uncertainty I estimate a MIMIC model with three formative indicators (unpredictability, the inability to measure outputs, and goal ambiguity), and two reflective ones (difficulty to assess the work, and the difficulty to assess the output quality). Figure 4.3 shows the general structure. The outcome will be an index that combines all indicators to obtain a score on uncertainty. Before calculating the index, however, I will firstly introduce the indicators. 66 Operationalizing the framework Figure 4.3 Indicators for uncertainty: general model form unpredictability diff assess work not measure outputs uncertainty assess output quality goal ambiguity To measure unpredictability I combine three items (see table 4.11). The last two stem from Van de Ven and Ferry (1980). Unpredictability is the sum of these three variables. Thus, I treat this variable also as a formative construct. The combination of items defines unpredictability, the items measure different aspects of the construct, and they do not have to correlate. Therefore, Cronbach’s alpha cannot inform us about the reliability of the construct and is not calculated. Table 4.11 Items for unpredictability (Q37 and Q38) a unpredictable variation in the amount of work b number of exceptions that arise in the unit c differences in day-to-day situations Source: items b and c: Van de Ven and Ferry, 1980, Q6 (p433), Q3 (p432) items are summed Next, I measure goal ambiguity by combining several items that stem from a study by Rainey (1983) (see table 4.12). I extend the original measure with one more item asking about the specificity of goals. Table 4.12 Items for goal ambiguity (Q31-Q34) a clarity of goals b specificity of goals c clarity of goals to outsiders = 0.654 d goals known to insiders (reverse score) Source: items a,c,d: Rainey, 1983 The items are reflective indicators of goal ambiguity, which I thus measure with their average score. The items load on a single factor and their Cronbach’s alpha is acceptable (0.654). Finally, one question assesses how much of the outputs can be measured objectively and expressed in a number (Q82). Its reverse score forms the third formative indicator of uncertainty: the inability to measure outputs. 67 Chapter four Unpredictability, goal ambiguity, and the inability to measure outputs form the formative indicators of the MIMIC model, together defining uncertainty. Uncertainty in turn causes difficulty in assessing the outputs. Two questions measure this aspect. I pick up the difficulty the supervisor experiences in assessing the work of the manager and his/her employees (Q44). Moreover, I ask about the difficulty in reaching agreement with superiors when appraising the quality of unit performance (Q83). Both variables are direct consequences of uncertainty and thus the reflective indicators in my MIMIC model. Table 4.13 gives descriptive statistics on all variables related to uncertainty. Table 4.13 Descriptive statistics for all uncertainty indicators 1 unpredictability questions Q37a,b, RQ38* N 256 min 5 max 14 2 3 mean 9.65 goal ambiguity RQ31-34** 258 1.00 4.00 2.00 inability to measure RQ82 258 1 5 2.28 outputs 4 difficulty to assess Q44 256 1 5 2.59 work 5 agreement on output Q83 257 1 5 2.49 quality Note: the abbreviation RQ implies the reverse score for this question is used. *the sum of these items is taken **the average of these items is taken std. deviation 1.92 indicator type formative 0.62 1.09 formative formative 0.95 reflective 0.84 reflective Figure 4.4 shows how the variables that measure uncertainty combine into the MIMIC model 26. The formative indicators give rise to the latent construct uncertainty, which in turn causes the reflective indicators. Figure 4.4 MIMIC model for uncertainty unpredictability .10 .19 not measure outputs .59 .50 uncertainty .35 .39 .25 2 R = .71 .17 e1 diff assess work .12 .35 assess output quality e2 goal ambiguity e3 In AMOS 7, maximum likelihood estimation (MLE) renders a significant model (N = 254, Chi-square = 3.241, p = 0.198) with a good model fit (RMR = 0.017, RMSEA = 0.05, 26 When I incorporate the separate questions directly into the model instead of the constructs (like goal ambiguity), the model is distorted and overall model fit insignificant. Taking in the constructs is less precise measurement, but prevents these problems. 68 Operationalizing the framework NFIDelta1 = 0.975, CFI = 0.99). All coefficients are significant at the 10% level. Unpredictability gets the highest p-value of 0.086. Together the three formative indicators account for approximately 71% of the variance in uncertainty. Violations of multivariate normality are minor. There are no indications for multicollinearity and outliers do not pose any problems either. As suggested by Jarvis and colleagues (2003), I interpret this MIMIC model as a single construct measured with both reflective and formative indicators. Therefore it makes sense to combine the indicators and form a single measure for uncertainty. AMOS 7 returns factor loadings for every variable. I combine these in equation 4.1 to obtain a score for uncertainty. As a final step these uncertainty scores are rescaled to a range of 1 to 5. This simplifies the interpretation of scores on different variables throughout this study. Table 4.14 shows the descriptive statistics for uncertainty. Equation 4.1 Calculating uncertainty uncertainty = 0.038 * unpredictability + 0.234 * goal ambiguity + 0.227 * not measure outputs + 0.059 * assess output quality + 0.087 * diff assess work Table 4.14 Descriptive statistics for uncertainty scale 1-5 questions N min max mean std. deviation uncertainty Q37a,b; RQ38; RQ31-34; RQ82; Q44; Q83 254 1.28 4.05 2.39 0.57 4.3.2 Asset specificity Everything that I have mentioned while discussing uncertainty goes for asset specificity as well. This too is a construct that I measure with formative indicators and once more use of a MIMIC model to create an index is preferred. However, measurement problems for the reflective indicators distort the MIMIC model and I am left with an inferior solution: the construction of a summated scale. In chapter two, asset specificity was defined as the opportunity costs of investments made to support the activity (2.2.2). Multiple types of asset specificity were distinguished, but I focus on those three that seem able to grasp the essence of the construct in a general way and might be relevant for all kinds of organizational units: physical asset specificity, human asset specificity, and interdependencies. These variables are formative indicators that cause or create asset specificity and together define it. Causality runs from the items to the construct. An increase in human asset specificity creates more asset specificity. The opposite is not necessarily true: an increase in asset specificity does not increase human asset specificity. Figure 4.5 gives an overview of all variables related to asset specificity. Since I intended to 69 Chapter four combine them into a MIMIC model, both formative and reflective indicators are identified. An introduction to the variables follows. Figure 4.5 Indicators for asset specificity: general model form human asset specificity obtain elsewhere physical asset specificity asset specificity insourcing interdependencies Regarding physical asset specificity, I distinguish two types (see table 4.15). I use a modified measure developed by Coles and Hesterly (1998) to assess the uniqueness of equipment. I have added a question on uniqueness of systems. Managers specifically compare their unit to other companies’ units that handle similar activities. The two items are summed to measure physical asset specificity. Thus, again, I treat this variable as a formative construct because either one of both types might be relevant for an organizational unit and determine the amount of physical asset specificity. They do not have to correlate. Table 4.15 Physical asset specificity (Q29 and Q30) a uniqueness of equipment (e.g. machinery, tools, warehousing) b uniqueness of systems (e.g. software, communication systems) Source: item a: Coles and Hesterly, 1998; modified items are summed Three questions grasp human asset specificity (see table 4.16). The first two items stem from a study by Anderson and Schmittlein (1984). I have added a question on the uniqueness of employees’ skills. Managers compare their employees to employees of other companies who work on similar activities. All three are reflective indicators of human asset specificity, which I measure with their average score. The items load on a single factor and their Cronbach’s alpha is marginally acceptable (0.597). Table 4.16 Human asset specificity (Q49/50/52) a difficulty a new employee experiences in learning the ins and outs number of weeks of training a new employee needs who has experience in b a similar profession. c the uniqueness of unit employees’ skills Source: items a and b: Anderson and Schmittlein, 1984; modified Note: for item b the natural log is used throughout 70 = 0.597 Operationalizing the framework Thirdly, I measure interdependencies between organizational units as a formative indicator of asset specificity. I use a measure by Keating (1997) that consists of two parts. First of all, the impact a respondent’s unit has on work carried out in other organizational units is assessed (Q35). The next item relates to the impact the respondent’s unit experiences from other units’ activities (Q36). Summing the items returns a score on interdependencies. Together physical and human asset specificity, and interdependencies constitute the formative indicators of asset specificity, defining the construct. To complete the MIMIC model a pair of reflective indicators is needed that represent direct consequences of the construct under study. The existence of asset specificity makes it difficult to compare activities to the market. Two questions pick up on this (Q39 and Q40). The first asks about the possibilities to obtain the organizational unit’s products/services from another company without compromising price or quality (‘obtain elsewhere’). A high score implies difficulty as a consequence of asset specificity. Question 40 indicates whether outsourcing the unit’s activities would imply a major strategic change. In presence of asset specificity outsourcing would indeed imply such a change. A high score on this question thus underlines the importance of insourcing, hence the name of this variable ‘insourcing’. Table 4.17 gives descriptive statistics on all variables related to asset specificity. Table 4.17 Descriptive statistics for all indicators of asset specificity 1 physical asset specificity questions Q29, Q30** N 251 min 2 max 10 mean 4.86 std. deviation 1.90 indicator type formative 2 human asset specificity Q49, Q50,Q52* 257 0.44 4.42 2.68 0.73 formative 3 interdependencies Q35, Q36** 253 2 10 7.37 1.63 formative 4 obtain elsewhere RQ39 254 1 4 3.10 0.74 reflective 5 insourcing Q40 253 1 5 4.04 Note: the abbreviation RQ implies the reverse score for this question is taken. *the average is taken; the natural logarithm of Q50 is used **the sum is taken 1.40 reflective Unfortunately, the items cannot be combined into a MIMIC model. Trying to estimate the model returns insignificant coefficients for both physical asset specificity and interdependencies (two out of three formative indicators). Removing insignificant variables from the model does not provide a solution because leaving out formative indicators alters the definition of the construct; in my case the definition of asset specificity. Therefore, Diamantopoulos and Winklhofer (2001) warn us not just to remove indicators from the model. At the same time, the estimated MIMIC model defeats its own object because it does not enable me to create an index. Literature does not provide a clear-cut solution to this 71 Chapter four dilemma and Diamantopoulos and Winklhofer conclude: ”how to balance these considerations is a question that has not yet been fully resolved” (2001, p272). A likely explanation for the problems experienced with the MIMIC model for asset specificity, is the particular distribution of the reflective indicators. Especially Q40 on the importance of insourcing the activity of the organizational unit is problematic. 64% of the respondents indicate that insourcing is vital. This variable does not vary enough and underscores the difficulties in measuring asset specificity with reflective indicators. These variables, however, play a significant role in index construction and the model depends strongly on them. This is a limitation of the MIMIC model because one likely chooses to build a formative indicator construct in cases in which it is difficult to come up with overall reflective measures of the construct. To obtain a measure for asset specificity after all, I decide simply to add up the scores on the formative indicators as shown by equation 4.2. Note that both physical asset specificity and interdependencies are already combinations of two variables. Thus their scores range from 2 to 10. Scores on human asset specificity range from 1 to 5. To give all three indicators the same weight, human asset specificity gets a weight of two in the formula. Together these items define asset specificity. This approach fits the idea of scale construction with formative indicators in that I stick to the definition and do not take in items that highly correlate only. However, it is an inferior solution, because it is no different from an ordinary summated scale. As a consequence the measurement of this variable is less precise. This might have an impact on the results of the analyses. As a final step the scores on asset specificity are rescaled to a range of 1 to 5 to simplify comparison with the other (contingency) variables. Table 4.18 shows descriptive statistics for asset specificity. Equation 4.2 Calculating asset specificity asset specificity = physical asset specificity + 2* human asset specificity + interdependencies Table 4.18 Descriptive statistics for asset specificity scale 1-5 questions N min max mean std. deviation asset specificity Q29; Q30, Q49; Q50; Q52; Q35; Q36 245 1.58 4.37 2.93 0.53 72 Operationalizing the framework 4.3.3 Ex-post information asymmetry To measure ex-post information asymmetry, I modify a measurement instrument created by Dunk (1993) and used regularly in other studies (see for instance, Abernethy, Bouwens, and Van Lent (2004), and Bouwens and Van Lent, 2004). The manager indicates his information advantage over his supervisor on several items (table 4.19). Table 4.19 Items for ex-post information asymmetry Cronbach’s = 0.842 component loadings a the type of activities undertaken in the subunit 0.717 b the type of input output relations inherent in the internal operations 0.751 0.735 c realization of the performance potential of the subunit d the technical aspects of the work 0.686 e the impact of internal factors on the managers activities 0.679 f the impact of external factors on the managers activities 0.689 g understanding the achievements of the subunit 0.789 Source: all items except item f: based on Dunk, 1993; modified Dunk’s original instrument measures information asymmetries in general and I am particularly interested in ex-post information asymmetry; differences in information between the supervisor and the manager that persist to exist at the end of a period, i.e. when evaluation takes place. Adding words to stress the ‘end of a period’ and using different answer categories creates the appropriate focus. Factor analysis indicates that all seven items load on a single construct and I combine these into a summated scale (Cronbach’s alpha = 0.842). One case has a missing value on one of the items and as before I use the remaining items to calculate information asymmetry for this case. Table 4.20 shows descriptive statistics for all contingency variables. Table 4.20 Descriptive statistics for all contingency variables uncertainty N 254 minimum 1.28 maximum 4.05 mean 2.39 std. deviation 0.57 measurability of outputs 258 asset specificity 245 1 5 3.72 1.09 1.58 4.37 2.93 ex-post information asymmetry 255 1 0.53 5 3.37 0.83 4.4 Measuring misfit Ultimately, the main explanatory variable in my analyses is misfit between an MCS and its relevant archetype. I hypothesize that misfit is negatively related to MCS effectiveness. A total of 20 variables together map the MCS. The qualitative definition of each archetype given 73 Chapter four by the theory (see table 4.1) can be translated into operational terms using these 20 variables (thereby extending the definition and obtaining quantitative scores). A deviation from such a score for an MCS implies deviation from the archetype and thus adds to misfit. To take this methodological step, firstly working definitions of the archetypes have to be set up, and then misfit has to be calculated. 4.4.1 Defining the archetypes When testing configurational theories, descriptions of the ideal types must be transformed into ideal profiles containing measurable items. The archetypes and their empirical profiles should be matched. This is an important step because the outcomes of the research project directly depend on the measurement of misfit, which in turn depends on the operationalization of the archetypes. Researchers choose from two main approaches to operationalize archetypes (cf. Doty et al., 1993; Doty and Glick, 1994): empirical specification or theoretical specification. Using empirics, the researcher should select those cases from the sample whose MCSs most closely resemble the ideal types (Doty and Glick, 1994). Their scores then define the archetypes and function as the benchmark for the other cases. However, taking this approach, the definition of the archetypes becomes sample-specific. Another sample might have shown cases with an even closer resemblance to the theoretical descriptions of the archetypes, and, –if the theory is correct-, these cases inhibit higher levels of effectiveness. Moreover, archetypes are theoretical constructs and they represent organizational forms that might exist (Doty and Glick, 1994) rather than forms that will actually occur. Starting from empirics could thus narrow down the definition of the constructs. Alternatively, for empirical specification one might think of selecting those cases with the highest effectiveness and use their MCSs as benchmarks. This, however, would render tests of the theory invalid: those cases that resemble the archetypes would be effective by definition, which makes it impossible to test the theoretical ideas. The theoretical approach has advantages over the empirical approach: there are no restrictions because of sample and a closer fit with theory can be established. Purely based on the interpretation of the theory, expert raters or the theorist himself, set values for all variables that describe the archetypes. Also in my study, during a range of discussions with the theorist himself every archetype obtained scores on all 20 variables that map the MCSs. Thus, taking on this approach, the theoretical and the empirical profiles are matched. 74 Operationalizing the framework Table 4.21 shows the values assigned to each archetype: arm’s length control (ALC), result oriented machine control (MR), action oriented machine control (MA), exploratory control (EC), and boundary control (BC) respectively. Table 4.21 Operationalizing the archetypes – defining fit 1 work autonomy ALC 4 MR 3 MA 3 EC 3 BC 4 2 amount of discussion <3 2 or 3 2 or 3 >2 <3 3 >3 >3 <4 <4 <4 irrelevant irrelevant irrelevant <4 <4 5 use of financial targets use of non-financial targets benchmarking >3 irrelevant irrelevant irrelevant irrelevant 6 boundaries irrelevant irrelevant irrelevant irrelevant 3 7 role of budget irrelevant 1 1 1 0 or 3 8 irrelevant 0 0 1 irrelevant 1 1 1 1 1 10 emergent targets percentage variable reward importance in performance evaluation: target achievement 4 4 3 3 3 11 compliance with rules 3 irrelevant 4 3 3 12 long term achievements 4 4 3 4 3 13 subjective judgements <4 <4 <4 >3 >2 4 9 importance in rewarding: 14 target achievement >3 >3 <4 <4 <4 15 budget versus actuals redundant >3 irrelevant <4 irrelevant 16 compliance with rules 3 3 4 3 3 importance for career: 17 target achievement >3 >3 <4 <4 <4 18 budget versus actuals redundant >2 <4 <4 <4 19 compliance with rules 3 irrelevant 4 3 3 20 professional skills irrelevant irrelevant irrelevant >3 >3 Note: the grey boxes indicate extreme misfits: a deviation regarding the particular variable implies a substantial deviation from the archetype. Therefore, if a misfit occurs for this value, its score will be multiplied by three when calculating misfit. Note 2: ‘equal to or larger/smaller than’ signs are used for summated variables. The definition of the archetypes is based directly on an interpretation of the theory, but the choices made are largely consistent as well with findings from contingency studies within the field. On the one hand, this link is established through the MCtce itself that builds from prior research and integrates existing knowledge. On the other hand, the choices made in the operationalization reinforce this link. The variables that together map an MCS were not chosen arbitrarily, but relate for a large part to previous studies (see paragraph 4.1). The same holds true for many of the underlying hypothesised relationships between the individual elements of control and the contingency variables. Consider for instance, a summary of findings from contingency studies on management control system design by Chenhall (2003). 75 Chapter four Regarding the role of task uncertainty, he writes that high task uncertainty is associated with less reliance on standard operating procedures, behaviour controls, and accounting performance measures. Task uncertainty is also associated with more personal controls, higher participation in budgeting, and usefulness of broad scope MCS (Chenhall, 2003, p141). The operationalization of the archetypes is consistent with these findings. As opposed to the other archetypes, exploratory control and boundary control are both related to activities with high uncertainty27. From the definitions it become clear that these archetypes rely less on target achievement for performance evaluation and more on subjective judgements. Moreover, the primary role of the budget is not to act as a performance target, but to give guidance or set limits. The insights from contingency studies on MCS design stem mainly from empirical work that focuses on a single contingency variable and a single control variable (Fisher, 1995; Chenhall, 2003). Therefore it is impossible to justify all values set for individual elements of the definitions by referring directly to prior work. The discussions were a necessary tool to operationalize the archetypes and both integrate existing knowledge and expand it to a setting with multiple contingency variables studying packages of interrelated control instruments. For most variables, we did not define a single round number, but a range of values (see table 4.21). For instance, the archetype arm’s length control has a value of four or five on the variable work autonomy. Recall that semantic differential scales were used for measurement of most variables. The meaning attached to each value of these scales provided guidance for the decisions on the exact values. For instance, to measure work autonomy managers indicate the amount of influence they have in several decisions. In the scaling the value four implies ‘quite a bit influence’, whereas the value three relates to ‘some influence’. The archetypes arm’s length control and boundary control are both characterised by high levels of work autonomy and thus obtain scores of at least four in the definition. Another aspect of the operationalization is the assignment of weights to some of the variables (in table 4.21 this is indicated by the grey boxes). There are two reasons for assigning weights. First of all, some variables are extremely important in characterising the archetype. A misfit on one of these variables is considered to be an extreme misfit, a true deviation from the essence of the archetype. Every archetype has its own extreme misfits: not using market 27 It is difficult to compare results for uncertainty directly across studies. Different definitions of the construct are used and its measurement is as diverse. Here the comparison is also just a general one because my measure of uncertainty encompasses much more than task uncertainty. In terms of their effects both can be compared however. 76 Operationalizing the framework related benchmarks in an arm’s length control structure is an extreme misfit because it truly differs from the essence of the archetype. Similarly, a budget should be used primarily as a target to be achieved when thinking about result oriented machine control. For action oriented machine control pre-set targets are extremely important (as opposed to emergent targets), whereas emergent targets characterise exploratory control. The essence of the latter archetype is also grasped by a focus on long term achievements, work related discussions, and use of subjective judgements. Finally, boundaries are extremely important for boundary control, but do not play a distinguishing role in the other archetypes. The second reason for applying weights is that some variables represent one of the dimensions of control by themselves, whereas other dimensions are represented by multiple variables. Without taking those differences into account, an imbalance would be introduced leading to the domination of some aspects of the management control system in misfit calculation. This is the case for work autonomy, which is measured by one composite variable and therefore gets extra weight in the misfit calculations. The next paragraph exactly describes the calculation of misfit. 4.4.2 Calculating misfit The observed scores for the MCSs and the theoretical scores for the archetypes can now be compared one on one. Since I am interested in overall dissimilarity between both profiles, though, I want to measure the overall distances. For this purpose, I calculate Euclidian distances, a generally accepted measure28 both within contingency and configurational research (Doty et al., 1993; Doty and Glick, 1994; Donaldson, 2001). The concept is easy to understand: distance between two points is measured as a direct line connecting them. Moreover, measurement of Euclidian distances can easily be extended to a multi-dimensional space (Hair, Anderson, Tatham, and Black, 1998) and hence enables the comparison of profiles. To obtain Euclidian distances, firstly I calculate the deviation between the archetype and an MCS’s score for each of the 20 variables 29. Thus, a simple subtraction suffices. Consider the following example: a case obtains a score of 4 on the variable ‘amount of discussion’. Fit with the archetype arm’s length control is defined as a score of either 1 or 2. The deviation 28 Apart from Euclidian distances, researchers work with Squared Euclidian distances. This saves one step in the calculations. Note that this has implications for the hypothesized relationship with effectiveness, however. 29 Definitions of the archetypes do not cover all 20 variables, because some of these are irrelevant or redundant. This differs per archetype (see table 4.21). 77 Chapter four between the case and the archetype arm’s length control is then given by 4-2 = 2. Next, the deviations just calculated are squared. These values are summed. Finally, misfit can be calculated by taking the square root of the sum. Table 4.22 below illustrates the process for some of the variables. Table 4.22 Calculating misfit archetype: arm’s length control observed MCS: case 21 (Xalc – Xmcs) work autonomy 4 3 1 amount of discussion <3 2 0 4 variable use of financial targets >3 2 irrelevant 4 - benchmarking >3 3 1 (… more variables…) … … … use of non-financial targets 2 1/2 MISFIT = (3*1 + 0 + 4 + 3*1 +...) Note: grey boxes indicate extreme misfits. The squared deviation of these values is multiplied by three before entering the formula. Details follow below. This general approach, however, needs a few additional steps to handle nominal variables (1), routed variables (2), and weighting of variables (3). Next, I will explain all three issues and their solutions. Nominal variables In my study I use both ordinal and nominal variables. For nominal variables the distance between scores 1 and 2 should be the same as between scores 1 and 3, because these only indicate membership to a certain group and no degree of a variable. This can be easily taken into account when calculating the distances for each variable separately. However, nominal variables generate a maximum deviation of 1 point, whereas ordinal variables can generate 2 or 3 points. To make them comparable and give them equal weights, I rescaled the absolute deviations to a range of 0-3 (where zero implies a fit and three the largest possible misfit). Routing Due to routing in the questionnaire, scores on three variables related to bonuses depend on whether there is a bonus or not. When scores on variables depend on one another and as a group provide information about the MCS, their misfit scores should also be treated in combination. This is the case for the variables in my study that assess whether managers obtain a variable financial reward and when they receive this reward30. Firstly, the variable 30 The relevant questions have numbers 19, 21, and 22 in the questionnaire. The exact questions are to be found in appendix B. 78 Operationalizing the framework ‘percentage variable reward’ measures how large the variable financial reward is (ranging from zero to more than 15%). The next three variables indicate the importance of targets (1), attaining the budget (2), and compliance with behavioural guidelines (3) for being granted this financial reward. Simply combining the deviation scores for each of these variables, could be misleading. Therefore I take them as a group and combine their scores as follows. If not having a bonus implies fit, cases that indeed do not have a bonus automatically score a perfect fit on all three variables. Cases that do have a bonus, obtain regular deviation scores. This way it all adds up: a large bonus adds more to misfit, than a small bonus. A bonus based on target achievement instead of compliance also brings an extra increase to misfit. Basically, a small bonus based on compliance with behavioural guidelines indicates a smaller misfit, than a large bonus based on performance targets. Correspondence to the definition of the archetypes is thus established. If having a bonus implies fit, not having one leads to misfit. However, cases that do have a bonus can still obtain misfit scores if the bonus is based on compliance instead of on achievement of performance targets. This way, having a bonus based on the ‘wrong’ things is worse than not having a bonus at all. Setting weights In the misfit calculation some deviations are multiplied by three. As already mentioned, there are two reasons for applying weights when calculating misfit: the first is the existence of extreme misfits, the second is imbalance in the number of variables that represent each of the dimensions of control. Misfit scores that obtain extra weight in the misfit calculation are multiplied by three before being summed. This weight of three is decided upon because in cases where multiple variables represent one dimension of control, together these could create an approximately three times larger misfit than other variables by themselves. Taking all the above into account, I calculate misfit between each case and all four archetypes, not just the relevant archetype. Standard procedures for calculating Euclidian distances (in SPSS for instance) cannot help out and I decided to do all calculations ‘by hand’ building syntaxes in SPSS. To make misfit scores on different archetypes comparable, I rescale them to a range of 1-5. Note that several combinations of misfit on single items can lead to the same amount of overall misfit. The next paragraph provides details on the misfit variables. 79 Chapter four 4.4.3 The misfit variables Figure 4.6 shows the misfit scores on the Y-axes grouped per archetype: arm’s length control, result oriented machine control, action oriented machine control, exploratory control, and boundary control respectively. Figure 4.6 Misfit scores compared 5 4 3 2 1 arm’s length machine result machine action exploratory boundary A closer look at the misfit variables reveals that none of the cases in my sample closely resembles the archetypes exploratory control or boundary control. A number of cases resemble arm’s length control; one case even shows a perfect fit. The same holds true for result oriented machine control. For this archetype four cases in the sample show a perfect fit. A quick scan of type of organization, number of employees, and some other descriptive variables for these perfect fit cases, does not disclose any unique characteristics. Table 4.23 shows the descriptive statistics for the misfit variables. Table 4.23 Descriptive statistics for the misfit variables scale 1-5 misfit arm’s length control N 248 min 1.00 max 3.79 mean 2.30 std. deviation 0.49 misfit result oriented machine control 243 1.00 3.62 2.49 0.65 misfit action oriented machine control 243 1.51 4.28 3.14 0.49 misfit exploratory control 243 1.70 4.07 3.01 0.47 misfit boundary control 244 1.88 4.07 3.19 0.42 80 Operationalizing the framework 4.5 Control variables Ultimately, the effectiveness of the management control system is the dependent variable in this study. I am primarily interested in the effect of misfit on the effectiveness, but other variables might have an impact as well or influence the relationship. Therefore a number of control variables are taken in. Expectations about the role or influence of these control variables will be discussed in chapter five. This paragraph describes their measurement. First of all, I incorporate the effectiveness of the organizational unit into the analyses. To build a measure of the effectiveness of an organizational unit, I combine ideas from different studies. Hitt and Middlemist (1979) and Hitt, Ireland, Keats, and Vianna (1983) developed a method to establish the criteria managers use for the assessment of their unit’s effectiveness. Their method is extensive, and cannot be replicated in this study, but part of their results is useful. During the first stage of their research, the authors conduct interviews with unit managers in which they identify a range of criteria used to assess the effectiveness of organizational units. These criteria are all related to achievement of organizational goals. From this list with criteria, I select those that seem generally applicable. I also use some items from a measure by Van de Ven and Ferry (1980). They measure perceived unit performance. I combine the items from the before-mentioned studies and add a few additional ones. Table 4.24 shows all items I use to measure effectiveness of the organizational unit. Table 4.24 Items for the effectiveness of the organizational unit (Q84) Cronbach’s = 0.845 component loadings a achievement of subunit goals b cooperation with other subunits 0.726 0.540 c satisfaction of internal customers of your subunit 0.692 d satisfaction of external customers of your subunit 0.723 e employee satisfaction 0.616 f compliance with standards and behavioural guidelines 0.526 g problem-solving ability of the subunit 0.540 h the quantity or amount of work produced 0.599 i the quality or accuracy of work produced 0.693 j overall subunit performance 0.859 Source: items a, h, i: Van de Ven and Ferry, 1980, Q93 (p455); items a, b, e, f, g: based on Hitt and Middlemist, 1979 and Hitt et al. 1983 Factor analysis shows all items load on a single construct. Cronbach’s alpha reveals the reliability of the scale. Its value is high (0.845). Based on these results, the ten items are summed and averaged to obtain a summated scale. In cases where managers replied either ‘not applicable’ or when a value is missing, the average of the items left is taken as a score for unit effectiveness. The smallest remaining number of items is seven (two cases). Managers 81 Chapter four indicated both good and bad performance with a mean score of 5.06 and sufficient variance (see table 4.25). Table 4.25 Descriptive statistics for unit effectiveness scale 1-7 unit effectiveness questions Q84 N 258 min 2.80 max 6.50 mean 5.02 std. deviation 0.69 Cronbach’s alpha 0.845 Apart from unit effectiveness, other aspects of the unit should be considered. The following variables act as control variables as well in the analyses: the number of employees who work on the main activity (a proxy for size), the number of steps up the hierarchy, and a dummy variable indicating profit versus not-for-profit companies. In the overall sample the share of not-for-profit companies is 20%. Descriptive statistics on these variables are to be found in chapter three (3.6), and repeated here for ease of reference. Table 4.26 Descriptive Statistics variable nr of steps to CEO N 258 minimum 0 maximum 10 mean 2.22 std. deviation 1.39 nr of main activity employees 258 2 3400 88 323 4.6 Closing remarks In this chapter I have operationalized most steps of the methodological framework. Of central concern was establishing a match between the theoretical constructs and their measures, and assessing the validity and reliability. Apart from describing the measurement of the variables and introducing new measurement instruments with a general applicability, some issues of broader importance were addressed. As discussed in previous chapters, empirical work that studies management control systems in their entirety and measures the effectiveness of control are extremely scarce, yet needed to learn about the workings of control. Therefore, my study contributes to research within the field of management control by exemplifying one approach to conduct this type of study. This chapter in particular presented methodological steps that enable the study of MCS design and effectiveness. Another topic that extends beyond the scope of this particular project, concerns scale construction. I have suggested ways to improve general measurement of two main TCE variables: uncertainty and asset specificity. To better grasp the essence of these constructs, I suggest working with formative indicator models and could built one myself for uncertainty. The next chapter deals with the final steps of the methodological framework: determining the relevant archetype and relating misfit to effectiveness. It reports on and discusses the analyses and results of this study. 82 Chapter five - Analyses and results The goal of this research project is to gain insights into management control system design and effectiveness. Therefore, five hypotheses were formulated based on the claims of the Transaction Cost Theory of Management Control (MCtce). The previous chapters dealt with several methodological steps that enable the testing of these hypotheses. Below, the methodological framework reappears highlighting the final steps (figure 5.1). This chapter reports on those steps and describes hypotheses testing. Apart from these direct tests of the theoretical ideas, other subjects are addressed. The first paragraph discusses the identification of the relevant archetype. This is the final preparatory step before we can relate misfit to effectiveness. To determine the relevant archetype, several alternatives are available and a discussion of the choices involved follows. The determination of the relevant archetype results in the allocation of each case to one of the archetypes. In this way five subsamples are created and paragraph 5.2 will compare these on general characteristics. Next, before testing the hypotheses, in paragraph 5.3 I explore the occurrence of fit and misfit and the predictive power of the theory. Then, paragraph 5.4 reports on hypotheses testing. This paragraph firstly describes the general model used for the analyses. Then, it deals with each of the hypotheses separately and reveals whether misfit does influence effectiveness or not. As will become clear though, I cannot test all five hypotheses. In the last part of this chapter, two more topics are covered. Paragraph 5.5 studies the relationship between the contingency variables and misfit, and focuses particularly on the complex role of uncertainty. Finally, additional tests provide insights in the relation between MCS effectiveness and the effectiveness of the organizational unit. This chapter ends with a summary of the findings. Figure 5.1 Methodological framework define define activity determine relevant archetype architype compare MCS with relevant relevant archetype archetype map MCS misfitt to link misfi effectiveness effectiveness measure measure MCS MCS effectiveness effectiveness 83 Chapter five 5.1 The relevant archetype Recall that the theory submits that the more a management control system resembles its relevant archetype, the more effective it will be. The relevance of an archetype depends on the activity of the organizational unit. Three variables (uncertainty, asset specificity, and ex-post information asymmetry) characterize the activities, and identify those situations each archetype can handle most effectively. For instance, machine control excels when low uncertainty and high asset specificity combine. However, two types of machine control exist: the action oriented type and the result oriented type. Previously (see 2.2.4), I explained my choice for an additional contingency variable to distinguish between the two: the measurability of outputs. Taking in this extra variable leads to a refinement of the original figure from Speklé (2001b) that illustrates the effective matches between activities and archetypes, as shown below (figure 5.2). Distinguishing groups of cases that share the same relevant archetype is a prerequisite for the testing of the theoretical claims. The claims on effectiveness concern contingent effectiveness. The circumstances determine which management control system will perform relatively the best, whereas the archetypes represent these effective control systems. The theory specifies the effects of misfit regarding the relevant archetype only. RESULT machine control Moderate Low High Low ACTION machine control ARM’S LENGTH CONTROL Low High Uncertainty EXPLORATORY CONTROL Ex-Post Information Asymmetry BOUNDARY CONTROL Measurability of outputs High Figure 5.2 Archetypes of control and their habitat High Asset Specificity Note: in case of high measurability of outputs, uncertainty is lower than in case of low measurability of outputs. Source: Speklé, 2001b, p76, modified The MCtce indicates the levels of the contingency variables in qualitative terms of high, moderate, and low. Therefore, in the operationalization of the theory, I have to set the boundaries: what is high and what is low? For this purpose several alternatives are available, however, each of them involves making arbitrary choices. Starting from empirics, the mean 84 Analyses and results and median seem to provide natural boundaries and indeed samples are usually split in half (see for instance, Gresov, 1989). However, this choice assumes that about half the sample experiences low levels of the contingency variables, whereas the other half faces high levels. Although the dataset contains much variety in terms of activities and therefore covers a broad range of values for the contingency variables, claims as to why a division into high and low would be equal within the sample or the population, cannot be justified. Also from a theoretical point of view, there is no reason to expect situations of, for instance, high information asymmetry to be as common as situations of low information asymmetry. Several alternatives exist to set the boundaries though. One of them is to determine the boundary based on the meaning attached to values of a semantic scale. This works fine for information asymmetry: any case with values four or higher bears the mark ‘high’. In the scaling that runs from 1 to 5 this value implies a ‘significant advantage’. Respondents who thought they have a significant information advantage on a certain item circled the number four. Measurability of outputs receives the same treatment. Its boundary is set at the score of three, which implies that outputs are ‘somewhat’ measurable. Any case with values three or lower bears the mark ‘low’. The alternative chosen for information asymmetry and measurability of outputs is not feasible for uncertainty and asset specificity, because the semantic meaning of different values in these scales was lost during their composition. For instance, I cannot claim that the value four in these scales represents ‘quite a bit uncertainty’. However, it is possible to determine the highest and lowest scores respondents could have obtained given the data and the measurement scales ensued. Even though not all of these scores actually show up in the sample, they identify the range of all possible scores. Following this line of reasoning, I decide to use the midpoints of the scales for uncertainty and asset specificity as boundaries. Note that in case of asset specificity, the MCtce only pertains to moderate and high levels, whereas I distinguish between low and high levels in the operationalization. Like standard TCE, the theory assigns activities with low asset specificity to the domain of the market (Speklé, 2001a). Thus, in case of effective alignment only those activities that inhibit at least moderate levels of asset specificity are present within the hierarchy. However, as discussed in chapter two (2.3), within the hierarchy sometimes misalignments (misfits) will occur between the activities and mode of governance. The same holds true for alignment of activities with either markets or hierarchies. Misalignments will occur at this level as well. Assuming that only cases with moderate and higher levels of asset specificity show up in my dataset is thus 85 Chapter five inappropriate. Therefore, I stick to the simple division into low and high, without trying to restrict the scale of asset specificity to moderate levels. Table 5.1 shows descriptive statistics for the contingency variables and compares the boundaries chosen to the empirical mean and median. A t-test shows that for all four variables the boundary significantly differs from both the mean and the median. For uncertainty, asset specificity, and ex-post information asymmetry the boundary exceeds both mean and median. The boundary of measurability of outputs is significantly lower. The implications are the same: working with these boundaries instead of with the means or medians, assumes that less than half the organizational units in my sample face circumstances more difficult to control, with high levels of uncertainty, asset specificity, and information asymmetry, and less possibilities to measure the outputs. Table 5.1 Contingency variables and their boundaries N scale min max mean median boundary uncertainty 254 1-5 1.28 4.05 2.39 2.33 3.00 measurability of outputs 258 1-5 1.00 5.00 3.72 4.00 3.00 asset specificity ex-post information asymmetry 245 1-5 1.58 4.37 2.93 2.90 3.00 255 1-5 1.00 5.00 3.37 3.43 4.00 Together these four contingency variables and their boundaries identify the relevant archetype for each case. In determining the relevant archetype, firstly, cases are split up into groups with low ( 3.00) and high (> 3.00) uncertainty and then, cases with low uncertainty are further separated into groups with low ( 3.00) and high (> 3.00) asset specificity. This second split identifies the groups arm’s length control and machine control respectively. As a further step, the distinction between low ( 3.00) and high (> 3.00) measurability of outputs is made within the group of low uncertainty and high asset specificity. This distinguishes the groups for action oriented and result oriented machine control. Note that (the reverse of) measurability of outputs is a form of uncertainty and as such already a part of that construct. As a consequence, activities that belong to action oriented machine control inhibit more uncertainty than activities that belong to the result oriented type. Overall though, uncertainty is still low for both groups. Finally, the cases with high uncertainty fall into separate groups characterized by low (< 4.00) and high ( 4.00) ex-post information asymmetry, which returns the groups exploratory and boundary control. This process leads to the allocation of all cases over the archetypes as shown in table 5.2. The table reveals that most activities in the sample combine low uncertainty and low asset 86 Analyses and results specificity. Thus, cases that have arm’s length control as their relevant archetype form the largest subgroup. Table 5.2 Relevant archetypes archetype frequency percentage arm’s length control 120 49.2 result machine control 66 27.0 action machine control 19 7.8 exploratory control 24 9.8 boundary control 15 6.1 total 244 100.0 missing values 14 To check whether the allocation indeed returns groups that significantly differ in mean levels of uncertainty, asset specificity, and ex-post information asymmetry, I run a number of tests. If variation in the contingency variables is limited or lots of cases are on or close to the boundary, the groups might not actually differ. To compare the groups nonparametric tests are called for because sample size differs largely among the groups, and some are small. Nonparametric tests are valid when working with small samples (Newbold, 1995). They are insensitive to outliers and distribution free: no normality assumptions are required. Firstly, a Kruskal-Wallis test for equality of means 31 (this is the nonparametric alternative to a one-way ANOVA) indicates that the groups differ significantly. For pair wise comparisons I use Mann-Whitney tests (the nonparametric alternative for a t-test). Significant differences between groups show up as desired. The mean score on uncertainty of both exploratory and boundary control, for instance, significantly exceeds that of arm’s length control and machine control. Moreover, action oriented and result oriented machine control differ significantly on measurability of outputs. Figure 5.3 shows how the cases spread out when compared on scores for the contingency variables. Panel A compares the scores on uncertainty and asset specificity, whereas panel B confronts uncertainty and information asymmetry. The scatters show how the boundaries determine the allocation of cases over the archetypes. Although a number of cases lie very close to or on the boundaries, most cases are spread out. The choices made to determine the relevant archetype apparently return distinct groups that can be used for hypotheses testing. 31 These nonparametric tests do not actually test for differences in means, but rather test for differences in central location using rank orders. In this case, both are pretty much the same and I use the word means because it is easy to interpret. 87 Chapter five Figure 5.3 Boundaries that determine the relevant archetypes Panel A uncertainty 4,00 3,00 2,00 1,00 1,50 2,00 2,50 3,00 3,50 4,00 4,50 asset specificity Panel B uncertainty 4,00 3,00 2,00 1,00 1,00 2,00 3,00 information asymmetry 88 4,00 5,00 Analyses and results The choices I make to set the boundaries are motivated, but arbitrary. Alternative choices would have rendered different allocations of the cases over the archetypes and that might impact the results of this study. When testing the hypotheses, therefore, I perform robustness checks by shifting the boundaries and assessing the impact. Before testing the claims of the MCtce directly, however, I compare the groups just created and explore the differences between them. 5.2 A comparison of the subsamples In the previous paragraph determination of the relevant archetype resulted in the allocation of all cases over five groups or subsamples. These groups differ in terms of their activities’ scores on the contingency variables. Because the sample holds a large variety in organizational units it is interesting to compare the groups on general characteristics as well. In this paragraph, I compare the groups on types of organizational units represented, share of profit versus not-for-profit organizations, hierarchical level, and the number of unit employees. Like before, nonparametric tests are appropriate32. First of all, table 5.3 compares the types of organizational units represented in each of the subsamples and the overall sample. The table shows the absolute number of units. The category ‘other’ is not further explored because of the large diversity in unit types within this group. 32 Since the goal is basically profile analysis, use of discriminant analysis might spring to mind. However, large difference in sample size among the groups likely renders unreliable results (Hair, et al., 1998). Therefore, I confine myself to pair wise comparisons and nonparametric tests. 89 Chapter five Table 5.3 Division of cases of types of units within each subsample and overall Arm's Length 26 Result Machine 13 Action Machine 3 Exploratory Control 3 Boundary Control 1 total sample 48 Production Finance and Accounting Division 15 13 1 2 0 32 16 6 3 5 1 31 13 4 0 0 1 19 ICT 5 3 3 5 2 19 Marketing or sales 9 7 0 0 1 17 Project unit 12 4 0 0 0 17 Human resources Research and Development Company as a whole 7 2 2 1 2 14 3 1 2 2 2 11 2 1 0 1 0 8 Facility management 1 2 1 0 1 5 Treasury 3 1 0 0 0 4 Legal department 0 0 1 1 1 3 Other 8 9 3 4 3 20 Total 120 66 19 24 15 258 type of unit Business unit The division over types of organizational units within the group arm’s length control differs slightly from the overall sample: there are relatively more business units, divisions, and project units, whereas units ICT and research and development are represented less. Business units form the largest group. The result oriented type of machine control shows a different picture. Compared to the overall sample, production departments and units marketing and sales are relatively more represented, whereas units finance and accounting as well as units research and development only have small shares within the subsample. Business units and production units are the dominant types. The share of production units within the subsample is larger than in the other groups. Within the group action oriented machine control, I find that divisions, units marketing and sales, and project units are not represented at all, and the percentage of production units is much smaller compared to the overall sample. There is no dominant type, but business units, units finance and accounting, and ICT units form the largest groups and have equal shares. The subsample exploratory control shows largely the same picture as action oriented machine control. Within the subsample, divisions, units marketing and sales, and project units are not represented at all, whereas the percentage of units ICT and finance and accounting are the largest within the subsample and much larger than in the overall sample. 90 Analyses and results Finally, boundary control cases show a relatively large number of human resources units and units research and development. Production units and project units are not present within this subsample. The subsamples can also be compared over other general characteristics. Table 5.4 shows the percentages of profit and not-for-profit organizations for each group and for the overall sample. Table 5.4 Profit versus not-for-profit organizations within each subsample percentage Arm's Length Result Machine 86 Action Machine 79 Exploratory Control 62 Boundary Control 60 total sample 80 profit 83 not-for-profit 17 14 21 38 40 20 total (%) 100 100 100 100 100 100 The share of not-for-profit companies within the groups arm’s length control and action oriented machine control does not differ much from the overall sample (about 20%). Within the groups exploratory and boundary control, however, the relative share of not-for-profit organizations is much larger. Result oriented machine control has the smallest share (14%). Next, I use Mann-Whitney tests for pair wise comparisons between the subsamples to explore differences in terms of hierarchical level and the number of unit employees. No clear profiles for the groups appear but some differences occur. Significant results (p-values of 0.10 or less) are summarized next. Regarding hierarchical level, I find that units from the group result oriented machine control are present at lower levels within the hierarchy than units arm’s length control and action oriented machine control. Regarding the size of the unit, I test for differences in number of unit employees. Nonparametric tests assess differences in the median rather than the mean. Therefore these tests are robust to the influence of outliers. When studying the variable unit employees this is particularly relevant because within the overall sample this number varies from 4 to 10,800. Some differences among the subsamples are significant: organizational units within the group result oriented machine control are significantly bigger than all other units. The smallest units belong to the subsample boundary control. The other groups do not differ significantly from one another. The subsamples are compared on a number of characteristics and table 5.5 summarizes the main findings. 91 Chapter five Table 5.5 Distinguishing features of the subsamples related to the archetypes Arm's Length Result Machine Action Machine Exploratory Control Boundary Control business units are largest group; units at higher hierarchical levels relatively large shares production units and business units; lower hierarchical levels; relatively large number of employees relatively large shares for finance and accounting units and ICT; no divisions, marketing or sales, or project units relatively large shares of finance and accounting units and ICT; no divisions, marketing or sales, or project units relatively large share human resources and R&D; relatively small In sum, regarding unit types some differences among the subsamples exist. The cases are allocated according to the characteristics of their activities and therefore some of these differences might be expected to exist. Generally speaking, it is not surprising to find a relatively large number of production units within the sample of result oriented machine control, which are found at lower hierarchical levels and employ a large number of people. The activities conducted within production departments will usually be more programmable and thus exhibit lower levels of uncertainty. In the same vane, from the outset I would expect units research and development to be present within the groups of exploratory or boundary control. These activities are difficult to program and might be characterised by high levels of information asymmetry. However, the divisions of organizational unit types are not definite. Units human resources, for instance, are represented in all subgroups and the groups action oriented machine control and exploratory control largely exhibit similar characteristics. In chapter three I stressed the importance of distinguishing activities from organizational units because units generally have multiple activities that might differ in terms of their levels of uncertainty, asset specificity, and ex-post information asymmetry. Indeed, only 12% of the managers in the overall sample indicated that their unit has only one activity. Therefore it is not surprising to see human resources units turn up in each subsample. There are two explanations: human resource units have different functions within different organizations, or within these units multiple activities take place, which differ in terms of uncertainty, asset specificity, and information asymmetry. Going back to the raw data, I find some evidence to support this view. In the questionnaire managers were asked to specify up to five activities of their units and choose one to focus on throughout. Comparing these descriptions for the units human resources, at least two types of activities can be distinguished: within the groups exploratory and boundary control activities are related to HR policy making and internal HR consulting. For arm’s length control and machine control managers chose activities like recruitment and administrative tasks. Thinking again in terms of programmability of tasks and measurability of outputs, -or about uncertainty 92 Analyses and results more general-, these differences fit the theoretical ideas. Activities that inhibit lower uncertainty end up in the group arm’s length control. I also compare the activities between the ICT units from action oriented machine control and exploratory control. For the group exploratory control activities chosen are mostly related to IT projects, whereas the activities within the other group relate to policy making and strategic decisions. I find no clear differences in activities among these groups for the finance and accounting units. From the analysis above, it appears that testing for differences between the subsamples does not lead to big surprises. Since the samples are generated based on the characteristics of their activities and compared on other characteristics, this provides some evidence for the quality of the data. Apparently, the underlying characteristics of the activities are picked up well. It also stresses the importance of my approach to specifically focus on activities instead of on organizational units as such. This assures the best correspondence between the theoretical and empirical levels of analysis. In this paragraph I have compared the subsamples on their general characteristics. The next step will be to compare them in terms of their misfit scores. This way I will explore the predictive power of the theory. 5.3 The occurrence of fit and misfit In this paragraph, the subsamples will be compared once more, but focus will be on the misfit variables. Recall that for each case a misfit score regarding each archetype has been calculated (misfit regarding arm’s length control, misfit regarding result oriented machine control, et cetera). In chapter two (2.3) expectations as to the occurrence of fit as well as misfit were discussed. To summarize the arguments: on the one hand, cases are expected to resemble their relevant archetype because that is effective. On the other hand, states of misfit will occur routinely because of complexities and during times of change. To research the issue I can compare misfit scores on all archetypes within each subsample, and I can compare misfit scores for each archetype between the subsamples. Firstly, I will compare the average misfit regarding each of the archetypes among the subsamples. The cases within each subsample are expected on average to resemble their relevant archetype more closely than all other cases resemble the same archetype. If organizations strive for effectiveness, and thus for resemblance with the relevant archetype, 93 Chapter five but mistakes are made, then average resemblance might be larger within the sample. As before, Mann-Whitney tests are the appropriate tests to make these comparisons. Table 5.6 summarized the results. Table 5.6 Differences in the average misfit: subsamples against all other cases misfit variable misfit arm’s length control sample sizes of groups compared 118 120 average misfit score sig. lower 0.093 misfit result machine control 63 170 no difference 0.554 misfit action machine control 18 215 (lower) 0.199 misfit exploratory control 22 211 lower 0.002 misfit boundary control 14 220 (lower) 0.178 Note: results are based on the Mann-Whitney U statistic for pair wise comparisons. This nonparametric test is based on rank orders rather than actual means. When I compare the cases within the subsample arm’s length control to all other cases, I find that within the subgroup, units score on average significantly lower on misfit regarding arm’s length control. For misfit regarding exploratory control this result also appears. There is no difference in average misfit regarding result oriented machine control between the subsample and all other cases. Thus, the cases in this subsample do not distinguish themselves from the other cases in their average level of misfit regarding result oriented machine control. Finally, for the groups action oriented machine control and boundary control, the outcome is less convincing: the average misfit scores within the groups seems to be lower, but this result is only significant at the 20% level. Considering the small sample sizes in these tests, however, only strong effects are likely to be picked up at all, therefore the result is worth mentioning. Moreover, one-directional testing is justified because of theoretical expectations (Newbold, 1995). Above I have compared the average misfit regarding each of the archetypes among the subsamples. From the outset I expected the average misfit regarding the archetypes to be lower within their subsample than within the rest of the sample. This is indeed the case for two of the subsamples. For two other samples the effect is also present but weak, and in only one subsample there is no effect. I find no results contrary to expectations, however. Additional insights might be gained by comparing misfit scores within each archetype group. Table 5.7 shows how many cases resemble their predicted archetype more closely than they resemble any of the other archetypes. Recall that for each case a misfit score with each archetype has been calculated. These misfit scores are compared and the lowest determines a new group ‘minimal misfit’. This comparison sheds light on the predictive power of the 94 Analyses and results theory. If the theory can help predict which MCS will occur under specific circumstances this is directly relevant to the study of control structure variety. Table 5.7 Fit between an MCS and its relevant archetype minimal misfit score Action Exploratory Machine Control 9 9 Arm's Length 44 Result Machine 50 Result Machine Control 25 27 3 8 0 63 Action Machine Control 6 6 2 2 1 17 Exploratory Control 5 6 4 6 1 22 Boundary Control 3 4 1 5 1 14 Total 83 93 19 30 7 232 relevant archetype Arm's Length Boundary Control 4 total 116 For 44 cases, the MCtce would predict the archetype arm’s length control to prevail and the case’s MCS indeed resembles that archetype more closely than any of the others. Overall, a total of 80 cases are predicted correctly, making up 34.5%33. Although there is no significant linear association between the rows and the columns, the distribution of cases is not uniform either. The biggest mistakes in predicting minimum misfit occur between arm’s length control and result oriented machine control. A large number of cases that mostly resemble arm’s length control fall into the group result oriented machine control (25 cases). The opposite also occurs (50 cases). This triggers the question whether I am able to distinguish the two groups well enough. The two misfit variables are significantly positively correlated. However, the correlation is moderate (r = 0.391; p = 0.000). Cases do obtain different misfit scores. The minimum of these misfit scores determines the minimal misfit group. Differences between the scores might be very small or even zero. From the 75 cases, however, only 8 have very small difference scores (less than 0.10 on a five point scale). Taking these possible disturbances into account will not alter the findings in terms of correctly predicted cases. It is important to realize that this minimal misfit can be relatively small, but still substantial in absolute terms. Previously, chapter four showed that none of the cases in my sample closely resembles the archetypes action oriented machine control, exploratory control or boundary control. A number of cases closely resemble arm’s length control and result oriented machine 33 Studying relationships within a cross-tabulation, one could run a Likelihood ratio test to compare the predictive power to a random prediction. However, this is not helpful here, because for the random prediction all possible cells are taken into account and I am only interested in the diagonal. 95 Chapter five control. The 7 cases in table 5.7 that mostly resemble boundary control thus have large misfits whereas the 83 cases for arm’s length control can have both large and small misfit scores. It is also possible to compare the maximum misfit scores within each archetype group. Table 5.8 shows this comparison. 16 cases resemble their relevant archetype least of all (7%). Again, there is no clear association, but the distribution is not uniform. Table 5.8 Maximum misfit between an MCS and its relevant archetype relevant archetype Arm's Length Arm's Length 3 maximum misfit score Result Action Exploratory Machine Machine Control 2 7 36 Boundary Control 68 total 116 Result Machine Control 0 3 6 18 36 63 Action Machine Control 1 3 0 5 8 17 Exploratory Control 0 7 0 5 10 22 Boundary Control 0 2 1 6 5 14 Total 4 17 14 70 127 232 All in all, the analyses in this paragraph return the following tentative results: within four subsamples cases on average resemble the relevant archetype more under the predicted circumstances than under other circumstances. Moreover, almost 35% of cases can be correctly predicted by the theory in terms of the archetype they resemble most. Only 7% of cases show less resemblance to their relevant archetype than to any of the other archetypes. However, my approach here is highly exploratory and provides tentative results. It is impossible to draw conclusions about the theory without making reference to the effectiveness of the control systems. I might not find that the MCSs match their predicted archetype more than any of the others, but without an assessment of the effectiveness this is little informative. To make claims about the theory therefore, it is necessary to incorporate effectiveness of control in the analyses. This will be done in the next paragraph. 5.4 Testing the MCtce In this paragraph I focus on testing the five hypotheses that stem directly from the MCtce. This involves the final step in the methodological framework: relating misfit to effectiveness. In chapter four, all cases were compared to each of the archetypes and related misfit scores were calculated. Each case thus received a score on five different misfit variables (misfit regarding arm’s length control, misfit regarding action oriented machine control, et cetera). Only one misfit score is relevant for each case and this has been determined in paragraph 5.1. However, I cannot combine these scores and test the link between misfit and effectiveness for the 96 Analyses and results complete sample. As explained in chapter two (2.3), this is inappropriate because of possible differences in maximum achievable effectiveness among the archetypes. Equifinality among the archetypes is not assumed. Therefore, one can only test the theoretical claims for subsamples and compare those cases that share the same relevant archetype34. I estimate linear regression models to test the hypotheses. Subparagraph 5.4.1 introduces the general model. Next, for each archetype the analyses and results are presented. Unfortunately, only the groups for arm’s length control and result oriented machine control are large enough to accommodate regression analysis. However, bivariate analysis and descriptive statistics shed some light on the relationships of interest within the other groups. 5.4.1 The general model The MCtce claims that resemblance of a management control system with its relevant archetype leads to effectiveness of that system. To study the relationships, I focus on lack of resemblance and measure misfit. I use a linear regression model to test the hypotheses for each of the archetypes. Equation 5.1 shows the general form. Equation 5.1 The general model effectiveness MCS = + 1*misfit + 2*effectiveness unit + 3*uncertainty + i*control variables + (i = 4, 5, ..., n) The effectiveness of the management control system is the dependent variable. Misfit regarding the relevant archetype is the primary variable of interest and expected to be negatively associated with MCS effectiveness. Apart from misfit, I take on effectiveness of the organizational unit as well as uncertainty. Effectiveness of the organizational unit functions as a control variable. I expect the two measures of effectiveness to be positively correlated. Little is known about the relationship between the two measures of effectiveness, mainly because researchers generally use organizational effectiveness as a proxy for effectiveness of control. They do not measure effectiveness of control directly, nor provide evidence for the relationship between the two variables (cf. Chenhall, 2003). The assumption that the two are positively correlated will not meet much opposition. However, making causal claims is problematic because causality 34 An alternative would be to correct for differences in effectiveness when testing the complete sample. However, differences in the relationships between misfit and effectiveness might also exist among the groups. The simplest solution therefore is to work with subsamples. It is also the most theory-consistent solution because apart from assumptions that stem directly from the theory no additional ones are made. 97 Chapter five seems to run both ways. Since the MCS is supposed to influence employee behaviour in order to accomplish organizational goals, a failing control system will likely have a negative influence on the organizational unit. At the same time, a highly effective organizational unit can invest more in its MCS to improve it. Moreover, there might be a psychological effect in rating the perceived performance of the MCS. A manager of a successful organizational unit might show overall optimism in rating the performance of both the unit and the management control system. Similarly, when the organizational unit underperforms, a bad feel about overall performance might impact the assessment of the MCS negatively. Thus, halo error might be present to some degree, which is the tendency of raters to make global assessments when they are unable to discriminate between different constructs (see Brownell, 1995, p45). Note that both effectiveness measures are influenced by many other factors, so that the relations are probabilistic rather than deterministic. For now, the effectiveness of the unit is taken in as a control variable. Paragraph 5.6 examines the relationship between the two effectiveness variables separately. Uncertainty also enters the equation. Although I have allocated cases according to their levels of the contingency variables, within each group variation still occurs. Only for uncertainty I expect a direct effect on MCS effectiveness, apart from its indirect effect through the determination of the relevant archetype. I argue that a psychological effect of uncertainty might influence the manager’s assessment of control effectiveness. This is relevant because I measure perceived effectiveness. Consider a manager of an organizational unit with activities that are highly certain, he/she will have an overview of what should be done within the unit, and can assess the outcomes of the work properly. The manager likely experiences a sense of being in control of the situation. If the MCS functions well, the manager will also be positive in his/her assessment. To the contrary, in case of high uncertainty, it might be difficult or impossible to get an overview of the work and to assess the outcomes. Therefore, even if the control system would be the best we can do (a perfect match according to the theory), the manager might not experience it to be the most effective system. He/she is always more comfortable with a control system that enables precise planning and measurement. The system in place might be the best in absolute terms, but relatively poor. Following this line of reasoning, I expect the perception of control effectiveness to be negatively influenced by uncertainty. The role of uncertainty is studied in more detail in paragraph 5.5. Finally, a number of other control variables appear in the analyses. Which ones and how many depends on the archetype being tested and will be explicated in each subparagraph. Before and during the analyses, assumptions underlying ordinary least squares regression 98 Analyses and results estimation (e.g. normality, multicollinearity, heteroskedasticity, influential observations) are checked and violations are dealt with accordingly. 5.4.2 Arm’s length control Firstly I will test the general model for those cases that have arm’s length control as their relevant archetype. Recall the first hypothesis: H1: In case of low uncertainty and moderate asset specificity, resemblance of an MCS with the archetype arm’s length control is positively associated with MCS effectiveness. The largest number of cases falls into this group: 120 organizational units have activities combining low uncertainty and low asset specificity. As a consequence several control variables can enter the analysis. Firstly, since my sample includes organizational units at different hierarchical levels, I take on the number of steps to the CEO to indicate this level. It might be that managers at low levels within the hierarchy perceive the effectiveness of the MCS differently from managers higher up the hierarchy. Moreover, differences in the relationships studied might exist because of the number and diversity of activities within units, which is probably larger at higher hierarchical levels. The number of steps up the hierarchy rather than the number of steps down is chosen because the latter only informs us about structure within the unit. Secondly, the number of employees working on the main activity enters the equation. In my model this variable represents size, which is known to influence organizational structures (Chenhall, 2003; Donaldson, 2001) and might therefore impact misfit or MCS effectiveness. I choose this variable rather than the number of unit employees for consistency. The level of analysis is the activity rather than the unit, and both the MCS and the effectiveness thereof are measured surrounding the activity. The relevant indicator of size therefore, is also the number of employees working on the activity35. Finally, I take on a dummy indicating whether the unit is part of a profit rather than a not-forprofit organization. Not-for-profit organizations make up 17% of the cases in this subsample. The theory does not pertain specifically to profit companies or not-for-profit companies, but differences in the composition of misfit might exist that could influence the relationships 35 For some cases the number of employees who work on the main activity is very small. Rerunning the analyses without these cases does not alter the results. 99 Chapter five studied. Moreover, differences in the perception of effectiveness might exist. Table 5.9 shows the descriptive statistics for the subsample. Table 5.9 Descriptive statistics: subsample arm’s length control 1 MCS effectiveness N 120 min 2.18 max 6.55 mean 4.62 std deviation 0.71 2 misfit arm’s length control 118 1.00 3.39 2.24 0.49 3 unit effectiveness 120 2.80 6.22 4.99 0.67 4 uncertainty 120 1.28 3.00 2.24 0.43 5 nr of main activity employees 120 2 3400 87 337 6 nr of steps to CEO 120 0 10 2.21 1.47 An examination of the variables leads to the following conclusions and adjustments. Uncertainty shows enough variation to be incorporated into the model, even though the sample is restricted to cases with low uncertainty. The distribution of two variables deviates substantially from the normal distribution (high skewness and kurtosis). These variables are transformed before the regression analysis is performed. I take the natural logarithm of the number of employees. Moreover, I turn the variable number of steps to the CEO into a variable with scores zero to five. Each implies the exact number of steps, except the highest score, which pertains to all cases with five or more steps to the CEO. After these changes, the assumption of normal distribution for all variables is no longer violated. When assessing univariate outliers, one case shows up with an extremely low score on MCS effectiveness. An examination of the original questionnaire and the raw data reveals neither mistakes, nor curiosities. To assess its influence, the regression analysis is run both with and without this outlier. Table 5.10 shows the correlations among the variables as they will be entered in the regression model. Uncertainty is negatively associated with both effectiveness of the unit and the number of employees who work on the main activity. The strength of these relationships is moderate. Overall the correlations between the independent variables are not strong enough to prompt any concerns for serious multicollinearity. During the regression analysis, however, I will specifically test for multicollinearity. 100 Analyses and results Table 5.10 Bivariate correlations within the group arm’s length control 1 Pearson correlations MCS effectiveness 1 1 2 3 2 misfit arm’s length control 3 unit effectiveness 0.424(**) 0.036 4 uncertainty -0.248(**) -0.055 -0.209(*) 5 nr of main activity employees 0.002 -0.147 -0.017 4 5 -0.096 -0.217(*) 6 nr of steps to CEO -0.128 0.016 0.020 -0.022 0.520 Note: N ranges from 118 to 120, * = significant at the 5% level (2-tailed), ** = significant at the 1% level (2-tailed). The correlation table also reveals significant associations between MCS effectiveness and effectiveness of the unit, and between MCS effectiveness and uncertainty, both showing the expected sign. In this bivariate analysis, misfit is unrelated to MCS effectiveness. The next step involves multivariate analysis. Regression analysis is used to test the general model. Ordinary least squares estimation returns the results as shown in table 5.11, panel A. The initial model for arm’s length control is significant and explains 25.3% of the variance in MCS effectiveness (adjusted R2 = 21.1%, N = 115). There are no indications of multicollinearity or heteroskedasticity in this model. Two cases with extremely large residual values are identified as outliers. Table 5.11 Regression models for arm’s length control Panel A – initial model B std. error T sig. 5.019 .000 -.135 -1.578 .118 1.055 .406 4.746 .000 1.057 .145 -.173 -1.990 .049 1.097 -.046 .053 -.075 -.870 .386 1.077 nr of steps to CEO* -.092 .049 -.159 -1.881 .063 1.031 dummy profit companies .146 .163 .076 .895 .373 1.051 (intercept) 3.781 .753 misfit with arm’s length control -.196 .124 effectiveness of the unit .430 .091 uncertainty -.288 nr of main activity employees** Beta 2 VIF 2 R = 25.3%, adjusted R = 21.1%, N = 115 Panel B – base model B std. error T sig. 6.769 .000 -.146 -1.824 .071 1.005 .384 4.625 .000 1.078 .125 -.270 -3.254 .002 1.078 .042 -.099 -1.229 .222 1.003 (intercept) 4.180 .618 misfit with arm’s length control -.193 .106 .389 .084 uncertainty -.407 nr of steps to CEO* -.052 effectiveness of the unit Beta 2 2 VIF R = 29.5%, adjusted R = 27%, N = 115 Note: the number of cases is equal for both models. However, these are not all the same cases. * this variable has been transformed. Its scaling runs from zero to five and indicates the number of steps to the CEO, except for the highest score obtained by cases with 5 steps or more. ** the natural log of this number is used 101 Chapter five As expected, the effectiveness of the organizational unit positively correlates with the effectiveness of the MCS. This variable accounts for most of the variance explained. Moreover, misfit shows a negative effect on MCS effectiveness, which is significant at the 12% level (p = 0.118)36. Furthermore, uncertainty shows a significant negative coefficient. Regarding the control variables only the number of steps to the CEO seems to be relevant. A negative effect of this variable on MCS effectiveness emerges from the analysis, which implies that managers at lower hierarchical levels indicate a poorer perceived performance of their MCSs. An explanation might be that they have less influence in the design of the MCS than managers in positions higher up the hierarchy. Higher involvement might lead to a higher appreciation. The other control variables do not have an impact on MCS effectiveness. As a second step, I run a trimmed model leaving out insignificant control variables. Moreover, the two cases that are identified as outliers (because of their extreme residual values) are removed from the analysis as well. One of them is the same case already identified as an outlier in the univariate analysis. This case turned out to be extremely influential in the regression model, especially in the relationship between misfit and MCS effectiveness. Combining trimming and outlier deletion results in the base model as shown in table 5.11, panel B. The model shows an overall improvement to the initial model. The model better fits the data and the variance explained in MCS effectiveness increases by about 4% (R2 = 29.5%). Again there are no indications of multicollinearity or heteroskedasticity. For effectiveness of the unit and for uncertainty, results remain the same and show lower levels of significance. The result for the misfit variable improves and becomes significant at the 7% level of confidence (p = 0.071). The direction and the strength of the effect remain the same. The effect of the number of steps to the CEO disappears and becomes insignificant in the base model. Together the findings from the initial and the base model support the first hypothesis and also return the expected results for the variables effectiveness of the unit and uncertainty. As discussed in the previous paragraph, when allocating cases according to their relevant archetype, I use arbitrary boundaries. Some cases are close to the boundary and would fall into different categories when the boundary changed only a little. Therefore, it is relevant to test whether choosing different boundaries would render different findings. I perform robustness checks on the results for arm’s length control adjusting the boundaries of respectively asset specificity and uncertainty. After each adjustment, cases are again allocated 36 It would have been appropriate here to perform one-tailed tests. The p-value for misfit then drops to 0.06. Testing one-tailed is appropriate when strong indications about the direction of the effect exist (Newbold, 1995). 102 Analyses and results according to their relevant archetype. Next, the analyses are run again for the new samples of cases that belong to arm’s length control. For the robustness checks I use the base model without outliers and insignificant control variables37. Firstly, two models with respectively a higher (3.2) and a lower (2.8) boundary for asset specificity are estimated. These values are chosen based on visual inspection of the scatter plot shown in figure 5.3 panel A. An adjustment should change the sample to incorporate cases that are close to the boundary, but should not involve too large a shift of that boundary. I do not want to test a different strategy in boundary setting, but merely test whether the choices I have made are sensitive to small changes. The number of cases for samples with the higher and lower boundaries of asset specificity is 140 and 89 respectively. Using these samples again in the regression analysis does not render different results. Next, I repeat the exercise adjusting the boundary for uncertainty up (3.2) and down (2.8). The number of cases in these models is 127 and 104 respectively. Based on these extra analyses, the overall conclusions do not change and therefore I conclude that the previous findings are robust to changes in boundary decisions. In these models only the effect for the variable ‘number of steps to the CEO’ changes. In some cases the effect is significant and negative, in others it is not significant. Another relevant question to address is whether or not resemblance with the archetype arm’s length control is always beneficial, regardless of the levels of the contingency variables. In that case the theory would not be supported (or only partly), because the effects of resemblance with arm’s length control would not be contingent on the characteristics of the activities. To answer this question, I test the relationships between misfit with arm’s length control and MCS effectiveness for the group of cases experiencing high uncertainty (N = 39). In this subsample misfit is not related to the effectiveness variable (a null result). Also for the group of cases that experience high levels of asset specificity, misfit is unrelated to MCS effectiveness. Thus the effect of misfit regarding arm’s length control pertains only to those observations for which theory predicts that it would. Under the specific circumstances of low uncertainty and low asset specificity only, MCSs that resemble arm’s length control are more effective than other governance structures. 37 When I use the initial model for the robustness checks, in two cases the result for misfit becomes less significant. The worst case scenario returns a p-value of 0.17, two-tailed. The two outliers identified earlier are responsible for this decline. However, most analyses return results that are significant at lower confidence levels and the sign of the effect is negative in all cases. Therefore, I conclude that overall the results are robust. 103 Chapter five In sum, I find evidence that misfit regarding the archetype arm’s length control has a negative impact on the perceived effectiveness of the control system. This effect is confined to the group of cases that experience low levels of both uncertainty and asset specificity. The results are robust to shifts of the boundaries and hypothesis H1 is being supported. 5.4.3 Result Oriented Machine Control The second hypothesis pertains to machine control of the result oriented type: H2: In case of low uncertainty, high asset specificity, and high measurability of outputs, resemblance of an MCS with the archetype result oriented machine control is positively associated with MCS effectiveness. This group holds 66 observations, which is enough to accommodate regression analysis. Because of the limited number of cases, not all control variables can enter the model. Table 5.12 shows the descriptive statistics. The share of not-for-profit organizations within this subsample is much smaller than in the overall sample (14% versus 20%). Table 5.12 Descriptive statistics: subsample result oriented machine control 1 MCS effectiveness N 66 min 3.00 max 6.09 mean 4.60 std deviation 0.81 2 misfit result oriented machine control 63 1.00 3.62 2.46 0.66 3 unit effectiveness 66 3.50 6.50 5.10 0.70 4 uncertainty 66 1.29 2.79 2.09 0.36 5 nr of main activity employees 66 4 3200 137 439 6 nr of step to the CEO 66 1 8 2.58 1.38 An examination of the variables leads to the following conclusions: uncertainty shows enough variation to be incorporated into the model, even though the sample is restricted to cases with low uncertainty. Overall the scores on misfit regarding result oriented machine control are relatively low. The maximum value of this variable is 3.62 on a five point scale. However, previous tests (see 5.3) showed that there is no significant difference between the average misfit score of this subsample and all other cases. The distribution of variables does not deviate substantially from the normal distribution (skewness and kurtosis values do not exceed the generally accepted levels). Finally, no univariate outliers are identified. Table 5.13 shows the correlations among the variables as they will be entered in the regression model. Uncertainty is negatively associated with effectiveness of the unit. The 104 Analyses and results correlation between both variables is quite strong (r = -0.503, p = 0.000). Furthermore, uncertainty is positively associated with misfit. The strength of this relationship is moderate (r = 0.376, p = 0.002). Misfit is not related to any of the effectiveness measures. Overall, some of the correlations between the independent variables are quite strong and multicollinearity might disturb the regression analysis. Table 5.13 Bivariate correlations within the group result oriented machine control 1 Pearson correlations MCS effectiveness 2 misfit result machine control 3 unit effectiveness 1 1 2 3 -0.075 0.532(**) -0.177 4 uncertainty -0.295(*) 0.376(**) -0.503(**) Note: N ranges from 63 to 66, * = significant at the 5% level (2-tailed), ** = significant at the 1% level (2tailed). Using the general model I test the relationship between misfit regarding result oriented machine control and MCS effectiveness. Apart from MCS effectiveness and misfit, I take on effectiveness of the unit, uncertainty, and the number of employees who work on the main activity38. Table 5.14 shows the results. The model is significant and explains 28.8% of the variance in MCS effectiveness (adjusted R2 = 23.9%, N = 63). Table 5.14 Model for result oriented machine control B 1.443 std. error 1.258 Beta misfit with result machine control .046 .152 effectiveness of the unit . 606 .150 uncertainty -.062 nr of main activity employees* .023 (intercept) T 1.147 sig. .256 VIF .037 .302 .764 1.203 .523 4.049 .000 1.359 .319 -.027 -.193 .848 1.538 .065 .041 .359 .721 1.057 2 2 R = 28.8%, adjusted R = 23.9%, N = 63 * the natural log of this number is used The results show that effectiveness of the organizational unit has a positive influence on MCS effectiveness. Moreover, neither misfit nor uncertainty has an effect on MCS effectiveness. Hypotheses H2 is not being supported. Mild multicollinearity seems to distort the findings for uncertainty. The variance inflation factor (VIF) remains below the threshold level of two, but the condition index indicates possible collinearity problems between effectiveness of the unit and uncertainty (ci = 36.557). 38 As before, some cases have a very small number of employees who work on the main activity. Rerunning the analyses without these cases does not alter the results. 105 Chapter five When exploring correlations among the variables (table 5.13), I had already found a fairly large negative correlation between effectiveness of the unit and uncertainty. The table also shows that uncertainty and MCS effectiveness are negatively correlated. In a bivariate test I find a negative effect of uncertainty on MCS effectiveness as expected, but I cannot test for it in the multivariate model. The regression model for result oriented machine control also suffers from mild heteroskedasticity, an indication that another variable that explains MCS effectiveness is missing from the model. Again, I perform robustness checks on these results. Adjusting the boundary for uncertainty downward (level = 2.8) does not alter the original sample. Adjusting the boundary upward to 3.2 adds one case to the sample. This is a result of the boundary put on measurability of outputs. Both types of machine control have activities that exhibit low uncertainty and high asset specificity. To distinguish between the two types, I chose measurability of outputs as a fourth contingency variable. This variable represents a form of uncertainty and is as such also a part of that construct. Within the group of low uncertainty, the variable measurability of outputs only gets three scores: 3, 4, or 5 (on a five point scale). Those cases with scores 4 and 5 are characterized by highly measurable outputs and thus a lower uncertainty. Those cases belong to the initial sample for result oriented machine control. For the robustness check, I can adjust the boundary for measurability of outputs downwards (level > 2). This returns a sample of 85 cases, which contains all cases for the two types of machine control combined. Rerunning the regression analysis with this subsample does not alter the conclusions. The coefficient of misfit is insignificant once more. The next paragraph (5.4.4) will look into the differences between action and result oriented machine control in more detail. Adjusting the boundary for asset specificity returns samples with 49 cases (higher boundary of 3.2) and 86 cases (lower boundary of 2.8) respectively. The results of the regression analysis reported above are also robust to these changes. The effect of misfit regarding result oriented machine control is not significant. Other results do not change either. All in all, I find no evidence of a negative effect of misfit regarding result oriented machine control for MCS effectiveness. This null result is robust to changes in the boundaries that determine the relevant archetype. Hypothesis H2 is not supported. Up to this point the analysis focused on identifying a linear relationship between misfit and effectiveness for which no evidence has been found. As discussed in chapter two, however, the form of the relationship between misfit and effectiveness of control remains unclear from the theory. Therefore, other types of relationships might exist between the two variables. In 106 Analyses and results this final part of this paragraph I will explore the issue for the subsample result oriented machine control. Instead of assuming that each increase in misfit has an effect on MCS performance, it might be the case that only large deviations have an effect on effectiveness, whereas small deviations do not impact performance directly. This relates to the complementarities between control instruments that together constitute the packages of control (see 2.1.2). In case of strong interrelationships minor misfits might impact control effectiveness directly. However, if the complementarities are less strong, minor misfits might not cause control problems. This would imply some sort of threshold value of misfit. To explore whether I can find some evidence for this idea of a threshold misfit level, I compare two groups of cases within the subsample result oriented machine control. The first group contains cases that resemble the archetype the most. The misfit value of these cases lies below the mean minus one standard deviation. The other group contains cases that are furthest away from the archetype. Their misfit value lies above the mean plus one standard deviation. I would expect the average effectiveness of the control system to differ between the groups, and be larger within the first group that resembles the archetype. However, when comparing the average MCS effectiveness between the two groups I find no significant difference. Once more, I conclude that misfit does not impact MCS effectiveness for the cases that belong to result oriented machine control. This initial analysis does not support the idea of a threshold level of misfit. However, the threshold level might not be reached within this sample. The descriptive statistics (table 5.12) show that the maximum misfit score is 3.62 on a five point scale. There are no cases within the sample that show very large deviations from the archetype. Larger samples would provide more possibilities further to research the issue in future research projects. 5.4.4 Action Oriented Machine Control To test the third hypothesis, I study cases that belong to the group action oriented machine control. Only 19 cases fall into this group. Unfortunately, this number is too small to accommodate regression analysis39 and I will stick to descriptive statistics and nonparametric bivariate analyses. The third hypothesis states: H3: In case of low uncertainty, high asset specificity, and low measurability of outputs, resemblance of an MCS with the archetype action oriented machine control is positively associated with MCS effectiveness. 39 As a threshold level of number of observations per independent variable, Hair et al. (1998; p166) state the ratio 5 to 1. For each independent variable one should have at least five observations. However, small samples (less than 20 observations) are only appropriate for simple regression analysis (p164). 107 Chapter five To learn about this subgroup, firstly I look at descriptive statistics. Next, correlations provide a feel for the interrelationships among the variables of interest. Table 5.15 shows descriptive statistics for the subsample. Table 5.15 Descriptive statistics: subsample action oriented machine control 1 MCS effectiveness N 19 min 2.64 max 6.00 mean 4.36 std deviation 0.85 2 misfit action oriented machine control 18 2.48 3.73 3.03 0.39 3 unit effectiveness 19 2.90 6.40 5.13 0.80 4 uncertainty 19 1.70 2.15 1.95 0.13 5 nr of main activity employees 19 4 500 39 113 6 nr of steps to CEO 19 0 4 1.79 1.03 The ratio of profit versus not-for-profit organizations is similar to that in the overall sample (21% not-for-profit in the subsample versus 20% overall). All variables show enough variation in scores to enable bivariate nonparametric tests. When comparing the subsamples previously (5.3), tests showed that the average misfit score regarding action oriented machine control does not differ between this subgroup and all other cases. However, those cases that resemble the archetype the most closely are no part of this subsample. In the overall sample the minimal misfit score is 1.51, within this subsample it is 2.48 (recall that the scale runs from 1 to 5). This might render less power to the test for effects of misfit on effectiveness because I cannot compare these cases to cases that show a relatively large fit. To study the relationships of interest, I calculate correlations between MCS effectiveness, misfit regarding action oriented machine control, effectiveness of the unit, and uncertainty. Because of the small sample size, nonparametric tests are appropriate. These tests are better suited to handle smaller samples, because they are insensitive to outliers (as opposed to Pearson correlations, for instance) and valid for all population distributions. No underlying assumptions about the population distribution are made. As a general rule, Newbold (1995) states that in bivariate analysis samples that holds less than 30 observations are denoted small. Here I use Spearman’s Rho to provide evidence on the associations among the variables. This measure is valid for samples of 5 – 30 observations (Newbold, 1995). Its interpretation is similar to the Pearson correlation coefficient. Table 5.16 shows the results. 108 Analyses and results Table 5.16 Bivariate correlations within the group action oriented machine control Spearman’s Rho 1 misfit action oriented machine control unit effectiveness uncertainty 0.243 0.601(**) -0.384 (^) MCS effectiveness N ranges from 18 to 19, ^ = significant at the 10% level (2-tailed), ** = significant at the 1% level (2tailed). Note: other bivariate correlations among the four variables are not significant (not reported) The relationships among MCS effectiveness, unit effectiveness, and uncertainty are as expected. MCS effectiveness and effectiveness of the organizational unit correlate positively (p = 0.007), whereas MCS effectiveness and uncertainty are negatively correlated. This result is significant at the 10% confidence level. There is no relationship between misfit and effectiveness. However, within small samples, only strong effects among variables will be picked up. Testing for other possible bivariate correlations between the four variables returns no significant results. To test whether these results are robust to changes in the boundaries that determine the relevant archetype, I rerun the analysis twice with adjusted boundaries for uncertainty and asset specificity respectively. When determining which cases belong to action oriented machine control I set the boundary for uncertainty at 3.2 (N = 28). Next, I set the boundary for asset specificity at 2.8 (N = 25). Since the sample is already small, I can only adjust the boundaries such that the sample expands. For both situations, the results for the relationships between MCS effectiveness, effectiveness of the unit, and uncertainty are stable. However, the correlation between misfit and MCS effectiveness becomes positive (r = 0.345; p = 0.078 (higher uncertainty boundary); r = 0.366; p = 0.079 (lower asset specificity boundary). The first result is thus unstable. In a larger sample, misfit regarding action oriented machine control positively affects MCS performance. This is contrary to the expectations and contrary to hypothesis H3. Finally, I adjust the boundary for measurability of outputs upwards to incorporate values 4 or less. This has a large impact on sample size (N = 57). In this sample I once more find a significant positive effect of misfit on MCS effectiveness (r = 0.404, p = 0.003). Within the group of cases that have action oriented machine control as their relevant archetype, I find evidence for a positive relationship between misfit and effectiveness. The less the MCS resembles the archetype, the more effective it will be. This contradicts hypothesis H3. The result appears in all samples studied as part of the robustness check. Only in the initial sample I found no significant result, but that sample is very small and therefore less likely to pick up any relationships. 109 Chapter five An interlude: action oriented versus result oriented machine control The MCtce indicates that both types of machine control are effective in circumstances of low uncertainty and high asset specificity. To distinguish between the two types, I chose measurability of outputs as an extra contingency variable. This variable represents a form of uncertainty and is as such also part of that construct. Within the group of low uncertainty, the variable measurability of outputs only gets three discrete scores: 3, 4, or 5 (on a five point scale). Therefore, the division of cases into two groups is rather crude. I might not be able to distinguish between the two types in a way that best fits the theoretical ideas. Using other alternatives might have rendered different results. To explore the impact of this decision I run extra analyses: I combine the groups action oriented machine control and result oriented machine control and test for the effect of a misfit variable that takes on the score of the lowest of both misfit variables for each case. I thus compare for each case the scores on both misfit regarding action oriented and result oriented machine control, and take the lowest. This variable will be referred to as misfit regarding machine control. This fits the theoretical idea that the choice between the two types of machine control is a forced choice (Speklé, 2004) because the situation will enable the one or the other form. The most suitable form will occur, which is reflected in a lower misfit. Table 5.17 shows how the two types of machine control make up this new variable and also shows the initial group for each case. About 75% (N = 61) of the cases shows most resemblance with the result oriented type of machine control. The other cases resemble the action oriented type more closely (N = 20). The table also reveals that out of these 61 cases that resemble the result oriented type more than the action oriented type, 51 indeed belong to the initial result oriented machine control group. Focusing on the minimal misfit thus renders a division of the machine control cases that differs from my division based on measurability of outputs. Table 5.17 The initial groups versus the groups based on minimum misfit misfit regarding machine control: action oriented result oriented total archetype: action oriented 8 10 result oriented 12 51 total 20 61 18 63 81 Using the new misfit variable, I estimate the general regression model once more. The results show that the misfit variable is unrelated to MCS effectiveness (p = 0.923). This different approach does not render new insights regarding the effects of misfit on effectiveness. 110 Analyses and results Finally, one analysis is left that might provide a better test of the relationships under study. There is agreement between the two approaches for 51 cases regarding machine control. Therefore I can also test the regression model for this group. This renders the same results as before for result oriented machine control: there is no relationship between misfit and effectiveness. In sum, the results I have found for both types of machine control are robust to decisions regarding the contingency variables. 5.4.5 Exploratory control The number of cases that falls into the group exploratory control is 24. Once more, the number is too small to accommodate regression analysis. Hypothesis H4 relates to this group: H4: In case of high uncertainty and low ex-post information asymmetry, resemblance of an MCS with the archetype exploratory control is positively associated with MCS effectiveness. To learn about this subgroup, I look at descriptive statistics and calculate correlations among the variables of interest. Table 5.18 shows descriptive statistics for the subsample. Table 5.18 Descriptive statistics: subsample exploratory control 1 MCS effectiveness N 24 min 2.64 max 5.18 mean 3.78 std deviation 0.78 2 misfit exploratory control 22 1.84 3.41 2.70 0.47 3 unit effectiveness 24 2.88 6.33 4.69 0.83 4 uncertainty 24 3.03 3.94 3.29 0.28 5 nr of main activity employees 24 2 500 39 99 6 nr of steps to CEO 24 0 5 2.17 1.09 The share of not-for-profit organizations exceeds that of the overall sample (38% not-forprofit in the subsample versus 20% overall). Previously when comparing this subsample to all other cases (5.3), I found that within the subgroup, units score on average significantly lower on misfit regarding exploratory control. To get insights into the bivariate relationships, I calculate correlation coefficients. As before, because of the sample size, I rely on nonparametric tests. Table 5.19 shows various significant relationships. Firstly, a positive relationship between MCS effectiveness and the effectiveness 111 Chapter five of the unit exists. This effect is significant at the 1% level. Within the other subsamples this relationship also emerged. Contrary to the theoretical expectations however, a significant positive relationship exists between misfit and MCS effectiveness (p = 0.10). Within this subsample, cases that resemble the archetype exploratory control more closely are less effective than those that do not. This contradicts hypothesis H4. Table 5.19 Bivariate correlations within the group exploratory control 1 Spearman’s Rho MCS effectiveness 1 1 2 misfit exploratory control 0.354 (^) 3 unit effectiveness 0.549(**) 2 3 0.367 (^) 4 uncertainty 0.141 0.406 (^) 0.278 Note: N ranges from 22 to 24, ^ = significant at the 10% level (2-tailed), ** = significant at the 1% level (2-tailed). Table 5.19 also reveals that there is no relationship between MCS effectiveness and uncertainty, nor is uncertainty related to unit effectiveness. However, the variable is positively associated with misfit regarding exploratory control (p = 0.06). This result will be discussed in paragraph 5.5, which specifically studies the relationships between the misfit variables and the contingency variables. Robustness checks show that the results are stable for the relationships among MCS effectiveness, effectiveness of the unit, and uncertainty. The result for misfit, however, is unstable. There are two possibilities for adjusting the boundaries: lower values of uncertainty are taken in (level = 2.8) or higher values of ex-post information asymmetry (level = 4.2). Adjusting the boundaries in the opposite directions returns either no differences in sample, or too small a sample. Lowering the boundary for uncertainty returns a sample with 40 cases. The positive effect of misfit on MCS effectiveness shows up again (r = 0.305; p = 0.062). When I adjust the boundary of information asymmetry upwards, however, I find no relationship between misfit and MCS effectiveness (N = 31). Within the group of cases that have exploratory control as their relevant archetype, I find some evidence for a positive relationship between misfit and effectiveness and some evidence that suggests there is no relationship between the variables. The effect is sensitive to decisions regarding the level of ex-post information asymmetry. When I incorporate cases with higher levels of information asymmetry, I find no influence of misfit on MCS effectiveness. However, overall I have found no evidence that meets the theoretical expectations. Hypothesis H4 is not supported. 112 Analyses and results 5.4.6 Boundary control The smallest number of cases belongs to the group boundary control. These cases experience high levels of uncertainty as well as high levels of ex-post information asymmetry. Hypothesis H5 was formulated for this group: H5: In case of high uncertainty and high ex-post information asymmetry, resemblance of an MCS with the archetype boundary control is positively associated with MCS effectiveness. Again, I confine myself to review the descriptive statistics and the correlations between the variables of interest. Table 5.20 shows descriptive statistics. Table 5.20 Descriptive statistics: subsample boundary control 1 MCS effectiveness N 15 min 2.73 max 6.18 mean 4.35 std deviation 0.88 2 misfit boundary control 14 2.35 3.90 3.06 0.42 3 unit effectiveness 15 4.20 6.20 5.18 0.50 4 uncertainty 15 3.04 4.05 3.42 0.36 5 nr of main activity employees 15 2 350 37 88 6 nr of steps to CEO 15 0 5 2.00 1.25 When comparing the share of not-for-profit organizations within the subsample to the overall sample (5.2), I found their share is much larger (40% versus 20%). Comparing the average misfit regarding boundary control of cases within this subsample to all other cases, I found no significant differences. As was the case for action oriented machine control, this sample does not contain cases with small misfits either. Table 5.21 reports the correlation coefficients among the variables MCS effectiveness, effectiveness of the organizational unit, uncertainty, and misfit regarding the archetype boundary control. Within the group boundary control, there are no significant relationships between MCS effectiveness and effectiveness of the unit, nor between MCS effectiveness and uncertainty. Misfit is also unrelated to MCS effectiveness. Note however, the same reservations made previously for drawing conclusions from small samples are still valid. Only large effects will be picked up within such a small sample. Of all other possible bivariate correlations only the association between misfit and effectiveness of the unit is significant. Both variables are positively correlated (p = 0.01). The theory does not inform us about this relationship. 113 Chapter five Table 5.21 Bivariate correlations within the group boundary control 1 Spearman’s Rho MCS effectiveness 1 1 2 misfit boundary control 0.303 3 unit effectiveness -0.050 4 uncertainty 0.007 Note: N ranges from 14 to 15, * = significant at the 5% level (2-tailed) 2 3 0.657(*) -0.121 -0.020 Once more, I perform robustness checks on these results. Testing the relationships for a sample with a lower uncertainty boundary (level = 2.05) and a lower boundary for information asymmetry (level = 3.5) confirms the results. Misfit and MCS effectiveness are unrelated. Sample sizes remain very small, however (N = 20 and N = 23 respectively). All in all, I find no evidence for a negative correlation between misfit regarding boundary control and MCS effectiveness. Hypothesis H5 is not being supported. 5.5 Misfit and the contingency variables Apart from the results that relate directly to the hypotheses, other results deserve our attention. An issue left unexplored so far, is the relationship between the contingency variables and misfit. Studying this relationship is beneficial for several reasons. It sheds light on the underlying assumptions part of the configurational approach, and helps to interpret the previous findings. Moreover, it can again provide insights into the predictive power of the theory. As discussed in chapter two (2.3), the contingency approach and the configurational approach have different expectations about the relationship between contingency variables and structure variables like the MCSs. In this paragraph I take on an exploratory approach to study this relationship. Figure 5.4 shows that two steps exist in the theory: contingency variables determine the relevant archetype, and misfit regarding the relevant archetype relates to MCS effectiveness. However, the contingency variables might also affect misfit. Figure 5.4 A two-step analysis contingency variables (UNC, AS, IA) I relevant archetype and misfit misfit 114 II MCS effectiveness Analyses and results As an initial step in this exploration of the relationships between contingency variables and misfit variables, I calculate correlations between both types of variables for the complete sample. Based on the MCtce it is expected that in case of rising asset specificity, for instance, cases would resemble the two types of machine control more closely, than they would resemble arm’s length control. Therefore, a positive correlation between misfit regarding arm’s length control and asset specificity might exist. Correlation tables reveal that both asset specificity and information asymmetry do not correlate with any of the misfit variables. Uncertainty, however, correlates with all of them (see table 5.22). Moreover, the direction of the correlations are as the theory would predict: in case of rising uncertainty MCSs move away from arm’s length control and result oriented machine control, and towards exploratory and boundary control. Table 5.22 Correlations between uncertainty and the misfit variables Pearson correlations 1 uncertainty misfit ALC misfit MR misfit MA misfit EC misfit BC 0.170(**) 0.244(**) -0.234(**) -0.344(**) -0.387(**) N ranges from 241 to 246, ** = significant at the 1% level (2-tailed). Note: there are no significant relationships between asset specificity and the misfit variables or ex-post information asymmetry and the misfit variables. Of course, this is a very crude way of analysing the situation because I do not take into account the interactions between the contingency variables. Nevertheless, some insights are gained. Apart from studying whether changes in misfit occur with changes in the contingency variables, it is interesting to assess how the changes take place. As discussed in chapter two (2.3) different views exist. The first view is that a change in the contingency variables leads to an adjustment of the MCS and a general tendency to move towards an effective situation. Gradual adjustments are possible and a fit-line exists (instead a discrete number of fit states). This is the general way of thinking within contingency theory (Donaldson, 2001). In this view fit will be re-established through gradual change. An opposed view stems from configurational thinking and is that the MCS does not change gradually to fit the changing circumstances, but change occurs as a quantum jump (Gerdin and Greve, 2004). Within configurational thinking only few states of fit are possible, represented by the archetypes. Organizational units cannot adjust their MCS gradually because of the complementarities between control instruments. A small change will disrupt the complete system, and therefore change is only beneficial if it involves the system as a whole. A one-on-one relationship between the contingency variables and misfit is thus not assumed. 115 Chapter five The correlation coefficients calculated above cannot shed light on this issue, because significant correlations could show up in both cases. To further explore the interrelationships between the contingency variables and misfit, I run a different analysis. I regress misfit on uncertainty, asset specificity, and their interaction for the two largest subsamples of arm’s length control and result oriented machine control. The interaction term enters the model because the theory pertains to joint effects of the contingency variables, rather than their individual effects, like standard Transaction Cost Economics (Boerner and Macher, 2001; David and Han, 2004)40. The product of uncertainty and asset specificity forms the interaction term. To prevent problems of multicollinearity, I centred (mean adjusted) the scores on uncertainty and asset specificity before calculating the multiplicative term (cf. Aiken and West, 1991). For arm’s length control, the model is not significant (N = 118). There is no relationship between the contingency variables and misfit regarding arm’s length control. An exploration of scatter plots that confronts misfit and asset specificity as well as misfit and uncertainty, does not reveal any other (non-linear) relationships between the variables. Combining this result with the results found earlier for the relationship between uncertainty and MCS effectiveness, paints a more complete picture. Recall that within this subsample a direct negative effect of uncertainty on MCS effectiveness exists (see 5.4.2). Thus an increase in uncertainty – within the group of cases with low uncertainty- does not influence misfit, but does have a negative effect on the perceived effectiveness of control. This result indirectly supports configurational thinking: with rising uncertainty organizational units experience a decrease in perceived effectiveness of the control system, but apparently they do not change their control systems accordingly. Adjustments might only be possible to a certain extent. Adjusting gradually is impossible and further change would involve a quantum jump. Since quantum jumps involve large changes to all parts of the control system simultaneously, these will not occur until the contingencies have changed enough to warrant such a large investment. This would explain why uncertainty does have a negative impact on perceived effectiveness of the control system, but not on misfit. However, I have also argued (5.4.1) that a psychological effect of uncertainty might exist that negatively influences the perceived performance directly. This is an alternative explanation for the negative effect of uncertainty on MCS effectiveness as shown in the regression. Unfortunately, I am not able to distinguish between these two effects of uncertainty. 40 Note that in the analyses in paragraph 5.4 in which the hypotheses are tested, the interaction between the contingency variables are automatically part of the analysis. The combination of the contingency variables determines the relevant archetype. 116 Analyses and results Therefore the total effect of uncertainty might represent just one explanation or combine both. At this point, too little is known about these complex relationships to enable rigorous testing. I repeat the exercise for the subsample result oriented machine control. A model explaining misfit from uncertainty, asset specificity, and their interaction is significant and explains 14.5% of the variance in the misfit variable (adj. R2 = 0.10, N = 63). Table 5.23 shows the results. Again before running the regression analysis variables were checked for normality, which was acceptable, and for univariate outliers of which none were found. Moreover, there are no indications of multicollinearity or heteroskedasticity for this model. Table 5.23 Model for misfit result oriented machine control B .438 std. error 1.057 Beta (intercept) T .415 sig. .680 uncertainty .689 .229 .369 3.006 .004 asset specificity .167 .274 .074 .608 .545 interaction uncertainty/asset specificity -.058 .076 -.094 -.766 .447 2 2 R = 14.5%, adjusted R = 10%, N = 63 The evidence from the analysis of this subsample reveals a completely different picture. Asset specificity does not have an influence on misfit, nor does the interaction term. Uncertainty, however, is positively associated with misfit regarding result oriented machine control. An increase in uncertainty leads to an increased misfit. Again I will combine this result with the results from the regression model tested earlier (5.4.3). In that model I found no effect of misfit on MCS effectiveness. This result seems to support contingency thinking: gradual change might occur in response to changing circumstances and therefore misfit does not affect effectiveness. If this were true, the misfit that I measure (misfit regarding the archetype) is not a true misfit, but represents a fit situation because the level of the contingency variable and the MCS match. The contingency approach assumes that multiple states of fit exist and that there is a fit-line along which companies can move. The direct impact of uncertainty on performance could not be assessed because of multicollinearity, but both variables are significantly negatively correlated (see 5.4.3). Again, it is impossible to distinguish between the different effects of uncertainty on performance. The evidence suggests that in some cases the contingency view of fit is appropriate whereas in other situations configurational forms of fit exist. In terms of arm’s length control versus result oriented machine control, this implies that complementarities among the control instruments might be stronger within arm’s length control. Working with a control structure 117 Chapter five that resembles result oriented machine control might offer more possibilities to adjust without disrupting the complete system. This exploration of the effects of the contingency variables on misfit, highlights a few interesting points. First of all, uncertainty seems to be the most influential contingency variable in my study and its overall role is complex. For the complete sample the effect of uncertainty accords with theoretical expectations from the MCtce: when uncertainty increases organizational units move away from certain archetypes towards others as predicted. A second point of interest is that the other contingency variables do not play a role in any of the analyses. Finally, my exploration reveals that the relationship between uncertainty and misfit differs between the subsample arm’s length control and machine control. This indicates a possible difference in strength in complementarities between both archetypes. 5.6 Effectiveness of control and organizational effectiveness Hitherto I have studied several related topics regarding management control system design and effectiveness, apart from testing specific hypotheses related to the MCtce. This paragraph also covers a related theme. In this study I have measured perceived effectiveness of both the management control system and the organizational unit. I have also concluded earlier that researchers generally measure organizational effectiveness and use it as a proxy for effectiveness of control rather than measuring control effectiveness directly (cf. Chenhall, 2003). The proxy might be rather crude, because of the many other possible influences on organizational performance apart from the functioning of the control system. The relationship between both is implied rather than demonstrated empirically. Consequently, little is known about the effects of the workings of the MCS on the organization. In this paragraph the relationship between the two variables is explored. I have argued that the two variables are likely positively correlated and that causal relationships might exist between them, the direction of which is unclear (see 5.4.1). In the overall sample, the effectiveness of the unit positively correlates with MCS effectiveness (r = 0.472, p = 0.01). The strength of this relationship varies among the different subsamples related to the archetypes. Table 5.24 shows the correlation between both effectiveness measures for the overall sample and for each of the subsamples. 118 Analyses and results Table 5.24 Correlation coefficients: MCS effectiveness and unit effectiveness 1 complete sample N 258 correlation 0.472 sig. 0.01 2 subsample arm’s length control 3 subsample action oriented machine control 120 0.424 0.00 19 0.601* 4 0.00 subsample result oriented machine control 66 0.532 5 0.00 subsample exploratory control 24 0.549* 0.00 6 subsample boundary control * nonparametric test; Spearman’s Rho 15 -0.050* 0.86 In all but one sample MCS effectiveness and effectiveness of the organizational unit are positively correlated. Their relationship is quite strong with correlation coefficients ranging from 0.42 to 0.60. Within the group of cases that belong to boundary control, the variables are unrelated. There are two possible explanations for the apparent differences in strength among the groups. One explanation is that other variables are involved that have an influence on one of the effectiveness variables, but not the other within the subsamples. Another possibility would be that the MCS measured only pertains to a small part of the unit. Multiple activities are executed within one unit and I only measure the MCS of one of them. If differences among activities exist regarding the control system, the direct link between the MCS mapped and the effectiveness of the unit might be smaller. To the contrary, if the control system pertains to a large part of the organizational unit or even to the complete unit, the relationship might be stronger. Apart from these general ideas about the differences in the relationship between MCS effectiveness and effectiveness of the organizational unit among the subsamples, another possible explanation exists for the boundary control cases. As mentioned before, only very strong effects would be picked up in small samples and the subsample boundary control only contains 15 cases. However, in all the other samples the effects are very strong. Therefore, I suggest another explanation that relates to the specific essence of this control structure. The MCtce submits that the archetypes are each effective given the circumstances, but they also represent the best we can do and boundary control is characterized as “a control structure of last resort” (Speklé, 2001a). If managers indeed perceive this control structure as such to be less effective than other forms of control, this might explain that lack of correlation between perceived MCS effectiveness and perceived unit effectiveness. Whether the performance of the unit is either good or bad, the performance of the MCS is always perceived to be relatively poor. As a consequence no direct link between both is observed. 119 Chapter five From these findings I conclude that using organizational effectiveness as a proxy for control effectiveness is rather crude, and leaves out a lot of information. Focusing on effectiveness of control directly will render more insights into the workings of control systems. It will be interesting to study the causal relationship between the two effectiveness variables more extensively in future research. 5.7 Summary of findings This chapter discussed the final steps of the methodological framework and showed how I have determined the relevant archetype for each observation in the sample. This returned five subsamples, which I firstly compared on general characteristics. Next, I described the testing of the hypotheses. Two out of five hypotheses could be tested by means of regression analysis. The others could not be tested this way, but some insights were gained on the relationships between the variables of interest through bivariate analyses. Finally, three topics were addressed in an exploratory way: the occurrence of (mis)fit, the relationships between the contingency variables and misfit, and the relationship between MCS effectiveness and effectiveness of the organizational unit. In this paragraph I will summarize the results. The next chapter presents the overall conclusions of this research project, reflections on the findings, and a discussion. The allocation of cases to each of the five subsamples returned distinct groups that differ in terms of the characteristics of their activities. This is important to assure correspondence between the theoretical and empirical levels of analysis. Because of the variety in activities within the overall sample, I started by comparing these groups on general characteristics to learn more about them. The comparison highlighted some differences but did not return distinct profiles. However, the findings were not counterintuitive either and stressed the importance of focusing on activities instead of on unit types. As a second step in exploring differences among the subgroups, I have compared the scores of the misfit variables. My approach is highly exploratory and provided tentative results. Firstly, I have compared the average score on misfit regarding each of the archetypes between each of the subgroups and all other cases. For four subsamples cases on average resemble the relevant archetype more under the predicted circumstances than under other circumstances. Next, I compared scores on the misfit variables within each sample to determine which archetype cases resemble most and least. Almost 35% of cases can be correctly predicted by the theory in terms of the archetype they resemble most. Only 7% of cases show less resemblance to their relevant archetype than to any of the other archetypes. The theory helps to explain (part 120 Analyses and results of) these results. Thus, the theory takes us one step ahead in understanding which MCSs prevail under what circumstances. However, without reference to MCS effectiveness the theoretical ideas cannot be tested. To test the main theoretical claims directly, five hypotheses were set up, two of which could be tested by means of regression analysis. Based on the regression model for arm’s length control, I find evidence that misfit regarding this archetype has a negative impact on the perceived effectiveness of the control system. This effect is confined to the group of cases that experience low levels of both uncertainty and asset specificity. The results are robust to shifts of the boundaries and hypothesis H1 is being supported. To test the second hypothesis, which relates to result oriented machine control again regression analysis is used. I find no evidence of a negative effect of misfit regarding result oriented machine control for MCS effectiveness. This null result is robust to changes in the boundaries that determine the relevant archetype. Hypothesis H2 is not supported. The other three hypotheses could not be tested by means of regression analysis because of the small sizes of the subsamples. Therefore, correlations among the variables of interest were calculated. Within the group of cases that have action oriented machine control as their relevant archetype, I find evidence for a positive relationship between misfit and effectiveness. The less the MCS resembles the archetype, the more effective it will be. This contradicts hypothesis H3. Within the group of cases that have exploratory control as their relevant archetype, I find some evidence for a positive relationship between misfit and effectiveness and some evidence that suggests there is no relationship between the variables. The effect is sensitive to decisions regarding the level of ex-post information asymmetry. When I incorporate cases with higher levels of information asymmetry, I find no influence of misfit on MCS effectiveness. However, overall I have found no evidence that meets the theoretical expectations. Hypothesis H4 is not supported. Finally, within the group boundary control, I find no evidence for a negative correlation between misfit regarding boundary control and MCS effectiveness. Therefore hypothesis H5 is not supported either. These analyses do not provide rigorous tests of the hypotheses but rather tentative results, because of the small sample sizes and the restriction to bivariate analysis as a consequence thereof. Apart from the results obtained by direct tests of the MCtce, additional results were obtained. The exploration of the effects of the contingency variables on misfit revealed that the role of uncertainty in my study is both influential and complex. For the complete sample the effect of uncertainty accords with theoretical expectations from the MCtce: when uncertainty increases 121 Chapter five organizational units move away from certain archetypes towards others as predicted. The relationship between uncertainty and misfit differs between the groups arm’s length control and result oriented machine control, which could be an indication of differences in strength of complementarities of each archetype. As a final related topic, I studied the relationship between MCS effectiveness and the effectiveness of the organizational unit. Results indicate that using organizational effectiveness as a proxy for control effectiveness is rather crude, and leaves out a lot of information. Focusing on effectiveness of control directly will render more insights into the workings of control systems. All results will be discussed in the next chapter. 122 Chapter six - Conclusion and discussion With this research project I contribute to the existing knowledge base by providing an empirical study on management control systems in their entirety and the effectiveness thereof. It is well recognized that studying control systems as packages of interrelated control elements is necessary for understanding the workings of control as well as its effects on performance (Otley, 1980; Flamholtz, 1983; Holmstrom and Milgrom, 1994; Abernethy and Chua, 1996). Empirical work that addresses these issues, however, is scarce. The goal of my research project is to learn about management control system design and effectiveness. My central research question is: which control systems are effective given the circumstances? This broad question indicates the nature of this study, which is theory driven yet exploratory in many respects. Providing an answer to the research question is not the sole contribution of my research project. Another contribution lies in the specific approach taken on enabling the study of MCS design and effectiveness as such. My study takes us a step closer to finding answers to the general problem of design of management control systems and exemplifies the type of study that can take research within the field forward. It also reveals the complexities surrounding this research problem. Studying the research question has led to a discussion of many topics, both theoretical and methodological. As a point of departure, I build directly from the work by Speklé (2001a, 2001b, 2004) whose Transaction Cost Theory of Management Control gives a theoretical answer to the research question. By putting the theory up to testing, I have gained insights into the empirical answer as well. Moreover, operationalizing the theoretical ideas involved theory refinement. Along the way, several issues of broader importance were addressed regarding MCS design, the survey method, and scale construction. In this chapter I will summarise and discuss the findings. Firstly, paragraph 6.1 addresses the research question. It provides a summary of the main findings and discusses the limitations of my study. Next, I focus on methodological issues that were central to this project and are of broader importance (6.2). Finally, in paragraph 6.3 I will discuss ideas for future research based on several findings that relate to the research topic more broadly and that have been addressed in an exploratory way. 6.1 The effectiveness of control systems This paragraph presents the aggregation of findings from this doctoral thesis and formulates an answer to the research question. I will summarise the results and discuss the limitations of my study. 123 Chapter six 6.1.1 Summary of findings To research the central question of this project – which MCSs are effective – I have tested hypotheses based on the Transaction Cost Theory of Management Control (MCtce; Speklé, 2001a, 2001b, 2004). This theory provides a theoretical answer to the research question. It submits that there are five effective forms of management control systems referred to as archetypes of control: arm’s length control, result oriented machine control, action oriented machine control, exploratory control, and boundary control. Each is effective only for the control of certain activities, but not for others. In this study I have looked at organizational units with various activities. Examples are production, human resources, research and development, and sales. These activities are characterised over three dimensions: the associated level of uncertainty, asset specificity, and ex-post information asymmetry. The effectiveness of the archetypes is contingent on these characteristics. This makes each archetype relevant for the control of certain activities. For instance, arm’s length control is the most effective control structure for activities with low uncertainty and moderate asset specificity. The main claim of the theory is: the closer an observed MCS resembles its relevant archetype, the more effective it will be. To obtain an empirical answer to the research question, I set out to test this main claim for each of the archetypes. Data collection by means of a survey returned 258 observations. For every observation misfit scores regarding each of the archetypes have been calculated. To enable hypotheses testing the overall sample was split up into five subsamples, one for every archetype. Before testing the theory’s main claim, I have compared the misfit variables and the subsamples to explore the predictive power of the theory. Firstly, the results of this initial analysis will be discussed. As argued in chapter two (2.3.2), on average cases are expected to resemble the archetypes because that is effective-, yet states of misfit will occur as well. Cases that do show misfits will be less effective. A first look at the data returned some general findings. First of all, within the overall sample I have found a move away from arm’s length control and result oriented machine control towards the other archetypes in case of a rise in the level of uncertainty. Thus, the average case resembles the archetypes exploratory or boundary control more in case of high uncertainty. This is as theory would predict. Testing for similar effects of asset specificity and ex-post information asymmetry returned insignificant results. I have also compared the average misfit scores among the subsamples. For four out of five subsamples the average misfit regarding the relevant archetype within the group, is lower 124 Conclusion and discussion compared to the average misfit of all other cases. Misfit regarding result oriented machine control forms the exception. For instance, within the subsample arm’s length control, the average misfit regarding this archetype is significantly lower than the average misfit regarding arm’s length control among all other cases. This finding once more indicates a link between the contingency variables and the misfit variables. The circumstances related to misfit seem to matter in ways that can be expected based on the theory. Finally, I have compared scores on the misfit variables within each subsample and determined which archetype cases resemble most and least. The results indicated that almost 35% of cases can be correctly predicted by the theory in terms of the archetype they resemble the most. Only 7% of cases show less resemblance to their relevant archetype than to any of the other archetypes. Combined these initial findings show a general tendency of management control systems to resemble their relevant archetype more closely under the specified circumstances. The theory has some predictive power and takes us one step forward in understanding which MCSs prevail under what circumstances. This relates directly to the problem of understanding control structure variety. However, without reference to differences in effectiveness between cases that do and cases that do not resemble the archetypes, the main claim of the theory cannot be tested. Thus, to assess the effects of misfit on performance, I have tested hypotheses directly related to the theory’s main claim. The subsamples for arm’s length control and result oriented machine control were large enough to test the impact of misfit on MCS effectiveness by means of regression analysis. The hypotheses related to these groups are repeated here for ease of reference: H1: In case of low uncertainty and moderate asset specificity, resemblance of an MCS with the archetype arm’s length control is positively associated with MCS effectiveness. H2: In case of low uncertainty, high asset specificity, and high measurability of outputs, resemblance of an MCS with the archetype result oriented machine control is positively associated with MCS effectiveness. The results of the analyses support the first hypothesis. Under the specified circumstances, an observed MCS that resembles the archetype arm’s length control more closely is relatively more effective. I have found no support for the second hypothesis. Robustness checks showed 125 Chapter six that both results are robust against decisions regarding the boundaries set on contingency variables to determine the relevant archetype for each case. A first conclusion is that in circumstances of low uncertainty and low asset specificity, resemblance with the archetype arm’s length control has a positive effect on MCS effectiveness. Arm’s length control is thus effective for the control of activities that show the following characteristics: activities are programmable, outputs can be measured, and performance can be compared to the market. An effective MCS is characterized by use of market benchmarks and relatively high work autonomy. There is also little discussion related to the coordination of work. This result pertains to half the cases in my sample. The cases studied stem from various industries and companies and therefore this conclusion seems to be valid for a large variety of units as long as their activities have the characteristics as specified. Another conclusion that follows straight from the analyses is that resemblance with result oriented machine control does not impact MCS effectiveness. This does not support the theoretical expectations. Apart from the possibility that the theory might not hold in this situation, other explanations can be given for this result. One explanation for the null result I have found for result oriented machine control might lie in the underlying assumptions of the configurational approach. Recall that the configurational approach assumes that only few states of fit exist. These are represented by the archetypes. Complementarities among the elements of control exist within the MCSs and therefore changing circumstances will not lead to gradual adjustments of the control structure, but a quantum jump will occur (see 2.3.2). An explanation for the null result would be that the complementarities within the result oriented machine control structure are rather weak. If this is true, gradual adjustments are possible up to a certain extent. Put differently, a threshold level of misfit might exist that only when reached will lead to a decrease in MCS effectiveness. Some tentative results illustrate this idea. I have tested for a linear relationship between uncertainty and misfit within the subsample result oriented machine control and found a negative relationship between both variables. I also examined a scatter plot of uncertainty and misfit to assess the appropriateness of assuming a linear relationship. The evidence suggests that gradual adjustments to the MCSs within this subsample are possible without having negative consequences for the perceived effectiveness of the MCS. It might be that the complementarities within the control structure result oriented machine control are indeed relatively weak. This result is limited to the range of misfit scores that occurs within my sample. I did not find evidence for a threshold level of misfit, but since my sample did not hold cases that show extreme misfits regarding the archetype the threshold level might not be 126 Conclusion and discussion reached. This way of thinking has implications for future research. These are discussed in the third paragraph (6.3). There is another explanation for the lack of results for misfit regarding result oriented machine control. It seems that multiple more of less equally effective control systems exist within the subsample. However, these might differ in terms of their costs. In a situation of low uncertainty and high asset specificity, managers potentially rely on a diverse set of control instruments ranging from rules and regulations to specific output targets. With many potential combinations of control instruments, the same level of control might be established in several ways, but at different costs. A test of the relationship between misfit and efficiency of control (instead of effectiveness of control) might therefore have rendered different findings. It could be that several combinations of control instruments are effective, but only one - the archetype - is the most efficient. Not measuring the costs of control systems therefore is a limitation of my study as will be discussed in the next subparagraph (6.1.2). Apart from arm’s length control and result oriented machine control, I have studied the other archetypes. Hypotheses H3, H4, and H5 could not be tested by means of regression analysis because the subsamples are too small. I could only assess bivariate relationships between misfit and MCS effectiveness within each of the subsamples. Those hypotheses are also repeated here: H3: In case of low uncertainty, high asset specificity, and low measurability of outputs, resemblance of an MCS with the archetype action oriented machine control is positively associated with MCS effectiveness. H4: In case of high uncertainty and low ex-post information asymmetry, resemblance of an MCS with the archetype exploratory control is positively associated with MCS effectiveness. H5: In case of high uncertainty and high ex-post information asymmetry, resemblance of an MCS with the archetype boundary control is positively associated with MCS effectiveness. To learn about these archetypes, I firstly studied the misfit variables as such. I have found that there are no cases in my sample that closely resemble the archetypes action oriented machine control, exploratory control, or boundary control. The subgroups for these archetypes were only small, so I might have found cases that resemble the archetypes in larger samples. 127 Chapter six Bivariate analyses returned the following results. Some of the evidence does not support the theory and some even contradicts the theory. Within the group of cases that have action oriented machine control as their relevant archetype, the evidence for the relationship between misfit and effectiveness contradicts hypothesis three. For the group of cases that have exploratory control as their relevant archetype, results are unstable. In the initial sample, a positive effect of misfit on MCS effectiveness showed up, which contradicts hypothesis H4. Robustness checks, however, showed that this result disappears and the effect becomes insignificant when I adjust the boundary to incorporate cases with higher levels of ex-post information asymmetry. The smallest number of cases falls into the group boundary control (15 cases). I find no evidence for a negative correlation between misfit regarding boundary control and MCS effectiveness. Caution is warranted when interpreting these results. The tests of the hypotheses H1 and H2 are convincing in the sense that the samples are relatively large, results are robust to boundary decisions, and no problems occurred within the models that could disrupt the relationship between misfit and effectiveness. The results for the other three hypotheses, however, are based on bivariate analyses and small samples. Recall that the misfit variables represent profiles with approximately 20 underlying variables. Studying 15 – 25 cases therefore cannot provide more than tentative results. Moreover, only bivariate relationships between misfit and MCS effectiveness have been investigated, whereas I expect the interrelationships among the variables to be more complex. Within the various subsamples multiple correlations between the variables as specified in the general model exist. MCS effectiveness and effectiveness of the organizational unit correlate positively, and uncertainty correlates with multiple variables. Conducting tests for the complete model, takes into account the impact of the relevant variables simultaneously and therefore provides a more rigorous test of the hypotheses. Having said that, the evidence for hypotheses H3, H4, and H5 does not support the theoretical ideas. Regarding my research question, it cannot be concluded that MCSs that resemble the archetypes action oriented machine control, exploratory control, or boundary control are more effective than other MCSs in circumstances of high uncertainty or high asset specificity. All in all, some findings support the theoretical ideas: the overall effect of uncertainty on misfit regarding the different archetypes, the general tendency to resemble the relevant archetype more closely than other archetypes (which can be related to the specific circumstances), and support for the hypothesis related to arm’s length control. Other findings do not support the theory: resemblance with result oriented machine control does not impact control effectiveness. The bivariate analyses for action oriented machine control, exploratory 128 Conclusion and discussion control and boundary control do not support the theory either. However, the effects of resembling any of the latter archetypes remain unclear due to small sample testing. Regarding the research question -which MCSs are effective given the circumstances-, I thus find that management control systems that resemble the archetype arm’s length control are effective for the control of activities with low uncertainty and low asset specificity. During this research project, the tests of the theoretical ideas were preceded by theory refinement. Keating (1995) distinguishes several forms of theory refinement in light of case study research. One of these forms (theory specification case) describes the process I went through in my research project: “(to) specify the theory by adding greater precision to theoretical constructs and propositions and rendering the theory into a refutable form” (Keating, 1995, p70). Here I refer to theory refinement in a broad sense and address some points of interest. One element of fine-tuning relates to the distinction between the two types of machine control. I suggest using the measurability of outputs for differentiating the action and result oriented type. Moreover, in the process of operationalizing the ideas some of the theory’s assumptions have been made explicit. For instance, the view that equifinality among the archetypes is not assumed to exist, which has direct consequences for the way the theory can be tested. Finally, during the operationalization the substance of the archetypes has been defined more precisely. All archetypes are defined over the same set of variables. Surely, this is not the only possible way to define the archetypes (also see 6.1.2), but it forms a basis for future research and further development of the specific contents of the archetypes. 6.1.2 Limitations When interpreting the results of this study, a number of limitations should be taken into account. Some limitations are inherent to survey research, others specifically relate to this research project. Both will be discussed in this subparagraph. First of all, I provide no tests of causal relationships. The data are cross-sectional and therefore both the design of the management control system and its effects (in terms of effectiveness of control) are measured at one point in time. Longitudinal case studies provide better means to investigate causal relationships and to study the development of control systems and the effects of these systems over time. Within survey research it is difficult to take a time lag into account because it is unknown how long the time lag should be. This might even differ among the large variety of organizational units within my dataset. 129 Chapter six Secondly, only two out of five hypotheses could be tested by means of regression analysis because of small subsamples. Although I tried to select cases according to their activities and aimed for as much variety as possible, the number of cases for the subsamples action oriented machine control, exploratory control, and boundary control were limited. More data would enable a more rigorous test of these hypotheses. Another limitation relates to the measurement of MCS effectiveness. My objective was to develop a measure that is generally applicable. Therefore, first of all, to measure effectiveness I make use of perceptions instead of ‘hard data’. In my case, it is impossible to work with objective measures, since I am comparing such a large variety of organizational units throughout the hierarchy. The only way to compare effectiveness of control among them is to ask the manager directly for an assessment. In chapter three I have argued that this approach fits my research design, since I study the relative effectiveness of various MCSs. However, a more fundamental problem exists for the measurement of MCS effectiveness. I have discussed the notion of equifinality (see 2.3.1) and argued that equifinality cannot be assumed to exist among the archetypes. The archetypes each represent the most effective structure given the specific activities, but this does not imply they are equally effective. It might even be that effectiveness should therefore be measured differently within each of the subsamples related to the archetypes. Recall that in measuring MCS effectiveness I ask about the achievement of underlying control goals, such as guiding manager’s behaviour or supporting managers in their decision making. Some of these might be more relevant for one of the archetypes, than for another. For instance, the main goal of a boundary control system could be to set boundaries and not to motivate employees. Because of the small subsamples I was not able to research such differences. I have two suggestions to improve the measure for future use: the first one is to attach weights to the goals. Apart from asking for an assessment of each item, an indication of its importance to the organizational unit might provide more precise measurement. That complicates the measure a lot though. Another way to go would be to tailor the measure to a specific situation (when studying a homogenous group of activities, for instance) or to determine from theory which items to combine for different situations. As already mentioned when discussing the findings for result oriented machine control, I have assessed the effectiveness of the control system, not its efficiency. This is also a limitation of my study. Throughout this research project I have focused on effectiveness of control systems as my central concern. However, also taking the costs of different control systems into account could render different and additional insights. For instance, when a manager uses more control instruments or the same ones more intensively than necessary to establish an effective control system two things might happen: the system could be distorted and people 130 Conclusion and discussion might start to obstruct it (1). The effectiveness would decrease accordingly. It could also return a better effectiveness up to the point of perfect control (2). Both situations will render excess costs while only one shows a decrease in effectiveness. These aspects could be picked up by assessing the efficiency of the MCSs. Thus, the costs of MCSs should be taken into account somehow. An example of an aspect used as a proxy for the costs of control can be found in Widener (in press). In her study of the levers of control framework, she uses four items to measure management attention as a proxy of the costs of a control system. This is only one aspect of the costs of a management control system, but could be a starting point for the future development of a measure of MCS efficiency. Another issue to be kept in mind when interpreting the results of this study, is that it could have been conducted differently. As the author of the MCtce himself put it: “… the application of this theory is bound to command considerable interpretative efforts from the researcher to deal with the inevitable shades of grey” (Speklé, 2001, p 439). These shades of grey relate to the definition of the archetypes, the measurement of misfit, and the assumed linkages between misfit and effectiveness. The definition of the archetypes could also have been set up empirically, or through an iterative process confronting theory and empirics. Another example of choices made concerns the measurement of misfit. In the calculation of misfit, weights were attached to separate parts of the control structure. Another interpretation of the theory might have resulted in a different weighting. Also, I have assumed a linear relationship to exist between misfit and MCS effectiveness whereas other relationships might exist. Although my choices are consistent with the theory, other possible operationalizations could also have been consistent with the theory. A final limitation concerns the measurement of some of the variables 41. Although considerable effort has been put into designing and testing a survey instrument that delivers data of high quality, with hindsight I conclude that some variables could have been measured better. First of all, the measurement of asset specificity can be improved. I was unable to measure all aspects of this construct and combine them into a MIMIC model that represents best the underlying structure of the concept. As a consequence I was left with the second best solution and thus with less precise measurement of asset specificity. I capture only a part of the construct instead of the whole and therefore the effects of the variable might be bigger than could be assessed in the analyses. However, it is difficult to assess the true impact on the findings of this study, because I cannot compare the first and second best solutions. Also 41 Appendix B gives an overview of the questions used in this study. I have added a note to those questions that might be improved. 131 Chapter six important is the role of benchmarks. In the questionnaire I only picked up on general use of benchmarks whereas it would have been better to ask explicitly about their role in performance evaluation and rewarding as well. This has implications for the distinction between arm’s length control and result oriented machine control, and could have separated the two types better. Another aspect that I would have liked to assess in more detail is the percentage bonus. There was little variety in the answers for this variable and for future use of the question I suggest adjusting the answer categories to differentiate better between smaller and larger bonuses. 6.2 Results related to methodology Throughout this doctoral thesis, several methodological themes have been discussed that are not only important to this particular study, but to research within management accounting and control generally. This paragraph presents a summary of the main concerns and implications for future research. Respectively I will discuss (1) the survey method, (2) measurement of MCS effectiveness, and (3) measurement of uncertainty and asset specificity or, more generally, constructs with formative indicators. The first theme concerns the survey method. Several authors concluded that the survey method is valuable and valid, but more concern must be given to its correct application (see for instance Ittner and Larcker, 2001; Zimmerman, 2001). Improper use of the method results in research that is based on data of poor quality. In chapter three I have showed how the critique on survey studies within accounting can be dealt with successfully by (1) establishing as close a match as possible between the theory and operationalization and by (2) putting my questionnaire through an extensive pre-test. Apart from making use of available methods for pre-testing, I have introduced and used validity-feedback interviews. This new technique proves a powerful method to obtain feedback on the content validity of questionnaires. To improve overall survey research within our field, researchers should make use of the available techniques for questionnaire design and pre-testing, and not underestimate the intricacies of the method. A second recurrent theme has been the measurement of MCS effectiveness. I have developed a measurement instrument to assess the perceived effectiveness of the management control system. Starting from the idea that it should be able to pick up the achievement of underlying goals of control, multiple items have been identified that in combination measure effectiveness (limitations of this instrument have been discussed in 6.1.2). For two reasons direct measurement of MCS effectiveness is preferred over measuring the effectiveness of the 132 Conclusion and discussion organizational unit and using that as a proxy for control effectiveness, which most studies do. First of all, the proxy is rather crude as illustrated by the correlation coefficients between both effectiveness measures. The values did not exceed 0.60 (for the exact numbers see table 5.24). A second reason is that the relationship between MCS effectiveness and effectiveness of the organizational unit as such deserves our attention. If we believe that the management control system affects the performance of organizations through its influence on employee behaviour, empirical evidence helps to prove and understand this relationship. However, little is known about it and thus several opportunities for future research exist. In the next paragraph I will discuss some of these opportunities. The final methodological theme that arose from my research project concerns the construction of measurement models with formative indicators. Several authors have stressed the importance of distinguishing formative from reflective indicator models and showed that wrong conclusions can result from misspecification (see for instance MacKenzie et al., 2005; Diamantopoulos and Siguaw, 2006; Bisbe et al. (in press)). I suggest a formative approach in measuring two key TCE variables. For both uncertainty and asset specificity I set up a general MIMIC model (Jöreskog and Goldberger, 1975). For uncertainty I was able to estimate the model and use it for scale construction. The construction of a MIMIC model for asset specificity failed, but improved measurement, especially of the reflective indicators, might lead to better results. The formative approach fits the essence of both constructs that each have multiple guises. Within TCE based studies, researchers mostly focus on single aspects of uncertainty and asset specificity, for instance human asset specificity (Rindfleisch and Heide, 1997), or they take in multiple aspects separately. The measurement of uncertainty has been very diverse and Boerner and Macher conclude that “empirical findings that relate uncertainty to organizational form are mixed, partly because of the multitude of uncertainty types examined” (2001, p6). The measurement of uncertainty has been criticized and researchers were called upon to pay extra attention to it (Boerner and Macher, 2001; David and Han, 2004). More precise measurement of both asset specificity and uncertainty is called for and I suggest a formative approach to enable this. A MIMIC model facilitates the study of different aspects simultaneously and can therefore provide us with a more comprehensive measure of these constructs. 6.3 Future research - another point of departure During the search for an answer to the research question several themes emerged that broadly relate to the study of MCS design and effectiveness. In this paragraph I discuss the main points of interest and make suggestions for future research. In four subparagraphs I will 133 Chapter six elaborate on configurational fit and complementarities, the role of uncertainty, the relationship between MCS effectiveness and effectiveness of the organizational unit, and finally, the design and effectiveness of MCSs. These topics will be presented in separate paragraphs, but they are very much intertwined. 6.3.1 Configurational fit and complementarities Central to this research project is configurational thinking. I study multiple contingency variables and multiple structure variables (combined in the archetypes), and only a discrete number of fit states are thought to exist. According to the configurational view, change of management control systems will occur through quantum jumps rather than gradually (Gerdin and Greve, 2004). These quantum jumps involve large changes to all parts of the control system simultaneously, and will not occur until the contingencies have changed enough to warrant the considerable investments involved. Thus, an MCS will not be adjusted with every change in the contingencies, which leads to a loss in performance. An opposed view is provided by the contingency approach. According to this approach change in organizational structure does accord with changes in contingency variables and will occur gradually. In chapter five, I have studied the relationship between the contingency variables and the misfit variables. This analysis returned different results for the largest subsamples: arm’s length control and result oriented machine control. Within the subsample arm’s length control I found what I expected adhering to the configurational view: there is no direct relationship between the contingency variables and misfit, but there is a negative influence of misfit on performance. To the contrary, for result oriented machine control I have found no effect of misfit on performance, but a linear relationship between uncertainty and misfit. This would fit the contingency view better. Both results seem to be explained best by different views. However, these two views are positioned as opposing views that cannot be combined (Donaldson, 2001; Gerdin and Greve, 2004). The discussion of these results has led to the idea of differences in strength of complementarities between MCSs and the possible existence of a threshold level of misfit. This might be a third view towards fit and effectiveness. If a threshold level of misfit exists, gradual adjustments are possible up to a certain extent, but at some point the threshold level is reached and fundamental changes are needed in order to stay effective. My exploratory analysis did not provide evidence for a threshold level of misfit within the subsample result oriented machine control, but the level might not have been reached. 134 Conclusion and discussion Incorporating this idea of differences in strength of complementarities and thus a threshold level of misfit, provides a slightly different view on fit within the configurational approach. Figure 6.1 illustrates configurational fit as usually described (panel A; Gerdin and Greve, 2004) and configurational fit with the possibility of differences in strength in complementarities between different archetypes (panel B). The circles represent states of fit and the size of the circles indicates the flexibility of each system or rather the weakness of the complementarities within each system. The stronger the complementarities among the control instruments are, the smaller the circle is. Within the circles gradual adjustments are possible. Positions outside the circles always imply states of misfit. Figure 6.1 Configurational fit: different strengths in complementarities Panel A Panel B S T R U C T U R E S T R U C T U R E CONTINGENCY CONTINGENCY Note: the size of the circles indicates the strength of the complementarities: the larger the circle, the weaker the complementarities within the archetype. Points within the circles represent states of fit; points outside the circles represent states of misfit. Source panel A: Gerdin and Greve, 2004, p306. This idea of a threshold level of misfit as a third view to states of fit, misfit, and effectiveness, might help us to understand the design of organizational structures and to explain empirical observations. There is little empirical work that focuses on these different views of fit and studies which approach provides a better explanation for the workings and effectiveness of organizational structures. Future research might try to compare these different views in a single study and shed light on the issue. 6.3.2 Uncertainty and the perception of control In my research project, the role of uncertainty turned out to be both influential and complex. Apart from its role as a contingency variable used to characterise the activities of organizational units, the variable seems to influence the perceived effectiveness of the control system directly. In three out of five subsamples uncertainty correlates negatively with MCS 135 Chapter six effectiveness. Within the subsamples exploratory control and boundary control I have found no significant relationship between these variables, but these subsamples are small and only strong effects would be picked up. There are two potential explanations for this negative effect of uncertainty on performance: the first relates to changes in the management control system and states of misfit, the second relates to a psychological effect. Both will be discussed next. The first potential explanation stems directly from configurational thinking. As discussed in the previous paragraph, adhering to the configurational approach an MCS is expected not to change gradually with changes in contingency variables. Thus, a rise in uncertainty would not be accompanied by a change in the control structure (as long as the change is not large enough to justify a quantum jump). However, this change in the level of uncertainty would cause the MCS no longer to fit the new situation, which in turn leads to a decrease in performance. In that case a direct negative effect of uncertainty on MCS effectiveness occurs. As a second potential explanation I have argued that a psychological effect of uncertainty might exist. This could influence the manager’s assessment of control effectiveness, which is relevant because I measure perceived effectiveness. The general idea is that in case of high uncertainty managers feel less comfortable with their management control system even it is the most effective system given the circumstances. Situations of high uncertainty do not come with a wide range of options in terms of suitable control instruments. Consider, for instance, a control structure like boundary control, which hinges on prohibitive behavioural guidelines. Instead of being able to rely on precise work instructions or pre-set performance targets, managers cannot but tell their employees what not to do or what to avoid. This likely leaves them dissatisfied with the effectiveness of the control system even if it is the most effective given their situation. On the contrary, in a situation of low uncertainty managers could rely on rules, regulations, pre-set performance targets and the like, and might feel more ‘in control’ than managers that cannot. The evidence I have found within the subsample result oriented machine control supports this view. Within that subsample I found an effect of uncertainty on both misfit and MCS effectiveness, but no effect of misfit on MCS effectiveness. Apparently, the effect of uncertainty does not follow from a state of misfit -like described above-, but might be a result of the manager’s dissatisfaction with the MCS as such. Future studies could research this psychological effect. This might enhance our understanding of the effects of control systems as such. Moreover, if the psychological effect of uncertainty occurs it should be taken into account when comparing the perceived effectiveness of different management control systems. 136 Conclusion and discussion 6.3.3 Effectiveness of control and effectiveness of the organization When discussing the results related to methodology, I have already argued that measuring both effectiveness of control and effectiveness of the organization or organizational unit is beneficial. Apart from merely assuming that a positive relationship between the two variables exists, empirical evidence should be gathered to shed light on this relationship. Two issues relevant for future research emerged from my study. Firstly, it is unclear what causal relationship exists between both constructs of effectiveness. Secondly, their relationship might differ across different situations. Regarding the first point, I have argued that causality might run both ways. Since the MCS is supposed to influence employee behaviour in order to accomplish organizational goals, an effective MCS will likely have a positive influence on the organizational unit. Moreover, a highly effective organizational unit probably has more resources to spend on its MCS to improve it. Another effect that might influence the relationship between effectiveness of the unit and effectiveness of the MCS is again a psychological one. When assessing the perceived effectiveness of control, a manager of a successful organizational unit might assess the functioning of his/her control system more positively. This could be due to overall optimism or to the believe that if one is doing well, the control system must also function well. The opposite is also true: the MCS will be valued less when the unit is not functioning well. Note that both effectiveness measures are influenced by many other factors, so that the relations are probabilistic rather than deterministic. My analyses showed that MCS effectiveness and effectiveness of the organizational unit are positively correlated in all but one sample. Their relationship is quite strong with correlation coefficients ranging from 0.42 to 0.60. Only within the subsample boundary control I did not find a significant relationship. As a possible explanation I have referred to the essence of boundary control as a management control structure. In the MCtce the archetype is described as “a control structure of last resort” (Speklé, 2001a). If managers indeed perceive this control structure to be less effective than other forms of control, this might account for the lack of correlation between perceived MCS effectiveness and perceived unit effectiveness within the subsample boundary control. This triggered the second issue: the relationship between both effectiveness measures might differ in strength across situations. Studies that focus specifically on the relationship between the effectiveness of the control system and unit effectiveness will be highly relevant. They can provide information on the 137 Chapter six effectiveness of control systems as such as well as on the effects of control systems on the organization or organizational unit. 6.3.4 MCS design and effectiveness Several research opportunities have already been discussed, but two issues remain that relate to MCS design and effectiveness. These will be presented in this final subparagraph. Research opportunities remain concerning the Transaction Cost Theory of Management Control. In my study the MCtce provided important insights, but more empirical studies should test the ideas to enable an assessment of the full impact of this theory. There are several possibilities. Collecting additional data for the subsamples action oriented machine control, exploratory control, and boundary control would enable hypotheses testing for these archetypes. Moreover, the definitions of the archetypes can be compared to empirical profiles and may be fine-tuned. Apart from a quantitative approach, other methods like case studies could helpfully be applied to study the archetypes and the activities they control. Multiple possibilities to contribute exist in the study of management control systems in their entirety. It is unclear how different control instruments combine to establish control as a set. Which combinations represent fit states and will thus be effective, and which ones cannot survive? Issues that deserve our attention are relationships among the elements of control that make up the system. The exact combinations that are used should be studied. Also, do the control instruments function as complements or substitutes, and are some systems characterised by strong complementarities whereas others are not? Would this depend on the circumstances, or on the structure itself? More empirical work might show how the separate control instruments within an MCS operate together. All in all, a lot remains to be learned about management control system design and effectiveness, leaving us with many exiting opportunities for future research. More studies that explicitly take on a broad view towards control and relate MCS design to effectiveness will teach us about this topic that lies at the heart of research within the field. 138 References Abernethy, M.A., J. Bouwens and L. Van Lent (2004). Determinants of control system design in divisionalized firms. The Accounting Review, 79 (3), pp545-570. Abernethy, M.A. and W.F. Chua (1996). A field study of control system ‘Redesign’: the impact of institutional processes on strategic choice. Contemporary Accounting Research, 13 (2), pp569-606. Aiken, L.S. and S.G. West (1991). Multiple regression: testing and interpreting interactions. Sage Publications, Newbury Park. Anderson, E. (1985). The salesperson as outside agent or employee: a transaction cost analysis. Marketing Science, 4 (3), pp234-254. Anderson, E. (1988). Transaction costs as determinants of opportunism in integrated and independent sales forces. Journal of Economic Behavior and Organization, 9, pp247-264 Anderson, E. and D.C. Schmittlein (1984). Integration of the sales force: an empirical examination. RAND Journal of Economics, 15, pp385–395. Beatty, P. (2004). Paradigms of cognitive interviewing practice, and their implications for developing standards of best practice. ZUMA-Nachrichten Spezial Band 9, Questionnaire Evaluation Standards, Centre for Surveys, Methods, and Analyses (ZUMA), Mannheim. Bedford, D. (2005). Management Control Systems Configurations in Practice. University of Technology, Sydney (working paper). Bisbe, J., J. Batista-Foguet and R. Chenhall (2006). Defining management accounting constructs: a methodological note on the risks of conceptual misspecification. Accounting, Organizations and Society, in press. doi:10.1016/j.aos.2006.09.010. Boerner, C.S. and J.T. Macher (2001). Transaction cost economics: an assessment of empirical research in the social sciences. McDonough School of Business (working paper). Bollen, K. and R. Lennox (1991). Conventional wisdom on measurement: a structural equation perspective. Psychological Bulletin, 110 (2), pp305-314. Bouwens, J. and L. van Lent (2004). Assessing the performance of profit center managers. University of Tilburg (working paper). Brownell, P. (1995). Research methods in Management Accounting. Coopers & Lybrand, Melbourne. Burton, R. M., J. Lauridsen and B. Obel (2002). Return on assets loss from situational and contingency misfits. Management Science, 48 (11), pp1461-1485. Chenhall, R.H. (2003). Management control systems design within its organizational context: findings from contingency-based research and directions for the future. Accounting, Organizations and Society, 28 (issue 2-3), pp127-168. 139 References Coles, J.W. and W.S. Hesterly (1998). Transaction costs, quality, and economies of scale: examining contracting choices in the hospital industry. Journal of Corporate Finance, 4, pp321–345. David, R.J. and S.-K. Han (2004). A systematic assessment of the empirical support for transaction cost economics. Strategic Management Journal, 25, pp39-58. Diamantopoulos, A. and J.A. Siguaw (2006). Formative versus reflective indicators in organizational measure development: a comparison and empirical illustration. British Journal of Management, 17, pp 263-282. Diamantopoulos, A. and H.M. Winklhofer (2001). Index construction with formative indicators: an alternative to scale development. Journal of Marketing Research, 38, pp269277. Dillman, D.A. (2000). Mail and internet surveys. The tailored design method. Second Edition, John Wiley and Sons, Inc., New York. Donaldson, L. (2001). The contingency theory of organizations. Sage Publications, Thousand Oaks. Doty, D. H. and W. H. Glick (1994). Typologies as a unique form of theory building: towards improved understanding and modeling. The Academy of Management review 19 (2), pp230251. Doty, D. H., W.H. Glick and G.P. Huber (1993). Fit, equifinality, and organizational effectiveness: a test of two configurational theories. Academy of Management Journal, 36 (6), pp1196-1250. Drazin, R. and A. H. v. d. Ven. (1985). Alternative forms of fit in contingency theory. Administrative Science Quarterly 30, pp514-539. Dunk, A. (1993). The effects of budget emphasis and information asymmetry on the relation between budgetary participation and slack. The Accounting Review, 68 (2), pp400-410. Ferreira, A. and D. Otley (2005). The consequences of management control systems design and use (working paper). Fisher, J. (1995). Contingency-based research on management control systems: categorization by level of complexity. Journal of Accounting Literature, 14, pp24-48. Flamholtz, E.G. (1983). Accounting, budgeting and control systems in their organizational context: theoretical and empirical perspectives, Accounting, Organizations and Society, 8 (2/3), pp153-169. Gerdin, J. (2005a). The impact of departmental interdependencies and management accounting system use on subunit performance. European Accounting Review, 14 (2), pp297327 Gerdin, J. (2005b). The impact of departmental interdependencies and management accounting system use on subunit performance: a second look. European Accounting Review, 14 (2), pp335-340 Gerdin, J. and J. Greve (2004). Forms of contingency fit in management accounting research a critical review. Accounting, Organizations and Society, 29, pp303-326. 140 References Gresov, C. (1989). Exploring fit and misfit with multiple contingencies. Administrative Science Quarterly, 34 (3), pp431-453. Hair, J.F.(Jr), R.E. Anderson, R.L.Tatham and W.C. Black (1998). Multivariate data analysis. Fifth Edition, Prentice-Hall, Inc., New Jersey. Hak, T., K. van der Veer and H. Jansen (2004). The Three-Step Test-Interview (TSTI): an observational instrument for pretesting self-completion questionnaires. ERIM Report Series Research in Management. Hartmann, F.G.H. (2005). The impact of departmental interdependencies and management accounting system use on subunit performance: a comment. European Accounting Review, 14 (2), pp329-334 Hitt, M.A., R.D. Ireland, B.W. Keats and A. Vianna (1983). Measuring subunit effectiveness. Decision Sciences, 14 (1), pp87-102. Hitt, M.A. and R.D. Middlemist (1979). A methodology to develop the criteria and criteria weightings for assessing subunit effectiveness in organizations. The Academy of Management Journal, 22 (2), pp356-374. Hofstede, G. (1981). Management control of public and non-for-profit activities. Accounting, Organizations and Society, 6 (3), pp193-211. Holmstrom, B. and P. Milgrom (1994). The firm as an incentive system. The American Economic Review, 84 (4), pp972-991. Ittner, C.D. and D.F. Larcker (2001). Assessing empirical research in managerial accounting: a value-based management perspective. Journal of Accounting and Economics, 32 (1-3), pp349-410. Jansen, H. and T. Hak (2005). The productivity of the three-step test-interview (TSTI) compared to an expert review of a self-administered questionnaire on alcohol consumption. Journal of Official Statistics, 21 (1), pp103-120. Jarvis, C.B., S.B. MacKenzie and P.M. Podsakoff (2003). A critical review of construct indicators and measurement model misspecification in marketing and consumer research. Journal of Consumer Research, 30 (2), pp199-218. Jensen, M.C. and W.H. Meckling (1992). Specific and general knowledge, and organizational structure. In: L. Werin and H. Wijkander (eds.), Contract economics, pp251-274, Basil Blackwell, Oxford. Jöreskog, K.G. and A.S. Goldberger (1975). Estimation of a model with multiple causes of a single latent variable. Journal of the American Statistical Association, 10, pp631-639. Keating, A.S. (1997). Determinants of divisional performance evaluation practices. Journal of Accounting and Economics, 24, pp243-273. Keating, P.J. (1995). A framework for classifying and evaluating the theoretical contributions of case research in management accounting. Journal of Management Accounting Research, 7 (Fall), pp66-86. King, G., R.O. Keohane and S. Verba (1994). Designing Social Inquiry. Scientific inference in qualitative research. Princeton University Press, New Jersey. 141 References Langfield-Smith, K. and D. Smith (2003). Management control systems and trust in outsourcing relationships. Management Accounting Research, 14, pp281-307. Lattin, J., J.D. Carroll and P.E. Green (2003). Analyzing multivariable data. Thomson Learning, Inc., Pacific Grove. Luft, J. and M.D. Shields (2003). Mapping management accounting: graphics and guidelines for theory-consistent empirical research. Accounting, Organizations and Society, 28, pp169249. MacKenzie, S.B., P.M. Podsakoff and C.B. Jarvis (2005). The problem of measurement misspecification in behavioral and organizational research and some recommended solutions. Journal of applied psychology, 90 (4), pp710-730. Merchant, K.A. (1982). The control function of management. Sloan Management Review, summer, pp43-55 Milgrom, P. and J. Roberts (1990). The economics of modern manufacturing: technology, strategy, and organization. The American Economic Review, 80 (3), pp511-528. Milgrom, P. and J. Roberts (1995). Complementarities and fit strategy, structure, and organizational change in manufacturing. Journal of Accounting & Economics, 19, pp179-208. Mintzberg, H. (1983). Structures in fives: designing effective organizations, Prentice Hall, Englewood Cliffs. Newbold, P. (1995). Statistics for business & economics. Fourth edition, Prentice-Hall, Inc., New Jersey. Nicholson, B., J. Jones and S. Espenlaub (2006). Transaction costs and control of outsourced accounting: case evidence from India. Management Accounting Research, 17, pp238-258. Otley, D.T. (1980). The contingency theory of management accounting: achievement and prognosis. Accounting, Organizations and Society, 5 (4), pp413-428. Otley, D., J. Broadbent and A. Berry (1995). Research in management control: an overview of its development. British Journal of Management, 6 (special issue), S31-S44. Ouchi, W.G. (1979). A conceptual framework for the design of organizational control mechanisms. Management Science, 25 (9), pp833-848. Ouchi, W.G. (1980). Markets, bureaucracies and clans. Administrative Science Quarterly, 25 (1), pp129-141. Price, J.L. (1997). Handbook of organizational measurement. International Journal of Manpower, 18 (4/5/6), pp305-558. Rainey, H.G. (1983). Public agencies and private firms; incentive structures, goals, and individual roles. Administration and Society, 15, pp207-242. Rindfleisch, A. and J. Heide (1997). Transaction cost analysis: past, present and future applications, Journal of Marketing, 61 (October), pp30-54. Shelanski, H.A. and P.G. Klein (1995). Empirical Research in Transaction Cost Economics: a Review and Assessment. The Journal of Law, Economics, & Organization, 11, pp335-361. 142 References Simon, H.A. (1957). Models of man, social and rational: mathematical essays on rational human behavior in a social setting. John Wiley & Sons Inc., New York. Simons, R. (1995). Levers of control: How managers use innovative control systems to drive strategic renewal. Harvard Business School Press, Boston. Smith, A. (1776). The wealth of nations, edited by Edwin Cannan, 1904. Reprint: New York, Modern Library, 1973. Snell, S.A. (1992). Control theory in strategic human resource management: the mediating effect of administrative information. The Academy of Management Journal, 35 (2), pp292327. Speklé, R.F. (2001a). Explaining management control structure variety: a transaction cost economics perspective. Accounting, Organizations and Society, 26, pp419-441. Speklé, R.F. (2001b). Beyond Generics: a closer look at hybrid and hierarchical governance. ERIM Ph.D. series Research in Management 8, Erasmus Research Institute of Management, Erasmus University Rotterdam. Speklé, R.F. (2004). Configurations of control: a transaction cost approach (working paper). Speklé, R.F., H.J. Van Elten and A. Kruis (2007). Sourcing of internal auditing: an empirical study. Management Accounting Research, 18 (1), pp102-124. Van de Ven, A.H. and D.L. Ferry (1980). Measuring and assessing organizations. John Wiley & Sons, New York. WWW Reprint 2000, University of Minnesota. http://umn.edu/~avandeve Van den Bogaard, M.A. and R.F. Speklé (2003). Reinventing the hierarchy: strategy and control in the Shell Chemicals carve-out. Management Accounting Research, 14, pp79-93. Van der Stede, W.A., S.M. Young and C.X. Chen (2005). Assessing the quality of evidence in empirical management accounting research: The case of survey studies. Accounting, Organizations and Society, 30, pp655-684. Verbeeten, F.H.M. (2005). Performance management in government and non-profit organizations: a goal setting theory perspective. Erasmus University, Rotterdam (working paper). Widener, S.K. (2004). An empirical investigation of the relation between the use of strategic human capital and the design of the management control system. Accounting, Organizations and Society, 29, pp377-399. Widener, S.K. (2007). An empirical analysis of the levers of control framework. Accounting, Organizations and Society, in press. doi:10.1016/j.aos.2007.01.001. Williamson, O.E. (1975). Markets and hierarchies: analysis and antitrust implications. Free Press, New York. Williamson, O.E. (1985). The Economic Institutions of Capitalism. Free Press, New York. Williamson, O.E. (1991). Comparative Economic Organization: the analysis of discrete structural alternatives. Administrative Science Quarterly, 36 (2), pp269-296. 143 References Williamson, O.E. (1996). The mechanisms of governance, Oxford University Press, New York. Willis, G.B. (2005). Cognitive interviewing: a tool for improving questionnaire design. Chapter 13, Sage Publications, Thousand Oaks. Zimmerman, J.L. (2000). Accounting for Decision Making and Control. Third edition, MacGraw-Hill Companies, Inc., Singapore. Zimmerman, J.L. (2001). Conjectures regarding empirical managerial accounting research. Journal of accounting and economics, 32 (1-3), pp411-427. 144 Appendix A - Description of a management control system In this appendix I describe the management control system of an organizational unit of a recruitment agency. The agency searches job applicants for their customers, who are companies in need of personnel with a higher education. This description was used for a validity-feedback interview (see 3.3) as part of the pre-test of the questionnaire. I asked the manager to fill out the questionnaire. Next, solely based on his answers, I described the MCS of the unit and its effectiveness as well as characteristics of the unit’s main activity. Numerical scores on items were translated back into sentences. This description is based on the final version of the questionnaire and therefore illustrates the type of information as well as the amount of detail the questionnaire generates. In the questionnaire, I refer to organizational units as subunits and therefore this terminology is used below. Appendix B gives an overview of the exact questions posed. I Appendix A Description: management control system of a recruitment unit March 2006 Part one – the main activity of the subunit The main activity of this subunit is ‘recruitment’. This activity is programmable to a large extent. It is very easy to identify an understandable sequence of steps to be followed in carrying out the work. It is very easy as well to get an overview of actions to take at the start of a period. Moreover many of the tasks of employees are the same from day-to-day. It is quite difficult, however, to think of one best way of doing things. The activity is also somewhat predictable. Unpredictable variety in the workload occurs frequently, but exceptions in the work occur rarely. Day-to-day situations are somewhat different. It is quite easy to analyse the activities: the manager is sure almost all the time (>90%) about what the outcomes of work efforts are going to be. It is quite easy for him to assess the work of employees. Also for the supervisor it is quite easy to assess the manager’s as well as the employees’ work and to define specific goals for the subunit at the beginning of period. The outputs of the subunit can be expressed in numbers, but it is quite difficult to reach agreement on their quality. There is no goal ambiguity within the unit: unit goals are very clear and specific. Moreover, goals are very clear to everyone who works in the unit and very easy to explain to outsiders. It is neither easy nor difficult for a new employee to learn the ins and outs. Tools and equipment used are unique to a little extent compared to those used by other companies for similar activities, but the systems used are quite unique. Compared to employees of other companies that work on similar activities, skills and knowledge of subunit employees are unique to a little extent. Quite a bit coordination is needed between employees. This unit and other units within the company depend somewhat on one another. II Description of a management control system Part two – management control within the unit The control system described below pertains to 22 people. 2.1 Organizational structure (as regards unit employees) Within the unit coordination of work mainly takes place through standard operating procedures/work instructions and proposals made by employees. Sometimes general assignments and planning are used, whereas specific assignments are used little. Work-related discussions between the manager and the employees occur a couple of times a month, whereas discussions among employees take place more than once a day. Employees have quite a bit of freedom in deciding on how to do their job, and how to handle work exceptions. They also have a say over what happens on their job and take part in making decisions that affect them. However, employees have little freedom in deciding what they do on their job. Overall it can be said that employees have quite a bit work autonomy. 2.2 Use of standards for unit employees Performance targets are most important in guiding employees in their work (rated ‘very important’). Standard operating procedures/work instructions and behavioural guidelines are somewhat important in this respect. The budget also acts as a performance target. Prohibitive guidelines that explicitly specify what is prohibited or what should be avoided are not in use in this unit. The only performance targets used for employees’ jobs are revenue targets and these are used very much. There are no targets related to social or professional skills. Normally, targets are set at the beginning of a period and do not change much. Whereas market benchmarks are not used for employees, benchmarks related to internal peer group performance or to the performance of other units are used quite a bit for employees’ jobs. 2.3 Performance evaluation of unit employees The unit manager uses subjective judgements somewhat to assess employee performance. Employees’ contribution to long term subunit and long term company performance is considered to be quite important in performance evaluation. Employees have some information advantage compared to the unit manager (score 2.4 on a five point scale). They are significantly better informed about the impact of both internal and external factors on their activities. The have quite some information advantage as regards the type of activities they III Appendix A have undertaken and their (potential) performance. They have no advantage regarding internal processes or technical aspects of the work. In order for employees to obtain a positive evaluation achievement of performance targets is very important. Showing good social and professional skills is quite important. Compliance with work instructions or behavioural guidelines is less relevant in this respect. The same holds true for being evaluated negatively. 2.4 Rewards and incentives for unit employees Employees receive bonuses based on both their individual performance and on the performance of their subunit. Bonuses can make up 16–30% of their total financial reward. The only criterion for being granted a bonus (rated ‘very important’) is the achievement of performance targets. When performing well, increased responsibilities and recognition from both peers and superiors are important non-financial rewards. Apart from this, employees receive some increased autonomy. Achievement of performance targets and professional skills are very important for getting a non-financial reward. Showing good social skills is quite important, but compliance with instructions/behavioural guidelines not at all. For employees’ career prospects, again achievement of performance targets is most important. Skills are quite important and compliance with work instructions and other behavioural guidelines is of little importance for employees’ careers. There is little focus on the group within the unit (or much focus on the individual). Employees always compete over performance targets. Sometimes they make someone an outsider whose work is below that of the others or encourage each other. Part three - control of the unit manager 3.1 Organizational structure (as regards the manager) Of all time spent on managerial activities, the subunit manager spends most time on staffing, directing and motivating employees, as well as on interacting with other managers. Least time is spent on establishing strategies for the unit. Work related discussions with other managers or supervisors occur a couple of times a month. IV Description of a management control system Overall the manager has quite some work autonomy (average score: 4.4 on a 5 point scale). He has very much influence in investment, marketing, production, and human resource decisions as well as in setting the agenda for the unit. He has also very much influence in deciding what tasks are to be performed within the unit. He has quite a bit influence in setting targets for subunit performance and designing the reward system. However, he has no influence in deciding upon criteria for employee performance appraisal. 3.2 Use of standards for the unit manager Whereas for employees’ jobs one type of performance targets was in use, for the manager different types occur. The most important ones are profit, revenue, and cost targets. Apart from these, non-financial quantitative targets are used quite a bit. There are no qualitative targets or targets related to skills. Also for the manager the budget serves as a performance target. Normally targets are set at the beginning of a period and do not change much. Market benchmarks as well as benchmarks related to internal peer group performance or to the performance of other units are somewhat in use. Behavioural guidelines that specify explicitly what should be avoided or what is prohibited do not exist for the manager either. 3.3 Manager’s performance evaluation The unit supervisor uses subjective judgements quite a bit to assess the manager’s performance. The supervisor considers the manager’s contribution to both long term subunit and long term company performance to be quite important in performance evaluation. At the end of a period the manager has quite some information advantage compared to the supervisor (score 3.1 on a five item scale), especially regarding the impact of internal and external factors on his activities, and the type of activities undertaken in the unit. He is also better informed about the internal processes and about what has been achieved in the unit. He has no advantage regarding technical aspects of the work. In order for the manager to obtain a positive evaluation, achievement of performance targets is considered to be very important. Showing good skills is little important and compliance with behavioural guidelines and work instructions not at all. For a negative evaluation the picture is somewhat different: achievement of performance targets remains the most important aspect, but lack of professional skills and violation of rules/behavioural guidelines are quite important as well. Lack of social skills is somewhat important. V Appendix A 3.4 Reward structure for the manager The unit manager receives stocks/options/profit shares. These amount to 31-45% of his total financial reward. The only thing that is important to get rewarded is achievement of performance targets. For career prospects, however, professional and social skills are important too. Compliance with work instructions/behavioural guidelines is somewhat important for the manager’s career. Part four - effectiveness Overall the effectiveness of the subunit is more than satisfactory (average score 5.5 on a 1-7 scale). The quantity of work produced gets the highest possible score and is thus ‘very good’. Other aspects, for example satisfaction of external customers, employee satisfaction, and the problem-solving ability of the unit, are graded more than satisfactory or good. Satisfaction of internal customers and compliance with instructions/guidelines get the lowest scores and are rated satisfactory. The effectiveness of the management control system (MCS) of the unit is also assessed. The MCS is the whole of all aspects of organizational structure, use of standards, performance evaluation, and reward structure as described above. When assessing different aspects of the MCS we see that about half the aspects are rated more than satisfactory whereas the other half scores less than satisfactory. Accordingly, the overall score is satisfactory (4.3 on a 7 point scale). Items that get high scores are motivating employees and managers, perceived fairness of the system, amount of work autonomy employees and managers have, and use of financial rewards. Items that are rated less than satisfactory are the coordination of work, guiding employees and managers, use of non-financial rewards and behavioural guidelines, and monitoring. The contribution of the MCS to overall subunit performance is also perceived to be poor. This judgement is underlined when confronting strategic goals of the unit with the support of the MCS to achieve these. It can be concluded that although being flexible and efficient are both quite important, the MCS does not facilitate these aspects at all. The same holds true for being innovative and improving internal business processes. Both are somewhat important, but not supported by the MCS. VI Appendix B - Overview of survey questions This appendix holds an overview of all questions that underlie the analyses in this study. The questions pertain to management control systems and their effectiveness as well as to the main activity and effectiveness of organizational units. Questions are identified by their original number and ordered by construct/variable. For each I indicate the scale, the items, and the original source. When no source is stated, the question has been newly developed for this study. Occasionally, a remark on use is added as well. The questions are part of a larger questionnaire that addresses management control system design and effectiveness both at the level of the organizational unit and within organizational units. A copy of the complete questionnaire is to be found at http://www.nyenrode.nl/faculty/faculty.cfm?faculty_id=201. VII Appendix B General information 1) A subunit is defined as a manager and all employees reporting directly or indirectly to him/her. What is the name of your subunit? 2) Please state the industry that your subunit is in: 3) What is the appropriate description of your subunit? items: Division, Business unit, Production department, Marketing or sales, Research and development, Treasury, Finance & accounting, ICT, Human resources, Project unit/ team, Other support, Other 4) Please indicate the number of steps up the hierarchy from your position to the CEO. Example: in the picture this would be 3. note: a copy of the picture referred to is to be found in chapter three (3.2.2) 5) Please indicate the number of steps down the hierarchy of your unit from your position to the bottom level. Example: in the picture this would be 2. note: idem question 4 46) Including yourself, how many employees work for this subunit? Please recall that the subunit consists of you and all employees that report directly or indirectly to you. You may include people hired from other companies if they report to you. 47) Question 25 asked you to choose one activity as the main activity of your subunit. How many employees within your subunit work on the main activity? Activities of the organizational unit 24) The following questions are about what employees in your subunit actually do. We refer to this as the activities of your subunit. Examples of activities are R&D, human resources, or production of a specific good or service. Recall that your unit comprises you as a manager and all employees that report directly or indirectly to you. Your subunit might clearly have one main activity. It might also have more activities that are equally important. Please name the main activities of your subunit: 25) Which activity is the most important or the most characteristic one for your unit? Please choose one activity: VIII Overview of survey questions Management Control Systems 7) Are the following performance targets used for your job? scale: not at all, little, somewhat, quite a bit, very much items: Stock price related targets, Return targets (examples: ROI, EVA, shareholder value added), Profit targets (examples: net income, profit margin, operating profit), Revenue targets, Cost targets, Non-financial quantitative targets (examples: market share, output quota), Qualitative targets (examples: customer satisfaction, service quality) source: types of performance targets based on Bouwens and Van Lent (2004, p15) 8) Please indicate whether the following types of performance benchmarks are used for your job. scale: not at all, little, somewhat, quite a bit, very much items: a. Market benchmarks or benchmarks based on performance of external peer groups b. Benchmarks based on performance of internal peer groups or the performance of other subunits note: to learn about use of benchmarks more broadly this question does not suffice. It informs us only about whether or not benchmarks are being used as such. No information on how these are being used is obtained nor on their role in, for instance, performance evaluation. 9) Which of the following statements describes best how your performance targets come about? Please check one box. items: a. Performance targets are set at the beginning of a period and will not change much under normal circumstances b. Performance targets come about steadily over time/ evolve during a period 11) The subunit’s budget can play several roles. How important are the following roles of the budget? Please assign a number 1 to 3, where 1 implies most important. items: a. The budget serves as a performance target to be met b. The budget gives guidance about the way to go c. The budget sets limits to what can be done 12) The next two questions (12 & 13) are about the things your supervisor takes into consideration when evaluating your performance. Some things might be crucial for either a positive or a negative evaluation, other things might be less important. How important are the following things for you to get a positive evaluation? scale: not at all important, little important, somewhat important, quite important, very important items: Social skills, Professional skills, Achievement of performance targets, Budget versus actuals, Compliance with standard operating procedures/ work instructions, Compliance with other behavioural guidelines source: inspired by Bouwens and Van Lent (2004), Verbeeten (2005), and Van de Ven and Ferry (1980) IX Appendix B 13) How important are the following things for you to get a negative evaluation? scale: not at all important, little important, somewhat important, quite important, very important items: Lack of social skills, Lack of professional skills, Not achieving performance targets, Budget versus actuals, Violating standard operating procedures/ work instructions, Violating other behavioural guidelines source: idem question 12 14) How much does your supervisor use subjective judgements to evaluate your performance? scale: not at all, very little, somewhat, quite a bit, very much 15) When evaluating your performance, does your supervisor take into consideration your contribution to long term subunit and long term company performance? scale: not at all, little, somewhat, quite a bit, very much items: Long term subunit performance, Long term company performance 16) Two types of behavioural guidelines might be in use for your job. Guidelines that specify what should be done, and guidelines that specify what should be avoided or what is prohibited. We refer to the latter as prohibitive guidelines. Do such prohibitive guidelines apply to you? items: no (skip to 19), yes 17) (If yes) General prohibitive guidelines hold for all employees of the company. Examples are general ethical norms, or rules about private use of the Internet. Subunitspecific prohibitive guidelines are relevant to the performance of your subunit. Examples are authorisation levels and procedures, or policy restrictions. Do subunitspecific prohibitive guidelines apply to you? items: no (skip to 19), yes 18) (If yes) Please indicate whether you agree or disagree with the following statement: “Violating subunit-specific prohibitive guidelines will always have serious consequences for my career.” scale: disagree strongly, disagree somewhat, neutral, agree somewhat, agree strongly 19) Do you receive any financial rewards apart from a fixed salary? items: no (skip to 23), yes 21) What percentage of your financial reward is variable? items: 1 – 15%, 16 – 30%, 31 – 45%, 46 – 60%, more than 60% note: in my sample variation in the answers to this question were limited. For future use of this question, I recommend adjusting the answer categories to incorporate smaller ranges. X Overview of survey questions 22) When do you receive a variable financial reward? Please indicate the importance of the following things for this to happen. scale: not at all important, little important, somewhat important, quite important, very important items: Social skills, Professional skills, Achievement of performance targets, Budget versus actuals, Compliance with standard operating procedures/ work instructions, Compliance with other behavioural guidelines source: idem question 12 23) Please indicate the importance of the following things for your career prospects. scale: not at all important, little important, somewhat important, quite important, very important items: Social skills, Professional skills, Achievement of performance targets, Budget versus actuals, Compliance with standard operating procedures/ work instructions, Compliance with other behavioural guidelines source: idem question 12 42) Please also indicate for the following decisions how much say or influence you have. scale: none, little, some, quite a bit, very much items: a. Deciding what work or tasks are to be performed in your unit b. Setting standards/ targets for subunit performance c. Deciding upon criteria for performance appraisal of employees d. Designing the reward system e. Deciding upon standard operating procedures/ work instructions f. Deciding upon other behavioural guidelines source: items a-d; Van de Ven and Ferry, 1980, Q25 (p435), Q57 (p445), Q63 (p446), Q79 (p450); modified 56) During the past 3 months, how often did discussions related to the coordination of work (face-to-face, by telephone or e-mail) occur on a one-to-one basis? scale: once a month or less, couple of times a month, about once a week, every day, more than once a day item: C (one item used): between you and other managers or supervisors source: Van de Ven and Ferry, 1980, Q86 (p453); scale modified ASSET SPECIFICITY Physical asset specificity 29) How much of the equipment in your subunit is especially designed or unique compared to equipment used for similar activities in other companies? Examples of equipment are machinery, tools, or warehousing. scale: none, some, about half, most, all source: Coles and Hesterly, 1998; modified 30) How unique are the systems used in your subunit compared to systems used by other companies for similar activities? Examples are communication systems or software. scale: not at all unique, unique to a little extent, somewhat unique, quite unique, very unique XI Appendix B Interdependencies 35) To what extent do your subunit’s actions impact on work carried out in other subunits of your firm? scale: to no extent, little extent, some extent, great extent, very great extent source: Keating, 1997 36) To what extent do actions of other subunits of the firm impact on work carried out in your particular subunit? scale: to no extent, little extent, some extent, great extent, very great extent source: Keating, 1997 Asset specificity in general 39) Please respond to the following statement by ticking one box. “My company could go to other companies to buy the products/ services my subunit delivers.” (reverse scored) items: a. It is very difficult/ impossible to obtain these products or services from other companies b. Possible, but price and quality conditions would be worse (quality would be less and/ or the price higher) c. It would be no problem to obtain the goods/ services elsewhere at about the same price and quality d. Possible, price and quality conditions would be better (at a better quality and/ or lower price) 40) Please indicate whether you agree or disagree with the following statement. “Outsourcing the activities of this subunit would imply a major strategic change for the company.” scale: disagree strongly, disagree somewhat, neutral, agree somewhat, agree strongly note: in my sample this question is unable to discriminate between respondents: there is a lack of variety in the answers Human asset specificity 49) How easy is it for a new subunit employee to learn the ins and outs he/ she needs to know to be effective? scale: very easy, quite easy, neither easy nor difficult, quite difficult, very difficult source: Anderson and Schmittlein, 1984; modified 50) Consider a new employee who has experience in a similar job/ profession. How much extra training would that person need? Please estimate the number of weeks of training, including training on-the-job. source: Anderson and Schmittlein, 1984; modified 52) How unique are the skills and knowledge of subunit employees compared to skills and knowledge of employees of other companies who work on similar activities? scale: not at all unique, unique to a little extent, somewhat unique, quite unique, very unique XII Overview of survey questions UNCERTAINTY Programmability 26) At the start of a period, how easy is it to get an overview of actions to take in order to carry out successfully the subunit’s main activity? scale: very easy, quite easy, neither easy nor difficult, quite difficult, very difficult Goal ambiguity 31) How clearly defined are the goals of this subunit? (reverse scored) scale: very unclear, quite unclear, somewhat clear, quite clear, very clear source: Rainy, 1983 32) How specific are the goals of this subunit? scale: very general, mostly general, somewhat specific, quite specific, very specific 33) How easy is it to explain the goals of this subunit to outsiders? (reverse scored) scale: very easy, quite easy, neither easy nor difficult, quite difficult, very difficult source: Rainy, 1983 34) Please indicate whether you agree or disagree with the following statement. “The goals of this subunit are clear to almost everyone who works in the subunit.” (reverse scored) scale: disagree strongly, disagree somewhat, neutral, agree somewhat, agree strongly source: Rainy, 1983 Unpredictability 37) How often do the following things happen? scale: never, rarely, sometimes, frequently, always items: a. Unpredictably, the amount of work varies greatly over time. (example: peak demands) b. Exceptions arise in your subunit that require substantially different methods or procedures source: item B; Van de Ven and Ferry, 1980, Q6 (p433) 38) How much the same are day-to-day situations, problems or issues you encounter in your subunit? (reverse scored) scale: completely different, quite a bit different, somewhat different, mostly the same, very much the same source: Van de Ven and Ferry, 1980, Q3 (p432) XIII Appendix B Uncertainty in general 44) How easy is it for your supervisor to know whether you and your employees do the work correctly? scale: very easy, quite easy, neither easy nor difficult, quite difficult, very difficult source: Van de Ven and Ferry, 1980, Q4 (p433); modified 83) How easy is it to reach agreement with superiors when judging/ appraising the quality of subunit performance? scale: very easy, quite easy, neither easy nor difficult, quite difficult, very difficult Measurability of outputs 82) Can the outputs of the subunit be measured objectively and expressed in a number? Please keep in mind to focus on the main activity. scale: not at all, very little, somewhat, quite a bit, very much note: this measure as such picks up ‘measurability of outputs’; reverse scored is it a part of uncertainty Information asymmetry 45) Please compare the amount of information you have relative to your supervisor at the end of a period. If you have no advantage or a disadvantage, tick “no advantage”. scale: no advantage, some, quite some, significant, very significant advantage items: a. How much better informed are you about the type of activities that have been undertaken in your subunit? b. How much more familiar are you with the type of input output relations inherent in the internal operations of your subunit? c. How much more certain are you about whether the performance potential of your subunit has been realized? d. How much more familiar are you with the technical aspects of the work in your subunit? e. How much better are you able to assess the impact of internal factors on your activities? f. How much better are you able to assess the impact of external factors on your activities? g. How much better do you understand what has been achieved in your subunit? source: all items except item f: based on Dunk, 1993; modified XIV Overview of survey questions Effectiveness of the organizational unit 84) How would you rate the performance of your subunit on the following aspects? Please feel free to write “don’t know” or “not applicable” after an item when appropriate. scale: 1 (very poor), 2, 3, 4 (satisfactory), 5, 6, 7 (very good) items: a. Achievement of subunit goals b. Cooperation with other subunits c. Satisfaction of internal customers of your subunit d. Satisfaction of external customers of your subunit e. Employee satisfaction f. Compliance with standards and behavioural guidelines g. Problem-solving ability of the subunit h. The quantity or amount of work produced i. The quality or accuracy of work produced j. Overall subunit performance source: items a, h, i: Van de Ven and Ferry, 1980, Q93 (p455); items a, b, e, f, g: based on Hitt and Middlemist, 1979 and Hitt et al. 1983 Effectiveness of the management control system Note: see 6.1.2. This measure does not take into account differences between control structures regarding the importance of control goals. I have two suggestions to improve it: the first one is to attach weights to the goals. Apart from asking for an assessment of each item, an indication of its importance to the organizational unit might provide more precise measurement. That complicates the measure a lot though. Another suggestion is to tailor the measure to a specific situation (when studying a homogenous group of cases, for instance) or to determine from theory which items/goals are relevant for different situations. 85) The organizational structure, use of performance targets or behavioural guidelines, and the reward and performance evaluation system combined form the management control system. How would you rate the performance of the management control system of your subunit on the following aspects? scale: 1 (very poor), 2, 3, 4 (satisfactory), 5, 6, 7 (very good) items: a. Motivating employees b. Motivating managers c. Supporting managers in decision making d. Guiding employees’ behaviour e. Guiding managers’ behaviour f. Coordination of work g. Perceived fairness h. Contribution to overall subunit performance source: item h: Ferreira and Otley, 2005; modified 86) How would you rate the following aspects of your subunit? scale: 1 (very poor), 2, 3, 4 (satisfactory), 5, 6, 7 (very good) items: a. Amount of work autonomy employees have b. Amount of work autonomy managers have c. Use of financial rewards d. Use of non-financial rewards e. Performance evaluation f. Use of standards / (behavioural) guidelines g. Monitoring XV Samenvatting Dit proefschrift beschrijft de opzet, uitvoering en bevindingen van een empirische studie naar het ontwerp en de effectiviteit van management control systemen. Management control systemen bestaan uit een pakket van control instrumenten die gebruikt worden om het werk te coördineren en prikkels te geven door middel van belonen en straffen. Voorbeelden van control instrumenten zijn regels en procedures, budgetten, prestatiemaatstaven en bonussen. Om control systemen op te zetten, kunnen organisaties kiezen uit een variëteit aan instrumenten. Niet iedere mogelijke combinatie van control instrumenten zal echter effectief zijn. De werking van een bepaald instrument hangt namelijk af van de combinatie met de andere instrumenten. Er is onderlinge samenhang. Vele onderzoekers hebben zich gericht op individuele instrumenten van control, maar er is weinig empirisch werk dat control systemen in hun totaliteit bestudeert als pakket van control instrumenten. Om die reden is er nog weinig bekend over de verschijningsvormen van control systemen en nog minder over de effectiviteit ervan. Met deze studie draag ik bij aan de wetenschappelijke kennis door mij specifiek te richten op management control systemen in hun totaliteit en op hun relatieve effectiviteit. Mijn centrale onderzoeksvraag is: welke management control systemen zijn -gegeven de omstandigheden- effectief? De Transactiekosten Theorie van Management Control (Speklé, 2001a, 2001b, 2004) verschaft inzichten in deze materie en vormt de theoretische basis van dit proefschrift. De theorie claimt dat er vijf effectieve control vormen (ofwel archetypen) bestaan: arm’s length control, result oriented machine control, action oriented machine control, exploratory control, en boundary control. Elk van deze archetypen biedt een effectieve control oplossing voor bepaalde activiteiten, maar niet voor andere. Voorbeelden van activiteiten zijn de productie van een goed of dienst, human resources, of research en development. Activiteiten worden gekenmerkt door hun mate van onzekerheid, de specificiteit van gedane investeringen, en ex post informatie asymmetrie. De theorie specificeert bijvoorbeeld dat het archetype arm’s length control het meest effectief is voor de control van activiteiten met een lage onzekerheid en een lage mate van specificiteit. De theorie claimt dat een management control systeem van een organisatieonderdeel dat lijkt op zijn relevante archetype, effectiever zal zijn in de control van zijn activiteiten dan andere control systemen. Welk archetype relevant is, hangt af van de activiteiten van dat organisatieonderdeel. Ik toets deze claims empirisch voor 258 organisatieonderdelen waarvoor met behulp van een vragenlijst gegevens zijn verzameld. XVII Om de hypothesen te kunnen toetsen, moet ik voor elke waarneming eerst bepalen welk archetype relevant is. Daarom worden alle waarnemingen aan een van vijf subgroepen (een voor elk archetype) toegewezen op basis van de karakteristieken van hun activiteiten. De resultaten van een regressieanalyse tonen aan dat gelijkenis van het management control systeem met het archetype arm’s length control, de effectiviteit van dat control systeem positief beïnvloedt. Zoals door de theorie voorspeld, treedt dit effect alleen op bij activiteiten met een lage mate van onzekerheid en een lage mate van specificiteit. Voor de waarnemingen die in de groep result oriented machine control vallen, vind ik een nul resultaat: er is geen verband tussen de gelijkenis van een management control systeem met het archetype en de effectiviteit van het control systeem. Voor action oriented machine control, exploratory control en boundary control zijn de subgroepen te klein om een regressieanalyse uit te kunnen voeren. De bivariate analyses die ik in plaats daarvan heb uitgevoerd, leveren slechts indicatieve resultaten op die de hypothesen niet bevestigen. Andere resultaten zijn meer algemeen aan het onderwerp gerelateerd. In de eerste plaats vind ik een positief verband tussen de effectiviteit van het management control systeem en de effectiviteit van het organisatieonderdeel. De sterkte van deze relatie verschilt voor de subgroepen die ieder een van de archetypen representeren. Daarnaast vind ik bewijs voor een direct negatief effect van onzekerheid op de gepercipieerde effectiviteit van het control systeem. Het totale effect van onzekerheid is complex omdat de variabele het ontwerp van control systemen ook direct beïnvloedt. Met mijn studie lever ik ook een methodologische bijdrage. In dit proefschrift beschrijf ik alle stappen die nodig zijn voor het operationaliseren van de theorie en voor het in kaart brengen van management control systemen. Daarmee geef ik een voorbeeld van hoe een studie naar management control systemen in hun totaliteit en hun effectiviteit opgezet kan worden. Ik introduceer ook ‘validity-feedback’ interviews als een krachtig instrument om de kwaliteit van vragenlijsten te testen. Het instrument is algemeen bruikbaar en levert extra informatie over de kwaliteit van vragenlijsten boven informatie die andere pre-test methoden opleveren. Ten slotte, om het meten van de variabele onzekerheid te verbeteren heb ik een algemeen model opgezet met formatieve indicatoren (in plaats van reflectieve). Dit heeft een positief effect op de validiteit. Deze aanpak kan ook de toekomstige meting van de variabele ‘specificiteit van gedane investeringen’ verbeteren. XVIII Curriculum Vitae After graduating from the Stedelijk Gymnasium in Leiden, Anne-Marie Kruis (1977) studied both general and business economics at Erasmus University Rotterdam. She specialised in international economics and business. Anne-Marie has spent one semester studying in Münster, Germany during which time she grew attached to the people, the language, and the place, which she still visits regularly. After graduating in 2002, she decided to apply for a PhD position at Erasmus University where she started working on her PhD project one year later. She enjoyed her time at Erasmus very much, but decided after two and a half years to switch to Nyenrode Business University where she continued working on her thesis. She currently holds a postdoc position at Nyenrode Business University and works on several research projects. One of them involves an international cooperation with a colleague from Rice University in Houston, Texas. Anne-Marie’s research interests are in the field of management control systems and organizational design. She also has great interest in methodological problems. Although most her time is dedicated to scientific research, she enthusiastically teaches courses on management accounting and control as well as on statistics. XIX