Introduction
  Policy Problem
  Content of the instrument
  Costs
  Impact
  Comparison, conclusions 
  Resources for policy makers
 

 

Subject Assessments for Academic Quality in Denmark: A review of purposes, processes and outcomes

Bjørn Stensaker

NIFU 

 

Executive summary

With a dual purpose related to accountability and improvement, Denmark has had a system of external quality monitoring since 1992. Organised through an independent agency (EVA), systematic evaluation of study programs has been the dominant method for quality assurance for a number of years. By the use of data and observer triangulation, stability in procedures and through extensive dialogue, the evaluations have resulted in noticeable changes in teaching and learning, study program objectives, and triggered dialogue and reflection both within higher education institutions as well as between higher education and its stakeholders. In 1999, the agency responsible for the study program evaluations was made permanent by an act in parliament, and has since then expanded into evaluating primary and secondary education in addition to higher education. At present, the greatest challenge for the Danish national evaluation system is to adjust a well-functioning domestic system of study program evaluation to the emerging international trend related to accreditation and convergence in how quality should be assured in the international marketplace. 

Introduction 

Higher education in Denmark is mainly public and consists of about 110 institutions including traditional universities, vocationally oriented colleges, and more specialised higher education institutions (in art, agriculture etc.). The Ministry of Education approves all public higher education institutions. Private institutions may operate without governmental approval, but then run the risk that their students will not be eligible for the state student grant. The higher education system can be divided into a university sector and a college sector (a binary system). The university sector consists of 12 institutions, and the remaining institutions all belong to the college sector. The substantial number of (small) colleges triggered the government in the late 1990s to stimulate voluntary amalgamations in this sector (Gornitzka et al 2001: 16). This process is still ongoing and the first amalgamated “Centres for higher education” (CVU) have been established. 

The degree system has three levels: bachelor studies (3 years), master degree studies (5 years), and the PhD-degree (additional 3 years). However, within the college sector one can find study programs that deviate from this structure, and that rather could be described as “short cycle” (1-3 years), “medium-cycle” (3-4 years), and “long-cycle” programs (5-6 years) (Thune 2001: 3). As such, the degree system is rather complex with limitations related to the possibility of transferring credit points within the system, especially between the college and the university sector. Denmark has a system of external examiners which partly comprises teachers/professors from other institutions, and partly labour market representatives. The role of the external examiners is to assure that students are treated fairly and to assure an equivalent national level of assessment across schools and institutions (Kristoffersen 2003: 26). 

As in other OECD-countries, Denmark has during the last ten to fifteen years experienced a rapid increase in student numbers. In the recent years, the gross intake to higher education has been between fifty and sixty percent of the relevant age group (Thune 2001: 3). Most of these students enrol on long-cycle higher education programs. Higher education institutions are responsible for admissions, but admission requirements are set by the Ministry of Education. In some programs, for example in Medicine, the Ministry still sets the admission number. In general, student numbers in study programs vary according to student preferences and choice. There are no tuition fees in the public sector. 

The steering and funding of Danish higher education have changed considerably during the 1990s. The trend has been to delegate more responsibility from the Ministry of Education to higher education institutions. One may claim that the changes in the steering of the sector have stimulated the autonomy of the institutions, even though the power and autonomy of Danish universities have been historically quite strong. However, strategic behaviour and strong institutional leadership have not been a central characteristic of Danish universities. Hence, in 2000 the Ministry of Education launched what may be termed as “development contracts” between the Ministry and the individual institution. The purpose is to agree on more long-term objectives and targets (four year periods) and to enable the institutions to market themselves better. It is voluntary for the institutions to join in the contract arrangements, and so far no sanctions or rewards have been linked to these instruments. 

The changes in the steering of higher education have been followed by a change in the funding of higher education with more emphasis on lump-sum allocations and output measures (Gornitzka et al 2001: 19). This means that the higher education institutions can decide on how to allocate resources internally. The most important output measure (the “taximeter-system”) is a combination of different indicators related to student numbers, the cost of studies in different disciplines and subject fields, and the number of credit points and exams taken. Research is funded separately. Four streams of money comprise most of the research funding: a lump-sum from the Ministry, allocations from different domestic research councils, applied research programs, and some funds from the Danish fund for basic research (DGF). 

The described changes in the steering and funding of higher education in Denmark in the last ten to fifteen years have also had implications for how academic quality assurance is conducted. Traditionally the country had a decentralised system of quality assurance, which left quality assurance up to the individual institution, with the external examiner system as the key component. In 1992 the Ministry of Education established the Danish Centre for Quality Assurance and Evaluation of Higher Education (EVA ), and instructed the centre to conduct systematic evaluation of all study programs offered in higher education within a seven-year period. Hence this centre can be interpreted as a more centralised and independent actor in the field of academic quality assurance. Why the political authorities at that time perceived a need for systematic evaluation at the national level is discussed below. 

The policy problem 

In the spring of 1992 a majority of the parties in the Danish Parliament arrived at a number of compromises on higher education, which led in the following year to a reform of the entire educational system. The stated objectives of the reform were to ensure (Thune et al 1996: 21): 

•  a higher degree of institutional freedom and autonomy combined with a tightening of each institution's management structure, 

•  a better balance between supply of and demand for study places, 

•  the quality of the study programs according to international standards. 

The reform implied a new study structure (the bachelor/master/PhD-system), a new Act on universities, which reorganized the political and managerial governance of the institutions (reducing the number of democratically elected governing bodies and introducing external representation in the academic senate and in faculty boards), an introduction of the taximeter-principle (an output-based funding system), and the establishment of a national system for the evaluation of higher education (conducted by a newly established agency for conducting such evaluations [EVA]). 

When looking at the stated objectives of the reform, the background for the reform also comes to the fore. First, a huge increase in the number of students that applied for higher education. Second, Denmark faced at that time constraints on public spending, which triggered a focus on the efficiency and the effectiveness of higher education. Third, worries that an expansion of higher education could lead to a lowering of the academic quality. Fourth, the international commitments that Denmark had in relation to the European Union and their student exchange system (Erasmus). 

The establishment of a national system for evaluation and an independent agency for carrying out such evaluations are in various ways related to the drivers behind the reform. The establishment of the EVA agency could be described along a number of different perspectives: 

•  The creation of the system of study program evaluations could be interpreted as a governmental response to perceived needs for more efficiency and output-orientation in Danish higher education. The share of resources spent on higher education, due to the increasing number of students among other reasons, triggered a need to check how resources are spent and to identify “organisational slack” inside higher education institutions. The systematic evaluation of all study programs offered in Danish higher education can be seen as an indicator of such an orientation. 

•  At the same time, study program evaluations could also be seen as an attempt to balance the centralisation-decentralisation dilemma in Danish higher education. While major parts of the 1992 reform intended to give institutions more autonomy, the establishment of a national evaluation system could be interpreted as a form of centralisation attempt when it comes to the quality assurance. Thus, the evaluations in this perspective represent the need to maintain control even in a more decentralised system. 

•  Since the evaluations were established with a double purpose of accountability and improvement, it is also possible to see the establishment of the evaluation system in a more developmental perspective. The decentralisation of authority and responsibility to institutions meant that the institutional leadership had to take on a stronger and more strategic role. However, this is a role that breaks with the traditions of institutional leadership in Danish universities. The traditional power structure in higher education centred round the departments and disciplines (Gulddahl Rasmussen 1997, Foss-Hansen 1997). The national evaluation system could in this perspective be interpreted as being a “support structure” for the institutional leadership (see also Stensaker 1999: 257-258). 

•  The notions “knowledge society” and “knowledge economy” and the role of higher education and research in these developments have had a powerful influence in the political debate on higher education in the last two decades. One of the important elements of the knowledge society is that higher education needs to establish better links with the world of work (Rasmussen 1997). In the new evaluation system, these links are very visible. Not only are members of industry and society part of the review panels, but graduated students are also, after a few years at work, asked about the relevance of their study program in relation to their current job. In this perspective, the study program evaluations could be seen as an instrument for increasing the relevance of higher education for the society and the world of work. 

•  Finally, one could also interpret the establishment of study program evaluations as a form of political accountability. Not only higher education, but also those responsible for higher education at the political level need to be accountable to the larger society. Hence, the creation of a national system for evaluation, and an agency responsible for carrying out such tasks could be interpreted as being an important symbolic action, by which politicians can show the public that something “is done to assure quality”. One indication of this is that how evaluations should be followed up was almost a non-issue in Denmark in the first few years of the 1990s (Askling et al 1998: 9). Not the outcomes, but the fact that evaluations were conducted seemed, in other words, to be the important thing. 

To pinpoint the policy problem in accurate terms is, in other words, somewhat problematic. However, the five perspectives mentioned above cover most of the arguments related to the establishment of the national evaluations in 1992, and can be said to have represented a formidable challenge for the leadership of EVA to balance these various needs and expectations.

Content of the policy instrument

The mandate for EVA, provided by the Ministry, instructed the centre that future evaluations had to focus on the study program level, that both control (accountability) and institutional improvement had to be a part of any procedures launched, and that evaluations were not a voluntary activity for the institutions. However, the results of the evaluations were not linked to funding (Evalueringscenteret 1998: 16-17). The evaluation system was not created on a permanent basis, but was set up for an initial period of seven years, and on the condition that the system and EVA itself should be subject to an evaluation when deciding whether evaluations should become a permanent activity. The political focus on study programs can probably be related to the huge number of small higher education institutions in Denmark, and the fear that the institutions could not be trusted as assurors of quality (see also Thune et al 1996). Also, the systematisation meant that all study programs were treated equally – a particular feature in the Scandinavian culture (Smeby 1996). EVA was created as an independent body. (See Box 1 for EVA's legal and organizational framework.) The Ministry of Education was not to instruct the centre, but the National Educational Councils (NEC) (in humanities, science, social sciences etc.) were given the right to decide the chronological order of the evaluations, and thus could be seen as the bodies responsible for the initiation of a given evaluation. 

EVA's formal mandate was (Thune 2001: 7): 

•  to initiate systematic evaluation of all study programs in higher education in Denmark including the university as well as the non-university sector (within a six year period), 

•  to develop appropriate methods of the recognition of study programs making them eligible for governments funding and student loans, 

•  to inspire and guide the institutions of higher education in aspects concerning evaluation and quality, 

•  to compile national and international experience on evaluation of the educational system and quality development.

Given the many interests surrounding the establishment of the study program evaluations, it is perhaps not surprising that active dialogue and consultation with institutional and departmental leadership was and is chosen by EVA as a strategy for organising the individual evaluations more in detail. This process is “real” in the sense that EVA often adjusts the organisation and design of a given evaluation after this round of consultation. 

The system that was established in 1992 had the following procedures, which basically have been kept unchanged ever since (Thune 2001: 7-8). 

•  EVA conducts a preliminary study with the purpose of identifying relevant study programs to be included in a given evaluation (due to difficulties in knowing the content and profile of some study programs), and to establish possible criteria/objectives to be used as a mandate for the evaluation. The final selection of study programs in a given assessment is decided after the preliminary study. 

Based on the preliminary study and internal guidelines, EVA prepares the terms of reference. The terms of reference is a formal basis for an evaluation and they have to be approved by the Board before the evaluation process can be started. The terms of reference include the background and purpose of the evaluation, time schedule, the list of higher education unites involved in the evaluation, items to be included in the evaluation, the division of responsibilities between the evaluation group and EVA, the general framework for the evaluation, and the methods to be applied (EVA 2002). 

Traditionally, a fitness-for-purpose approach has been used in Denmark , emphasising the objectives of a given study program (EVA 2002: 14). This approach has been balanced by including national policy objectives in the mandate when relevant. Hence, each evaluation conducted has a specific mandate. In previously conducted programme evaluations the following items were included: 

      • the objectives of the programme, 
      • management, organisation and resources, 
      • structure of the programme, 
      • content of the programme, 
      • practical learning, 
      • methods of teaching and training, 
      • lecturers/professors, including pedagogical competencies, 
      • exams and evaluation of students, 
      • students entry levels and progression, 
      • internationalisation, 
      • relations to other institutions and society, 
      • quality assurance. 
The physical environment and learning resources have also been regularly included. (EVA 2002:14) 

When an institution is selected for a review, it is informed of the evaluation process in writing. It is mandatory that EVA informs the institution on the legal basis for the evaluation, including the rights and obligations of the institution, the purpose of the evaluation, the terms of reference, the members of the evaluation group and expectations to the institution's own contribution to the process (EVA 2002).

•  The unit responsible for offering a given study program (usually a department) then writes a self-evaluation report based on a rather detailed protocol provided by EVA. This protocol usually instructs the department to describe the objectives of the program, the management and organisation surrounding the program, content, methods of teaching and learning, and to include quantitative information concerning the number of academic staff, applications, drop-outs, completion rates, etc. for the last three years. The purpose of the self-evaluation is both to provide the external expert committee with background information and to stimulate ´development´ in the department. The department is free to choose the organisation of the self-evaluation process, but is advised to include academic staff, administration and students in the process.

•  In parallel to the self-evaluation, a comprehensive survey on the quality of the programs is often conducted among various users, i.e. students, graduates, and employers (and sometimes external examiners within the subject field). These surveys are outsourced by EVA to private consultancies, poll firms, etc. The purpose of the survey is to provide alternative views and perspectives on the subject field and on each particular study program, and to indicate the relevance of the study program to important stakeholders. Due to the fact that these surveys are somewhat expensive, only one group of users are focused in each evaluation. The selection is made by EVA. Due to the fact that these surveys are conducted in parallel with the self-evaluation, a given department cannot use the results in their self-evaluation. The timing of the process is usually such that the self-evaluation and the surveys are finished simultaneously for the use of the external expert committee.

•  The next phase in the evaluation process is the visit to the department by the external expert committee . Before the visit, the committee meets to discuss the information available, to find areas for investigation in addition to, or supplementing, the check-list used for the self-evaluation, and to plan the visit. 

The committee is selected and chosen by EVA, and usually consists of three to five persons with extensive knowledge of the subject field and/or expertise in university governance and management. Committee members must be independent of the institutions involved in the evaluation, which is ensured by a formal statement from potential experts. The potential expert must state whether he/she has been employed, been invited to give lectures or in any other way associated with the programme under review, as well as whether his/her child, spouse or near friend has studied or been employed by the program (EVA 2002). Due to the fact that Denmark is a small country, where it is difficult to find “independent” experts, and that people from Denmark , Sweden and Norway can understand each other's language, experts are often recruited from the latter two countries. Even if these experts could be regarded as “peers”, a committee usually also includes a member (sometimes two) from outside of higher education. Typically, this representative is from business, industry or a public organisation. The expert committee must be approved by the EVA's board. 

A visit to a department usually takes place over a two-day period where representatives of the leadership, academic staff, administration and students are interviewed. Usually, but dependent on the number of study programs evaluated, the same committee undertakes all the visits. After the site visit, the committee meets two or three times in order to discuss and finalise the evaluation report. 

EVA provides the committee with a secretary with the responsibility to organise the evaluation and to write the draft version of the external report. For each evaluation, EVA appoints an internal project team of two evaluation officers and one assistant, which provides secretarial help to the external committee and ensures that the evaluation is conducted as specified in both the terms of reference and EVA's formal regulation. The project team is involved throughout the process: from conducting the preliminary study, and preparing the program and interview guides for the site visit, up to drafting the report to be discussed and approved by the evaluation group (EVA 2002).

•  After the visit, and after several meetings in the external expert committee, an external report is written presenting an overall analysis of the quality of the program field at the national level as well as individual analyses of all study programs included in the evaluation. A draft version of the report is then sent to all departments/institutions involved in a given evaluation, and a closed conference is held where only representatives of EVA, the external expert committee and the departments/institutions participate. The purpose of the conference is both to prepare the departments for the coming conclusions, but also to provide an opportunity to adjust the report for any misperceptions or errors. After the conference, the final report is printed and sent to every participating department/institution, the Ministry of Education, and the relevant NEC. The self-evaluation report, the survey results and the external report are open and accessible to the public. The NEC has the responsibility to follow-up the report, for example, by checking how departments/institutions took actions based on the recommendations given in the report. It is important to note that the departments/institutions are not mandated to follow the recommendations, but will be asked questions related to what actions have been taken on the basis of the report.

The typical evaluation conducted by EVA includes up to ten study programs (occasionally even more) within a given subject field, with a self-evaluation report made for each study program/department, and with an external visit to every study program. After the release of the final report, Danish newspapers have a tradition for taking an interest in the results, sometimes creating a sparkling public debate. 

Implementation

The implementation process in the first years was not without some turbulence. When the study program evaluations and EVA were proposed by the government, the Rectors conference, the umbrella body of Danish universities, was invited to participate in the development and design of the new centre. The Rectors conference, however, turned down the invitation from the Ministry. The argument used was that the Rectors conference preferred external evaluations, if necessary, to be carried out by the Ministry itself, and not by an independent body (Evalueringscenteret 1998: 16). This scepticism from the universities' side can be related to both fear of loosening up established ties between the Ministry and the institutions (the established power structure), but also to protest since the institutions had to cover part of the expenses related to the evaluations themselves (the self-evaluation). 

Due to the initial scepticism towards what the evaluations would bring, and the consequences of the evaluations, the procedures surrounding the evaluations were delicately designed (Askling et al 1998: 11). Because of the fear of being seen as just a cover for ministerial and political agendas, or to be perceived as “soft” towards the institutions, the procedures surrounding the evaluations had a focus on methods, systematisation and standardisation (almost unprecedented for quality assurance agencies) as the way to gain legitimacy and respect (Askling et al 1998: 12). Using EVA's own words: “the method is developed with the aim of uncovering the quality of a study program through a concrete, transparent and trustworthy process” (Evalueringscenteret 1998: 25, authors translation). The fact that the mandate for any given evaluation is developed after a pre-study of the subject field, that rather detailed instructions are provided for the self-evaluation process (the protocol), that data is collected from current and graduated students and employers, and the arrangement of a conference with the involved parties before publication checking for any errors or potential overlooked problems, are all indications that the evaluation process is designed to be as robust and solid as possible. Since private consultancy/poll firms were hired to collect and analyse the data from graduated students and employers, not only “methodological” triangulation was obtained (the use of different methods to shed light on a phenomenon), but also “observer” triangulation (the use of different actors to observe the phenomenon). Not surprisingly, all this information sometimes created very lengthy evaluation reports (up to 200 pages, see Thune 2001: 8). 

Over the years, the evaluation procedure has been incrementally changed and developed. Some of the elements of the evaluation procedure have also evoked debate which sometimes has triggered adjustments. Issues that have been raised are, first, related to the protocol for the self-evaluation. In the first years, this protocol was very detailed with the potential effect that it was perceived as less relevant for initiating more developmental processes at the department level (Askling et al 1998: 16). Thus, over the years the protocol has been revised with the purpose of providing a broader framework for the self-evaluation instead of being a “questionnaire” to be answered. More open, reflective questions to be answered have been included in the protocol (especially related to how departments have established routines and systems for quality assurance). 

Second, a debate has focused on the ability of a given evaluation to relate to the needs of the individual study program when the evaluation covers up to and sometimes even more than ten programs at the time. Stensaker (1999: 259) has, for example, documented how the departmental/institutional perceived benefit of an evaluation drops when more than ten study programs are evaluated at the same time. This study shows that the perceived benefit related to the self-evaluation process is constant independent of the number of study program participating in an evaluation, but the benefit related to the external panel visit and external report is perceived as smaller the more study programs are included. An explanation is probably related to a capacity problem in the external expert committee. A huge number of participating study programs results in more general recommendations from the committee, and the individual study program is not considered in the same way as when a given evaluation only covers a few study programs. 

Third, the user surveys have also been criticised over the years. One line of criticism has been directed at the (lack of) competence of private consultancies for designing and analysing useful surveys (Evalueringscenteret 1998: 37). Another argument has been related to the timing of the user surveys in the overall evaluation process. Stensaker (1999: 263) has, for example, argued that due to the fact that self-evaluation and user surveys are conducted simultaneously, the departments cannot use the information from the surveys in their own self-evaluation. As such, it could be argued that these surveys have been more related to external (the external expert committee) than internal (departmental) needs. Since it is EVA in cooperation with the given consultancy/poll firm that decides the content of the survey, departments/institutions also miss a chance to put “their” issues on the agenda. 

A fourth issue is related to the fact that a number of departments/institutions have been somewhat dissatisfied with how the closed conference works, and how the results of the evaluations have been commented upon in newspapers after the launching of the final report. The time for debate within the conference is limited and therefore the problems/potentials of each study program have not been fully addressed. In addition, many of the participating departments are also potential “competitors” in the Danish higher education market and have a more reserved attitude towards openly discussing problems and solutions (Evalueringscenteret 1998: 44, Askling et al 1998: 21). The fact that Danish newspapers, after the launching of the final report, tend to focus upon findings that give departments bad publicity, has also been raised as an issue, but the departments have not suggested shielding the external reports from public scrutiny. 

Despite some debates and criticisms, the systematic evaluations were continued according to the plan. By 1999, EVA had fulfilled its mission and produced 62 evaluation reports, conducted way over a hundred user surveys of graduated students, employers, etc., and involved approx. 200 experts in the evaluation processes (Evalueringscenteret 1998: 30). The question to be asked is, of course, related to the impact of all this. 

Impact

When the initial period was over, the Ministry of Education initiated an external evaluation of EVA. This process was conducted much in the same way as an ordinary study program evaluation. Hence, not only did EVA have to write a self-evaluation report describing and analysing the previous years, but also an external review of the methods, procedures and roles was conducted by experts from other Scandinavian countries (Askling et al 1998). A user-survey was also conducted, where a private consultancy firm asked representatives of the students, the higher education institutions (rectors, deans, etc.) and other stakeholders about their views on EVA's activities (PLS-Consult 1998). This section is based on this evaluation process, on a separate article where some of the data collected by PLS-Consult was re-analysed (Stensaker 1999), but also on observations from an independent study conducted by an American researcher in the field (Massy 1999). 

Concerning methodology, trustworthiness and relevance of the study program assessments, Askling et al (1998: 4-6, 26) stated that the conducted evaluations had gained legitimacy, and that accountability and improvement actually were balanced, even if the improvement dimension could have been highlighted more. The rectors of a number of universities have reached similar conclusions (PLS-Consult 1998: 14). The high degree of systematisation, the stringent routines associated with each evaluation, and the various sources of data used to evaluate study programs were mentioned as important factors leading to this conclusion. The fact that the evaluation system created in Denmark did not integrate any performance indicator system into the evaluation is probably a factor leading to a positive attitude from the higher education sector. 

The critique from Askling et al (1998: 25) was, therefore, more directed at what they saw was the weak point – the follow-up of the evaluations. Due to the “arms-length” steering principle, the Ministry only checked whether institutions had launched any actions after completion, and EVA had no responsibility for what happened after the publication of the evaluation report (Smeby & Stensaker 1999: 6). Thus, follow-up was a responsibility of the institutions themselves, but the external reinforcement for making sure actions were implemented was not great. 

On this background, one could expect that the impact of the evaluations at the higher education institutions was limited. A survey of deans and department heads/other leaders at the department level, and a smaller number of interviews with rectors of various higher education institutions gives a rather different picture. Not least, a general attitude was that the evaluation processes had been a positive experience, and had created much discussion and dialogue inside the institutions (PLS-Consult 1998, Stensaker 1999, Massy 1999). 

When asked to pinpoint the most beneficial element in the evaluation process, the self-evaluation process was undoubtedly most often mentioned (Stensaker 1999: 259-260). Many respondents also had a very positive view of the visit from the expert panel (PLS-Consult 1998: 15). The perceived benefits of the user-surveys were somewhat mixed, where feedback from students in general was seen as most beneficial (Stensaker 1999: 261). A negative comment mentioned by all respondents was that the external evaluations were time-consuming and rather expensive processes (PLS-Consult 1998: 14). 

Concerning follow-up, slightly over 60 percent of department heads and other leaders at the department level claimed that recommendations had been followed up to a great extent, and that a huge majority of the respondents saw the recommendations given as good advices for improvement of the study programs (PLS-Consult 1998: 16). If one studies the areas where changes were most visible, the curriculum structure, examination, teaching methods, and the objectives of the study programs were the most often mentioned areas. Many respondents also claimed that the evaluations in general had triggered decisions and speeded up existing change processes in the evaluated departments (PLS-Consult 1998: 18). 

In a broader, institutional, perspective, the assessments of study programs have had less effect. For example, for a number of years the protocol for departmental self-evaluation did not contain questions related to whether departmental systems for assuring and improving quality had been established. Hence, the assessments of the study programs contributed in this period little to the establishment of institutional routines for the systematic maintenance and development of the quality of teaching and learning (see, for example, PLS-Consult 1998: 17). Furthermore, since the protocol for the self-evaluations were fully developed by EVA, the departments participating in the evaluation often experienced less ownership, and hence less motivation for going into the evaluation process with an improvement orientation (Evalueringscenteret 1998: 34). A last point to be made is that study program evaluations tend to de-couple the institutional leadership from the evaluation process. Even if the rectors often are drawn into the evaluation process, for example, by being interviewed by the external expert committee, one can detect a feeling that the institutional leadership perceive the study program evaluations as less relevant for them. For example, a majority of the rectors seemed to prefer an evaluation model where research and education were integrated in an evaluation (PLS-Consult 1998: 15). Another example is related to the fact that many rectors perceived that the costs associated with the evaluations outweighed the benefits (PLS-Consult 1998: 15). This perception changes, however, when those closer to the study program were asked about the perceived benefit. Almost 80 percent of the study program managers answered that they perceived the evaluation as relevant, providing them with valuable recommendations. In other words, greater distance to the evaluation creates less perceived benefit. 

It is perhaps due to this de-coupling of the institutional leadership from the evaluations that Massy (1999: 30), analysing the national evaluation systems in Sweden and Denmark , stated that such systems “need not be an exercise in power and control”. (One should, however, also bear in mind that the traditional weak role of the institutional leadership in the Scandinavian countries could impact substantially on the conclusions drawn by Massy.) He maintains that one of the success criteria for these systems is that issues of power and control are not allowed to dominate the agenda. Perhaps as an effect of that, Massy (1998: 31) acknowledges that the national evaluation system has not transformed higher education. But, they have managed to “start universities and departments on the road to becoming learning organizations (…), to become self-conscious about the processes of teaching and learning and how to improve them”. 

Costs

Costs have – on the national level - traditionally been a non-issue related to the evaluation system in Denmark . For example, the question of whether the country got “value-for-money” when a new system was designed in 1999 was totally ignored. The necessity to maintain a national system of evaluation and to expand it to all levels was dominating the agenda. However, as mentioned earlier, at the institutional/department level, complaints have been launched that participating in the study program evaluations is time consuming and economically burdensome since departments have to pay for the self-evaluation themselves. Finding resources for freeing staff to be included in the self-evaluation process has been perceived as somewhat difficult. 

The costs related to funding and running EVA is decided by the Ministry of Education. EVA's latest annual budget (2004) sums up to 43.6 mill. DKK (approx. 6 mill. US dollars). This includes the costs for evaluations in the primary school sector (both the primary and secondary level of education), but so far only a few pilot-projects have been launched in this area resulting in very limited expenditures. 

The majority of the budget is covered by an appropriation from the Ministry of Education (approx. 40 mill. DKK). The rest of the budget, about 4 mill. DKK, stems from other external sources. This budget covers the salary of 48 full-time employees at EVA, and ten to twelve student assistants working part-time. EVA estimates that the costs of a typical study program evaluation (dependent on the number of study programs included) vary between 250.000 and 500.000 DKK (approx. 33.000 – 66.000 US dollars). Most of these resources are tied up in travel expenses and honoraria for the external expert committee. A typical honorarium for an external expert, depending on the number of study programs included in the evaluation, varies between 12.000 and 25.000 DKK (1.400 – 2.800 US dollars). In addition, the costs of user surveys vary between 50.000 and 200.000 DKK (approx. 6.500 – 27.000 US dollars). As mentioned before, private consultancies are often the contractors and conductors of these surveys. In the first years, some evaluations became more expensive than budgeted and some were even substantially delayed according to their time schedule, but this problem has in later years been solved with more rigorous steering of each evaluation project (Evalueringscenteret 1998: 24). 

Comparisons, conclusions and future action

After the evaluation of EVA and the evaluation system in 1998, the Ministry of Education argued that systematic study program evaluation on the national level should continue, and that EVA had to be a permanent centre (UFK 1998: 33-36). However, the Ministry suggested that activities should be expanded, both concerning scope and practices. Hence, compared to the past, several new elements are visible in the current evaluation system in Denmark . 

  • Not only study programs in higher education, but education given at all levels in the Danish system are at present object for evaluation. The background for this expansion in scope was that Denmark experienced some (relatively) low scores in an international survey testing the knowledge level at certain ages in primary and secondary school (the PISA-survey), and that, amongst other things, more systematic evaluations were needed as a means to improve quality (Kristoffersen 2003: 26). 
  • Concerning methods, systematic evaluations of study programs in higher education are continued, but new forms of evaluations, including thematic evaluations, evaluations of institutions, system evaluations, and audits (evaluation of quality assurance systems) are also to be conducted (at all levels) (Kristoffersen 2003: 27). 
The study program evaluations carried out over the previous years are still recognisable in the new version. This has probably a lot to do with the purpose of the evaluations, which still maintain the duality of accountability and improvement, and that the established procedures for conducting study program evaluations still are used. In a recent impact-study of some of the evaluations conducted by the new body, findings suggest that the study program evaluations continue to have an impact on Danish higher education, even in “round two” (Stensaker 2004: 38). The respondents do still have a positive view of the evaluations, approx. 60 percent of the recommendations are followed up, and a typical effect is that dialogue, discussions and reflections at the institutions increase afterwards (EVA 2004). The result is interesting in that counter to the predictions of researchers in the field, there are no signs of a diminishing impact of the national evaluation system as the institutions learn the “tricks of the trade” the second time around (Jeliazkova & Westerheijden 2001: 1). A possible explanation is that evaluations that stimulate dialogue, discussions and reflection may not be so vulnerable for that sort of institutional “strategic behaviour”. As neatly said by a US-observer: “The strength of the Scandinavian evaluation philosophy lies in its strategy of creating discourse rather that laying down regulations” (Massy 1999: 29). 

However, the type of discourse that Scandinavian evaluations create, varies somewhat between the countries. In Sweden, where institutional audits (evaluations of institutional work to secure and improve quality) for a number of years were the dominant evaluation type on the national level, the discourse (measured in the number of recommendations made) has traditionally centred around management and strategy issues (Stensaker 2000). Those experiencing the effects of these audits perceived the problem that the audit did not address quality issues on the “shop floor”. Whether the quality of a given study program or department is good or bad, in other words, can not be detected by using this method. Contrary to this, the Danish evaluations address quality issues related to the teaching and learning process more directly. Most recommendations in the study program evaluations in Denmark have addressed issues related to teaching, curriculum and pedagogy, and organisation, management and strategy issues relating to quality have been less emphasised (Evalueringscenteret 1998: 26). 

That being said, it is important to remember that even if the focus of the Danish evaluation system is on study programs, it is still a “fitness-for-purpose” process and not directed at securing academic standards per se. The study programs in Denmark are not, as has been the case in the UK , checked against predefined academic criteria developed for every subject area, and neither have the same disciplinary orientation as the program assessments that have been carried out by the VSNU in the Netherlands (Brennan and Shah 2000: 64). A reason is probably the existence of the external examiner system in Denmark . In this way, one may argue that Denmark actually has two parallel systems for academic quality assurance on the national level. That these two systems in the past have been poorly linked has surprised external observers (see Dill 2002). 

Dill (2002: 28) has also noted that the Danish study program evaluations have only to a limited extent focussed on assuring the quality of new study programs established in Danish higher education. As such, the Danish study program evaluations have been more oriented towards ´general´ accountability in a system characterised by a relatively high degree of stability than targeted at quality assurance in a fast moving, more competitive and open higher education market. For example, the study program evaluations have been difficult to utilise as ´consumer guidance´ for students. It can also be questioned whether they actually address political needs for information about higher education (Rasmussen 1997b: 257). The ´accountability´ generated by the evaluations have, in other words, been directed more to ´the environment´ rather than to a specific interest group. 

On this background, one could ask whether the absence of “revolutionary change” in the Scandinavian higher education systems during the last ten to fifteen years ( Finland excepted) may be one of the factors for the “success” of the Danish (and Scandinavian) evaluation approach. According to Massy, it is the “soft pedalling of the accountability agenda” that has provided the Scandinavian evaluations with high degree of legitimacy (Massy 1999: 33). Is it the absence of the need for radical political change of the higher education system that has provided a basic condition for the “soft pedalling”? Along the same line, it should also be noted that the Ministry of Education in Denmark has managed to follow up its own strategy and intention concerning the evaluation system, and that the legitimacy of the evaluation system probably also can be related to the fact that the Ministry have not interfered in the evaluation process. In other words, the arms-length strategy has been allowed to work according to the intentions. 

Here, a contrast can be made to the Netherlands where the Inspectorate of Higher Education was to take action if a program experienced a poor assessment by the VSNU. This inspectorate, which reported directly to the Dutch Ministry of Education, could then instigate its own investigation potentially resulting in a deletion of the program from the list of recognised (funded) programs. Even if programs were not closed as a result of this procedure during the 1990s (Brennan and Shah 2000: 63-64), it is an indication of a policy design more oriented towards control than dialogue. 

A comparison can also be made to the UK subject assessment system where the design of the subject assessments in the 1990s was more oriented towards grading different programs (accountability/control) than initiating improvement processes inside the institutions. According to Brennan and Shah, this relates directly to the “assessment methodology, in particular the summative judgement on a three-point scale” (Brennan & Shah 2000: 92). Typical effects of this methodology was the assignment and redistribution of status and reputation of programs, while a more typical effect of the Danish assessment methodology is that of changing institutional cultures and creating positive attitudes towards teaching and learning. Even if there were national funding schemes available in the UK for starting up quality development projects in teaching and learning at the institutions, the access to these resources was again dependent on a good assessment result (Brennan & Shah 2000: 94). Hence, to become better you should be rather good in the first place. 

In retrospect, one may argue that the many policy problems highlighted in section two continue to exist in the new system in Denmark . Worries about efficiency, centralisation, control, development, relevance and how to deal with these problems are as present in Danish higher education today as ten years ago. The main difference is that it seems easier to articulate views on these issues today, and that the different actors trust more the motives and engagement of each other. The unfortunate accountability-improvement debate that in many countries, like the UK , dominated the policy agenda and consequently lowered the trust in the evaluation system was, after the first years, resolved through practice in Denmark (Askling et al 1998) and has not appeared since. Thus, Denmark is a good case for illustrating that external evaluation can balance accountability and improvement (see also Stensaker 2003: 157). 

Seen in relation to the development in other countries, there has recently been an increased focus on international comparisons and on international developments within the field of quality assurance in Denmark . Accreditation is, for example, one of the new elements introduced in Europe in recent years (Faber & Huisman 2003), and EVA is currently working along a number of ways to “internationalise” their activities even further. Experimenting with audits of entire universities and colleges is one option currently pursued, yet another is to establish a procedure for mutual recognition of national quality assurance agencies (EVA 2002, Lindeberg & Kristoffersen 2002, Dill 2002). At present, there is in Europe a political drive towards more convergence of the systems of higher education (due to the Bologna-declaration), and consequently, more convergence of quality assurance systems as well. For the current Danish focus on study program evaluation, this seems to be the greatest challenge for the future. The problem is according to Thune (2001: 17), how a well-functioning (domestic) and established system of (“fitness-for-purpose”) study program evaluations can be translated or reinterpreted in the light of the (international) trend towards accreditation. How Denmark solves this problem remains to be seen. 

Resources for policymakers

a) Live Internet Links 

Information about the assessment of study programs in Denmar or the national evaluation system is available on the web-pages of the Danish Ministry of Education or the Danish Evaluation Institute: 

•  Ministry of Education: http://www.uvm.dk/ 

•  Danish Evaluation Institute: http://www.eva.dk/ 

•  The Danish educational system: http://www.eurydice.org/Eurybase/Application/frameset.asp?country=DK&language=EN 

•  The official window to Denmark http://www.denmark.dk/servlet/page?_pageid=85&_dad=portal30&_schema=PORTAL30&_fsiteid=175&_fid=12424 
 
 

b) Technical documents 

•  Evaluation report in Agricultural Sciences (includes details about the methodology, style and process) ( pdf – 2,221KB

•  International Comparative Evaluation: Methodological experiences and future perspectives. ( pdf – 39KB

•  The EVA 2004 Action plan ( pdf – 302KB

•  Mutual Recognition: The Danish report ( pdf – 88KB

•  Educational Evaluation Around the World. An international anthology . See the chapter “ Denmark ” by D. Kristoffersen for a good overview of the current procedures, objectives and values related to EVA's activities. ( pdf – 903 KB

•  Quality Assurance in the Nordic Higher Education: accreditation-like practices. ENQUA occasional paper 2 ( pdf – 454KB

•  Follow-up on the Bologna-declaration – the establishment of a European Quality Assurance System (describes among other things the ENQA-network, a collaboration of European Quality Assurance Agencies) www.eva.dk -> publications 

•  The Ministry of Education action plan “Better Education” ( pdf– 443KB )

c) Studies 

Several of the studies in the reference list provide more information on the Danish evaluation system, see for example: 

- Smeby & Stensaker (1999) 

- Stensaker (1999) 

- Massy (1999) 
 
 

d) The new Danish evaluation system 

The current white paper from 1999 (“Uddannelse og Fællesskab”) describing the background for and the elements in the new Danish evaluation system can be obtained from: http://pub.uvm.dk/1999/uof/index.html (in Danish) 
 
 

References

Askling, B., Nordskov Nielsen, L., & Stensaker, B. (1998) Mellom fag og politikk. En granskning av Evalueringscenteret og det danske evalueringssystemet for høyere utdanning . Undervisningsministeriet, København.

Dill, D.D. (2002) Through the looking glass – Comments on the Nordic Pilot Project on Mutual Recognition. In Lindeberg, T. & Kristoffersen, D. (eds.) A Method for Mutual Recognition. Experiences with a method for mutual recognition of quality assurance agencies . ENQA Occasional Papers 4. Helsinki .

EVA (2002) Mutual Recognition. The Danish report . The Danish Evaluation Institute. Copenhagen .

EVA (2003) Educational evaluation around the world. An international anthology . The Danish Evaluation Institute. Copenhagen .

EVA (2004) Effektundersøgelse. Redegørelse . Danmarks evalueringsinstitut. København.

Evalueringscenteret (1998) Selvevalueringsrapport . København.

Faber, M. & Huisman, J. (2003) Same voyage, different routes? The course of the Netherlands and Denmark to a “European model” of quality assurance. Quality in Higher Education , 9(3), pp. 231-241.

Foss-Hansen, H. (1997) Universitetssektoren på vej mod år 2000. Den splittede omverden og behovet for helhed. I Rasmussen, P. (1997) Universitetstradisjonen i krise. I Rasmussen, P. & Jakobsen, A. (red.) Universiteter i dag. Politikk – kultur – ledelse . Samfundslitteratur. Fredriksberg.

Gornitzka, Å., Maassen, P. & Dahl Norgård, J. (2001) Nasjonal prioritering og arbeidsdeling i høyere utdanning – internasjonale erfaringer . NIFU skriftserie 9/2001. Oslo .

Gulddahl Rasmussen, J. (1997) Styring av forskning og undervisning på universiteter. Forandringer i de valgte mellomledernes funktioner og placering. I Rasmussen, P. (1997) Universitetstradisjonen i krise. I Rasmussen, P. & Jakobsen, A. (red.) Universiteter i dag. Politikk – kultur – ledelse . Samfundslitteratur. Fredriksberg.

Jeliazkova, M. & Westerheijden, D.F. (2001) A next generation of quality assurance models. On phases, levels and circles in policy development . Paper given at the CHER-conference in Dijon , 2-4 September. 

Kristoffersen, D. (2003) Denmark . In Educational Evaluation Around the World. An international anthology . The Danish Evaluation Institute. Copenhagen . 

Lindeberg, T. & Riemann, S. (2002) Systematic evaluation in an international context: A small country perspective . Paper presented at the 24 th EAIR-Forum, Prague .

Lindeberg, T. & Kristoffersen, D. (eds.) (2002) A Method for Mutual Recognition. Experiences with a method for mutual recognition of quality assurance agencies . ENQA Occasional Papers 4. Helsinki .

Maassen, P. A. M. & Stensaker, B. (2003) Interpretations of self-regulations. The changing state-higher education relationship in Europe . In Begg, R. (ed) The dialogue between higher education research and practice. Kluwer academic publishers. Dorndrecht.

Massy, W.F. (1999) Energizing Quality Work: Higher Education Quality Evaluation in Sweden and Denmark . National Center for postsecondary Improvement, Stanford University , Stanford.

PLS-Consult (1998) Undersøkelse af effekterne af uddan nelsesevalueringer . København.

Rasmussen, P. (1997) Universitetstradisjonen i krise. I Rasmussen, P. & Jakobsen, A. (red.) Universiteter i dag. Politikk – kultur – ledelse . Samfundslitteratur. Fredriksberg.

Rasmussen, P. (1997b) A Danish approach to quality in higher education. In Brennan, J., de Vries, P., & Williams, R. (eds.) (1997) Standards and quality in higher education . Jessica Kingsley Publishers. London .

Smeby, J.C. & Stensaker, B. (1999) National quality assessments systems in the Nordic countries: developing a balance between external and internal needs? Higher Education Policy , 12(1), pp. 3-14.

Stensaker, B. (1999) User surveys in external assessments: problems and prospects. Quality in Higher Education , 5(3), pp. 255-264.

Stensaker, B. (2000) Quality as discourse: an analysis of external audit reports in Sweden 1995-1998. Tertiary Education and Management , 6, pp. 305-317.

Stensaker, B. (2003) Trance, transparency and transformation: the impact of external quality monitoring on higher education. Quality in Higher Education , 9(2), pp. 151-159.

Stensaker, B. (2004) Epilog: effekter af evaluering – en relativisering. In Effektundersøgelse. Redegørelse . Danmarks evalueringsinstitut. København.

Thune, C., Jacobsen, H. & Kristoffersen, D. (1996) Establishing an Evaluation System in Denmark . In Smeby, J. C. (ed.) Evalu ation of Higher Education in the Nordic Countries . Nordic Council of Ministers. Nord 1996: 6, Copenhagen .

Thune, C. (1996) The alliance of accountability and improvement: the Danish experience. Quality in Higher Education . 2(1), pp. 21-32.

Thune, C. (2001) Quality Assurance of Higher Education in Denmark . Global perspectives on quality in higher educat ion. In Dunkerly, D. & Wong, W.S. (eds.) Ashgate Publishers. Aldershot .

UFK (1998) Kvalitet og ansvar. Fremtidens kvalitetsoppgaver inden for de videregående uddannelser . Undervisningsministeriet, København.

The abbreviation EVA is used throughout the document even though the organisation “Centre for Quality Assurance and Evaluation” (Evalueringscenteret - EVC) changed its name and formal status in 1999 to the “ Danish Evaluation Institute” (Danmarks Evalueringsinstitut - EVA).