February 27, 2005

EVI-2 Report

This report is for the EVI-2 administration in the spring of 2003.  There were 37 pre-tests and 42 post-tests administered and completed.  A total of 34 students completed both the pre- and post-test.  Repeated measures t-tests were conducted on the item and total scores in order to assess if there was a significant increase in the scores after participation in the program. Table 1 provides descriptive statistics for each item and the total score at both pre- and post-test for the 34 students who completed both tests.  The difference between these means (calculated as post-test mean minus pre-test mean) is also presented, along with the results of the t-tests.  There was a significant increase in the total score from the pre-test to post-test.

Table 1 reveals that, for some items, student performance at pre-test is actually better than performance at post-test.  Although not determined to be statistically significant, more than 5% of the students who answered the following items correctly at pre-test subsequently answered them incorrectly at post-test:  3, 7, 19, 38, and 44.  However, several items show the opposite and desired effect; i.e., more students answer correctly at post-test than at pre-test.  Significant differences were found for the following 15 items for which more than 15% of students who answered incorrectly at pre-test subsequently answered correctly at post-test:  6, 13, 18, 20-23, 25, 26, 29-31, 33, 35, and 36.

Note that the percentage of students answering each item correctly is the same as the mean for each item. 

An examination of Table 1 reveals that there are several items that a majority of students answer correctly at the pre-test, indicating that they enter the program with this knowledge.  If 70% or more of the students answered an item correctly, it was considered that they had prior knowledge of this item.  The following 32 items (68% of the items on the test) show this effect:  1-5, 7-12, 14, 15, 17, 24, 26-28, 31, 32, 34-37, 39-44, 46, and 47. This indicates that students already know the majority of the concepts assessed by the test before completing the program.

A reliability estimate (Cronbachs coefficient alpha) was computed for the total scores from both pre- and post-testing occasions. The result of this analysis is presented in Table 3.  Reliability is the degree to which the items representing a construct are answered consistently.  The more reliable a test, the more confidence we can have that the scores obtained from the administration of the test are essentially the same scores that would be obtained if the test were readministered.  Reliability is expressed numerically, usually as a coefficient that ranges from 0 to 1.00.  A high coefficient indicates high reliability or consistency.  If the test scores were perfectly reliable, the coefficient would be 1.00.  However, test scores are seldom perfectly reliable.  Scores can be affected by errors in measurement or characteristics of the test itself (e. g., ambiguous items, items with no variance).  In general, reliabilities above .70 are considered adequate for program evaluation or research.  For evaluation of individual students, reliabilities closer to .80 are desirable.  Examining the results in Table 3, we can see that scores from the total test can be considered reliable. 

Appendix A contains the frequency distributions of the raw data (i.e., prior to scoring) both before and after the program. Examining these tables will reveal which distracter options were chosen most often at each time period (the option with the * next to it was the correct answer). This can help identify misconceptions students bring to the program and those with which they leave.  Specifically, these results indicate that students may be unclear about the definition of ethics (items 13, 19, and 38) and may not understand the concept of fidelity (item 18) even after completion of the program.

Finally, Table 4 displays the results for the demographic items (48-51).  For this administration of the EVI-2, more males than females participated, over 50% of the participants were freshmen, and most students had not completed the By the Numbers or the Calling the Shots programs.


Table 1

EVI-2 Item and Total Descriptive Statistics, Mean Differences, and t-test Results

Item Number

Pre-Test Mean
(SD)

Post-Test Mean
(SD)

Mean Difference (Post-Test -
Pre-Test )

t

p

1

0.97

(0.171)

0.94

(0.239)

-0.03

-0.57

0.571

2

0.88

(0.327)

0.88

(0.327)

0.00

0.00

1.000

3

0.97

(0.171)

0.88

(0.327)

-0.09

-1.36

0.184

4

0.88

(0.327)

0.91

(0.288)

0.03

0.44

0.661

5

0.82

(0.387)

0.91

(0.288)

0.09

1.14

0.263

6

0.68

(0.475)

0.94

(0.239)

0.26

2.72

0.010

7

0.91

(0.288)

0.85

(0.359)

-0.06

-1.00

0.325

8

0.91

(0.288)

0.94

(0.239)

0.03

0.44

0.661

9

0.82

(0.387)

0.88

(0.327)

0.06

0.70

0.488

10

0.94

(0.239)

0.94

(0.239)

0.00

0.00

1.000

11

0.85

(0.359)

0.88

(0.327)

0.03

0.33

0.744

12

0.91

(0.288)

0.97

(0.171)

0.06

1.44

0.160

13

0.26

(0.448)

0.65

(0.485)

0.38

4.04

0.000

14

0.97

(0.171)

0.94

(0.239)

-0.03

-1.00

0.325

15

0.71

(0.462)

0.74

(0.448)

0.03

0.30

0.768

16

0.41

(0.500)

0.62

(0.493)

0.21

1.56

0.128

17

0.88

(0.327)

0.88

(0.327)

0.00

0.00

1.000

18

0.29

(0.462)

0.68

(0.475)

0.38

3.42

0.002

19

0.59

(0.500)

0.53

(0.507)

-0.06

-0.63

0.535

20

0.35

(0.485)

0.76

(0.431)

0.41

3.42

0.002

21

0.50

(0.508)

0.74

(0.448)

0.24

2.77

0.009

22

0.29

(0.462)

0.82

(0.387)

0.53

5.02

0.000

23

0.65

(0.485)

0.85

(0.359)

0.21

2.93

0.006

24

0.91

(0.288)

0.88

(0.327)

-0.03

-0.37

0.711

25

0.50

(0.508)

0.74

(0.448)

0.24

2.48

0.019

26

0.79

(0.410)

0.97

(0.171)

0.18

2.66

0.012

27

0.88

(0.327)

1.00

(0.000)

0.12

2.10

0.044

28

0.76

(0.431)

0.74

(0.448)

-0.03

-0.33

0.744

29

0.15

(0.359)

0.88

(0.327)

0.74

9.57

0.000

30

0.47

(0.507)

0.88

(0.327)

0.41

4.31

0.000

31

0.85

(0.359)

1.00

(0.000)

0.15

2.39

0.023

32

0.79

(0.410)

0.82

(0.387)

0.03

0.33

0.744

33

0.38

(0.493)

0.79

(0.410)

0.41

3.66

0.001

34

0.91

(0.288)

0.88

(0.327)

-0.03

-0.37

0.711

35

0.82

(0.387)

0.97

(0.171)

0.15

2.39

0.023

36

0.74

(0.448)

0.97

(0.171)

0.24

2.77

0.009

37

0.88

(0.327)

0.94

(0.239)

0.06

0.81

0.422

38

0.47

(0.507)

0.38

(0.493)

-0.09

-0.77

0.447

39

0.85

(0.359)

0.94

(0.239)

0.09

1.14

0.263

40

0.82

(0.387)

0.88

(0.327)

0.06

0.81

0.422

41

0.94

(0.239)

0.94

(0.239)

0.00

0.00

1.000

42

0.94

(0.239)

0.97

(0.171)

0.03

0.57

0.571

43

0.79

(0.410)

0.85

(0.359)

0.06

0.81

0.422

44

0.88

(0.327)

0.82

(0.387)

-0.06

-0.70

0.488

45

0.65

(0.485)

0.71

(0.462)

0.06

0.53

0.600

46

0.79

(0.410)

0.91

(0.288)

0.12

1.68

0.103

47

0.74

(0.448)

0.76

(0.431)

0.03

0.37

0.711

Total

34.21

(5.221)

39.79

(6.004)

5.59

5.35

0.000

Table 3

Reliability of EVI-2

Pre-Test

Post Test

a=.7403

a=.8874


Table 4

Results of Demographics Items 48-51

The breakdown of the total number of students who responded to these demographic items at pre-test and post-test is reported.  The Combined columns are the results for those who responded at both pre- test and post-test.  Some students did not answer these or used a response that is not included in the choice of responses.

 

Pre-Test

Post-Test

Combined

Demographics

N

%

N

%

N

%

Gender:

           

Male

25

67.6

29

69.0

23

67.6

Female

12

32.4

13

31.0

11

32.4

Total

37

100.0

42

100.0

34

100.0

Class Level:

           

Freshman

19

51.4

22

52.4

18

52.9

Sophomore

6

16.2

10

23.8

6

17.6

Junior

7

18.9

4

9.5

5

14.7

Senior

5

13.5

6

14.3

5

14.7

Total

37

100.0

42

100.0

34

100.0

By the Numbers:

           

Yes

10

27.8

11

26.8

9

27.3

No

26

72.2

30

73.2

24

72.7

Total

36

100.0

41

100.0

33

100.0

Calling the Shots:

           

Yes

0

0.0

4

10.0

0

0.0

No

35

100.0

36

90.0

32

100.0

Total

35

100.0

40

100.0

32

100.0

Appendix A

Frequency Tables for Items 1-47

The letter P in the item number indicates that this is a post-test item.  The asterisk indicates the correct response. Examine the valid percent column as it does not include missing data.