M.
Figueiredo
a,
L.
Esteves
b,
J.
Neves
c and
H.
Vicente
*d
aDepartamento de Química, Centro de Investigação em Educação e Psicologia, Escola de Ciências e Tecnologia, Universidade de Évora, Évora, Portugal. E-mail: mtf@uevora.pt
bDepartamento de Química, Escola de Ciências e Tecnologia, Universidade de Évora, Évora, Portugal
cAlgoritmi, Universidade do Minho, Braga, Portugal. E-mail: jneves@di.uminho.pt
dDepartamento de Química, Centro de Química de Évora, Escola de Ciências e Tecnologia, Universidade de Évora, Évora, Portugal. E-mail: hvicente@uevora.pt
First published on 30th November 2015
This study reports the use of data mining tools in order to examine the influence of the methodology used in chemistry lab classes, on the weight attributed by the students to the lab work on learning and own motivation. The answer frequency analysis was unable to discriminate the opinions expressed by the respondents according to the type of the teaching methodology used in the lab classes. Conversely, the data mining approach using k-means clustering models, allowed a deeper analysis of the results, i.e., enabled one to identify the methodology to teach chemistry that, in students' opinion, is important for learning chemistry and increasing their motivation. The sample comprised 3447 students of Portuguese Secondary Schools (1736 in the 10th grade; 1711 in the 11th grade). The k-means Clustering Method was used, with k values ranging between 2 and 4. The main strengths of this study are the methodological approach for data analysis and the fact that the sample was formed by students with different school careers that enables the use of the individual as the unit of analysis.
According to Bopegedera (2011) the majority of the chemistry students are interested in other courses like medicine, allied health fields, engineering or other sciences where chemistry is only a requirement to pursue their studies. However, to some students chemistry is still perceived as a challenging subject to study. The report for SCORE – Science Community Supporting Education (Coe et al., 2008) presents results from a large number of studies conducted over a long period, using many different methods and datasets. These studies showed that chemistry is one of the most difficult subjects, hence, students need to be well motivated and to have good knowledge prior to commencing post compulsory study.
The practical work is widely and frequently used in the teaching of chemistry in secondary schools as a methodology of teaching and motivation (Millar, 2002). However, the term “practical work,” is commonly used in the literature as an overarching term that refers to any type of science teaching and learning activity in which students, working either individually or in small groups, are involved in manipulating and/or observing real objects and materials (Millar, 2010). From this point of view, practical work is a broad category that includes for example, “laboratory work” (or “lab work”).
A second type of methodology consists in carrying out the lab work by students according to recipes executed step-by-step. Students focus their thoughts on finishing one step after another and many times they do not develop a deeper understanding of the experiments. For many students lab work means just working, handling laboratory equipment, not including, in many cases, the development and the understanding of scientific thinking (Hofstein and Lunetta, 2004).
In a third type of methodology, the students that conducted experiments based on a receipt made by themselves, under teacher guidance, were frequently more motivated for the subject. Laboratory works developed based on constructivism had great role for increasing students' learning achievements and developing students' positive attitudes towards chemistry laboratory, in contrast to traditional teacher centred approach (Tarhana and Sesen, 2010). From this perspective it makes sense to introduce a problem and guide the students in finding solutions. In fact, this methodology implies fundamental steps in the teaching/learning process of tentative skills, like collecting information or doing planning, promoting the acquisition of key abilities. Some studies show that learning experiences based on concrete situations are authentic, meaningful, challenging, and based on the choice and on the students' work, not only increasing the intrinsic motivation of the students to learn Science (Yair, 2000; Koballa and Glynn, 2007), but also improving their attitudes towards Science and Learning (Sherz and Oren, 2006). Furthermore, it is not relevant to continue to use lab work only as a mere illustration of theories or as a means to train manipulative abilities, like measuring volumes or masses, although attaining accurate and precise results has been always quite desirable (Bennett and O'Neale, 1998).
Recent studies confirm that laboratory based learning quality is increased as students have an active role in the process of gaining knowledge (Cheung, 2007; Bennett et al., 2010; Kind et al., 2011). There are several methods that allow to explore this type of learning like class research seminars, problem based learning, case studies, project-based learning, role playing, cooperative and cooperation learning, group debate, development of mind maps, experience based learning. However, Bopegedera (2011) points out the importance of the connections between theory (presented in the textbook and lectures) and practice (in the laboratory and problem-solving workshops) to provide a holistic learning experience. Indeed, students need a good balance between teacher guidance and independent thought.
The studies about motivation for learning usually distinguish between intrinsic and extrinsic motivation (Stipek, 1996, 2014). The former is understood to be a personal interest in pursuing a goal without any palpable reward, i.e., the goal is considered to be an own wish and not required by external agents. Extrinsic motivation, in pursuit of a task, is required or directed by external factors not on the basis of the own wishes. Learning is more likely to be significant when it increases the degree of intrinsic motivation that leads to personal fulfilment. Indeed, some studies point out the many advantages for students who enjoy learning compared with those who do that because they feel they must achieve extrinsic rewards or avoid punishment (Stipek, 2014). According to this author, students who enjoy learning for their own sake seem to learn at more conceptual levels, seek intellectual challenges more frequently, and persist longer during difficult tasks than students who focus on external rewards and punishments. Considering the fact that in schools learning is something that is imposed to students, is promoted primarily by factors of extrinsic motivation, like educational attainment, progression to the next level, among others. These factors are often overvalued both in school and in the family context (Heyman and Dweck, 1992; Stipek, 2014). Nevertheless, the role of extrinsic motivation cannot be undervalued in the educational context. Indeed, through appropriate stimuli (i.e., teaching strategies) the teacher can help the student to redefine goals, attributions, interests and/or self-concepts. The relations between extrinsic stimuli and motivation are neither linear nor uncomplicated, since these stimuli can trigger/influence in different ways the students' motivation. Some studies about students' motivation refer five criteria to distinguish between intrinsic and extrinsic motivation (Harter, 1981; Shachar and Fischer, 2004), namely preference for challenge; curiosity and interest; independent mastery; independent judgment and internal criteria. These criteria are linked, respectively, to the questions “Does the student like hard challenging work as opposed to easier assignments?”; “Does the student work to satisfy his/her own interest and curiosity rather than to satisfy the teacher?”; “Does the student prefer to acquire his own skills of logical thinking instead of relying on the teacher for help and guidance?”; “Does the student prefer self-directed learning instead of learning directed by the teacher?”; and “Does the student know when he/she has succeeded or failed on school assignments instead of being dependent on external evaluation only?”.
Recent studies have shown that the use of diversified teaching strategies can significantly increase the intrinsic motivation of students. Baeten et al. (2012) shows the importance of gradually introducing students to case-based learning, in terms of their autonomous motivation and achievement. A study conducted by Changeiywo et al. (2011) highlights that students exposed to mastery learning approach have significantly higher motivation than those taught through regular methods.
In this context the completion of lab work emerges as a factor in the students you want to unleash the mechanisms of intrinsic motivation for learning science, especially chemistry.
The classical KDD application areas include, among others, marketing, finance, fraud detection, manufacturing, telecommunications, internet or medicine (Han et al., 2011; Witten et al., 2011). Recent studies show the applicability of KDD to other areas like production of water to human consumption (Pinto et al., 2009; Couto et al., 2012) or prediction of the availability of nitrogen in soils (Nunes et al., 2012). Regarding education research, data mining is still considered a new paradigm and a promising challenge. Indeed, educational data mining can be considered an emerging theme, concerned with developing methods for exploring the various types of data that come from the educational context. A few studies that illustrate the applicability of these tools to different problems in educational field can be found in literature.
A specific application of data mining tools in learning management systems was presented by Romero et al. (2008). The main objective of this study was to classify students into different groups with equal final marks depending on the activities carried out in Moodle. The C4.5 algorithm was used to induce decision trees and a set of interesting rules were obtained by the authors. For instance, students with a low number of quizzes passed were classified as fail. Students with a high number of passed quizzes are directly classified as excellent. Finally, students with a medium number of passed quizzes are classified as fail, pass or good depending on other variables like total time of assignments, number of quizzes, number of quizzes failed, or number of assignments. The knowledge discovered can be used by the instructor in different ways. On the one hand, to classifying new students in order to detect early students with learning problems and, on the other hand, to decide about the use of some types of activities that conduce to higher marks, or on the contrary, to decide to eliminate some activities related to low marks. Also in the scope of distance education platforms, Sevindik and Cömert (2010) compare different data mining algorithms like k-means, Apriori, C4.5, Support Vector Machines, k-Nearest Neighbours and Naive Bayes. According to the authors the algorithm C4.5, used to induce decision trees, shows to be the more effective one in classifying students' characteristics and academic success and can be used by the teacher to anticipate possible scenarios and avoid academic failure.
Şen and Uçar (2012) developed a process of knowledge discovery from databases, using artificial neural networks and decision trees, in order to study the students' achievements. The input variables were gender, age, type of high school graduation, education type (i.e., distance/regular) and lesson type, while the output variable was the students' scores. Both classification methods exhibited values of overall accuracy higher than 94%. The results show that the students' success rate has inverse ratio with students' age and the success score decreases with increasing age. Another interesting feature is related with type of education. The students with best scores (ranging between 80 and 100) are studying in the formal education while the students with scores varying between 65 and 80 are studying in the distance education. The study also shows that the scores less than 60 were obtained mainly by students in the distance education.
Şen et al. (2012) developed models to predict secondary education placement test results using C5 decision tree algorithm, vector machines, artificial neural networks and logistic regression. The overall accuracy of models ranges between 82% (logistic regression model) and 95% (decision tree model). The authors used 24 input variables that include, among other, gender, marital status or the scores obtained by the students in various subjects like mathematics, science and technology or foreign language. An import aspect of this study, that should be noted, was related with the sensitivity analysis performed on the models in order to determine the importance of the input variables. The sensitivity analysis showed that previous test experience, whether a student has a scholarship, number of siblings, previous years' grade point average are among the most important predictors of the placement test scores. Undeniably, knowing the factors that more directly or indirectly affect achievement is valuable to all actors involved in the educational process (i.e., students, parents, teachers, administrators) in order to maximize success.
Neves et al. (2015) and Figueiredo et al. (2014) present the development of decision support systems to evaluate the quality of learning and to evaluate potential situations of school dropout, respectively. These systems were built under a formal framework based on Logic Programming, in terms of its knowledge representation and reasoning procedures, complemented with an approach to computing grounded on Artificial Neural Networks. This approach not only allows to obtain the evaluation of quality of learning (or school dropout risk) but it also permits the estimation of the confidence that one has on the model prediction.
All these studies exemplify the use of different data mining algorithms (e.g. cluster analysis, decision trees, association rules, support vector machines, artificial neural networks) and illustrate the potential and the central role that such tools could play in the educational context.
The major contribution of this work is related with the methodological approach and the use of data mining tools for data analysis. In other words, one of the strengths of this study lays in the fact that the sample was formed by students with different school careers, and consequently exposed to different teaching methodologies in lab classes. Through the use of data mining tools it was possible to correlate the importance attributed by the students to the lab work on their learning and motivation with the teaching methodologies followed in their lab classes, considering their integral school careers. Indeed, in the studies present in literature that address and discuss the problem of the influence of the methodologies followed in lab classes on learning and/or motivation of students, the sample is usually formed by a specific group of students submitted to the same methodology, i.e., homogeneous samples, in the sense that all students have the same educational experience where the unit of analysis is the group (not the individual).
10th grade | 11th grade | |||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Age | Gender | Age | Gender | |||||||||||
District | <15 | 15 | 16 | 17 | >17 | F | M | <16 | 16 | 17 | 18 | >18 | F | M |
a Portugal regions – north. b Portugal regions – centre. c Portugal regions – south. d Portugal regions – interior. e Portugal regions – coastal. | ||||||||||||||
Bejac,d | 0 | 71 | 52 | 17 | 10 | 89 | 61 | 2 | 7 | 37 | 13 | 64 | 72 | 51 |
Bragançaa,d | 0 | 43 | 68 | 21 | 7 | 75 | 64 | 0 | 0 | 43 | 25 | 37 | 43 | 62 |
Castelo Brancob,d | 0 | 62 | 46 | 11 | 3 | 64 | 58 | 1 | 3 | 48 | 17 | 59 | 67 | 61 |
Évorac,d | 1 | 52 | 43 | 15 | 9 | 66 | 54 | 0 | 0 | 47 | 16 | 79 | 85 | 57 |
Faroc,e | 0 | 89 | 58 | 8 | 16 | 92 | 79 | 2 | 5 | 57 | 27 | 80 | 98 | 73 |
Lisbonb,e | 1 | 292 | 241 | 48 | 31 | 341 | 272 | 1 | 4 | 226 | 77 | 316 | 326 | 298 |
Oportoa,e | 0 | 204 | 165 | 32 | 20 | 219 | 202 | 1 | 3 | 149 | 28 | 237 | 219 | 199 |
Concerning the 11th grade sample, a glance to Table 1 shows that 46.8% of students are male and 53.2% are female. Regarding the students age, only 35.4% of them do not exceed 17 years old, which insinuates high rate of school failure. The geographical location seems not to influence the results since there are no significant differences between the percentages of students under 17 (seventeen) years old (lies among 33.1% in the district of Évora and 37.4% in the district of Faro).
Some implementations of k-means only allow numerical values for attributes, (i.e., it may be necessary to convert categorical attributes). However, this is not necessary for clustering in WEKA since the WEKA Simple k-means algorithm automatically handles a mixture of categorical and numerical attributes. In addition, the algorithm also normalizes the numerical attributes automatically, when computes the Euclidean distance. A more detailed description of the WEKA Simple k-means algorithm can be found in Witten et al. (2011); Sharma(Sachdeva) et al. (2012).
The validation of the questionnaire respects the practices recommended by Bell (2010). Subsequently, the questionnaire was evaluated by a group of experts that suggested some amendments. As soon as these revisions where done, the questionnaire was applied to a small group of students of both grades, not included in the sample, to evaluate the validity of the questionnaire and identify possible difficulties in the interpretation of the questions. The questionnaire was sent by mail to the schools that indicated their willingness to participate in this move. To ensure that the answers reflect the whole school career and were not influenced by the work developed in the present academic year, the questionnaires were applied in the beginning of academic year. Thus, in this study only the responses received until 31 October were considered, i.e., five weeks after the beginning of the school year. The return rate for the samples related to the 10th and 11th grades were, respectively, 39.2% (1776 questionnaires received in 4530 sent), and 34.5% (1754 questionnaires received in 5085 sent).
The basic idea in the k-means clustering method is to try to discover k clusters, according to the requirements:
• Each cluster must contain at least one object; and
• Each object must belong to exactly one cluster.
The k-means algorithm input parameters stand for the number of clusters, k, and a data set, D, with n objects. As soon as the algorithm is enforced, it selects, randomly, k points that denote the initial centres of the clusters, being the objects assigned to the cluster to which they are akin done according to the Euclidean distance between the objects and the cluster midpoint (Bradley and Fayyad, 1998). Next, the algorithm computes the new centre for each cluster. These processes iterate until further refinement may no longer improve the model or the number of iterations exceed a specified limit (Fig. 2). In this study k varied from 2 to 7 and the iterative process was stopped whenever the additional refinement does not improve the model.
Sometimes, it is useful to build a rule-based classifier by extracting IF-THEN rules from the DTs. The rule is created at each path, from the root (the first node of DT) to a leaf. Each splitting criterion along a given path is logically ANDed to form the rule antecedent (the IF part). The leaf node holds the class prediction, forming the rule consequent (the THEN part).
Early systems for generating DTs include CART (Breiman et al., 1984) and ID3 (Quinlan, 1986), the latter being followed by the version C4.5 and C5.0. The C4.5 version was an improvement of the ID3 algorithm that allows the use of continuous values, support omitted values and tree pruning (Quinlan, 1993). The DT algorithm used in this study was the J48 as implemented in WEKA (Hall, et al., 2009). This J48 implements the 8th revision of the C4.5 algorithm. A description of the J48 algorithm can be found in Witten et al. (2011).
Fig. 4 Frequencies of the answers given to each question by the respondents of the 10th and the 11th grades. |
• A small percentage of respondents claim that the lab work is done exclusively by the teacher;
• The answers related with the organization of the students in lab classes are not conclusive;
• About 80% of the students declare that the lab work is based on experimental guidelines; and
• About 75% of the respondents state that the post-lab work consists on the elaboration of written reports.
The analysis of Fig. 4 also denotes, for both grades, that in the opinion of students the importance of lab work for learning chemistry and to increase their own motivation is very high or high. Only a few respondents answer moderate, low or very low. However, the percentages of the most positive responses were higher in the 10th grade, which may be attributed to two factors. The former one is linked to the fact that in the 11th grade the quantitative treatment and the discussion of the results obtained in lab work are deeper, and require the knowledge and the skills already acquired in subjects like chemistry, physics and mathematics. The latest is linked to the fact that, as previously noted, some of these students may be repeating the attendance on the subject, and that lab work does not constitute novelty and does not influence, to the same extent, either the importance attributed by the students to the lab work in learning or the motivation to study chemistry.
Since the overwhelming majority of the respondents declares that the importance of lab work for learning chemistry and to increase their own motivation is very high or high, it is impossible to conclude about the influence of the methodology used in chemistry lab classes, on the weight attributed by the students to the lab work on learning and own motivation. In order to overcome these limitations a methodology of data analysis based on cluster analysis and decision trees was carried out. The k-means clustering method is one of the most efficient data mining algorithms that seek to identify groups of similar objects (i.e., respondents) in complex samples. Decision trees were used to understand how the clusters were formed. To ensure that the clusters are formed based on the methodology followed in chemistry lab classes the input variables used are exclusively the answers to the questions related with the methodology (issues Q1, Q2, Q3 and Q4).
k = 2 | k = 3 | k = 4 | |||||||
---|---|---|---|---|---|---|---|---|---|
Cluster 1 | Cluster 2 | Cluster 1 | Cluster 2 | Cluster 3 | Cluster 1 | Cluster 2 | Cluster 3 | Cluster 4 | |
Who does the lab work? | |||||||||
Students | 821 | 0 | 156 | 0 | 665 | 156 | 0 | 665 | 0 |
Students and teacher | 0 | 797 | 117 | 680 | 0 | 117 | 255 | 0 | 425 |
Teacher | 0 | 118 | 27 | 91 | 0 | 27 | 91 | 0 | 0 |
How are the students organized in the lab classes? | |||||||||
Groups of 3 students | 232 | 195 | 78 | 167 | 182 | 78 | 75 | 182 | 92 |
Groups of 4 students | 409 | 442 | 132 | 382 | 337 | 132 | 171 | 337 | 211 |
Groups with another number of students | 180 | 278 | 90 | 222 | 146 | 90 | 100 | 146 | 122 |
Which is the basis of the lab work? | |||||||||
Experimental guidelines | 665 | 771 | 0 | 771 | 665 | 0 | 346 | 665 | 425 |
Experimental problems | 156 | 144 | 300 | 0 | 0 | 300 | 0 | 0 | 0 |
What type of post-lab work is done? | |||||||||
Worksheets | 80 | 119 | 42 | 96 | 61 | 42 | 12 | 61 | 84 |
Written reports | 651 | 685 | 197 | 604 | 535 | 197 | 325 | 535 | 279 |
Worksheets and written reports | 90 | 111 | 61 | 71 | 69 | 61 | 9 | 69 | 62 |
Regarding the responses obtained in the questionnaire related with the 11th grade, an examination of Table 3 reveals that the k = 2 and the k = 3 clustering models are fairly analogous. The main difference is the separation of cluster 2 from the k = 2 clustering model into cluster 2 (with 835 objects) and cluster 3 (with 83 objects), into the k = 3 clustering one. For both models, cluster 1 was formed by the students that declare that the lab classes are done always by the students, while the division of cluster 2 (model of two clusters) into two clusters (model of three clusters) enabled to differentiate the students that reported that the lab classes are done always by the teacher (cluster 3), of those that state that the classes are done sometimes by them and sometimes by the teacher (cluster 2). The k = 3 and the k = 4 clustering models are quite similar too. In this case, cluster 4, which comprises 713 objects, was formed from cluster 1 (lost 480 cases) and from cluster 2 (lost 233 objects) of the three clusters model.
k = 2 | k = 3 | k = 4 | |||||||
---|---|---|---|---|---|---|---|---|---|
Cluster 1 | Cluster 2 | Cluster 1 | Cluster 2 | Cluster 3 | Cluster 1 | Cluster 2 | Cluster 3 | Cluster 4 | |
Who does the lab work? | |||||||||
Students | 793 | 0 | 793 | 0 | 0 | 313 | 0 | 0 | 480 |
Students and teacher | 0 | 835 | 0 | 835 | 0 | 0 | 602 | 0 | 233 |
Teacher | 0 | 83 | 0 | 0 | 83 | 0 | 0 | 83 | |
How are the students organized in the lab classes? | |||||||||
Groups of 3 students | 278 | 260 | 278 | 240 | 20 | 278 | 240 | 20 | 0 |
Groups of 4 students | 247 | 303 | 247 | 268 | 35 | 0 | 35 | 35 | 480 |
Groups with another number of students | 268 | 355 | 268 | 327 | 28 | 35 | 327 | 28 | 233 |
Which is the basis of the lab work? | |||||||||
Experimental guidelines | 689 | 769 | 689 | 696 | 73 | 226 | 463 | 73 | 696 |
Experimental problems | 104 | 149 | 104 | 139 | 10 | 87 | 139 | 10 | 17 |
What type of post-lab work is done? | |||||||||
Worksheets | 17 | 108 | 17 | 84 | 24 | 4 | 35 | 24 | 62 |
Written reports | 630 | 619 | 630 | 570 | 49 | 63 | 400 | 49 | 581 |
Worksheets and written reports | 146 | 191 | 146 | 181 | 10 | 26 | 167 | 10 | 70 |
Once presented the various models of segmentation and set the main differences among them, it is necessary to define criteria to evaluate them. Since there is no theoretical reason to judge the models' performance their value is determined by the models' ability to provide useful descriptions of the data, taking into account the goals set. Having in mind that the study object is to investigate the influence of the methodology used in chemistry lab classes, on the weight attributed by the students to the lab work on learning and own motivation, it is intended that the clusters obtained should be as homogeneous as possible in terms of methodology used in chemistry lab classes. Thus, for both grades, the models of three clusters were selected, since the models of two clusters contain, in cluster 2, two distinct answers to the question Who does the lab work? Conversely, the model of four clusters seems to bring no improvement, once the introducing of a new cluster does not result in a gain of homogeneity.
To ensure statistical significance of the attained results, 20 (twenty) runs were applied in all tests. In each simulation, the available data is randomly divided into two mutually exclusive partitions, i.e., the training set, with two-thirds of the available data and used to construct the models, and the test set, with the remaining of the examples being used after training, in order to compute the accuracy values (Souza et al., 2002). The DTs obtained are shown in Fig. 5 and 6, respectively for the 10th and 11th grades. Concerning the 10th grade the rule regarding the cluster 1 is “the basis of lab work is Experimental Problems”. With respect to cluster 2 there are two rules. The former is “the basis of lab work is Experimental Guidelines and the lab work is done by Teacher”, while the second is “the basis of lab work is Experimental Guidelines and the lab work is done sometimes by the Students and sometimes by the Teacher”. These rules can be merged into “the basis of lab work is Experimental Guidelines and the lab work is not done exclusively by the Students”. Finally, the rule concerning the cluster 3 is “the basis of lab work is Experimental Guidelines and the lab work is done by the Students”.
Regarding the 11th grade, the rules concerning the clusters 1, 2 and 3 are respectively “the lab work is done by the Students”; “the lab work is done sometimes by the Students and sometimes by the Teacher”, and “the lab work is done by the Teacher”.
A common tool to evaluate the results presented by the DTs models is the coincidence matrix, a matrix of size L × L, where L denotes the number of possible classes. This matrix is created by matching the values predicted by the model (rows) with the target values (columns). The coincidence matrixes, presented in Table 4, denote the average of 20 (twenty) experiments and reveal that the accuracy of the DTs displayed in Fig. 5 and 6 are 100% for both training and test sets.
10th grade | 11th grade | |||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
Training set | Test set | Training set | Test set | |||||||||
Cluster 1 | Cluster 2 | Cluster 3 | Cluster 1 | Cluster 2 | Cluster 3 | Cluster 1 | Cluster 2 | Cluster 3 | Cluster 1 | Cluster 2 | Cluster 3 | |
a The values displayed denote the average of 20 experiments. | ||||||||||||
Cluster 1 | 230 | 0 | 0 | 70 | 0 | 0 | 529 | 0 | 0 | 264 | 0 | 0 |
Cluster 2 | 0 | 537 | 0 | 0 | 234 | 0 | 0 | 567 | 0 | 0 | 268 | 0 |
Cluster 3 | 0 | 0 | 421 | 0 | 0 | 244 | 0 | 0 | 63 | 0 | 0 | 20 |
This result may be related with the development of higher level skills associated with the inquiry and the planning of the lab work, which are not present in the lab classes based on experimental guidelines. According Hofstein (2004) the appropriate laboratory activities can be effective in promoting cognitive skills, metacognitive skills, practical skills, and attitude and interest towards chemistry, learning chemistry, and practical work in the framework of chemistry learning.
Another feature to be exploited in the present work has to do with the influence of the methodology followed in lab classes to increase the student's motivation to learn chemistry. Fig. 7b shows the strength of the relationships (given in percentages) between the clusters to which the respondents are assigned and the replies to the question Does your motivation to study chemistry increase when you execute lab work? The analysis of Fig. 7b shows that regardless of the cluster, the majority of partakers consider that the motivation to study chemistry increases “Very high” or “High” with the execution of lab work. Nevertheless, an examination of Fig. 7b shows that the percentages of positive answers given by the respondents assigned to cluster 1 are similar and lower than 50.0%. Furthermore, a non-negligible percentage of respondents (≥6.0% ∧ <9.0%) claim that the increase of motivation to study chemistry is “Moderate”. Regarding cluster 2, the percentage of answers “High” is greater than the percentage of answers “Very high”. The percentage of negative answers is higher than in the other clusters. Concerning cluster 3, the percentage of answers “Very High” is greater than the percentage of answers “High”. None of the respondents assigned to this cluster indicated the answer “Very low”. Only a small percentage (≤0.5%) answered “Moderate” or “Low”.
The results presented above seem to indicate, on the one hand, the role played by the lab classes to increase the students' motivation to study chemistry and, on the other hand, also seem to show that the scholars reveal some resistance in executing the research and planning work required to perform open lab classes, based on tentative situations. According to Logar and Savec (2011) these results may be linked to the use of experimental skills of the learners (e.g. what to do, how and when), i.e., working with laboratory equipment and materials, use of laboratory manuals, use of theoretical basics of experimental work, terms, symbols, representations, working with classmates in groups.
Concerning cluster 2, moulded by the respondents that assert that the lab work is not done exclusively by the students, the percentage of answers “High” is greater than the percentage of answers “Very high”. None of the respondents assigned to this cluster indicated the answer “Low”, and only a small percentage (<1%) answered “Moderate” or “Very low”. Regarding cluster 3, that includes the respondents to which lab work is done exclusively by the teachers, the percentage of answers “High” is greater than the one of answers “Very high”. However, the overall percentage of the positive answers decreases, ranging between 56% and 67%. This result tells one that at least one third of the respondents included in this cluster have a negative opinion about the weight of lab work in chemistry learning.
These results show that the weight attributed by the students to lab work in chemistry learning is strongly dependent on their involvement. When lab classes are demonstrative (i.e. the lab work is carried out exclusively by the teacher), the weight of lab work in chemistry learning drops. Other researches (Cheung, 2007; Bennett et al., 2010) confirm that the quality of learning based on lab classes increases when students have an active role in the process of adding knowledge. Hofstein and Lunetta (2004) emphasize that the laboratory experiences raise the interest and the students' motivation, and also provide the development of practical skills and the capability of solving problems that may contribute to understand the nature of Science.
Regarding the influence of the methodology followed in lab classes to upturn the students' motivation of the 11th grade to study chemistry, the graph showed in Fig. 8b was conceived. The positive answers overcome the negative ones in all clusters, although in some cases there is a relatively high percentage of negative answers (up to 28%). However, a glance to Fig. 8b shows that the percentage of answers “Very High” given by the respondents assigned to cluster 1 is greater than the percentage of answers “High”. Only a small percentage (≤5%) answered “Moderate”, “Low” or “Very low”. Concerning cluster 2 the results are similar to those presented for cluster 1. Regarding cluster 3, the percentage of answers “Very high” and “High” is quite similar. The overall percentage of positive answers ranges between 43% and 56%. This result reveals that about half of the respondents in this cluster have a negative opinion with respect to see lab work as the mean to increase the students' motivation to study chemistry. Keeping in mind that cluster 3 includes the respondents that state that the lab work is done exclusively by the teachers, these results suggest that the increase of the students' motivation to study chemistry is not relevant when the lab classes are demonstrative. Conversely, when lab work is carried out by the students the results obtained with this sample suggest that their motivation to study chemistry increases. These results are in agreement with those obtained by Hofstein (2004). This author refers that the appropriate laboratory activities providing students with authentic and practical learning experiences has the potential to adjust the classroom learning environment and thus to enhance students' motivation to study chemistry.
Conversely, the data mining approach using k-means clustering models presented in this study, enabled one to identify the methodology to teach chemistry that, in the students' opinion, is important for learning chemistry and increasing their motivation. The results obtained with the data mining approach, based on students' opinion, showed that the methodology used in lab classes that most contributes for students' own motivation and for learning of chemistry is one that is based on the work of the students.
The results obtained in this study, based on students' opinions, could be important for teachers. Indeed, the results show that the type of methodology that should be adopted in lab classes must involve the students' work. The 135 minutes session, planned in the curricula for the realization of practical work, may be determinant to engage students to proceed studies in the scientific area of chemistry and, in the future, to choose careers related with this science. To achieve such goal the practical work must be mainly lab work, in which the students must conduct by themselves all the stages of the lab work development (planning, execution and interpretation), i.e., the students should be involved in the process of gaining knowledge.
The encouraging results obtained in this work show that data mining approach can be very useful to identify the methodologies followed in lab classes that most contribute to increase the weight attributed by the students to the lab work on learning and their own motivation. However, it should be highlighted that this study is based on students' opinion. It is impossible to be assertive about the methodology that works best, since the study design did not consider the collection of data related with the learning assessment.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/c5rp00144g |
This journal is © The Royal Society of Chemistry 2016 |