Development and Validation of a Critical Thinking Skills Instrument for Grade 11 Biology Students Based on Ennis’s Framework
DOI:
https://doi.org/10.57092/ijetz.v5i1.546Abstract
The ability to think critically is a key competency required of students in the 21st century, particularly in science education. This study aimed to develop and empirically evaluate a critical thinking skills assessment instrument for Grade 11 senior high school students in biology learning. The instrument was developed using the ADDIE development model, encompassing the analysis, design, development, implementation, and evaluation stages. Empirical testing was conducted during the implementation stage with Grade 11 senior high school students in Jambi City. Item analysis was performed to examine content validity, empirical validity, reliability, item difficulty, and discrimination indices. Empirical data were analyzed using ANATES version 4.0.9. The results indicated that 22 out of 56 items met the predefined validation criteria, demonstrating acceptable levels of item difficulty and discrimination power. The reliability coefficient of the finalized instrument was 0.68, which is considered acceptable for an exploratory educational assessment instrument. These findings suggest that the developed instrument possesses adequate psychometric properties for measuring students’ critical thinking skills in biology learning and can support future instructional evaluation and research within similar educational contexts.
Downloads
References
Akbar, S. (2013). Instrumen perangkat pembelajaran. Bandung: PT Remaja Rosdakarya.
Almunawarah, R., Halim, A., & Elisa, E. (2023). Analysis of high school students' critical thinking skills using FRISCO indicators. International Journal of Research and Review, 10(10), 276-283. https://doi.org/10.21275/SR231002095937
Alvionita, D., Prayitno, B. A., & Sugiyarto. (2020). Problem-based learning with iSpring assisted inquiry learning to improve students' critical thinking skills. Journal of Physics: Conference Series, 1567(4), 042044. https://doi.org/10.1088/1742-6596/1567/4/042044
American Educational Research Association, American Psychological Association, & National Council on Measurement in Education. (2014). Standards for educational and psychological testing. Washington, DC: American Educational Research Association.
Arifin, Z., Setiawan, A., & Widodo, A. (2025). Developing critical thinking through inquiry-based learning in biology education: A meta-analysis. Thinking Skills and Creativity, 55, 101678. https://doi.org/10.1016/j.tsc.2024.101678
Arikunto, S. (2015). Dasar-dasar evaluasi pendidikan (2nd ed.). Jakarta: Bumi Aksara.
Ariyanti, F., & Rahayu, W. P. (2025). Development of Nearpod-based learning evaluation to measure students' critical thinking skills in retail business management subjects. In Proceedings of the 8th International Research Conference on Economic and Business (IRCEB 2024) (pp. 62-80). Atlantis Press. https://doi.org/10.2991/978-94-6463-722-9_7
A'yun, Q., Khasanah, U., Fatmaryanti, S. D., & Sulisworo, D. (2022). Development of higher order thinking skill (HOTS) test on magnetic field concepts to improve students' critical thinking skills. Biosfer: Jurnal Tadris Biologi, 13(2), 183-192. https://doi.org/10.24042/biosfer.v13i2.13658
Badrujaman, A. (2020). Teori dan aplikasi evaluasi program bimbingan dan konseling. Jakarta: Prenadamedia Group.
Branch, R. M. (2009). Instructional design: The ADDIE approach. Springer. https://doi.org/10.1007/978-0-387-09506-6
Braun, V., & Clarke, V. (2021). Thematic analysis: A practical guide. SAGE Publications. https://doi.org/10.1002/9781118901731.iecrm0249
Cohen, L., Manion, L., & Morrison, K. (2018). Research methods in education (8th ed.). Routledge. https://doi.org/10.4324/9781315456539
Creswell, J. W., & Creswell, J. D. (2018). Research design: Qualitative, quantitative, and mixed methods approaches (5th ed.). SAGE Publications.
Darling-Hammond, L., Flook, L., Cook-Harvey, C., Barron, B., & Osher, D. (2020). Implications for educational practice of the science of learning and development. Applied Developmental Science, 24(2), 97-140. https://doi.org/10.1080/10888691.2018.1537791
DeVellis, R. F. (2017). Scale development: Theory and applications (4th ed.). SAGE Publications. https://doi.org/10.4135/9781506335194
Dini, N. A. I., & Kuswanto, H. (2025). Integrating local wisdom: Innovative assessment instrument of critical thinking skills in science learning. Jurnal Eduscience, 12(3). https://doi.org/10.36987/jes.v12i3.6849
Elangovan, N., & Sundaravel, E. (2021). Method of preparing a document for survey instrument validation by experts. Journal of Health and Allied Sciences, 11(3), 147-154. https://doi.org/10.1055/s-0041-1728878
Ennis, R. H. (1991). Critical thinking: A streamlined conception. Teaching Philosophy, 14(1), 5-24. https://doi.org/10.5840/inquiryctnews201126215
Ennis, R. H. (2018). Critical thinking across the curriculum: A vision. Topoi, 37(1), 165-184. https://doi.org/10.1007/s11245-016-9401-4
Erfan, M., Maulyda, M. A., Hidayati, V. R., Astria, F. P., & Ratu, T. (2020). Analisis kualitas soal kemampuan membedakan rangkaian seri dan paralel melalui tes berbasis online. Jurnal Penelitian Pendidikan IPA, 6(2), 191-195. https://doi.org/10.29303/jppipa.v6i2.420
Facione, P. A. (2020). Critical thinking: What it is and why it counts. Insight Assessment. Retrieved from https://www.insightassessment.com
Fadlilah, A. N., & Indana, S. (2025). Development of E-LKPD based on science literacy to train critical thinking skills in Merdeka Curriculum. Berkala Ilmiah Pendidikan Biologi (BioEdu), 14(3), 553-562. https://doi.org/10.26740/bioedu.v14n3.p553-562
Fajar, N., & Suryani, R. D. (2023). Biology learning evaluation module development based on higher order thinking skills and local wisdom value. JPBIO (Jurnal Pendidikan Biologi), 8(1), 142-152. https://doi.org/10.31932/jpbio.v8i1.2307
Fajaryati, N., Budiyono, B., & Akhyar, M. (2021). Developing an instrument for assessing the feasibility of vocational high school students' entrepreneurial intentions. Jurnal Pendidikan Vokasi, 11(1), 45-56. https://doi.org/10.21831/jpv.v11i1.36789
Fereday, J., & Muir-Cochrane, E. (2006). Demonstrating rigor using thematic analysis: A hybrid approach of inductive and deductive coding and theme development. International Journal of Qualitative Methods, 5(1), 80-92. https://doi.org/10.1177/160940690600500107
Fraenkel, J. R., Wallen, N. E., & Hyun, H. H. (2012). How to design and evaluate research in education (8th ed.). McGraw-Hill.
Grohs, J. R., Kirk, G. R., Soledad, M. M., & Knight, D. B. (2018). Assessing systems thinking: A tool to measure complex reasoning through ill-structured problems. Thinking Skills and Creativity, 28, 110-123. https://doi.org/10.1016/j.tsc.2018.03.003
Halpern, D. F. (2014). Thought and knowledge: An introduction to critical thinking (5th ed.). Psychology Press. https://doi.org/10.4324/9781315885279
Haynes, S. N., Richard, D. C., & Kubany, E. S. (1995). Content validity in psychological assessment: A functional approach to concepts and methods. Psychological Assessment, 7(3), 238-247. https://doi.org/10.1037/1040-3590.7.3.238
Karnengsih, K., Harahap, R. D., & Siregar, I. H. (2021). Analisis butir soal ujian sekolah mata pelajaran IPA menggunakan program ANATES. Jurnal Basicedu, 5(5), 4061-4070. https://doi.org/10.31004/basicedu.v5i5.1489
Kemendikbudristek. (2022a). *Salinan Keputusan Kepala Badan Standar, Kurikulum, dan Asesmen Pendidikan Nomor 033/H/KR/2022 tentang Capaian Pembelajaran*. Jakarta: Kementerian Pendidikan, Kebudayaan, Riset, dan Teknologi. Retrieved from https://kurikulum.kemdikbud.go.id
Kemendikbudristek. (2022b). Panduan pembelajaran dan asesmen pendidikan anak usia dini, pendidikan dasar, dan pendidikan menengah. Jakarta: Kementerian Pendidikan, Kebudayaan, Riset, dan Teknologi. Retrieved from https://kurikulum.kemdikbud.go.id
Khoeriyah, Z., Novitasari, A., & Paratama, A. O. S. (2025). The impact of the science-technology-society model on the enhancement of student's HOTS: A systematic literature review. Journal of Innovative Science Education, 14(3). https://doi.org/10.15294/jise.v14i3.29625
Krathwohl, D. R. (2002). A revision of Bloom's taxonomy: An overview. Theory Into Practice, 41(4), 212-218. https://doi.org/10.1207/s15430421tip4104_2
Manassero-Mas, M. A., Moreno-Salvo, A., & Vázquez-Alonso, Á. (2022). Development of an instrument to assess critical thinking in science and technology. Education Sciences, 12(3), 201. https://doi.org/10.3390/educsci12030201
McHugh, M. L. (2012). Interrater reliability: The kappa statistic. Biochemia Medica, 22(3), 276-282. https://doi.org/10.11613/BM.2012.031
Messick, S. (1995). Validity of psychological assessment: Validation of inferences from persons' responses and performances as scientific inquiry into score meaning. American Psychologist, 50(9), 741-749. https://doi.org/10.1037/0003-066X.50.9.741
Nunnally, J. C., & Bernstein, I. H. (1994). Psychometric theory (3rd ed.). McGraw-Hill.
O'Connor, C., & Joffe, H. (2020). Intercoder reliability in qualitative research: Debates and practical guidelines. International Journal of Qualitative Methods, 19, 1-13. https://doi.org/10.1177/1744987120927206
Odukoya, J. A., Adekeye, O., & Okonkwo, E. (2018). Assessing the effectiveness of mobile learning devices in tertiary institutions: The experience of undergraduates in a private university in Nigeria. Cogent Education, 5(1), 1540918. https://doi.org/10.1080/2331186X.2018.1540918
Pamelia, S. S., & Hariani, D. (2021). Analysis of critical thinking ability of students class X SMA Negeri 1 Sampang on environmental pollution material. Berkala Ilmiah Pendidikan Biologi (BioEdu), 11(1), 107-115. https://doi.org/10.26740/bioedu.v11n1.p107-115
Partnership for 21st Century Learning. (2019). Framework for 21st century learning definitions. Battelle for Kids. Retrieved from https://www.battelleforkids.org
Paul, R. (2018). Critical thinking and the critical person. In Thinking: The second international conference (pp. 373-404). Routledge. https://doi.org/10.4324/9781315802015-27
Rahmi, Y. L., Miatidini, N. A., Alberida, H., Darussyamsyu, R., Ichsan, I. Z., Sigit, D. V., Titin, T., Koc, I., & Sison, M. H. (2021). HOTS assessment of biology cell: Validity, practicality and reliability. Jurnal Penelitian Pendidikan IPA, 7(3), 481-487. https://doi.org/10.29303/jppipa.v7i3.742
Ratna, R., Suryanda, E., & Rusdi, R. (2025). Development of critical thinking skills assessment instrument based on Ennis framework on environmental change material. Biosfer: Jurnal Pendidikan Biologi, 18(1), 1-12. https://doi.org/10.21009/biosferjpb.54197
Retnawati, H. (2016). Validitas, reliabilitas, dan karakteristik butir: Panduan untuk peneliti, mahasiswa, dan psikometrian. Yogyakarta: Parama Publishing. Retrieved from https://staffnew.uny.ac.id/upload/132255129/pengabdian/buku-validitas-reliabilitas.pdf
Santosa, T. A., Lufri, L., & Andromeda, A. (2023). Development of higher order thinking skills (HOTS) instruments in biology learning: A systematic review. Jurnal Mangifera Edu, 8(1), 1-14. https://doi.org/10.31943/mangiferaedu.v8i1.166
Sari, D. N., Sunarno, W., & Prayitno, B. A. (2024). Diagnostic assessment profile of learning styles and critical thinking skills in biology learning based on the Merdeka Curriculum. JPBI (Jurnal Pendidikan Biologi Indonesia), 10(3), 876-887. https://doi.org/10.22219/jpbi.v10i3.36557
Schwab, K. (2017). The fourth industrial revolution. Currency.
Septiany, L. D., Puspitawati, R. P., Susantini, E., Budiyanto, M., Purnomo, T., & Hariyono, E. (2024). Analysis of high school students' critical thinking skills profile according to Ennis indicators. IJORER: International Journal of Recent Educational Research, 5(1), 157-167. https://doi.org/10.46245/ijorer.v5i1.544
Shaw, S., & Crisp, V. (2011). Tracing the evolution of validity in educational measurement. Assessment in Education: Principles, Policy & Practice, 18(4), 365-382. https://doi.org/10.1080/0969594X.2011.607444
Sintia, D. N., & Yuliani, Y. (2024). Analysis of students' critical thinking skills in biology learning at senior high school. Berkala Ilmiah Pendidikan Biologi (BioEdu), 13(2), 334-343. https://doi.org/10.26740/bioedu.v13n2.p334-343
Sugiyono. (2019). Metode penelitian pendidikan: Pendekatan kuantitatif, kualitatif, dan R&D. Bandung: Alfabeta.
Suraida, S., Aslamiah, A., & Suriansyah, A. (2025). Development of assessment instruments to measure students' critical thinking skills. International Journal of Social Science and Human Research, 8(2), 1123-1132. https://doi.org/10.47191/ijsshr/v8-i2-48
Suryanto, S., & Taseman, T. (2022). Analisis kualitas butir soal ujian akhir semester genap mata pelajaran matematika menggunakan program ANATES. Jurnal Basicedu, 6(4), 7310-7320. https://doi.org/10.31004/basicedu.v6i4.3321
Syafril, S., Aini, N. R., Pahrudin, A., & Yaumas, N. E. (2021). Developing instrument for students' critical thinking ability on mathematics. European Journal of Educational Research, 10(1), 337-349. https://doi.org/10.12973/eu-jer.10.1.337
Ulfa, M., & Kuswanti, N. (2020). Development of assessment instrument based on higher order thinking skills of respiratory system of grade XI of senior high school. Berkala Ilmiah Pendidikan Biologi (BioEdu), 9(3), 431-437. Retrieved from https://ejournal.unesa.ac.id/index.php/bioedu/article/view/35726
Wahyuningtyas, A., Triwahyuni, E., & Kustiyowati. (2025). Learning media based on Google Sites to improve critical thinking in senior high school students. Jurnal Pedagogi dan Pembelajaran, 8(2), 289-301. https://doi.org/10.23887/jp2.v8i2.99322
Willingham, D. T. (2019). Why don't students like school?: A cognitive scientist answers questions about how the mind works and what it means for the classroom (2nd ed.). Jossey-Bass. Retrieved from https://www.wiley.com
World Economic Forum. (2020). Schools of the future: Defining new models of education for the fourth industrial revolution. WEF. Retrieved from https://www.weforum.org
Yokhebed, Y., Karmadi, R. M. D., & Nastiti, L. R. (2025). Validity and reliability analysis of a socioscientific issues-based critical thinking self-assessment instrument using the Rasch model. JPBI (Jurnal Pendidikan Biologi Indonesia), 11(1), 73-82. https://doi.org/10.22219/jpbi.v11i1.38902
Yu, P. L. H., & Zin, Z. M. (2023). The development of higher order thinking skills (HOTS) assessment instrument for biology education: A systematic literature review. Asian Journal of University Education, 19(4), 789-802. https://doi.org/10.24191/ajue.v19i4.24567
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Try Susanti, Dwi Gusfarenie, Nanda Gusriani, Diandara Oryza, Nining Nuraida

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
























