Analysis of Multiple Choice Questions on Impulse Momentum Material to See the Level of Difficulty of the Questions

Authors

  • Nur Ainayah Universitas Negeri Yogyakarta
  • Rida Siti Nur'aini Mahmudah Universitas Negeri Yogyakarta

DOI:

https://doi.org/10.37891/kpej.v9i1.1039

Abstract

This research aims to analyze the suitability, difficulty level, and reliability of impulse-momentum test instruments in physics subjects using the Rasch model. The background to this research is based on the importance of ensuring that the assessment instruments used in education have adequate validity and reliability. Using descriptive research methods with a quantitative approach, data were collected from 275 students from three schools in Bengkulu Province. The impulse-momentum test instrument was created and validated with the help of physics teachers from each school, and the research was conducted over 5 days. Data analysis was carried out using the QUEST application to assess the suitability of the questions under the Rasch model, the level of difficulty of the questions, and the instrument's reliability. The research results show that the majority of questions are of moderate difficulty, with instrument reliability at 0.97, indicating very good reliability. However, two items were found to have outfit values exceeding 2, indicating a mismatch with the Rasch model. In conclusion, the momentum impulse test instrument has good reliability, but several items need improvement to better align with the analytical model used. This emphasizes the importance of understanding item parameters and ensuring consistency with analytical models to ensure the validity and reliability of assessment instruments in education.

References

Ahmad, S., Wasim, S., Irfan, S., Gogoi, S., Srivastava, A., & Farheen, Z. (2019). Qualitative v/s. Quantitative Research- A Summarized Review. Journal of Evidence Based Medicine and Healthcare, 6(43), 2828–2832. https://doi.org/10.18410/jebmh/2019/587

Ames, H., Glenton, C., & Lewin, S. (2019). Purposive Sampling In A Qualitative Evidence Synthesis: A Worked Example From A Synthesis on Parental Perceptions of Vaccination Communication. BMC Medical Research Methodology, 19(1), 1–9. https://doi.org/10.1186/s12874-019-0665-4

Andrade, H. L. (2019). A Critical Review of Research on Student Self-Assessment. Frontiers in Education, 4(August), 1–13. https://doi.org/10.3389/feduc.2019.00087

Arasinah, K., Bakar, A. R., Ramlah, H., Soaib, A., & Zaliza, H. (2015). Using Rasch Model and Confirmatory Factor Analysis to Assess Instrument for Clothing Fashion Design Competency. International Journal of Social Science and Humanity, 5(5), 418–421. https://doi.org/10.7763/ijssh.2015.v5.492

Bankstahl, U. S., & Görtelmeyer, R. (2013). Measuring Subjective Complaints of Attention and Performance Failures - Development and Psychometric Validation in Tinnitus of The Self-Assessment Scale APSA. Health and Quality of Life Outcomes, 11(1), 1–12. https://doi.org/10.1186/1477-7525-11-86

Banzett, R. B., O’Donnell, C. R., Guilfoyle, T. E., Parshall, M. B., Schwartzstein, R. M., Meek, P. M., Gracely, R. H., & Lansing, R. W. (2015). Multidimensional dyspnea profile: An instrument for clinical and laboratory research. European Respiratory Journal, 45(6), 1681–1691. https://doi.org/10.1183/09031936.00038914

Chan, C. K. Y., & Luk, L. Y. Y. (2021). Development and Validation of An Instrument Measuring Undergraduate Students’ Perceived Holistic Competencies. Assessment and Evaluation in Higher Education, 46(3), 467–482. https://doi.org/10.1080/02602938.2020.1784392

Culmone, C., Smit, G., & Breedveld, P. (2019). Additive Manufacturing of Medical Instruments: A State-of-The-Art Review. Additive Manufacturing, 27(October 2018), 461–473. https://doi.org/10.1016/j.addma.2019.03.015

Darmana, A., Sutiani, A., Nasution, H. A., Ismanisa*, I., & Nurhaswinda, N. (2021). Analysis of Rasch Model for the Validation of Chemistry National Exam Instruments. Jurnal Pendidikan Sains Indonesia, 9(3), 329–345. https://doi.org/10.24815/jpsi.v9i3.19618

De-Roeck, E. E., Dury, S., De Witte, N., De Donder, L., Bjerke, M., De Deyn, P. P., Engelborghs, S., & Dierckx, E. (2018). CFAI-Plus: Adding Cognitive Frailty As A New Domain to The Comprehensive Frailty Assessment Instrument. International Journal of Geriatric Psychiatry, 33(7), 941–947. https://doi.org/10.1002/gps.4875

Elbes, E. K., & Oktaviani, L. (2022). Character Building in English for Daily Conversation Class Materials for English Education Freshmen Students. Journal of English Language Teaching and Learning, 3(1), 36–45. https://doi.org/10.33365/jeltl.v3i1.1714

Embretson, S. E., & Reise, S. P. (2013). Item Response Theory for Psychologists. Item Response Theory for Psychologists, 1–371. https://doi.org/10.4324/9781410605269

Hedge, C., Powell, G., & Sumner, P. (2018). The Reliability Paradox: Why Robust Cognitive Tasks Do Not Produce Reliable Individual Differences. Behavior Research Methods, 50(3), 1166–1186. https://doi.org/10.3758/s13428-017-0935-1

Kademi, H. I., Ulusoy, B. H., & Hecer, C. (2019). Applications of Miniaturized and Portable Near Infrared Spectroscopy (NIRS) For Inspection and Control of Meat and Meat Products. Food Reviews International, 35(3), 201–220. https://doi.org/10.1080/87559129.2018.1514624

Leigheb, M., de Sire, A., Colangelo, M., Zagaria, D., Grassi, F. A., Rena, O., Conte, P., Neri, P., Carriero, A., Sacchetti, G. M., Penna, F., Caretti, G., & Ferraro, E. (2021). Sarcopenia Diagnosis: Reliability of The Ultrasound Assessment of The Tibialis Anterior Muscle As An Alternative Evaluation Tool. Diagnostics, 11(11), 1–10. https://doi.org/10.3390/diagnostics11112158

Loomba, R., & Adams, L. A. (2020). Advances in Non-Invasive Assessment of Hepatic Fibrosis. Gut, 69(7), 1343–1352. https://doi.org/10.1136/gutjnl-2018-317593

Mishra, P., Singh, U., Pandey, C. M., Mishra, P., & Pandey, G. (2019). Application of Student's t-test, Analysis of Variance, and Covariance. Annals of Cardiac Anaesthesia, 22(4), 407–411. https://doi.org/10.4103/aca.ACA_94_19

Mokshein, S. E., Ishak, H., & Ahmad, H. (2019). The Use of Rasch Measurement Model In English Testing. Cakrawala Pendidikan, 38(1), 16–32. https://doi.org/10.21831/cp.v38i1.22750

Naughton, M. J., & Shumaker, S. A. (2003). The Case for Domains of Function in Quality of Life Assessment. Quality of Life Research: An International Journal of Quality of Life Aspects of Treatment, Care and Rehabilitation, 12 Suppl 1, 73–80. https://doi.org/10.1023/a:1023585707046

Rahayu, W., Putra, M. D. K., Rahmawati, Y., Hayat, B., & Koul, R. B. (2021). Validating an Indonesian Version of The What Is Happening in This Class? (Wihic) Questionnaire Using A Multidimensional Rasch Model. International Journal of Instruction, 14(2), 919–934. https://doi.org/10.29333/iji.2021.14252a

Şahin, A., & Anıl, D. (2017). The Effects of Test Length and Sample Size on Item Parameters in Item Response Theory. Kuram ve Uygulamada Egitim Bilimleri, 17(1), 321–335. https://doi.org/10.12738/estp.2017.1.0270

Surucu, L., & Maslacki, A. (2020). Validity and Reliability in Quantitative Research. Business & Management Studies: An International Journal, 8(3), 2694–2726. https://doi.org/10.15295/bmij.v8i3.1540

Vero, M., & Chukwuemeka, O. A. (2019). Formative and Summative Assessment: Trends and Practices in Basic Education. Journal of Education and Practice, 1–19. https://doi.org/10.7176/jep/10-27-06

Widyaningsih, S. W., Yusuf, I., Prasetyo, Z. K., & Istiyono, E. (2021). The Development of the HOTS Test of Physics Based on Modern Test Theory: Question Modeling Through E-Learning of Moodle LMS. International Journal of Instruction, 14(4), 51–68. https://doi.org/10.29333/iji.2021.1444a

Yuniasih, N. K., Yudiana, K., & Japa, I. G. N. (2021). The Concept of Heat Transfer measured by Cognitive Domain Assessment Instruments. Jurnal Ilmiah Sekolah Dasar, 5(1), 140. https://doi.org/10.23887/jisd.v5i1.34328

Zuo, Y. (2020). A Comprehensive Simulation Study of Estimation Methods for the Rasch Model. Stats, 3(2), 94–106. https://doi.org/10.3390/stats3020009

Downloads

Published

10-06-2026

How to Cite

Ainayah, N., & Siti Nur'aini Mahmudah, R. (2026). Analysis of Multiple Choice Questions on Impulse Momentum Material to See the Level of Difficulty of the Questions. Kasuari: Physics Education Journal (KPEJ), 9(1), 1–10. https://doi.org/10.37891/kpej.v9i1.1039

Issue

Section

Articles