Dataset of Student Solutions to Algorithm and Data Structure Programming Assignments.

International Conference on Language Resources and Evaluation (LREC)(2022)

Cited 0|Views5
No score
We present a dataset containing source code solutions to algorithmic programming exercises solved by hundreds of Bachelor-level students at the Universitat Hamburg. These solutions were collected during the winter semesters 2019/2020, 2020/2021 and 2021/2022. The dataset contains a set of solutions to a total of 21 tasks written in Java as well as Python and a total of over 1500 individual solutions. All solutions were submitted through Moodle and the Coderunner plugin and passed a number of test cases (including randomized tests), such that they can be considered as working correctly. All students whose solutions are included in the dataset gave their consent into publishing their solutions. The solutions are pseudonymized with a random solution ID. Included in this paper is a short analysis of the dataset containing statistical data and highlighting a few anomalies (e.g. the number of solutions per task decreases for the last few tasks due to grading rules). We plan to extend the dataset with tasks and solutions from upcoming courses.
Translated text
Key words
dataset, programming, algorithm, sourcecode, java, python
AI Read Science
Must-Reading Tree
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined