This project provides R scripts that downloads the latest PISA data (SAS
files from the OECD website), reads it into R using the {haven}
package, and filters the data specifically for Brunei. The processed CSV
file is also provided for researchers interested in conducting analyses
such as IRT and latent trait regression.
If you would like to replicate the data processing steps, you can follow the instructions below. Prerequisites:
- R (version 4.0 or higher recommended)
- R packages:
haven
,tidyverse
(install them usinginstall.packages("haven")
andinstall.packages("tidyverse")
) - A stable internet connection!
Then, follow these steps:
-
Clone this repository:
git clone https://github.com/yourusername/pisabn.git
-
Open the project in RStudio or your preferred R environment.
-
Run the scripts in the
R/
directory in order, to initiate the download and processing of the PISA data.
The PISA data is divided into several files, each containing different types of data. The data files are as follows:
CY08MSP_CRT_COG
(Creative Thinking data)CY08MSP_FLT_COG
(Financial Literacy data)CY08MSP_FLT_QQQ
(Financial Literacy questionnaire data)CY08MSP_FLT_TIM
(Financial Literacy timing data)CY08MSP_SCH_QQQ
(School questionnaire data)CY08MSP_STU_COG
(Student cognitive item data)CY08MSP_STU_QQQ
(Student questionnaire data)CY08MSP_STU_TIM
(Student questionnaire timing data)CY08MSP_TCH_QQQ
(Teacher questionnaire data)
To learn more about the data, consult the downloaded codebook
CY08MSP_CODEBOOK_27thJune24.xlsx
.
The data is sourced from the OECD. Please visit their website for more information on the PISA data: https://www.oecd.org/en/data/datasets/pisa-2022-database.html
Please be aware that the use of OECD PISA data is subject to the OECD’s terms and conditions. For complete details regarding licensing, restrictions, and proper attribution requirements, please consult the OECD Terms and Conditions page. By using this repository, you agree to adhere to the citation and usage guidelines provided by the OECD.
Use the following citation to reference the data:
OECD. (2024). PISA 2022 database. URL: https://www.oecd.org/en/data/datasets/pisa-2022-database.html
The code and scripts provided in this repository are offered “as is” without any warranty, either express or implied. This includes, but is not limited to, warranties of merchantability, fitness for a particular purpose, or non-infringement. Use this code at your own risk, and please refer to the OECD website for any data-specific inquiries or concerns.