This repository contains the analysis code for the Ghent Semi-spontaneous Speech Paradigm (GSSP). The GSSP is a picture description task that is used to capture (near) spontaneous speech in a controlled setting.
- The data is collected via a web application and can be found on kaggle
- The paradigm is described in detail in this preprint manuscript.
- The supplementals can be found here
- The notebooks README contains a thorough description of the speech parsing and analysis notebooks.
In a nutshell the r-scripts folder performs a thorough statistical analysis of the arousal & valence scores for the audio files. The outcome can be observed in a shiny html file. All speech data transformation and analysis is performed in the notebooks folder.
The utilized python packages are listed in the pyproject.toml file and the utilized R packages are listed in the scripts/r_packages.txt file.
├── docs │ └── cgn <-- CGN related documentation ├── GSSP_utils <-- Python functions shared across notebooks (and CGN parsing) ├── loc_data <-- Local data shared across notebooks ├── notebooks <-- the analysis Jupyter notebooks ├── reports <-- Generated figures from the notebooks └── scripts <-- R scripts for statistical analysis & shiny app- A preprint manuscript is available on psyArxiv.
@misc{van_der_donckt_2023, title={Ecologically Valid Speech Collection in Behavioral Research: The Ghent Semi-spontaneous Speech Paradigm (GSSP)}, url={psyarxiv.com/e2qxw}, DOI={10.31234/osf.io/e2qxw}, publisher={PsyArXiv}, author={Van Der Donckt, Jonas and Kappen, Mitchel and Degraeve, Vic and Demuynck, Kris and Vanderhasselt, Marie Anne and Van Hoecke, Sofie}, year={2023}, month={Mar} }👤 Jonas Van Der Donckt, Mitchel Kappen