IRISA-KIHT-S Dataset

IRISA-KIHT-S Dataset

Links

Digital devices can help pupils and teachers in the learning process by promoting active learning techniques and providing immediate feedbacks. The e-learning literature shows that computer-based analysis of handwriting can be really accurate, sensitive, and reliable to produce relevant and consistent feedbacks for correction or guidance.

This dataset was presented in [1] for a task of handwritting reconstruction from the sensor data. The sensor data come from a digital pen called the STABILO Digipen. Noted that these data can also be used for classification purposes.

This dataset is available free of charge for research community and for research purposes only. For publications using this database, please quote the reference below. [1]

Dataset description

This dataset is composed of 30 recordings and it is writer balanced as there is one recording per writer. Table  presents the IRISA-KIHT-S statistical description.

Every 34-sample recording session generates files from the data acquisition mobile app. The sensor signals file has 15 columns and N rows, where N is the number of IMU signals, timestamps, and sensor values.

The table has 13 columns: milliseconds, accelerometer front (x, y, z), accelerometer rear  (x, y, z), gyroscope (x, y, z), magnetometer  (x, y, z), and force signals. Tablet signal files contain milliseconds, position coordinates (x, y, z), and pressure force signals.

The transcription (labels) file contains labels and the start and stop time-stamps for every sample. Additional files concerning the sensor calibration and recording meta data are provided.

Data Acquisition

The recording process starts by selecting one set of predefined scripts that will be written on the tablet surface using the Digipen. One set consists of 34 samples that have to be written one at a time during a single recording session.

It is composed of five groups: 15 characters, 10 words, 5 equations, 2 shapes and 2 word groups. While recording, a user holds the pen’s on/off switch up, which is a natural way to take the Digipen due to grips designed on the pen to naturally position the fingers properly.

Sensors

Each Digipen is equipped with five sensors.

  • Front accelerometer (STM LSM6DSL)
  • Gyroscope (STM LSM6DSL)
  • Rear accelerometer (Freescale MMA8451Q)
  • Magnetometer (ALPS HSCDTD008A)
  • Force sensor (ALPS HSFPAR003A)

Sensor Data

The sensors’ raw data stream is provided in the files called sensor_data.csv. Each file consists of 15 columns:

  • Millis: The timestamp when the data were processed on the tablet computer that the pen was connected to during recording
  • Acc1 X, Acc1 Y, Acc1 Z: The values of the front accelerometer in three dimensions
  • Acc2 X, Acc2 Y, Acc2 Z: The values of the rear accelerometer in three dimensions
  • Gyro X, Gyro Y, Gyro Z: The gyroscope values in three dimensions
  • Mag X, Mag Y, Mag Z: The magnetometer values in three dimensions
  • Force: The force with which the pen tip touches the surface
  • Time: A sample counter

Citation

If you use the IRISA-KIHT-S dataset, please cite:

[1] Swaileh, W., Imbert, F., Soullard, Y. et al. Online handwriting trajectory reconstruction from kinematic sensors using temporal convolutional network. IJDAR (2023).

BibTex:

@article{swaileh:hal-04076399,
  TITLE = {{Online Handwriting Trajectory Reconstruction from Kinematic Sensors using Temporal Convolutional Network}},
  AUTHOR = {Swaileh, Wassim and Imbert, Florent and Soullard, Yann and Tavenard, Romain and Anquetil, Eric},
  URL = {https://inria.hal.science/hal-04076399},
  JOURNAL = {{International Journal on Document Analysis and Recognition}},
  PUBLISHER = {{Springer Verlag}},
  YEAR = {2023},
  KEYWORDS = {Online Handwriting ; Trajectory Reconstruction ; Digital Pen ; Temporal Convolutional Neural Network ; Inertial Measurement Units},
  PDF = {https://inria.hal.science/hal-04076399v2/file/IJDAR_ICDAR_2023%20%286%29.pdf},
  HAL_ID = {hal-04076399},
  HAL_VERSION = {v2},
}