The Competition on Visually Rich Document Intelligence and Understanding (VRD-IU)

The 2024 Competition on Visually Rich Document Intelligence and Understanding (VRD-IU) will be held in conjunction with the 33rd International Joint Conference on Artificial Intelligence (IJCAI 2024) in Jeju Island, Korea.

  • Competition date & time: 03.08.24 - 09.08.24 TBA
  • Competition physical location : Jeju, South Korea
  • Hybrid mode : TBA

Overview

The VRD-IU(Visually Rich Document Intelligence and Understanding) competition aims to tackle the obstacles presented by the diverse and complex nature of form-like documents, which frequently involve multiple stakeholders and contain essential information that is challenging to extract. This competition, based on the Form-NLU Dataset featuring digital, printed, and handwritten forms, offers two tracks catering to participants' varying skill levels. Tasks range from extracting key information (Track A) to localising it within documents (Track B), ensuring engagement across proficiency levels in advancing visually rich document understanding technology. This initiative not only accelerates advancements in document understanding but also aims to draw increased interest and engagement in this field, presenting a prime opportunity for innovators to contribute to the evolution of efficient information extraction and analysis methodologies.

Track A - Form Key Information Extraction

Users must develop a deep learning-based retriever to extract the target form components based on the given key query. We provide human-annotated semantic entities bounding box coordinates of input form documents; users are required to locate the entity based on the input query. The evaluation metric is F1-Score following the Form-NLU Task B.
Competition Link: https://www.kaggle.com/competitions/vrd-iu2024-tracka

Track B - Form Key Information Localisation

Users are encouraged to develop an end-to-end framework to predict the bounding box coordinates from the input document image based on the input key. For Track B, no ground truth bounding box of form semantic entities is given; the inputs are only strictly formed images and key queries. The evaluation metric is the Mean Average Precision (MAP) of the predicted bounding box.

Competition Link: https://www.kaggle.com/competitions/vrd-iu2024-trackb

Important Dates

  • Data, baseline paper & code available: 29 April, 2024
  • Track A Challenge Due: 10 July, 2024
  • Track B Challenge Due: 15 July, 2024
  • Announcement of Winners: 20 July, 2024
  • Paper Submission Due: 27 July, 2024
  • Competition: 05 August, 2024
  • Note: All deadlines are Anywhere on Earth (UTC - 12) time.

Organising Committee

TBA

For any queries, send an email to caren.han@unimelb.edu.au