FUNSD

GPTKB entity

Statements (58)
Predicate Object
gptkbp:instance_of gptkb:Database_Management_System
gptkbp:bfsLayer 5
gptkbp:bfsParent gptkb:Layout_LM
gptkbp:analysis available tools
gptkbp:application document analysis
gptkbp:class gptkb:11
gptkbp:collaborations various universities
gptkbp:collection crowdsourcing
gptkbp:community_support active
gptkbp:contains financial documents
gptkbp:created_by gptkb:Job_Search_Engine
gptkbp:data_type gptkb:JSON
v1.0
structured data
comprehensive
manual and automated
gptkbp:data_usage gptkb:academic_research
gptkbp:explores supported
gptkbp:format gptkb:PDF
gptkbp:has_community active discussions
gptkbp:historical_source real-world documents
https://www.w3.org/2000/01/rdf-schema#label FUNSD
gptkbp:impact high
gptkbp:is_available_on gptkb:archive
gptkbp:is_cited_in Cite as: FUNSD.
gptkbp:is_described_as A dataset for form understanding in natural language processing.
gptkbp:is_divided_into train, validation, test
gptkbp:is_documented_in segmentation
gptkbp:is_evaluated_by F1 score
held-out set
gptkbp:is_tasked_with information extraction
gptkbp:is_used_for training machine learning models
gptkbp:label field, value, header, footer
gptkbp:language English
gptkbp:license Apache License 2.0
gptkbp:primary_source form extraction
gptkbp:products high
gptkbp:project 2 years
SROIE
CORD-19
Doc Bank
gptkbp:provides_information_on open access
1.5 GB
permissive
synthetic and real data
gptkbp:publishes 199
gptkbp:receives_funding_from government grants
gptkbp:related_to OCR
NER
gptkbp:release_year gptkb:2018
gptkbp:requires gptkb:theorem
gptkbp:security_features compliant with regulations
gptkbp:target_audience gptkb:physicist
gptkb:software
gptkbp:tutorials provided
gptkbp:updates periodic
gptkbp:user_manual gptkb:Label_Studio
available online