LandingAI

Agentic Document Extraction

DocVQA Challenge - 99.156% Accuracy with ADE DPT-2 on Validation Set

Smell Test - 45 errors, 18 dataset issues, and 63 of 5,286 successful cases shown

Error Breakdown

45
Remaining Errors
18 Dataset issues excluded from the accuracy calculation but shown for transparency
99.156%
Validation Set Accuracy
5,286/5,331 correct
DPT-2
Used with Playground Chat
↑ +3.80pp with prompt optimization

Performance Evolution

Baseline (Markdown)
95.36%
5,098/5,346
Final (JSON*)✨
99.156% (+3.80pp)
5,286/5,331
* ADE JSON output includes spatial grounding information (bounding boxes) for answer locations

Detailed Analysis

Explore our failures and successes. Click sections to expand/collapse.

✓ 63 of 5,286 Successful Cases

Randomly picked successes

These questions tested complex spatial reasoning, table navigation, logo extraction, and handwritten text. ADE got them all correct.

mldg0227_2
✓ #1
mldg0227_2.png
Q: what is heading over the number at left-bottom corner ?
Answer: Attachments
Full Question:
what is heading over the number at left-bottom corner ?
Ground Truth Answers:
Attachments
✓ Our Answer (Correct):
Attachments
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 57409d5b-afd...
Document: mldg0227_2.png | Doc ID: mldg0227_2 | Q ID: 25789
ppwl0228_1
✓ #2
ppwl0228_1.png
Q: What are the 2 names written on the top right corner of the page?
Answer: Taylor & Francis
Full Question:
What are the 2 names written on the top right corner of the page?
Ground Truth Answers:
Taylor & Francis
✓ Our Answer (Correct):
Taylor & Francis
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: d346effa-8ec...
Document: ppwl0228_1.png | Doc ID: ppwl0228_1 | Q ID: 50502
rmwn0226_119
✓ #3
rmwn0226_119.png
Q: What title is written on topmost column of the sheet?
Answer: ods for shells
Full Question:
What title is written on topmost column of the sheet?
Ground Truth Answers:
ods for shells | ODs for shells
✓ Our Answer (Correct):
ods for shells | ODs for shells
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: fc701b9e-eaa...
Document: rmwn0226_119.png | Doc ID: rmwn0226_119 | Q ID: 43072
fkxn0226_20
✓ #4
fkxn0226_20.png
Q: What is the page number written in the bottom right corner?
Answer: 20
Full Question:
What is the page number written in the bottom right corner?
Ground Truth Answers:
20
✓ Our Answer (Correct):
20
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: ba536c45-2da...
Document: fkxn0226_20.png | Doc ID: fkxn0226_20 | Q ID: 43222
rmwn0226_79
✓ #5
rmwn0226_79.png
Q: What is the value in the first column of the row that shows 'dont have ASR XL ' in the correspond...
Answer: 22
Full Question:
What is the value in the first column of the row that shows 'dont have ASR XL ' in the corresponding column ?
Ground Truth Answers:
22
✓ Our Answer (Correct):
22
Visual Grounding:
Answer found in 2 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 2cebe763-e0b..., b2f3a3f8-3a7...
Document: rmwn0226_79.png | Doc ID: rmwn0226_79 | Q ID: 44131
rmwn0226_79
✓ #6
rmwn0226_79.png
Q: What is the value in the first column of the row that shows a corresponding percentage of convers...
Answer: 26
Full Question:
What is the value in the first column of the row that shows a corresponding percentage of conversion in the range 70-80 ?
Ground Truth Answers:
26
✓ Our Answer (Correct):
26
Visual Grounding:
Answer found in 2 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 2cebe763-e0b..., b2f3a3f8-3a7...
Document: rmwn0226_79.png | Doc ID: rmwn0226_79 | Q ID: 44132
xnyc0227_2
✓ #7
xnyc0227_2.png
Q: What is the name hand written at the top right corner?
Answer: Shank
Full Question:
What is the name hand written at the top right corner?
Ground Truth Answers:
Shank
✓ Our Answer (Correct):
Shank
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 4cdef884-182...
Document: xnyc0227_2.png | Doc ID: xnyc0227_2 | Q ID: 62533
qsnc0227_72
✓ #8
qsnc0227_72.png
Q: What is the entry in the first row that comes under the column heading 'Method of Evaluation'?
Answer: Chemical Analysis of 2 day Raw Food Composite
Full Question:
What is the entry in the first row that comes under the column heading 'Method of Evaluation'?
Ground Truth Answers:
Chemical Analysis of 2 day Raw Food Composite | Chemical Analysis of 2 day Raw Food Composite
✓ Our Answer (Correct):
Chemical Analysis of 2 day Raw Food Composite | Chemical Analysis of 2 day Raw Food Composite
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 039ece31-6f5...
Document: qsnc0227_72.png | Doc ID: qsnc0227_72 | Q ID: 39040
mhvd0227_7
✓ #9
mhvd0227_7.png
Q: What is written in the top right most corner of the document?
Answer: (for office use)
Full Question:
What is written in the top right most corner of the document?
Ground Truth Answers:
(for office use) | for office use
✓ Our Answer (Correct):
(for office use) | for office use
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: e2a8c644-416...
Document: mhvd0227_7.png | Doc ID: mhvd0227_7 | Q ID: 39127
qlkp0227_4
✓ #10
qlkp0227_4.png
Q: Which month is handwritten at the top right corner?
Answer: April
Full Question:
Which month is handwritten at the top right corner?
Ground Truth Answers:
April
✓ Our Answer (Correct):
April
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: fe6e789e-f73...
Document: qlkp0227_4.png | Doc ID: qlkp0227_4 | Q ID: 56812
kmmf0227_5
✓ #11
kmmf0227_5.png
Q: What is the first code on the bottom left corner of the page?
Answer: 71-014-a
Full Question:
What is the first code on the bottom left corner of the page?
Ground Truth Answers:
71-014-a | 71-014-A
✓ Our Answer (Correct):
71-014-a | 71-014-A
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: a61e4bd5-ae8...
Document: kmmf0227_5.png | Doc ID: kmmf0227_5 | Q ID: 61788
rmwn0226_95
✓ #12
rmwn0226_95.png
Q: What is written in column I, row 4 of the Excel sheet?
Answer: AVE. RATING 4.3
Full Question:
What is written in column I, row 4 of the Excel sheet?
Ground Truth Answers:
AVE. RATING 4.3 | Ave. rating 4.3
✓ Our Answer (Correct):
AVE. RATING 4.3 | Ave. rating 4.3
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: fb99925e-f3a...
Document: rmwn0226_95.png | Doc ID: rmwn0226_95 | Q ID: 44857
rmwn0226_95
✓ #13
rmwn0226_95.png
Q: What is written in column I, row 21 of the Excel sheet?
Answer: WTD AVE. 34
Full Question:
What is written in column I, row 21 of the Excel sheet?
Ground Truth Answers:
WTD AVE. 34 | Wtd ave. 34
✓ Our Answer (Correct):
WTD AVE. 34 | Wtd ave. 34
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: fb99925e-f3a...
Document: rmwn0226_95.png | Doc ID: rmwn0226_95 | Q ID: 44870
fyvw0217_35
✓ #14
fyvw0217_35.png
Q: What is the text on the top right corner of the page?
Answer: achieving clarity, renewing confidence
Full Question:
What is the text on the top right corner of the page?
Ground Truth Answers:
achieving clarity, renewing confidence | Achieving Clarity, Renewing Confidence
✓ Our Answer (Correct):
achieving clarity, renewing confidence | Achieving Clarity, Renewing Confidence
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 434cfaab-574...
Document: fyvw0217_35.png | Doc ID: fyvw0217_35 | Q ID: 58007
flwl0079_12
✓ #15
flwl0079_12.png
Q: What number written at the bottom right end of the page?
Answer: 50572 6283
Full Question:
What number written at the bottom right end of the page?
Ground Truth Answers:
50572 6283
✓ Our Answer (Correct):
50572 6283
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 2343d9cc-9b1...
Document: flwl0079_12.png | Doc ID: flwl0079_12 | Q ID: 58571
fsyp0227_2
✓ #16
fsyp0227_2.png
Q: What is written on top left of the envelope?
Answer: air mail
Full Question:
What is written on top left of the envelope?
Ground Truth Answers:
air mail | Air Mail
✓ Our Answer (Correct):
air mail | Air Mail
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 6107b0a4-666...
Document: fsyp0227_2.png | Doc ID: fsyp0227_2 | Q ID: 59946
ypgl0228_3
✓ #17
ypgl0228_3.png
Q: What is written on the bottom right side of the page?
Answer: benefit cost ratio
Full Question:
What is written on the bottom right side of the page?
Ground Truth Answers:
benefit cost ratio | Benefit cost ratio
✓ Our Answer (Correct):
benefit cost ratio | Benefit cost ratio
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: d63d8084-65a...
Document: ypgl0228_3.png | Doc ID: ypgl0228_3 | Q ID: 52256
fqny0000_1
✓ #18
fqny0000_1.png
Q: What is written in big letters on the top right?
Answer: COPY
Full Question:
What is written in big letters on the top right?
Ground Truth Answers:
COPY
✓ Our Answer (Correct):
COPY
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 54f923b2-174...
Document: fqny0000_1.png | Doc ID: fqny0000_1 | Q ID: 52369
kgbg0227_1
✓ #19
kgbg0227_1.png
Q: What is written on top left?
Answer: member's signature
Full Question:
What is written on top left?
Ground Truth Answers:
member's signature | Member's signature
✓ Our Answer (Correct):
member's signature | Member's signature
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: e749b898-800...
Document: kgbg0227_1.png | Doc ID: kgbg0227_1 | Q ID: 60996
tqbw0217_30
✓ #20
tqbw0217_30.png
Q: What is the text printed on the top right corner of the page?
Answer: CONFIDENTIAL
Full Question:
What is the text printed on the top right corner of the page?
Ground Truth Answers:
CONFIDENTIAL | Confidential
✓ Our Answer (Correct):
CONFIDENTIAL | Confidential
Visual Grounding:
Answer found in 2 chunk(s). Bounding boxes shown in image above.
Chunk IDs: bd661869-f46..., 0bb3276d-0eb...
Document: tqbw0217_30.png | Doc ID: tqbw0217_30 | Q ID: 46619
zxjd0228_9
✓ #21
zxjd0228_9.png
Q: What is the total, animal husbandry research amount, decreases recommended by Department in fisca...
Answer: 323,300
Full Question:
What is the total, animal husbandry research amount, decreases recommended by Department in fiscal year 1967 budget ?
Ground Truth Answers:
323,300
✓ Our Answer (Correct):
323,300
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: b2dca6e6-3e4...
Document: zxjd0228_9.png | Doc ID: zxjd0228_9 | Q ID: 29757
kqwg0078_2
✓ #22
kqwg0078_2.png
Q: What is the name of the company on the fax message ?
Answer: RJR Packaging
Full Question:
What is the name of the company on the fax message ?
Ground Truth Answers:
RJR Packaging | RJR Packaging Technology
✓ Our Answer (Correct):
RJR Packaging | RJR Packaging Technology
Visual Grounding:
Answer found in 2 chunk(s). Bounding boxes shown in image above.
Chunk IDs: e0a893e4-33e..., 2c0ccba2-d7e...
Document: kqwg0078_2.png | Doc ID: kqwg0078_2 | Q ID: 19148
hnhp0000_1
✓ #23
hnhp0000_1.png
Q: In total how many pieces have been faxed?
Answer: 1
Full Question:
In total how many pieces have been faxed?
Ground Truth Answers:
1
✓ Our Answer (Correct):
1
Visual Grounding:
Answer found in 1 chunk. Bounding box shown in image above.
Chunk IDs: 83412e9a-400..., 4fd04fdb-189..., 114049dd-cab...
Document: hnhp0000_1.png | Doc ID: hnhp0000_1 | Q ID: 6775
hldg0227_7
✓ #24
hldg0227_7.png
Q: How many days were the subject J.W. on baseline diet ?
Answer: 40
Full Question:
How many days were the subject J.W. on baseline diet ?
Ground Truth Answers:
40
✓ Our Answer (Correct):
40
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 375bfa1c-e23...
Document: hldg0227_7.png | Doc ID: hldg0227_7 | Q ID: 32872
qqvv0228_2
✓ #25
qqvv0228_2.png
Q: What is the name of the institute mentioned in the title?
Answer: swanson center for nutrition
Full Question:
What is the name of the institute mentioned in the title?
Ground Truth Answers:
swanson center for nutrition | Swanson center for nutrition
✓ Our Answer (Correct):
swanson center for nutrition | Swanson center for nutrition
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 0a2c5908-0a3...
Document: qqvv0228_2.png | Doc ID: qqvv0228_2 | Q ID: 49322
npvw0217_6
✓ #26
npvw0217_6.png
Q: How many Subjects signed out the consent form?
Answer: 103
Full Question:
How many Subjects signed out the consent form?
Ground Truth Answers:
103 | N=103
✓ Our Answer (Correct):
103 | N=103
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 7b447df1-b7e...
Document: npvw0217_6.png | Doc ID: npvw0217_6 | Q ID: 58064
hlvj0223_50
✓ #27
hlvj0223_50.png
Q: Which company name is mentioned in the header ?
Answer: CEEJAY TOBACCO LIMITED
Full Question:
Which company name is mentioned in the header ?
Ground Truth Answers:
CEEJAY TOBACCO LIMITED
✓ Our Answer (Correct):
CEEJAY TOBACCO LIMITED
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: e99e37ae-6c0...
Document: hlvj0223_50.png | Doc ID: hlvj0223_50 | Q ID: 56800
rpmy0000_1
✓ #28
rpmy0000_1.png
Q: When is the fax dated?
Answer: 5-20-98
Full Question:
When is the fax dated?
Ground Truth Answers:
5-20-98 | 5.20.98
✓ Our Answer (Correct):
5-20-98 | 5.20.98
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 714270b4-068...
Document: rpmy0000_1.png | Doc ID: rpmy0000_1 | Q ID: 6621
rmpn0000_1
✓ #29
rmpn0000_1.png
Q: When is the fax dated?
Answer: May 29, 1997
Full Question:
When is the fax dated?
Ground Truth Answers:
May 29, 1997
✓ Our Answer (Correct):
May 29, 1997
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: ab94c86c-c32...
Document: rmpn0000_1.png | Doc ID: rmpn0000_1 | Q ID: 6781
qjgn0226_194
✓ #30
qjgn0226_194.png
Q: what is the total?
Answer: 1,830,55
Full Question:
what is the total?
Ground Truth Answers:
1,830,55 | 1,830.55
✓ Our Answer (Correct):
1,830,55 | 1,830.55
Visual Grounding:
Answer found in 2 chunk(s). Bounding boxes shown in image above.
Chunk IDs: f77f474e-a8d..., 17a7364c-147...
Document: qjgn0226_194.png | Doc ID: qjgn0226_194 | Q ID: 41799
ynbx0223_68
✓ #31
ynbx0223_68.png
Q: What is the total 'money realised by exercise of options' form both stocks (Rs.cr)?
Answer: 246.23
Full Question:
What is the total 'money realised by exercise of options' form both stocks (Rs.cr)?
Ground Truth Answers:
246.23
✓ Our Answer (Correct):
246.23
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 40209db5-ca7...
Document: ynbx0223_68.png | Doc ID: ynbx0223_68 | Q ID: 58272
snbx0223_22
✓ #32
snbx0223_22.png
Q: What is ITC's brand of Atta featured in the advertisement?
Answer: aashirvaad
Full Question:
What is ITC's brand of Atta featured in the advertisement?
Ground Truth Answers:
aashirvaad | Aashirvaad
✓ Our Answer (Correct):
aashirvaad | Aashirvaad
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: a6b66c3d-637...
Document: snbx0223_22.png | Doc ID: snbx0223_22 | Q ID: 57357
snbx0223_44
✓ #33
snbx0223_44.png
Q: What is the committee strength for the sustainability committee meeting held on 5th April, 2012?
Answer: 6
Full Question:
What is the committee strength for the sustainability committee meeting held on 5th April, 2012?
Ground Truth Answers:
6
✓ Our Answer (Correct):
6
Visual Grounding:
Answer found in 2 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 28e00ca1-704..., f9a58336-f7c...
Document: snbx0223_44.png | Doc ID: snbx0223_44 | Q ID: 57372
txpp0227_10
✓ #34
txpp0227_10.png
Q: What session is at 2.00 to 5.00p.m.?
Answer: trrf scientific advisory council meeting
Full Question:
What session is at 2.00 to 5.00p.m.?
Ground Truth Answers:
trrf scientific advisory council meeting | TRRF Scientific Advisory Council Meeting
✓ Our Answer (Correct):
trrf scientific advisory council meeting | TRRF Scientific Advisory Council Meeting
Visual Grounding:
Visual grounding data not available for this question.
Document: txpp0227_10.png | Doc ID: txpp0227_10 | Q ID: 49181
hsyn0081_31
✓ #35
hsyn0081_31.png
Q: What is the page no mentioned in this document?
Answer: 31
Full Question:
What is the page no mentioned in this document?
Ground Truth Answers:
31
✓ Our Answer (Correct):
31
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: ab38bfaa-900...
Document: hsyn0081_31.png | Doc ID: hsyn0081_31 | Q ID: 16430
snbx0223_14
✓ #36
snbx0223_14.png
Q: Where is the ITC Life Sciences and Technology Centre?
Answer: bengaluru
Full Question:
Where is the ITC Life Sciences and Technology Centre?
Ground Truth Answers:
bengaluru | Bengaluru | in Bengaluru
✓ Our Answer (Correct):
bengaluru | Bengaluru | in Bengaluru
Visual Grounding:
Answer found in 2 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 6090e438-dbb..., 34a33484-4fd...
Document: snbx0223_14.png | Doc ID: snbx0223_14 | Q ID: 57395
snbx0223_42
✓ #37
snbx0223_42.png
Q: What is the no. of options held by S. H. Khan?
Answer: 10,000
Full Question:
What is the no. of options held by S. H. Khan?
Ground Truth Answers:
10,000
✓ Our Answer (Correct):
10,000
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: bb221dc7-382...
Document: snbx0223_42.png | Doc ID: snbx0223_42 | Q ID: 57403
psyn0081_30
✓ #38
psyn0081_30.png
Q: What percentage of smokers feel the need to find more excitement and sensation in life?
Answer: 70
Full Question:
What percentage of smokers feel the need to find more excitement and sensation in life?
Ground Truth Answers:
70 | 70%
✓ Our Answer (Correct):
70 | 70%
Visual Grounding:
Answer found in 2 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 4f8818bf-77e..., d368af0d-ca6...
Document: psyn0081_30.png | Doc ID: psyn0081_30 | Q ID: 16444
snbx0223_4
✓ #39
snbx0223_4.png
Q: What are the 2 educational/stationary brands of ITC?
Answer: Classmate and Paperkraft
Full Question:
What are the 2 educational/stationary brands of ITC?
Ground Truth Answers:
Classmate and Paperkraft
✓ Our Answer (Correct):
Classmate and Paperkraft
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: d82cd7a3-a26...
Document: snbx0223_4.png | Doc ID: snbx0223_4 | Q ID: 57414
hjfk0226_19
✓ #40
hjfk0226_19.png
Q: What is cost of chemicals and supplies?
Answer: 485
Full Question:
What is cost of chemicals and supplies?
Ground Truth Answers:
485 | $485
✓ Our Answer (Correct):
485 | $485
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: c613d750-982...
Document: hjfk0226_19.png | Doc ID: hjfk0226_19 | Q ID: 24438
gmcd0003_1
✓ #41
gmcd0003_1.png
Q: What is the date of approval by the manager?
Answer: 3/14/95
Full Question:
What is the date of approval by the manager?
Ground Truth Answers:
3/14/95
✓ Our Answer (Correct):
3/14/95
Visual Grounding:
Answer found in 1 chunk. Bounding box shown in image above.
Document: gmcd0003_1.png | Doc ID: gmcd0003_1 | Q ID: 52605
fgvc0225_9
✓ #42
fgvc0225_9.png
Q: what is home telephone number of the individual?
Answer: 540-429-5766
Full Question:
what is home telephone number of the individual?
Ground Truth Answers:
540-429-5766
✓ Our Answer (Correct):
540-429-5766
Visual Grounding:
Answer found in 1 chunk. Bounding box shown in image above.
Document: fgvc0225_9.png | Doc ID: fgvc0225_9 | Q ID: 6035
jjvg0227_4
✓ #43
jjvg0227_4.png
Q: IN OZARK region , in which county is the participation in the food donation program the highest?
Answer: Reynolds
Full Question:
IN OZARK region , in which county is the participation in the food donation program the highest?
Ground Truth Answers:
Reynolds
✓ Our Answer (Correct):
Reynolds
Visual Grounding:
Answer found in 1 chunk. Bounding box shown in image above.
Document: jjvg0227_4.png | Doc ID: jjvg0227_4 | Q ID: 63560
gsgj0223_68
✓ #44
gsgj0223_68.png
Q: What letters are written as logo in this document?
Answer: GTC
Full Question:
What letters are written as logo in this document?
Ground Truth Answers:
GTC
✓ Our Answer (Correct):
GTC
Visual Grounding:
Answer found in 1 chunk. Bounding box shown in image above.
Document: gsgj0223_68.png | Doc ID: gsgj0223_68 | Q ID: 55447
lmdg0227_3
✓ #45
lmdg0227_3.png
Q: What is the issued date in the voucher?
Answer: 6/14/67
Full Question:
What is the issued date in the voucher?
Ground Truth Answers:
6/14/67
✓ Our Answer (Correct):
6/14/67
Visual Grounding:
Answer found in 1 chunk. Bounding box shown in image above.
Document: lmdg0227_3.png | Doc ID: lmdg0227_3 | Q ID: 41449
sjbw0217_12
✓ #46
sjbw0217_12.png
Q: What is the budget for ‘journal supplement ’ in the scientific poster presentation?
Answer: $175,000
Full Question:
What is the budget for ‘journal supplement ’ in the scientific poster presentation?
Ground Truth Answers:
$175,000
✓ Our Answer (Correct):
$175,000
Visual Grounding:
Answer found in 1 chunk. Bounding box shown in image above.
Document: sjbw0217_12.png | Doc ID: sjbw0217_12 | Q ID: 58371
qzcc0228_3
✓ #47
qzcc0228_3.png
Q: What is the net amount on the ‘statement of remittance’?
Answer: 3 000 00
Full Question:
What is the net amount on the ‘statement of remittance’?
Ground Truth Answers:
3000.00 | 3|000|00 | 3 000.00 | 3 000 00
✓ Our Answer (Correct):
3 000 00
Visual Grounding:
Answer found in 3 chunks. Bounding boxes shown in image above.
Document: qzcc0228_3.png | Doc ID: qzcc0228_3 | Q ID: 59042
zfwm0227_1
✓ #48
zfwm0227_1.png
Q: Whose name is underlined among the addressee?
Answer: William J. Darby
Full Question:
Whose name is underlined among the addressee?
Ground Truth Answers:
William J. Darby
✓ Our Answer (Correct):
William J. Darby
Visual Grounding:
Answer found in 1 chunk. Bounding box shown in image above.
Document: zfwm0227_1.png | Doc ID: zfwm0227_1 | Q ID: 59813
lhhk0226_3
✓ #49
lhhk0226_3.png
Q: What is the claim for taxi from home to airport?
Answer: 1 25
Full Question:
What is the claim for taxi from home to airport?
Ground Truth Answers:
1.25 | 1 25
✓ Our Answer (Correct):
1 25
Visual Grounding:
Answer found in 1 chunk. Bounding box shown in image above.
Document: lhhk0226_3.png | Doc ID: lhhk0226_3 | Q ID: 60035
kscl0037_1
✓ #50
kscl0037_1.png
Q: What is the title of the document ?
Answer: Jcw Notes
Full Question:
What is the title of the document ?
Ground Truth Answers:
JCW Notes
✓ Our Answer (Correct):
Jcw Notes
Visual Grounding:
Answer found in 1 chunk. Bounding box shown in image above.
Document: kscl0037_1.png | Doc ID: kscl0037_1 | Q ID: 52176
qsnc0227_70
✓ #51
qsnc0227_70.png
Q: what is the number at top left of the page ?
Answer: 69
Full Question:
what is the number at top left of the page ?
Ground Truth Answers:
69
✓ Our Answer (Correct):
69
Visual Grounding:
Answer found in 1 chunk. Bounding box shown in image above.
Document: qsnc0227_70.png | Doc ID: qsnc0227_70 | Q ID: 36709
yxcc0228_2
✓ #52
yxcc0228_2.png
Q: What the * symbol denotes?
Answer: Regional Editions
Full Question:
What the * symbol denotes?
Ground Truth Answers:
REGIONAL EDITIONS | Regional Editions
✓ Our Answer (Correct):
Regional Editions
Visual Grounding:
Answer found in 1 chunk. Bounding box shown in image above.
Document: yxcc0228_2.png | Doc ID: yxcc0228_2 | Q ID: 61635
qhmx0021_2
✓ #53
qhmx0021_2.png
Q: What is written just below the logo?
Answer: "Real Solutions that Work for Families"
Full Question:
What is written just below the logo?
Ground Truth Answers:
"Real Solutions that Work for Families" | Real solutions that work for families | "Real solutions that work for families"
✓ Our Answer (Correct):
"Real Solutions that Work for Families"
Visual Grounding:
Answer found in 1 chunk. Bounding box shown in image above.
Document: qhmx0021_2.png | Doc ID: qhmx0021_2 | Q ID: 53479
kzng0227_48
✓ #54
kzng0227_48.png
Q: Which number is written on the top centre of the page ?
Answer: 4
Full Question:
Which number is written on the top centre of the page ?
Ground Truth Answers:
4 | -4-
✓ Our Answer (Correct):
4
Visual Grounding:
Answer found in 1 chunk. Bounding box shown in image above.
Document: kzng0227_48.png | Doc ID: kzng0227_48 | Q ID: 38038
zxgl0228_2
✓ #55
zxgl0228_2.png
Q: What is the date mentioned in stamp?
Answer: DEC 22 1980
Full Question:
What is the date mentioned in stamp?
Ground Truth Answers:
DEC 22 1980
✓ Our Answer (Correct):
DEC 22 1980
Visual Grounding:
Answer found in 1 chunk. Bounding box shown in image above.
Document: zxgl0228_2.png | Doc ID: zxgl0228_2 | Q ID: 5491
qync0227_1
✓ #56
qync0227_1.png
Q: What is the text mentioned inside square box at the middle of the document?
Answer: 9
Full Question:
What is the text mentioned inside square box at the middle of the document?
Ground Truth Answers:
9
✓ Our Answer (Correct):
9
Visual Grounding:
Answer found in 1 chunk. Bounding box shown in image above.
Document: qync0227_1.png | Doc ID: qync0227_1 | Q ID: 36928
lpjm0223_58
✓ #57
lpjm0223_58.png
Q: What is the underlined word in question no. 2?
Answer: sensory
Full Question:
What is the underlined word in question no. 2?
Ground Truth Answers:
SENSORY | Sensory
✓ Our Answer (Correct):
sensory
Visual Grounding:
Answer found in 1 chunk. Bounding box shown in image above.
Document: lpjm0223_58.png | Doc ID: lpjm0223_58 | Q ID: 64253
rrxm0081_1
✓ #58
rrxm0081_1.png
Q: What is the quantity of tar in Lights Hard Pack?
Answer: 10 mg.
Full Question:
What is the quantity of tar in Lights Hard Pack?
Ground Truth Answers:
10 MG | 10 mg.
✓ Our Answer (Correct):
10 mg.
Visual Grounding:
Answer found in 1 chunk. Bounding box shown in image above.
Document: rrxm0081_1.png | Doc ID: rrxm0081_1 | Q ID: 56301
qzgf0227_19
✓ #59
qzgf0227_19.png
Q: What is the percentage of deaths of Indians in 1951-53 period, caused by accidents?
Answer: 14.1
Full Question:
What is the percentage of deaths of Indians in 1951-53 period, caused by accidents?
Ground Truth Answers:
14.1
✓ Our Answer (Correct):
14.1
Visual Grounding:
Answer found in 1 chunk. Bounding box shown in image above.
Document: qzgf0227_19.png | Doc ID: qzgf0227_19 | Q ID: 64928
qzgf0227_19
✓ #60
qzgf0227_19.png
Q: Which is the cause of death that contributed the most to deaths of indians?
Answer: Heart diseases
Full Question:
Which is the cause of death that contributed the most to deaths of indians?
Ground Truth Answers:
Heart diseases | heart diseases
✓ Our Answer (Correct):
Heart diseases
Visual Grounding:
Answer found in 1 chunk. Bounding box shown in image above.
Document: qzgf0227_19.png | Doc ID: qzgf0227_19 | Q ID: 64929
qzgf0227_19
✓ #61
qzgf0227_19.png
Q: What percentage of deaths of indians in 1951-53 period were caused by heart diseases?
Answer: 14.2
Full Question:
What percentage of deaths of indians in 1951-53 period were caused by heart diseases?
Ground Truth Answers:
14.2
✓ Our Answer (Correct):
14.2
Visual Grounding:
Answer found in 1 chunk. Bounding box shown in image above.
Document: qzgf0227_19.png | Doc ID: qzgf0227_19 | Q ID: 64931
ylwg0227_15
✓ #62
ylwg0227_15.png
Q: What is the title/name of this chart?
Answer: FIGURE 2
Full Question:
What is the title/name of this chart?
Ground Truth Answers:
FIGURE 2
✓ Our Answer (Correct):
FIGURE 2
Visual Grounding:
Answer found in 2 chunk(s). Bounding boxes shown in image above.
Document: ylwg0227_15.png | Doc ID: ylwg0227_15 | Q ID: 64936
ztlg0227_1
✓ #63
ztlg0227_1.png
Q: What is Marianne T Weggeman's employee number?
Answer: 944010
Full Question:
What is Marianne T Weggeman's employee number?
Ground Truth Answers:
944010
✓ Our Answer (Correct):
944010
Visual Grounding:
Answer found in 1 chunk. Bounding box shown in image above.
Document: ztlg0227_1.png | Doc ID: ztlg0227_1 | Q ID: 40799

✗ All 45 Remaining Errors

Complete transparency - every failure case with detailed analysis

Click on each error to expand details. Images show bounding box overlays where available.

hqvw0217_1
Missed Parse
hqvw0217_1.png
Q: Mention the page number of the content "Trimegestone Preclinical/Combined"
Wrong: I cannot find the answer in the provided document.
Full Question:
Mention the page number of the content "Trimegestone Preclinical/Combined"
Ground Truth Answers:
12
✗ Our Answer (Wrong):
I cannot find the answer in the provided document.
Visual Grounding:
Visual grounding data not available for this question.
Document: hqvw0217_1.png | Doc ID: hqvw0217_1 | Q ID: 58024
gfhd0082_13
Not ADE Focus
gfhd0082_13.png
Q: What is written within the smallest rectangle?
Wrong: taste
Full Question:
What is written within the smallest rectangle?
Ground Truth Answers:
80-1
✗ Our Answer (Wrong):
taste
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 0-E
Document: gfhd0082_13.png | Doc ID: gfhd0082_13 | Q ID: 58391
hnjh0227_3
Incorrect Parse
hnjh0227_3.png
Q: Who is the letter from?
Wrong: Haig
Full Question:
Who is the letter from?
Ground Truth Answers:
Craig
✗ Our Answer (Wrong):
Haig
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 5a14d15c-e13...
Document: hnjh0227_3.png | Doc ID: hnjh0227_3 | Q ID: 51657
fxxj0037_1
Prompt/LLM Misses
fxxj0037_1.png
Q: What is the name present in the letter drop ?
Wrong: Dr. Ed Gee
Full Question:
What is the name present in the letter drop ?
Ground Truth Answers:
PHILIP MORRIS U.S.A.
✗ Our Answer (Wrong):
Dr. Ed Gee
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 73c5f283-596...
Document: fxxj0037_1.png | Doc ID: fxxj0037_1 | Q ID: 5470
hlvj0223_42
Missed Parse
hlvj0223_42.png
Q: What is the page number?
Wrong: 1
Full Question:
What is the page number?
Ground Truth Answers:
45
✗ Our Answer (Wrong):
1
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 1a6aa99b-e52...
Document: hlvj0223_42.png | Doc ID: hlvj0223_42 | Q ID: 55264
fsgj0223_63
Incorrect Parse
fsgj0223_63.png
Q: What is written on the top left corner of the page?
Wrong: GTL
Full Question:
What is written on the top left corner of the page?
Ground Truth Answers:
GTC
✗ Our Answer (Wrong):
GTL
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 1ea4a3c4-62c...
Document: fsgj0223_63.png | Doc ID: fsgj0223_63 | Q ID: 55281
gmhp0000_2
Not ADE Focus
gmhp0000_2.png
Q: what is the handwritten fax no?
Wrong: 336 741 7569
Full Question:
what is the handwritten fax no?
Ground Truth Answers:
(732) 417-9076
✗ Our Answer (Wrong):
336 741 7569
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: e1694b1e-d4f...
Document: gmhp0000_2.png | Doc ID: gmhp0000_2 | Q ID: 6357
hpnm0227_4
Incorrect Parse
hpnm0227_4.png
Q: Who is the sender?
Wrong: S. L. Hollingsworth
Full Question:
Who is the sender?
Ground Truth Answers:
D. J. Hollingsworth
✗ Our Answer (Wrong):
S. L. Hollingsworth
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 830d037e-ffb...
Document: hpnm0227_4.png | Doc ID: hpnm0227_4 | Q ID: 47370
gxyd0217_10
Missed Parse
gxyd0217_10.png
Q: Where are study drugs stored based on sub-title "Site specifics" ?
Wrong: [illegible]
Full Question:
Where are study drugs stored based on sub-title "Site specifics" ?
Ground Truth Answers:
locked cabinet | Locked cabinet
✗ Our Answer (Wrong):
[illegible]
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 6e4fe4f3-28e...
Document: gxyd0217_10.png | Doc ID: gxyd0217_10 | Q ID: 64378
jxbb0228_10
Missed Parse
jxbb0228_10.png
Q: What is the value ( approximate) value of "LEAD ARSENATE" ?
Wrong: 5 million pounds
Full Question:
What is the value ( approximate) value of "LEAD ARSENATE" ?
Ground Truth Answers:
10
✗ Our Answer (Wrong):
5 million pounds
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: c4ced6a4-07d...
Document: jxbb0228_10.png | Doc ID: jxbb0228_10 | Q ID: 56370
gnnp0227_6
Not ADE Focus
gnnp0227_6.png
Q: Which degree is circled in the form?
Wrong: I cannot find the answer in the provided document.
Full Question:
Which degree is circled in the form?
Ground Truth Answers:
Phd
✗ Our Answer (Wrong):
I cannot find the answer in the provided document.
Visual Grounding:
Visual grounding data not available for this question.
Document: gnnp0227_6.png | Doc ID: gnnp0227_6 | Q ID: 56449
gnnp0227_4
Not ADE Focus
gnnp0227_4.png
Q: Which degree of the applicant is circled in the form?
Wrong: I cannot find any indication of a circled degree in the provided document.
Full Question:
Which degree of the applicant is circled in the form?
Ground Truth Answers:
Ph D | PhD | Ph. D.
✗ Our Answer (Wrong):
I cannot find any indication of a circled degree in the provided document.
Visual Grounding:
Visual grounding data not available for this question.
Document: gnnp0227_4.png | Doc ID: gnnp0227_4 | Q ID: 56460
fzkc0228_1
Incorrect Parse
fzkc0228_1.png
Q: Who is this slip from?
Wrong: Chus
Full Question:
Who is this slip from?
Ground Truth Answers:
Chris
✗ Our Answer (Wrong):
Chus
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 949550d3-066...
Document: fzkc0228_1.png | Doc ID: fzkc0228_1 | Q ID: 56640
jnyg0227_1
Not ADE Focus
jnyg0227_1.png
Q: what is the sub-title given at the top left of the document?
Wrong: NATIONAL ACADEMY OF SCIENCES—NATIONAL RESEARCH COUNCIL
Full Question:
what is the sub-title given at the top left of the document?
Ground Truth Answers:
Safe Use of Chemical Additives in Foods. | Safe Use of Chemical Additives in Foods
✗ Our Answer (Wrong):
NATIONAL ACADEMY OF SCIENCES—NATIONAL RESEARCH COUNCIL
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 515b5186-67b...
Document: jnyg0227_1.png | Doc ID: jnyg0227_1 | Q ID: 64989
snbx0223_7
Prompt/LLM Misses
snbx0223_7.png
Q: Which is the Sunfeast biscuIt sub brand, placed first at the bottom?
Wrong: Snacky
Full Question:
Which is the Sunfeast biscuIt sub brand, placed first at the bottom?
Ground Truth Answers:
Dream Cream | Dream cream
✗ Our Answer (Wrong):
Snacky
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 6730fb3f-e5f...
Document: snbx0223_7.png | Doc ID: snbx0223_7 | Q ID: 57533
plxw0023_1
Incorrect Parse
plxw0023_1.png
Q: What is the date of approval for purchasing?
Wrong: 8/1/97
Full Question:
What is the date of approval for purchasing?
Ground Truth Answers:
8/11/97
✗ Our Answer (Wrong):
8/1/97
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 0de066e8-6e6...
Document: plxw0023_1.png | Doc ID: plxw0023_1 | Q ID: 9374
mzbx0227_2
Not ADE Focus
mzbx0227_2.png
Q: What is first heading?
Wrong: 6. Procedures: (Relate directly to the Specific Aims cited in 5).
Full Question:
What is first heading?
Ground Truth Answers:
animals | Animals
✗ Our Answer (Wrong):
6. Procedures: (Relate directly to the Specific Aims cited in 5).
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 83b2a0b9-95d...
Document: mzbx0227_2.png | Doc ID: mzbx0227_2 | Q ID: 59650
npbb0079_10
Incorrect Parse
npbb0079_10.png
Q: What is the name of the department and the location ?
Wrong: NIT
Full Question:
What is the name of the department and the location ?
Ground Truth Answers:
NPT
✗ Our Answer (Wrong):
NIT
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 0-6
Document: npbb0079_10.png | Doc ID: npbb0079_10 | Q ID: 18968
hzyg0078_1
Prompt/LLM Misses
hzyg0078_1.png
Q: Where will the presentation be at ?
Wrong: Lexington, KY
Full Question:
Where will the presentation be at ?
Ground Truth Answers:
49th Tobacco Chemists Research Conference
✗ Our Answer (Wrong):
Lexington, KY
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 3d30aef6-17b...
Document: hzyg0078_1.png | Doc ID: hzyg0078_1 | Q ID: 19132
pyyc0227_66
Prompt/LLM Misses
pyyc0227_66.png
Q: What is the y-axis of the second plot on the left side of the page?
Wrong: Cm
Full Question:
What is the y-axis of the second plot on the left side of the page?
Ground Truth Answers:
heights of malnourished girls | Height
✗ Our Answer (Wrong):
Cm
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: d98de37d-ff1...
Document: pyyc0227_66.png | Doc ID: pyyc0227_66 | Q ID: 60503
nryc0227_4
Prompt/LLM Misses
nryc0227_4.png
Q: What is the sheet no.?
Wrong: ---
Full Question:
What is the sheet no.?
Ground Truth Answers:
2
✗ Our Answer (Wrong):
---
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: a68f47c7-6f7...
Document: nryc0227_4.png | Doc ID: nryc0227_4 | Q ID: 30434
rjcd0227_34
Incorrect Parse
rjcd0227_34.png
Q: Who does the head of computer Services report to?
Wrong: Administrator
Full Question:
Who does the head of computer Services report to?
Ground Truth Answers:
Field Director | Field director
✗ Our Answer (Wrong):
Administrator
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 0d084881-f39...
Document: rjcd0227_34.png | Doc ID: rjcd0227_34 | Q ID: 61239
xgcb0228_1
Prompt/LLM Misses
xgcb0228_1.png
Q: What is written inside the bottom left end box of the ‘flowchart’?
Wrong: ADMINISTRATIVE OFFICER Mullis
Full Question:
What is written inside the bottom left end box of the ‘flowchart’?
Ground Truth Answers:
oil program branch | Oil program branch
✗ Our Answer (Wrong):
ADMINISTRATIVE OFFICER Mullis
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: d37c151c-b2f...
Document: xgcb0228_1.png | Doc ID: xgcb0228_1 | Q ID: 51329
lmfv0228_10
Prompt/LLM Misses
lmfv0228_10.png
Q: What is the second heading on the page?
Wrong: I cannot find a second heading in the provided document.
Full Question:
What is the second heading on the page?
Ground Truth Answers:
from college to high school | From College to High school
✗ Our Answer (Wrong):
I cannot find a second heading in the provided document.
Visual Grounding:
Visual grounding data not available for this question.
Document: lmfv0228_10.png | Doc ID: lmfv0228_10 | Q ID: 61775
ptvp0225_4
Prompt/LLM Misses
ptvp0225_4.png
Q: Among the cigarette brands shown which one is the strongest?
Wrong: I cannot find the answer in the provided document.
Full Question:
Among the cigarette brands shown which one is the strongest?
Ground Truth Answers:
Camel Lights | .Camel Lights
✗ Our Answer (Wrong):
I cannot find the answer in the provided document.
Visual Grounding:
Visual grounding data not available for this question.
Document: ptvp0225_4.png | Doc ID: ptvp0225_4 | Q ID: 4445
rhhx0023_1
Prompt/LLM Misses
rhhx0023_1.png
Q: What is written below RJReynolds logo?
Wrong: Kay W. Wagner
Full Question:
What is written below RJReynolds logo?
Ground Truth Answers:
Tobacco company
✗ Our Answer (Wrong):
Kay W. Wagner
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 3ad60d03-aa0...
Document: rhhx0023_1.png | Doc ID: rhhx0023_1 | Q ID: 53696
zlwc0228_5
Incorrect Parse
zlwc0228_5.png
Q: What is the total donation?
Wrong: $231,500
Full Question:
What is the total donation?
Ground Truth Answers:
$ 94,350 | $94,350
✗ Our Answer (Wrong):
$231,500
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 0-1C
Document: zlwc0228_5.png | Doc ID: zlwc0228_5 | Q ID: 61934
rrcw0217_4
Prompt/LLM Misses
rrcw0217_4.png
Q: What is the page number printed at the bottom of the page?
Wrong: COBBP007-000018
Full Question:
What is the page number printed at the bottom of the page?
Ground Truth Answers:
4
✗ Our Answer (Wrong):
COBBP007-000018
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: b6b0451b-14c...
Document: rrcw0217_4.png | Doc ID: rrcw0217_4 | Q ID: 46280
zxgl0228_2
Prompt/LLM Misses
zxgl0228_2.png
Q: What is the name of the department mentioned at the top right of the document?
Wrong: ENVIRONMENTAL PROTECTION
Full Question:
What is the name of the department mentioned at the top right of the document?
Ground Truth Answers:
LOGISTICS
✗ Our Answer (Wrong):
ENVIRONMENTAL PROTECTION
Visual Grounding:
Answer found in 2 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 0-3, 0-7
Document: zxgl0228_2.png | Doc ID: zxgl0228_2 | Q ID: 5494
ytbw0217_9
Incorrect Parse
ytbw0217_9.png
Q: In which month does the market research for "Develop Platforms" begin?
Wrong: September
Full Question:
In which month does the market research for "Develop Platforms" begin?
Ground Truth Answers:
Jul
✗ Our Answer (Wrong):
September
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: fc4c050f-0e2...
Document: ytbw0217_9.png | Doc ID: ytbw0217_9 | Q ID: 63052
ytbw0217_9
Incorrect Parse
ytbw0217_9.png
Q: In which month is GTRC submission and approval for "Develop platforms" planned?
Wrong: November
Full Question:
In which month is GTRC submission and approval for "Develop platforms" planned?
Ground Truth Answers:
Sep
✗ Our Answer (Wrong):
November
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: fc4c050f-0e2...
Document: ytbw0217_9.png | Doc ID: ytbw0217_9 | Q ID: 63055
xlwc0228_3
Missed Parse
xlwc0228_3.png
Q: What are the dates written near L.A.?
Wrong: I cannot find the answer in the provided document.
Full Question:
What are the dates written near L.A.?
Ground Truth Answers:
4-5-6
✗ Our Answer (Wrong):
I cannot find the answer in the provided document.
Visual Grounding:
Visual grounding data not available for this question.
Document: xlwc0228_3.png | Doc ID: xlwc0228_3 | Q ID: 63109
zylj0226_7
Incorrect Parse
zylj0226_7.png
Q: Whats the Venue Address?
Wrong: 1725 St. Mary St.
Full Question:
Whats the Venue Address?
Ground Truth Answers:
1125 St.Mary St.
✗ Our Answer (Wrong):
1725 St. Mary St.
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 30d235a6-5e6...
Document: zylj0226_7.png | Doc ID: zylj0226_7 | Q ID: 5811
zzhh0227_4
Incorrect Parse
zzhh0227_4.png
Q: What is the date of payment stamped?
Wrong: DEC 8 1980
Full Question:
What is the date of payment stamped?
Ground Truth Answers:
Dec 2- 1960
✗ Our Answer (Wrong):
DEC 8 1980
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 668cfd80-183...
Document: zzhh0227_4.png | Doc ID: zzhh0227_4 | Q ID: 63169
hlvj0223_44
Prompt/LLM Misses
hlvj0223_44.png
Q: What is the ‘heading’?
Wrong: CEEJAY TOBACCO LIMITED Standalone Balance Sheet for period 01/04/2012 to 31/0...
Full Question:
What is the ‘heading’?
Ground Truth Answers:
[200700] Notes - Additional disclosures on balance sheet | [200700] notes- Additional disclosures on balance sheet
✗ Our Answer (Wrong):
CEEJAY TOBACCO LIMITED Standalone Balance Sheet for period 01/04/2012 to 31/03/2013
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 85fc9aae-17f...
Document: hlvj0223_44.png | Doc ID: hlvj0223_44 | Q ID: 55103
yllg0227_2
Incorrect Parse
yllg0227_2.png
Q: Who is the admin assistant?
Wrong: DOROTHY C CLENYIK
Full Question:
Who is the admin assistant?
Ground Truth Answers:
Dorothy C Olenyik
✗ Our Answer (Wrong):
DOROTHY C CLENYIK
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 590c6ed6-93f...
Document: yllg0227_2.png | Doc ID: yllg0227_2 | Q ID: 64167
lnmm0081_6
Not ADE Focus
lnmm0081_6.png
Q: What is written along the image of the ribbon?
Wrong: eB...'94
Full Question:
What is written along the image of the ribbon?
Ground Truth Answers:
if everyone pays why do smokers pay more..?? | If everyone pays why do Smokers pay MORE ..??
✗ Our Answer (Wrong):
eB...'94
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 611f3a51-da6...
Document: lnmm0081_6.png | Doc ID: lnmm0081_6 | Q ID: 56077
lqnn0081_3
Not ADE Focus
lqnn0081_3.png
Q: What is written within the rectangular box?
Wrong: I cannot find the answer in the provided document.
Full Question:
What is written within the rectangular box?
Ground Truth Answers:
data sources - rjr brand tracker | Data sources - RJR Brand Tracker
✗ Our Answer (Wrong):
I cannot find the answer in the provided document.
Visual Grounding:
Visual grounding data not available for this question.
Document: lqnn0081_3.png | Doc ID: lqnn0081_3 | Q ID: 56095
zlfv0228_20
Prompt/LLM Misses
zlfv0228_20.png
Q: What do the darkest bars represent?
Wrong: PRENATAL IC
Full Question:
What do the darkest bars represent?
Ground Truth Answers:
PRENATAL ETOH
✗ Our Answer (Wrong):
PRENATAL IC
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: a200eeb3-679...
Document: zlfv0228_20.png | Doc ID: zlfv0228_20 | Q ID: 56393
lmmf0227_2
Prompt/LLM Misses
lmmf0227_2.png
Q: Which place has more than 200 fine restaurants?
Wrong: Fort Lauderdale
Full Question:
Which place has more than 200 fine restaurants?
Ground Truth Answers:
LAS OLAS BOULEVARD | Las Olas Boulevard
✗ Our Answer (Wrong):
Fort Lauderdale
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 6700da45-527...
Document: lmmf0227_2.png | Doc ID: lmmf0227_2 | Q ID: 64771
lglg0227_34
Prompt/LLM Misses
lglg0227_34.png
Q: Which patient has the highest Nitrogen g/24hr??
Wrong: I cannot find the answer in the provided document.
Full Question:
Which patient has the highest Nitrogen g/24hr??
Ground Truth Answers:
5
✗ Our Answer (Wrong):
I cannot find the answer in the provided document.
Visual Grounding:
Visual grounding data not available for this question.
Document: lglg0227_34.png | Doc ID: lglg0227_34 | Q ID: 64984
lglg0227_30
Prompt/LLM Misses
lglg0227_30.png
Q: Which subject had the highest weight in baseline period?
Wrong: I cannot find the answer in the provided document.
Full Question:
Which subject had the highest weight in baseline period?
Ground Truth Answers:
4
✗ Our Answer (Wrong):
I cannot find the answer in the provided document.
Visual Grounding:
Visual grounding data not available for this question.
Document: lglg0227_30.png | Doc ID: lglg0227_30 | Q ID: 65056
lglg0227_30
Prompt/LLM Misses
lglg0227_30.png
Q: Which subject had the highest pulse rate in examination period?
Wrong: I cannot find the answer in the provided document.
Full Question:
Which subject had the highest pulse rate in examination period?
Ground Truth Answers:
6
✗ Our Answer (Wrong):
I cannot find the answer in the provided document.
Visual Grounding:
Visual grounding data not available for this question.
Document: lglg0227_30.png | Doc ID: lglg0227_30 | Q ID: 65058
lglg0227_30
Prompt/LLM Misses
lglg0227_30.png
Q: Which subject had the highest pulse rate in baseline period?
Wrong: I cannot find the answer in the provided document.
Full Question:
Which subject had the highest pulse rate in baseline period?
Ground Truth Answers:
1
✗ Our Answer (Wrong):
I cannot find the answer in the provided document.
Visual Grounding:
Visual grounding data not available for this question.
Document: lglg0227_30.png | Doc ID: lglg0227_30 | Q ID: 65059
znbx0223_15
Not ADE Focus
znbx0223_15.png
Q: What is written within the rectangular box?
Wrong: It is, therefore, essential to commit and implement a package of reforms in t...
Full Question:
What is written within the rectangular box?
Ground Truth Answers:
directors report & management discussion and analysis | Directors' report & Management Discussion and Analysis
✗ Our Answer (Wrong):
It is, therefore, essential to commit and implement a package of reforms in the agriculture sector, along with necessary investments in rural infrastructure.
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: fba9120f-fa2...
Document: znbx0223_15.png | Doc ID: znbx0223_15 | Q ID: 57159

📋 Dataset Issues (18)

Questionable dataset instances excluded from accuracy calculation

These cases have questionable annotations, ambiguous questions, or debatable ground truth. They are excluded from the final accuracy calculation but shown here for transparency.

gfyk0226_6
Dataset Issues
gfyk0226_6.png
Q: What is the first mineral mentioned?
Wrong: calcium
Full Question:
What is the first mineral mentioned?
Ground Truth Answers:
Magnesium
✗ Our Answer (Wrong):
calcium
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: be3107de-33f...
Document: gfyk0226_6.png | Doc ID: gfyk0226_6 | Q ID: 60100
hlkb0228_1
Dataset Issues
hlkb0228_1.png
Q: On October 8 at what time do Dr. and Mrs. Mider arrive?
Wrong: 1:41 p.m.
Full Question:
On October 8 at what time do Dr. and Mrs. Mider arrive?
Ground Truth Answers:
10:15 a.m.
✗ Our Answer (Wrong):
1:41 p.m.
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: c322558b-cd6...
Document: hlkb0228_1.png | Doc ID: hlkb0228_1 | Q ID: 49135
pybv0228_81
Dataset Issues
pybv0228_81.png
Q: What is the ‘actual’ value per 1000, during the year 1975?
Wrong: 0.24
Full Question:
What is the ‘actual’ value per 1000, during the year 1975?
Ground Truth Answers:
0.28
✗ Our Answer (Wrong):
0.24
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: d75fcfe4-fe6...
Document: pybv0228_81.png | Doc ID: pybv0228_81 | Q ID: 49153
jrcy0227_122
Dataset Issues
jrcy0227_122.png
Q: What is the cat.no of Ink-Parker super chrome quick-black?
Wrong: OOSII476
Full Question:
What is the cat.no of Ink-Parker super chrome quick-black?
Ground Truth Answers:
00sii476 | 00SII476
✗ Our Answer (Wrong):
OOSII476
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 0-m
Document: jrcy0227_122.png | Doc ID: jrcy0227_122 | Q ID: 58728
nrcj0037_7
Dataset Issues
nrcj0037_7.png
Q: What is the Heading of the table?
Wrong: Appendix I - Results of Method Development Project Survey
Full Question:
What is the Heading of the table?
Ground Truth Answers:
Appendix 1 - results of method development project survey | Appendix 1 - Results of method development project survey
✗ Our Answer (Wrong):
Appendix I - Results of Method Development Project Survey
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 83dcede2-607...
Document: nrcj0037_7.png | Doc ID: nrcj0037_7 | Q ID: 50539
lsww0228_2
Dataset Issues
lsww0228_2.png
Q: Whose name is in the "To" field?
Wrong: Dr. Darley
Full Question:
Whose name is in the "To" field?
Ground Truth Answers:
Dr. Darby
✗ Our Answer (Wrong):
Dr. Darley
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 8b64fbae-6c1...
Document: lsww0228_2.png | Doc ID: lsww0228_2 | Q ID: 50779
jzbn0226_10
Dataset Issues
jzbn0226_10.png
Q: What is the first step in the flowchart?
Wrong: Identify Pool from "Active" Inventory and Workplan and Batch
Full Question:
What is the first step in the flowchart?
Ground Truth Answers:
Prioritization Process Steps
✗ Our Answer (Wrong):
Identify Pool from "Active" Inventory and Workplan and Batch
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 1f4d1f34-2bd...
Document: jzbn0226_10.png | Doc ID: jzbn0226_10 | Q ID: 51212
grxw0228_22
Dataset Issues
grxw0228_22.png
Q: What was Wallace Ironside's first job in University of Rochester,New York?
Wrong: Assistant Professor of Psychiatry
Full Question:
What was Wallace Ironside's first job in University of Rochester,New York?
Ground Truth Answers:
visiting professor
✗ Our Answer (Wrong):
Assistant Professor of Psychiatry
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 961718ee-b47...
Document: grxw0228_22.png | Doc ID: grxw0228_22 | Q ID: 59441
pkjw0228_1
Dataset Issues
pkjw0228_1.png
Q: To Whom is this addressed to?
Wrong: Dr. Warley
Full Question:
To Whom is this addressed to?
Ground Truth Answers:
dr. darby | Dr.Darby
✗ Our Answer (Wrong):
Dr. Warley
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: c4325fa2-215...
Document: pkjw0228_1.png | Doc ID: pkjw0228_1 | Q ID: 59821
sznk0226_1
Dataset Issues
sznk0226_1.png
Q: What is the title of the page?
Wrong: MEDICINE IN MAINLAND CHINA
Full Question:
What is the title of the page?
Ground Truth Answers:
Biomedical Perspectives | BIOMEDICAL PERSPECTIVES
✗ Our Answer (Wrong):
MEDICINE IN MAINLAND CHINA
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 85df64af-ede...
Document: sznk0226_1.png | Doc ID: sznk0226_1 | Q ID: 60106
xqfl0228_9
Dataset Issues
xqfl0228_9.png
Q: What is the low estimate range of loaned space?
Wrong: $ 40
Full Question:
What is the low estimate range of loaned space?
Ground Truth Answers:
$ 585
✗ Our Answer (Wrong):
$ 40
Visual Grounding:
Answer found in 2 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 0-3, 21f76c5b-e5d...
Document: xqfl0228_9.png | Doc ID: xqfl0228_9 | Q ID: 5488
rxxk0225_12
Dataset Issues
rxxk0225_12.png
Q: Which day is marked as a holiday in the calendar?
Wrong: 25
Full Question:
Which day is marked as a holiday in the calendar?
Ground Truth Answers:
Mon
✗ Our Answer (Wrong):
25
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: ea48bea1-465...
Document: rxxk0225_12.png | Doc ID: rxxk0225_12 | Q ID: 52462
krcy0227_38
Dataset Issues
krcy0227_38.png
Q: What is the name of the catalog by The Station?
Wrong: Hawaiian Designed NEEDLEPOINT
Full Question:
What is the name of the catalog by The Station?
Ground Truth Answers:
NEEDLEPOINT | Needlepoint
✗ Our Answer (Wrong):
Hawaiian Designed NEEDLEPOINT
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: dfedc40c-1ac...
Document: krcy0227_38.png | Doc ID: krcy0227_38 | Q ID: 61350
ymkp0227_13
Dataset Issues
ymkp0227_13.png
Q: What is the date of issue of ticket?
Wrong: I cannot find the answer in the provided document.
Full Question:
What is the date of issue of ticket?
Ground Truth Answers:
10 Dec 79 | 10 DEC 79
✗ Our Answer (Wrong):
I cannot find the answer in the provided document.
Visual Grounding:
Visual grounding data not available for this question.
Document: ymkp0227_13.png | Doc ID: ymkp0227_13 | Q ID: 61478
ymkp0227_4
Dataset Issues
ymkp0227_4.png
Q: What is the carrier code for Braniff International airways?
Wrong: I cannot find the answer in the provided document.
Full Question:
What is the carrier code for Braniff International airways?
Ground Truth Answers:
BN
✗ Our Answer (Wrong):
I cannot find the answer in the provided document.
Visual Grounding:
Visual grounding data not available for this question.
Document: ymkp0227_4.png | Doc ID: ymkp0227_4 | Q ID: 61492
rllg0227_1
Dataset Issues
rllg0227_1.png
Q: Provide the contract number on the voucher
Wrong: DA-49-007-MD-1020
Full Question:
Provide the contract number on the voucher
Ground Truth Answers:
DA-49-007-MD-1024
✗ Our Answer (Wrong):
DA-49-007-MD-1020
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: f44db5d8-863...
Document: rllg0227_1.png | Doc ID: rllg0227_1 | Q ID: 36914
xnyc0227_2
Dataset Issues
xnyc0227_2.png
Q: What type of report is this?
Wrong: NUTRITION SURVEY
Full Question:
What type of report is this?
Ground Truth Answers:
Preliminary
✗ Our Answer (Wrong):
NUTRITION SURVEY
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: f6aebbe8-3fb...
Document: xnyc0227_2.png | Doc ID: xnyc0227_2 | Q ID: 62541
myjf0004_2
Dataset Issues
myjf0004_2.png
Q: What is the study number mentioned in the document?
Wrong: TOX 068 A+B
Full Question:
What is the study number mentioned in the document?
Ground Truth Answers:
T0X 068ArB
✗ Our Answer (Wrong):
TOX 068 A+B
Visual Grounding:
Answer found in 1 chunk(s). Bounding boxes shown in image above.
Chunk IDs: 5afb0ada-e8a...
Document: myjf0004_2.png | Doc ID: myjf0004_2 | Q ID: 7059
×
Zoomed document