Shapes surrounding one or more data elements. Ellipses and rectangles were most common; brackets marked axis-aligned ranges. We found enclosures to be self-sufficient: a rectangle around a bar can communicate a filter or RV task without requiring a second type.
Research Overview
This study evaluates how visualization students annotate grouped bar charts when answering high-level analytical questions. Through qualitative coding, we identify a taxonomy of annotation types and characterize how different types support specific low-level analytic tasks.
-
Taxonomy. We coded annotations from 20 student submissions and identified five primary annotation types: enclosure, connector, text, mark, and color, plus a residual category for special symbols. Each type is mapped to the low-level analytic tasks it was used to support.
-
Ensemble Annotations. We found that participants frequently combined types into ensembles, including 2-, 3-, and 4-annotation combinations for a single task. We characterize one-way and two-way dependency relationships between the constituent types.
-
Design Space Observations. We report cross-cutting patterns: within-subject consistency in type selection, task-driven ensemble use (sort always required ensembles), and annotation strategies specific to non-chronological data ordering.
Methodology
We used a course assignment in a mixed undergraduate/graduate data visualization course at the University of South Florida (Spring 2022). Students annotated three grouped bar charts based on the Georgia Department of Public Health (GDPH) COVID-19 visualization to support answering 12 high-level questions.
We recruited students from an Interactive Data Visualization elective at the University of South Florida (USF), with 21 undergraduate and 18 graduate students (39 total). The assignment was given halfway through the semester, before any lecture on bar charts or annotations.
Each student received three grouped bar charts with four high-level questions each (12 total). They were asked to annotate each chart to make the questions easy to answer without writing the answer directly. Any tool was permitted; work was done individually outside class.
Students also identified which low-level tasks each question required. These tasks form the axis of the taxonomy: each annotation type is characterized by the tasks it supports.
- RV Retrieve Value: look up a specific data value.
- Filter Filter: identify a subset satisfying a condition.
- CDV Compute Derived Value: calculate a value from multiple points.
- FE Find Extremum: identify the max or min in a set.
- Sort Sort: rank data items by value.
Two coders independently labeled each annotation, starting from a prior five-type taxonomy and refining iteratively.
We used two bar ordering conditions to investigate how chart structure affects annotation strategy.
- Chrono Bars ordered chronologically by date.
- Non-C Bars ordered highest to lowest, replicating the original GDPH chart.
This produced 31 chronological and 29 non-chronological charts across the 20 analyzed submissions.
Three representative submissions showing how differently students annotated the same kind of grouped bar chart. The notes below point to the exact boxes, arrows, labels, and highlights visible in each figure.
Large blue brackets, arrows, and handwritten notes are drawn directly on the chart. The long bracket under Apr 30-May 2 and the smaller boxes around May 3 and May 8 show which dates to focus on, while the arrows and notes like #, sort, and last explain what to compare or retrieve.
This example combines circles, guide lines, and handwritten questions. The red and green circles under the x-axis pick out specific dates, the horizontal lines around 40 and 48 mark thresholds to check against, and the box around May 8 isolates the bar group the student wants to sort.
This student uses clean color-coded boxes instead of freehand marks. Matching boxes at the top and bottom highlight Apr 30-May 2, May 3, and May 5, the numbers above selected bars give exact values, and the note at May 9 states the main takeaway.
Five Annotation Categories
Five annotation types were identified. Usage counts reflect instances observed across all 20 analyzed submissions. Primary tasks are those most frequently associated with each type; secondary tasks were observed less often.
Lines and arrows. Lines marked bar heights or trends; arrows pointed from text or enclosures to specific bars. We found that connectors almost never appeared alone; without an anchoring type, a connector has no referent and cannot communicate a task.
Words, phrases, and sentences. We identified three subtypes: description (explanatory prose about a task or derivation), value (a specific numeric label), and legend (a label for annotation-based groupings, often paired with color). Text was the only type we observed across all five tasks.
Small, non-enclosing symbols placed on or near data elements to indicate them without surrounding them. We observed checkmarks, circles, underlines, T-shapes, and Roman numerals. Marks were the least-used type despite functional overlap with enclosures for RV and filter tasks.
Color modifications applied to annotations or chart elements. We found two subtypes: highlight (translucent overlay, filter tasks only) and hue variation (distinct color to differentiate groups or questions). We did not observe color used for Sort tasks.
› Faded cards contain no examples of the selected task. Primary tasks are shown in full color; secondary tasks (less frequent) are marked with ›.
Ensemble Annotations
An ensemble is two or more annotation types used together for a single task instance. Ensembles arose when individual annotations were insufficient, either because the task was complex (e.g., sort) or because visual clutter required spatial separation between an annotation and its referent.
Ensembles were divided into three categories by the number of constituent annotation types: 2-annotation, 3-annotation, and 4-annotation. Within 2-annotation ensembles, we further distinguish one-way dependency (one type can stand alone; the other cannot) from two-way dependency (neither type is meaningful without the other for that task).
Key Findings
Patterns observed across the 20 coded submissions, organized by annotation usage, ensemble behavior, and task-specific constraints.
Cite This Work
@article{rahman2025annotationtaxonomy,
title = {Exploring annotation taxonomy in grouped bar charts:
{A} qualitative classroom study},
author = {Rahman, Md Dilshadur and Quadri, Ghulam Jilani
and Szafir, Danielle Albers and Rosen, Paul},
journal = {Information Visualization},
volume = {24},
number = {1},
pages = {79--94},
year = {2025},
publisher = {SAGE},
doi = {10.1177/14738716241270247}
}