Conference Paper HEMVIP: Human Evaluation of Multiple Videos in Parallel
Patrik Jonell, Yoon Youngwoo, Pieter Wolfert, Taras Kucherenko, Gustav Henter
Issue Date
International Conference on Multimodal Interaction (ICMI) 2021, pp.707-711
Project Code
21HS1500, Development of Human-care Robot Technology for Aging Society, Lee Jae Yeon
In many research areas, for example motion and gesture generation, objective measures alone do not provide an accurate impression of key stimulus traits such as perceived quality or appropriateness. The gold standard is instead to evaluate these aspects through user studies, especially subjective evaluations of video stimuli. Common evaluation paradigms either present individual stimuli to be scored on Likert-type scales, or ask users to compare and rate videos in a pairwise fashion. However, the time and resources required for such evaluations scale poorly as the number of conditions to be compared increases. Building on standards used for evaluating the quality of multimedia codecs, this paper instead introduces a framework for granular rating of multiple comparable videos in parallel. This methodology essentially analyses all condition pairs at once. Our contributions are 1) a proposed framework, called HEMVIP, for parallel and granular evaluation of multiple video stimuli and 2) a validation study confirming that results obtained using the tool are in close agreement with results of prior studies using conventional multiple pairwise comparisons.
KSP Keywords
Gesture generation, Gold standard, Human evaluation, Perceived quality, User study, Validation study, Video stimuli, objective measure, pairwise comparisons, subjective evaluation
