WG2 Task Force “Crowdsourcing”

WG2 Task Force“Crowdsourcing” Tobias Hossfeld, Matthias Hirth, Bruno Gardlo, Sebastian Egger, KatrienDe Moor, Patrick Le Callet,Christian Keimel, Klaus Diepold,Valentin Burger WG2 Mechanisms and Models

Agenda • Goal of this task force • Problem statement and key applications • Required methodology • Discussion: What are your interests in crowdsourcing? How can we foster collaboration, joint activities? WG2 Task Force „Crowdsourcing“

Goals of this Task Force • to identify the scientific challenges and problems for QoE assessment via crowdsourcing but also the strengths and benefits, • to derive a methodology and setup for crowdsourcing in QoE assessment, • to challenge crowdsourcing QoE assessment approach with usual “lab” methodologies, comparison of QoE tests • to develop mechanisms and statistical approaches for identifying reliable ratings from remote crowdsourcing users, and • to define requirements onto crowdsourcing platforms for improved QoE assessment. • Joint activities, collaboration within Qualinet WG2 Task Force „Crowdsourcing“

Problem Statement and Key Applications • (Un-)Reliability of users remotely conducting the user tests • Application-layer monitoring, e.g. if browser window has focus • Asking ‘consistency’ questions, questions about content of test • Monitoring of test environment • Technically, e.g. used hardware • Non-technically, e.g. if user gets tired (currently done by analyzing the user results, integrating appropriate questions) • Key applications • Web-based applications, like web browsing, online video streaming, etc. • If specific hardware or software is required, tests may not be possible with crowdsourcing WG2 Task Force „Crowdsourcing“

Required Methodology • Test design methodology • Consistency tests, content questions, application-based user monitoring • But not too much to avoid boring the users • Impact of Internet connection must not influence the tests (otherwise we need network monitoring at the end user site) • E.g. downloading the entire test (including videos), • E.g. adaptive streaming, if download is simply not possible (due to the amount of data, live streaming, etc.) • Statistical measures to identify unreliable users • Other tools to check users, e.g. panelcheck http://www.panelcheck.com • Comparison of different QoE testing methods • Lab studies • Crowdsourcing studies / social networking studies (different incentives to participate) • Field trials WG2 Task Force „Crowdsourcing“

UniWuE and Crowdsourcing Platforms • Collaboration between Uwue and Microworkers.com provider • We may specify new features  ask for integrating new features into platform • For example, to specify that 50% of the test subjects are younger than 30 years and 50% are older than 30 years • Support / Collaboration with Qualinet partners • introductions for the usage of the Microworkers.com platform, like the account and task creation • help during the task design, which highly affects the result quality. • initial pool of trustworthy workers for QoE tests  pool can be extended and adapted depending on the results of other users QoE tests. • we have ready-to-use hardware to run web based crowdsourcing task and validated mechanisms to integrate these tasks into the microworkers.com platform, like payment-key generation strategies. WG2 Task Force „Crowdsourcing“

Advantages of Crowdsourcing / Particular Interests • Building an open QoE panel for Qualinet • More information of users desirable, e.g. profiles from social networks and/or crowdsourcing platforms • Reliable users in panel are preferred for QoE tests, but open to everyone • How to build the panel, how to let it grow? • Allows investigating QoE over time or impact of context on QoE, e.g. same users conduct same QoE tests at several instants in time / different context; see next slide • Combining crowdsourcing and social networks (non-paid CS) • To get users and information about users • To conduct tests with special user groups, “demographic” features • Reliable QoE tests: Test design, Statistical measures, Monitoring • Comparison of different test methods WG2 Task Force „Crowdsourcing“

QoE over Time • “QoE over Time” means temporal aspects of QoE / time-dynamic modeling of QoE • Different aspects/viewpoints have to be taken into account • Waiting times: in the beginning (initial delay), during service consumption (stalling) • Single session: short-term memory effects (shown for web-browsing) • Per user: long-term memory effects, expectations are changing over time (2 years ago: 1Mbps; nowadays 16Mbps for DSL users) • Content: long duration videos and QoE testing of this (see NTU) • Beside explicit QoE measures, implicit measures are of interests • E.g. mouse movements (to check reliability, tiredness, etc.) • Cross-activity, e.g. with WG 1 subgroup “Web and cloud apps”

Collaboration and Joint Activities • Support by UniWue to setup tests in MicroWorkers.com platform • Update/comment working document sent to WG2 reflector • E.g. features desired for integrating in platforms • STSMs • E.g. joint tests for comparison crowdsourcing and lab • E.g. task design • Joint Activities • Standardization, e.g. updated subjective test methodologies and evaluation wrt. reliability What are your interests?

WG2 Task Force “Crowdsourcing”

WG2 Task Force “Crowdsourcing”

Presentation Transcript

Accredited Gemologists Association

Ian Graham Chairman JTF4, European Prevention Implementation Committee and IHF Council on CVD Prevention

CHILD PROTECTION TASK FORCE MEETING

White House Task Force on Recycling

Presentation to the Defense Task Force on Sexual Assault in the Military Services

Patenting in the Age of Crowdsourcing : An Expanded Opportunity for Third Party Participation

Responsible Care® Environment CODE OF MANAGEMENT PRACTICEs Task Force – 07 ECMP

School Security Special Task Force Recommendations

Information Technology Task Force Meeting

Assessing the Effect of Visualizations on Bayesian Reasoning through Crowdsourcing

Crowdsourcing using Mechanical Turk: Quality Management and Scalability

Regional Financing for Malaria Task Force (RFMTF) Update

Progress Report for the Task Force on IT Governance

Value at Risk

OECD Task Force on Financial Services Recommendations for SNA update issue 6A

Joint JAA/EUROCONTROL Task-Force on UAVs

Status of the JET device and planning of Task Force H in upcoming JET campaigns

SRC/ISMT FORCe: Factory Operations Research Center Task NJ-877

Task Force on Student Educational Capacity Meeting 1 - Kick-Off Nov. 16, 2009

Member Benefits Task Force