260 likes | 376 Vues
The ETISEO Project features a comprehensive video corpus designed for the evaluation of visual tracking and object recognition systems in challenging conditions. It includes realistic video sequences with varying lighting, occlusions, and diverse environments such as corridors, subway stations, and car parks. The dataset comprises multiple sequences showcasing different scenarios like vehicle parking, people entering/exiting vehicles, and pedestrian movements. With graduate difficulty levels and distinct contexts, this corpus aims to facilitate performance evaluation and comparison across algorithms.
E N D
ETISEO Project • Video corpus • dedicated to • the first evaluation
ETISEO Project To capturerealisticvideo sequences with graduate difficulties: • lighting variations, • occlusions … • covering predefined scenarios.
ETISEO Project • Video Providers • Corpus Data description • Video contents demonstration
Video providers Several data providers offeringlarge diversity. • Differentsites : • corridor, • sub-way, • - building entry, • - apron, • - car park. • Differentcontexts : • indoor / outdoor • single / multi-camera • big / small overlap area • visible / IR • actors / real scenes
Video Sensors Video providers : Silogic Toulouse-Blagnac Airport France Apron
Video providers : Silogic Apron Multi-view
Video providers : INRETS Building Entry Multi-view Car Park
Video providers : CEA Corridor Car street
Video providers : CEA Geometry : IR/Colour calibration Estimation of an homographic matrix between the 2 images (sensors close and aligned)
Video providers : RATP Subway Real scene
Corpus data • Video Providers • Corpus Data description • Video contents demonstration
Priority Sequences • Evaluation risk: • That participants process only part of the dataset. • Resulting in dispersed metrics. • No real evaluation or comparison being possible
Priority Sequences • Solution: • Create some mandatory sequences. • Participants should process at least these sequences. • Need some feedback to select these priority sequences. See the questionnaire.
Corpus data For each topic, the set contains sequences with graduate difficulty.
Corpus data • Covered scenarios : • vehicle parking • people getting in & out from a vehicle • people walking • people crossing • person entering a building • person entering a room • person meeting • abandoned baggage
Corpus data • Summary : • indoor / outdoor sequences, • 8 scenarios, • graduate difficulty, • 20 sequences, • 37 video clips, • 5.2 Go of data.
Corpus diffusion • Video format : • sequence of jpeg/bmp images for colour images • sequence of TIFF and jpeg images for IR. • Timestamp : • Image name contains the timestamp relative to the start of the sequence. • Calibration : • Provide as • - a calibration matrix used to generate 3D • - couples of 2D / 3D points.
Corpus diffusion • Structure: Contains the sequence ETI-VS2-RD-32 ETI-VS2-RD-32-C1 Camera folders C1-Format.xml ETI-VS2-RD-32-C1-00h00m36s000-0000.jpeg ETI-VS2-RD-32-C1-….jpeg ETI-VS2-RD-32-C1-00h02m40s719-1559.jpeg ETI-VS2-RD-32-C2 C2-Format.xml ETI-VS2-RD-32-C2-00h00m36s000-0000.jpeg ETI-VS2-RD-32-C2-….jpeg ETI-VS2-RD-32-C2-00h02m40s723-1559.jpeg
Corpus diffusion • Structure: • Each video resource will be store in a unique folder using the following convention: • ETI-VSindexdataset-Type-sequenceindex • Example: • ETI-VS2-RD-32
Corpus diffusion • C1-format.xml : • <?xml version="1.0" encoding="UTF-8"?> • <sequence name=”ETI-VS1-BC-32”> • <properties> • <format>JPEG</format> could be JPEG,TIFF… • <width>720</width> • <height>576</height> • <depth>24</depth> • <frames>1560</frames> # of frames (start=0) • <inter-frame-ms>80000</inter-frame-ms> average time in ms between frames • <maxsize>65232</maxsize> maximum image size • <camera>C1</camera> camera id • <sensor>color</sensor> color, IR, greyscale • </properties> • </sequence>
ETISEO Project • Video Providers • Corpus Data description • Video content demonstration
Video demonstration • Silogic / Apron: • Type : multi-view, played and real scenes. • Precision evaluation : scenes with real 3D ground truth. • person walking on the zone. C1, C2, C3, C4 • vehicle moving on the zone. C1, C2, C3, C4 • Real scenes : vehicles moving on the apron with large variation of conditions (outside environment). • GPU arrival « normal conditions ». C1 , C2 • GPU arrival « reflections ». C1 , C2 • GPU arrival « shadows » C1, C2
Video demonstration • INRETS / parking & building entry. • Type : multi-view, played scenes. • Scenes : vehicles parking, people entering/leaving cars, entering/leaving the building. Large variation of conditions (outside environment). • ETI-VS1-BE-18: C1,C2,C3,C4 • ETI-VS1-BE-19: C4 • ETI-VS1-BE-20: C1,C2,C3,C4
Video demonstration • INRETS / parking & building entry. • Type : multi-view, played scenes. • Scenes : Sequence ETI-VS1-BE-20 will be provided with different compression levels: • BMP, JPEG and MPEG.
Video demonstration • CEA / parking & building entry. • Type : single-view but visible/IR couple. Played scenes. • Road : vehicles and people. • ETI-VS1-RD-14: C1, C2 • ETI-VS1-RD-15: C1 , C2 • ETI-VS1-RD-16: C1 , C2 • ETI-VS1-RD-17: C1 , C2 • Corridor : People crossing, abandoned luggage. • ETI-VS1-BC-11 C1 , C2 • ETI-VS1-BC-12 C1 , C2 • ETI-VS1-BC-13 C1, C2
Video demonstration • RATP / subway. • Type : single-view. Played scenes mixed with real environment. • Corridor : People crossing, abandoned luggage. • ETI-VS1-MO-07 • ETI-VS1-MO-08 • ETI-VS1-MO-09 • Railway platform : People crossing, abandoned luggage. • ETI-VS1-MO-10