210 likes | 361 Views
Argos Evaluation of video surveillance tools. Thomas Fourès. General description. Steering committee CLIPS IMAG EURECOM INA IRIT
E N D
Argos Evaluation of video surveillance tools Thomas Fourès Programme Techno Vision
General description • Steering committee CLIPS IMAG EURECOM INA IRIT LABRI LIP6 NOVELTIS SFRS Programme Techno Vision
General description • Goals • Evaluation of analysis tools applied on heterogeneous contents • Content sets • Documentaries, TV news, Videosurveillance • Results analysis • Comparing methods • Identifying remaining locks Programme Techno Vision
Phase 1 Shot segmentation Transition effects identification Camera motion identification Shot location Presence of persons Presence of text Phase 2 Phase 1 tasks + Framing identification Same person identification Text-in-video transcription Story segmentation Behaviour identification Tasks and evaluation Programme Techno Vision
Evaluation • Type of results • Segmentation: • Starting/ending frame • Feature detection: • Presence in a segment • Confrontation with the ground truth Programme Techno Vision
Corpus • 42 hours Programme Techno Vision
Corpus • Documentaries: 10h24 MPEG-2 720x576 25fps • TV News: 11h10 MPEG-1 352x288 25fps • Videosurveillance I : 10h32 MPEG-2 352x288 25fps • Videosurveillance II : 10h50 MPEG-1 352x288 24 fps Programme Techno Vision
Implementation • Web site : www.irit.fr/argos • Public software resources • Evaluation tools • Annotation tools • Ground truth • Results submission Programme Techno Vision
Participants CLIPS-IMAG EURECOM IRISA IRIT LABRI LIFL LIP6 LIRIS Participants LISTIC VIPER – U. Geneva ENST Cost 292 Implementation Programme Techno Vision
Time Schedule End January 2006 : Evaluations of the first phase March 2006 : Results September 2006 : Evaluations of the second phase November 2006 : Results Implementation Programme Techno Vision
Segmentation description scheme (T 1.1, 2.1, 2.10) Feature description scheme (T 1.2, 1.3, 1.4, 1.5, 1.6, 2.2, 2.3, 2.4, 2.5, 2.6, 2.7, 2.8, 2.9) Results in XML Schemes Programme Techno Vision
Segmentation <?xml version="1.0" encoding="utf-16"?> <xs:schema id="SegmentationEval" xmlns:xs="http://www.w3.org/2001/XMLSchema"> <xs:complexType name="segBoundaryResultsType"> <xs:sequence minOccurs="1" maxOccurs="unbounded"> <xs:element name="segBoundaryResult" type="segBoundaryResultType" /> </xs:sequence> </xs:complexType> <xs:complexType name="segBoundaryResultType"> <xs:sequence minOccurs="1" maxOccurs="unbounded"> <xs:element name="document" type="documentType" /> </xs:sequence> <xs:attribute name="sysId" type="xs:string" use="optional" /> </xs:complexType> <xs:complexType name="documentType"> <xs:sequence minOccurs="1" maxOccurs="unbounded"> <xs:element name="segment" type="segmentType" /> </xs:sequence> <xs:attribute name="src" type="xs:string" use="required" /> <xs:attribute name="sNum" type="xs:integer" use="required" /> </xs:complexType> <xs:complexType name="segmentType"> <xs:attribute name="FirstFrame" type="xs:integer" use="required" /> <xs:attribute name="LastFrame" type="xs:integer" use="required" /> </xs:complexType> <xs:element name="segBoundaryResults" type="segBoundaryResultsType" /> </xs:schema> Programme Techno Vision
Segmentation Programme Techno Vision
Segmentation <segBoundaryResults> <segBoundaryResult sysId="CLIPS_run01"> <document src="INA01" sNum="101"> <segment FirstFrame="1" LastFrame="256" /> <segment FirstFrame="313" LastFrame="558" /> </document> <document src="INA02" sNum="101"> <segment FirstFrame="1" LastFrame="117" /> <segment FirstFrame="257" LastFrame="879" /> </document> </segBoundaryResult> <segBoundaryResult sysId="CLIPS_run02"> <document src="INA01" sNum="210"> <segment FirstFrame="1" LastFrame="123" /> <segment FirstFrame="263" LastFrame="558" /> </document> <document src="INA02" sNum="210"> <segment FirstFrame="1" LastFrame="256" /> <segment FirstFrame="533" LastFrame="578" /> <segment FirstFrame="579" LastFrame="879" /> </document> </segBoundaryResult> </segBoundaryResults> Programme Techno Vision
Detection <?xml version="1.0" encoding="utf-16"?> <xs:schema id="DetectionEval" xmlns:xs="http://www.w3.org/2001/XMLSchema"> <xs:element name="VideoFeatureExtractionResults" type="VideoFeatureExtractionResultsType" /> <xs:complexType name="VideoFeatureExtractionResultsType"> <xs:sequence minOccurs="1" maxOccurs="unbounded"> <xs:element name="VideoFeatureExtractionRunResult" type="VideoFeatureExtractionRunResultType" /> </xs:sequence> </xs:complexType> <xs:complexType name="VideoFeatureExtractionRunResultType"> <xs:sequence minOccurs="1" maxOccurs="unbounded"> <xs:element name="VideoFeatureExtractionFeatureResult" type="VideoFeatureExtractionFeatureResultType" /> </xs:sequence> <xs:attribute name="sysId" type="xs:string" use="optional" /> </xs:complexType> <xs:complexType name="VideoFeatureExtractionFeatureResultType"> <xs:sequence minOccurs="0" maxOccurs="unbounded"> <xs:element name="item" type="itemType" /> </xs:sequence> <xs:attribute name="fNum" type="xs:integer" use="required" /> </xs:complexType> <xs:complexType name="itemType"> <xs:attribute name="seqNum" type="xs:integer" use="required" /> <xs:attribute name="segId" type="xs:string" use="required" /> <xs:attribute name="textTranscription" type="xs:string" use="optional" /> </xs:complexType> </xs:schema> Programme Techno Vision
Detection Programme Techno Vision
Detection Result Submission <VideoFeatureExtractionResults> <VideoFeatureExtractionRunResult sysId="IRIT_run01"> <VideoFeatureExtractionFeatureResult fNum="2081"> <item segId="INA01_00001" seqNum="2" /> <item segId="INA01_00003" seqNum="1" /> </VideoFeatureExtractionFeatureResult> </VideoFeatureExtractionRunResult> </VideoFeatureExtractionResults> <VideoFeatureExtractionResults> <VideoFeatureExtractionRunResult sysId="INA"> <VideoFeatureExtractionFeatureResult fNum="2090"> <item segId="INA01_00001" textTranscription="Journal" seqNum="1" /> <item segId="INA01_00003" textTranscription="Les titres" seqNum="1" /> <item segId="INA02_00002" textTranscription="Proche Orient" seqNum="1" /> <item segId="INA02_00003" textTranscription="Climat" seqNum="1" /> </VideoFeatureExtractionFeatureResult> </VideoFeatureExtractionRunResult> </VideoFeatureExtractionResults> Programme Techno Vision
Evaluation : recall x precision • « TRECVID-like » results Programme Techno Vision
Participation • Corpus restriction to the videosurveillance part • Agreement • Contact : joly@irit.fr Programme Techno Vision
fNum : PTTC P : Phase = {1,2} T : Tâche C : Numéro de la caractéristique Tâche 1.2 : Identification des effets de transition Cut 1021 Transition complete par interpolation progressive (fondus, morphing ...) 1022 Transition complete par incrustation progressive (volets, iris...) 1023 Transition partielle (compositing, incrustations, ...) 1024 Autre 1020 Tâche 1.3 : Identification des mouvements de caméra plan fixe 1031 zoom/trav avant 1032 zoom/trav arrière 1033 pan/trav à droite 1034 pan/trav à gauche 1035 plongee/pano/ trav vers le bas 1036 contre-plongee/pan/travvers le haut 1037 Autre 1030 Tâche 1.4 : Identification du lieu du tournage interieur 1041 exterieur 1042 autre 1040 Schémas de soumission des résultats - détection Programme Techno Vision
Tâche 1.5 : Identification de la présence de personnes a l'image Présence de personnes 1051 Absence de personnes (non requise) 1052 Tâche 1.6 : Identification de la présence de texte à l'image Présence de texte 1061 Absence de texte (non requise) 1062 Tâche 2.7 : Identification du cadrage Plan rapproche (1 visage, 1 objet, ...) 2071 Plan moyen (2 ou 3 visages, 2 ou 3 objets, ...) 2072 Plan d'ensemble 2073 Autre 2070 Tâche 2.8 : Identification des différentes apparitions d'une même personne personne 1 2081 personne 2 2082 personne 3 2083 … Tâche 2.9 : transcription du texte ascii présent dans une vidéo 2090 Tâche 2.11 : identification du comportement de personnes Courir 2111 S'arrêter 2112 Déposer un objet 2113 Schémas de soumission des résultats - détection Programme Techno Vision