The ADVISOR behaviour module is able to recognize some predefined scenarios using the output of several modules. A frame-to-frame tracker is used to link the moving region found at time t with those found at the previous (and following) time instant. Such information is useful to build a first coarse temporal and spatial connection between moving regions.
Two long-term trackings (one for the individuals, that means for isolated persons, and the other for groups) reconstruct the trajectory (over a wider period of time) of individuals and groups, computing at the same time some useful information like speed or level of agitation. A crowd monitoring module gives information about overcrowding, direction of motion of crowd.
Another input for the behaviour recognition module is a priori knowledge of the structure of the filmed scene. This a priori knowledge (= context description) takes the form of a 3D description of the geometric structures of the scene (walls, objects, doors...) plus several information about object and zone properties. The behaviour recognition module uses all these outputs as input to analyse what is happening in a scene.
The behaviour recognition module is able to recognize a wide set of complex scenarios, describing actions and behaviours which can happen in image sequences, like "people are jumping above barriers" or "fighting in progress" or "vandalism against an equipment". The recognition is performed incrementally: each scenario can be split into some events (which have to be recognized separately) and some temporal constraints (which have to be fulfilled by the events). This is the first time that such recognition of complex spatio-temporal scenarios is performed in real time and on real scenes.
INRIA has shown the efficiency of the behaviour recognition module. The behaviour recognition module has been demonstrated and evaluated on four long videos (lasting several hours) containing interesting and normal behaviours. It has also been demonstrated on one live video at a Barcelona metro station. The behaviour recognition module has a high rate of correct recognition (89%) on the recorded sequences. INRIA managed to recognise 25 blocking behaviours out of 27, 30 fighting behaviours out of 35, 7 jumping over barrier behaviours out of 8, 4 vandalism behaviours out of 4 and 2 overcrowding behaviours out of 2.
The behaviour recognition module has low rate of false alarms (6,5%) on the recorded sequences. There were 0 false alarm for blocking, vandalism and overcrowding behaviour, 1 false alarm for jumping over barrier behaviour and 4 false alarms for fighting behaviour. There are still limitations due to the ability to model scenarios (e.g. the description of accurate gestures is not yet formalised), the position of cameras (e.g. an action partly in the field of view of the cameras may not be detected) and tracking errors (e.g tracking an individual in a crowd is still an open issue).