Community Research and Development Information Service - CORDIS

Periodic Report Summary 2 - SIIP (Speaker Identification Integrated Project)

Project Context and Objectives:
Fused Speaker Identification:

SIIP will develop a high performance innovative and sustainable Speaker Identification (SID) solution, running over a large voiceprint (voice sample) database (blacklist). This solution is based on integration and fusion of a series of state of the art Speech Analytic Algorithms which includes Voiceprint recognition, Gender Identification, Age identification, Language and Accent Identification, Keyword and Taxonomy spotting and voice cloning detection. This fusion will be the core technology of SIIP and will be named as ‘F-SID’ (Fused Speaker Identification) is the core technology of SIIP which will result in much higher true-positive identification of individuals, compared to each of these algorithms as stand-alone, reducing in a scale, the False-Positive/ False-Negative detection and increasing the reliability, confidence and judicial admissibility of the speaker identification.

SIIP solution adaptability:

SIIP solution will enable LEAs to run F-SID over all voice stream communication mediums that can be lawfully intercepted, by court warrant against a specific suspect, including: Internet VOIP apps (some of these applications are encrypted e.g. Skype, Tango, Viber and ooVoo are not encrypted) such as G-talk, Skype (although Skype is encrypted, LEA may use Black Box to decrypt it getting the encryption keys from the vendors and use the given keys to decrypt Skype inside the monitoring/decoder), Viber, Tango, ooVoo; PSTN; Cellular and SATCOM (satellite communication). Other voice communication mediums are “recorded” type and include social media such as YouTube or Facebook.

SIIP Solution scalability:

SIIP can be implemented at the headquarters level (where the monitoring center is placed) and at the tactical level (where the LEA investigation is done in geographical vicinity to the suspect location, by portable system that intercepts cellular channels, WiFi, xDSL or LAN), while exchanging data, in real time between these two levels (Tactical<>Headquarters).
Multisource of voiceprints enrolment (Voiceprints enrolment is done by voice sampling voice sampling): For better enrolment of suspect voiceprints, SIIP will make use of various enrolment methods:
(i) Enrolment of voiceprints at the suspect point of presence (POP) where high quality voice recording sensors and kits should be deployed (covertly) near the suspect physical presence. (ii) Enrolment of voiceprints from open source (public) social media (e.g. YouTube, Facebook) where individuals upload content which includes their voice. (iii) Enrolment of voiceprints from lawful intercepted calls from telephony/Internet peer to peer VOIP sources.

Rich metadata enrolment:

SIIP will enrol rich metadata that will be associated to the enrolled voiceprints. The rich metadata that includes for example, IP addresses, e-mail addresses, chat nick-names, social network identifiers, social network connections, cell phone identities and many more. This rich data will be enrolled from Internet open sources and from lawful intercepted telephony and Internet (VOIP) data.

International cooperation between LEAs:

The use of the “Sharing Center” would significantly improve effectiveness of international police cooperation and assists in the identification of suspects who can then be brought to justice.
SIIP consists of a centralizing secured database, to be located at the ‘INTERPOL’, for analysing voice samples (retrieving and analysing voice prints for identification purposes) enhanced with rich metadata of individuals subject to request for international police cooperation.

Privacy by design concept:

SIIP integrated solution will endorse the privacy by design concept and ensure that privacy is respected at the level of each module developed and at the level of the overall system integration. Safeguards will be included at different levels of the system to prevent misuse of the SIIP solution.

Project Results:
The concrete results of the SIIP project for the second year of its activities were:

• Implementation of end-user requirements
• Feed-back and interaction to other WPs
• Engagement with SIIP end-users
• Successful preparation and execution of the PoC

WP2 cooperated closely with consortium members from WP4, WP8, WP9 and WP10 and engaged in interactions with partners from other work packages and addressed all ethical, legal and societal issues that arose in the process.
• Submission of Deliverable 2.2 - Procedures for data collection, storage, access, sharing policies, protection, retention and destruction and security measures to prevent improper use.
• Submission of Deliverable 2.3 - Report and bibliography on Societal aspects of existing voice biometric sensors; Report on societal impact Speaker identification technologies: past, present and future.
• Submission of Deliverable 2.4 - Report on audio and voice recognition in the context of Information Ethics and Ethics-based review of pre-existing speaker-identification.
• Submission of Deliverable 2.6 - Special Clause 15 compliance documents.
• Organisation of panel at European Biometric Association annual conference on social and legal aspects of the SIIP project.

• Design and definition of the SIIP system architecture (located at LEA headquarters)
• Design and definition of SIIP portable system architecture (located at the tactical level)
• Design of SIIP Information Sharing Center (SISC) architecture at the INTERPOL
• Design and definition of cross component models based on non- functional requirements
• Information flow, logical connection between the different system components and User Interfaces definition

For the first time, the sizing of the system was tested under real conditions. Successful basic performance and interoperability tests have confirmed that the architectural design and sizing conform to the requirements

• SIIP Collection Database modules support the immediate consistency model. This means that even on big data, the DB queries will show the most recent data
• The Lawfully Intercepted Voice Simulator supports IPDR (Internet Protocol Detail Record) metadata
• The SIIP YouTube API is capable of fetching YouTube video comments
• Improved audio quality in social network multimedia content
• Full Integration of WP4 modules with the entire SIIP ecosystem

• H-SIIP enrollment database integrated
• Enhanced H-SIIP enrollment programming interface (encryption and sharing category of metadata)
• Enhanced T-SIIP technologies
• Model adaptation techniques has proven an accuracy improvement up to 58% on EER
• Integration of Low Signal Level Enhancement algorithm
• First version of the SISC module (initial developments)

• Enhanced SIIP voice processing modules
• First experimental results on information fusion
• Evaluation sets and metrics
• Defined and transcribed corpus for use with all modalities (NIST)
• Acoustic simulator development

• Development and release of the SISC service prototype
• Development and release of the SIIP Portal prototype
• Integration and testing of the core SIIP ecosystem services including the Secured Identified Call Records database and SIIP Mediation and Alerting Module
• Definition of the SIIP Portal and SISC service integration plan
• Release of D7.3 and D7.4 project deliverables

• Integration of 15 different components provided by 7 partners
• Implementation of the workflows allowing the orchestration and the execution of users requests via the portal
• Implementation of the PoC use case and scenario
• Installation of the PoC in end-users premises (PJ and Carabinieri)
• Submission of three deliverables

• Publications of the SIIP consortium
• Ongoing maintenance and update of the SIIP public website
• Presentation of the SIIP project in many conferences and workshops

Potential Impact:
Economic - SIIP breakthrough technologies for voice biometrics and speaker identification will enable LEA's a most accurate and fast detection of the suspect identity during lawful investigations and could highly contribute to identify terrorists or criminals threats in time, solve hostage takings and demand of ransom cases. This will promote to ameliorate the high costs of terror and crime, providing an effective suspect identification that can be also translated to substantial monetary savings. It will significantly shorten the time invested by LEAs in chasing the wrong leads.

The work achieved so far will contribute impacting significantly speech recognition technologies. The consortium partners expect SIIP to have the following impacts on society and economy:

Enhanced tools for fighting organized crime and terrorism - Speech recognition technology is an important tool in successfully prosecuting kinds of illegal activity involve actors who are forensically aware, sophisticated and able to pay large sums for technological help and inside information

Ensuring security research benefits society

Ensuring security research does not have negative impacts on society

Ensuring security research meets the needs of society

List of Websites:

Reported by



Follow us on: RSS Facebook Twitter YouTube Managed by the EU Publications Office Top