Understanding Europe’s Fashion Data Universe

Projektinformationen

FashionBrain

ID Finanzhilfevereinbarung: 732328

DOI

10.3030/732328

Projekt abgeschlossen

EK-Unterschriftsdatum 28 Oktober 2016

Startdatum 1 Januar 2017

Enddatum 31 Dezember 2019

Finanziert unter

INDUSTRIAL LEADERSHIP - Leadership in enabling and industrial technologies - Information and Communication Technologies (ICT)

Gesamtkosten

€ 2 794 448,75

EU-Beitrag

€ 1 699 323,75

1 699 323,75

1 095 125,00

Koordiniert durch

THE UNIVERSITY OF SHEFFIELD
United Kingdom

Dieses Projekt findet Erwähnung in ...

CORDIS bietet Links zu öffentlichen Ergebnissen und Veröffentlichungen von HORIZONT-Projekten.

Links zu Ergebnissen und Veröffentlichungen von RP7-Projekten sowie Links zu einigen Typen spezifischer Ergebnisse wie Datensätzen und Software werden dynamisch von OpenAIRE abgerufen.

Leistungen

The classification algorithm and its evaluation on fashion time series

As a result of task 5.3, this deliverable will consist of implemented algorithms that will be integrated within the data integration infrastructure developed within WP2 (T 2.3).

Time Series Operators for MonetDB

This deliverable will report the extended support for time series data processing in MonetDB, including integration with the software provided by D4.1. Corresponding software will be made available through the MonetDB open-source repository.

A set of aggregation algorithms and their experimental evaluation

As a result of task 3.2, this deliverable will consist of implemented algorithms that will be integrated within the data integration infrastructure developed within WP2 and will feed into WP5,6, and 7.

Data integration solution

A MonetDB data integration solution for modeling and storing i) the different available datasets from all partners, ii) the taxonomy, iii) the extracted named entities and links. The deliverable will also include the extension of MonetDB with JSON support to include the management of semi-structured data. The proposed solution will be used in WP4, WP5, and WP6.

A set of crowdsourcing interfaces

This deliverable will consist of a set of Human Intelligence Task design experimentally validated for object recognition in images, Validation of named entity extraction, image labeling. This will be available for tasks in WP5.

Named Entity Recognition and Linking methods

As a result of task 1.1, this deliverable will consists of implemented algorithms for entity extraction from textual documents and linking to the ontology defined in WP1. The result will feed into WP 4, 5, and 6.

Report on text joins

This report will describe our methodology for learning text joins and a robust entity recognizer for the fashion domain.

Surveys design and crowdsourcing tasks

The tangible result of task 3.3 will be models generated by means of crowdsourcing which will be used to address our use-cases in WP5 and WP6.

Communication plan

This deliverable will contain a plan of communications relevant for the project dissemination and community building activities including planned activities to support standardisation and interoperability. Outcomes of these efforts will be reported in Periodic Activity Reports produced in WP7.

Project factsheet

A brief project Fact Sheet suitable for Web publishing will be published within one month from the start of the project. The Fact Sheet will outline the project's rationale and objectives, specify its technical baseline and intended target groups and application domains, and detail intermediate and final outputs. The Fact Sheet can be used by the Commission for its own dissemination and awareness activities throughout the project lifecycle, and may be published on EC and EC sponsored Web sites. The factsheet has to be maintained and updated until the end of the project; this will be documented in the regular reporting.

Relation Extraction with Stacked Deep Learning

This report integrates relation extraction and stacked deep learning for selected relations of the Zalando FDWH. We will investigate, how much of the training should be executed in the database or how much may be shipped to a less expensive GPU-based architecture.

Software Requirements: SSM library for time series modelling and trend prediction

Most modern algorithms of State Space Models (SSM) for time series analysis and probabilistic inference will be summarised in this deliverable and will be used as a basis for future software developments in the project. The output will be available for project internal and public use.

Showcase specification and dissemination summary

This deliverable will present the produced promotion and dissemination material, demonstration workflows, and the fully functional data integration infrastructure ready to be demo-able to the public also including screencasts (as indicated in T7.2) . We will grant the Commission the right to use the Showcase for its own dissemination and awareness activities (including Web based and electronic publications) after the completion of the project. The Showcase will feature a meaningful subset (software, data, etc.) of the functionality characterizing the project demonstrator(s) arrived at, along with relevant copyright notices and contact information, and suitable installation aids and run-time interfaces. We will also report about project activities undertaken to support standardisation of project results and collaboration with other projects and relevant initiatives as well as the results of reaching-out by means of press, social media, open-source communities using demos, use cases, and benchmark results realized during the project. As planned in T7.1, we will report on our contribution to the Big Data Value PPP activities.

Showcase specification

This deliverable will contain a specification of the FashionBrain data integration infrastructure including design of promotion material and requirements for software needed to run it.

Survey document of existing datasets and data integration solutions (M6)

This deliverable will consist of an overview of existing state-of-the-art solutions for data integration including infrastructures, algorithms, and datasets covering both academic research as well as industry solutions. This will be the result of Task 1.1.

Demo on text joins

This demo presents fully functional and documented text joins for the example of the Zalando FDWH. Given a fashion data warehouse, we will demo executing text joins for common (and often idiosyncratic) fashion entities, such as brand or products.

Early Demo on textual image search

This deliverable consists of a preliminary image search prototype based on textual entities. This is the basis for D 6.5, which will extend the textual component by NLP and multi-linguality.

Early Demo on Fashion Trend Prediction

This early demo will show how it will be possible to detect fashion trends (style) on fashion time series over time.

Demo on Relation Extraction with Stacked Deep Learning

This demo integrates methods for stacked deep learning on typical crowd-based workflows for trend detection and brand monitoring.

Demo on Fashion Trend Prediction

This demo will show how a particular fashion trend (style) is detected on fashion time series over time. The prediction will be implemented as an operator in MonetDB.

Scalable Crowdsourced Social Media Annotation

This deliverable consists of a publicly available website with data visualization functionalities. We demonstrate that we analysed hundreds of fashion blogs, instagram profiles and that we are able to constantly update the profiles with recently published images.

Demo on textual image search

This deliverable consists of a image search prototype system which uses all of the data collected and allows users to search by images, collects user feedback and is able to periodically improve its results based on this interaction data. It extends the textual component of D 6.3 by NLP and multi-linguality primarily targeting on German, English, French, and Italian.

Product Taxonomy Linking

This deliverable extends D5.1 with a demo that integrates the products social media posts linking, that means that we recognise products from different social media channels.

Project Web site

Setting up the public, general audience targeted project Web site. The site will provide project overviews and highlights; up-to-date information on intermediate and final project results, including public reports and publications as well as synthesis reports drawn from selected confidential material in non-proprietary formats (e.g. PDF); project events, including e.g. user group meetings, conferences and workshops; contact details, etc. The project's Web site first point of access will describe the goals of the project in a simple jargon free language. The Web site will be maintained and updated until the end of the project. All open source components published will be extensively documented by means of textual documents and screencasts of professional quality illustrating how to download, install and operate the components in question. Documentation manuals and screencasts will be specifically identified as project deliverables and prominently published on the project's Web site.

Veröffentlichungen

Analysing Errors of Open Information Extraction Systems

Autoren: Schneider, Rudolf; Oberhauser, Tom; Klatt, Tobias; Gers, Felix A.; Löser, Alexander
Veröffentlicht in: Conference on Empirical Methods on Natural Language Processing Workshop Proceedings, Ausgabe 3, 2017, Seite(n) 8, ISSN 0074-1963-7
Herausgeber: CEUR-WS.org

FashionBrain Project: A Vision for Understanding Europe's Fashion Data Universe

Autoren: Checco, Alessandro; Demartini, Gianluca; Loeser, Alexander; Arous, Ines; Khayati, Mourad; Dantone, Matthias; Koopmanschap, Richard; Stalinov, Svetlin; Kersten, Martin; Zhang, Ying
Veröffentlicht in: Machine learning meets fashion' workshop at KDD 2017, Ausgabe 2, 2017
Herausgeber: arxiv

IDEL: In-Database Entity Linking with Neural Embeddings

Autoren: Kilias, Torsten; Löser, Alexander; Gers, Felix A.; Koopmanschap, Richard; Zhang, Ying; Kersten, Martin
Veröffentlicht in: IEEE BigComp2019, Ausgabe 1, 2018, Seite(n) 12
Herausgeber: arxiv

Let's Agree to Disagree: Fixing Agreement Measures for Crowdsourcing

Autoren: Alessandro Checco, Kevin Roitero, Eddy Maddalena, Stefano Mizzaro and Gianluca Demartini
Veröffentlicht in: 2017
Herausgeber: AAAI

Contextual String Embeddings for Sequence Labeling

Autoren: Alan Akbik, Duncan Blythe and Roland Vollgraf
Veröffentlicht in: 27th International Conference on Computational Linguistics, COLING 2018, 2018
Herausgeber: ICCL

Smart-MD - Neural Paragraph Retrieval of Medical Topics

Autoren: Rudolf Schneider, Sebastian Arnold, Tom Oberhauser, Tobias Klatt, Thomas Steffek, Alexander Löser
Veröffentlicht in: Companion of the The Web Conference 2018 on The Web Conference 2018 - WWW '18, 2018, Seite(n) 203-206, ISBN 9781450356404
Herausgeber: ACM Press
DOI: 10.1145/3184558.3186979

RelVis: Benchmarking OpenIE Systems

Autoren: Rudolf Schneider, Tom Oberhauser, Tobias Klatt, Felix A. Gers, and Alexander Löser
Veröffentlicht in: International Semantic Web Conference (Posters, Demos & Industry Tracks) 2017, 2017
Herausgeber: Stanford University

ZAP: An Open-Source Multilingual Annotation Projection Framework

Autoren: Alan Akbik and Roland Vollgraf
Veröffentlicht in: 11th Language Resources and Evaluation Conference, LREC 2018, 2018
Herausgeber: European Language Resources Association

Love at First Sight: MonetDB/TensorFlow

Autoren: Ying Zhang, Richard Koopmanschap, Martin Kersten
Veröffentlicht in: 2018 IEEE 34th International Conference on Data Engineering (ICDE), 2018, Seite(n) 1672-1672, ISBN 978-1-5386-5520-7
Herausgeber: IEEE
DOI: 10.1109/icde.2018.00208

FEIDEGGER: A Multi-modal Corpus of Fashion Images and Descriptions in German

Autoren: Leonidas Lefakis, Alan Akbik, Roland Vollgraf
Veröffentlicht in: 2018
Herausgeber: European Language Resources Association

All That Glitters is Gold - An Attack Scheme on Gold Questions in Crowdsourcing

Autoren: Alessandro Checco, Jo Bates and Gianluca Demartini
Veröffentlicht in: The sixth AAAI Conference on Human Computation and Crowdsourcing, 2018
Herausgeber: AAAI

The Projector: An Interactive Annotation Projection Visualization Tool

Autoren: Alan Akbik, Roland Vollgraf
Veröffentlicht in: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2017, Seite(n) 43-48
Herausgeber: Association for Computational Linguistics
DOI: 10.18653/v1/D17-2008

In-Database Machine Learning with MonetDB/TensorFlow

Autoren: Torsten Kilias, Alexander Löpser, Felix A. Gers, Richard Koopmanschap, Ying Zhang, Martin Kersten, Mark Raasveldt, Pedro Holanda, Hannes Mühleisen and Stefan Manegold
Veröffentlicht in: 11TH EXTREMELY LARGE DATABASES CONFERENCE, 2018
Herausgeber: XLDB2018

Investigating User Perception of Gender Bias in Image Search - The Role of Sexism

Autoren: Jahna Otterbacher, Alessandro Checco, Gianluca Demartini, Paul Clough
Veröffentlicht in: The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval - SIGIR '18, 2018, Seite(n) 933-936, ISBN 9781-450356572
Herausgeber: ACM Press
DOI: 10.1145/3209978.3210094

Investigating Stability and Reliability of Crowdsourcing Output

Autoren: Rehab K. Qarout, Alessandro Checco, Kalina Bontcheva
Veröffentlicht in: CrowdBias 2018, 2018
Herausgeber: CEUR

All Those Wasted Hours - On Task Abandonment in Crowdsourcing

Autoren: Lei Han, Kevin Roitero, Ujwal Gadiraju, Cristina Sarasua, Alessandro Checco, Eddy Maddalena, Gianluca Demartini
Veröffentlicht in: Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining - WSDM '19, 2019, Seite(n) 321-329, ISBN 9781-450359405
Herausgeber: ACM Press
DOI: 10.1145/3289600.3291035

How Does BERT Answer Questions? - A Layer-Wise Analysis of Transformer Representations

Autoren: Betty van Aken, Benjamin Winter, Alexander Löser, Felix A. Gers
Veröffentlicht in: Proceedings of the 28th ACM International Conference on Information and Knowledge Management - CIKM '19, 2019, Seite(n) 1823-1832, ISBN 9781450369763
Herausgeber: ACM Press
DOI: 10.1145/3357384.3358028

RecovDB: Accurate and Efficient Missing Blocks Recovery for Large Time Series

Autoren: Ines Arous, Mourad Khayati, Philippe Cudre-Mauroux, Ying Zhang, Martin Kersten, Svetlin Stalinlov
Veröffentlicht in: 2019 IEEE 35th International Conference on Data Engineering (ICDE), 2019, Seite(n) 1976-1979, ISBN 978-1-5386-7474-1
Herausgeber: IEEE
DOI: 10.1109/icde.2019.00218

Multilingual Sequence Labeling With One Model

Autoren: Alan Akbik, Tanja Bergmann and Roland Vollgraf
Veröffentlicht in: NLDL 2019, 2019
Herausgeber: NLDL

FLAIR: An Easy-to-Use Framework for State-of-the-Art NLP

Autoren: Alan Akbik, Tanja Bergmann, Duncan Blythe, Kashif Rasul, Stefan Schweter and Roland Vollgraf
Veröffentlicht in: NAACL-HLT 2019, 2019
Herausgeber: NAACL-HLT 2019

Platform-related Factors in Repeatability and Reproducibility of Crowdsourcing Tasks

Autoren: Rehab Qarout, Alessandro Checco, Gianluca Demartini and Kalina Bontcheva
Veröffentlicht in: 2019
Herausgeber: AAAI

OpenCrowd: A Human-AI Collaborative Approach for Finding Social Influencers via Open-Ended Answers Aggregation

Autoren: Ines Arous, Jie Yang, Mourad Khayati, Philippe Cudré-Mauroux
Veröffentlicht in: Proceedings of The Web Conference 2020, 2020, Seite(n) 1851-1862, ISBN 9781-450370233
Herausgeber: ACM
DOI: 10.1145/3366423.3380254

Pooled Contextualized Embeddings for Named Entity Recognition

Autoren: Alan Akbik, Tanja Bergmann and Roland Vollgraf
Veröffentlicht in: NAACL-HLT 2019, 2019
Herausgeber: NAACL-HLT

Implicit Bias in Crowdsourced Knowledge Graphs

Autoren: Gianluca Demartini
Veröffentlicht in: Companion Proceedings of The 2019 World Wide Web Conference on - WWW '19, 2019, Seite(n) 624-630, ISBN 9781-450366755
Herausgeber: ACM Press
DOI: 10.1145/3308560.3317307

Challenges for Toxic Comment Classification: An In-Depth Error Analysis

Autoren: Betty van Aken, Julian Risch, Ralf Krestel, Alexander Löser
Veröffentlicht in: Proceedings of the 2nd Workshop on Abusive Language Online (ALW2), 2018, Seite(n) 33-42
Herausgeber: Association for Computational Linguistics
DOI: 10.18653/v1/w18-5105

The Evolution of Power and Standard Wikidata Editors: Comparing Editing Behavior over Time to Predict Lifespan and Volume of Edits

Autoren: Cristina Sarasua, Alessandro Checco, Gianluca Demartini, Djellel Difallah, Michael Feldman, Lydia Pintscher
Veröffentlicht in: Computer Supported Cooperative Work (CSCW), Ausgabe 28/5, 2019, Seite(n) 843-882, ISSN 0925-9724
Herausgeber: Kluwer Academic Publishers
DOI: 10.1007/s10606-018-9344-y

SECTOR: A Neural Model for Coherent Topic Segmentation and Classification

Autoren: Arnold, Sebastian; Schneider, Rudolf; Cudré-Mauroux, Philippe; Gers, Felix A.; Löser, Alexander
Veröffentlicht in: Transactions of the Association for Computational Linguistics, Ausgabe 2, 2019, ISSN 2307-387X
Herausgeber: MIT press

Mind the gap

Autoren: Mourad Khayati, Alberto Lerner, Zakhar Tymchenko, Philippe Cudré-Mauroux
Veröffentlicht in: Proceedings of the VLDB Endowment, Ausgabe 13/5, 2020, Seite(n) 768-782, ISSN 2150-8097
Herausgeber: VLDB Endowment
DOI: 10.14778/3377369.3377383

The Impact of Task Abandonment in Crowdsourcing

Autoren: Lei Han, Kevin Roitero, Ujwal Gadiraju, Cristina Sarasua, Alessandro Checco, Eddy Maddalena, Gianluca Demartini
Veröffentlicht in: IEEE Transactions on Knowledge and Data Engineering, 2019, Seite(n) 1-1, ISSN 1041-4347
Herausgeber: Institute of Electrical and Electronics Engineers
DOI: 10.1109/tkde.2019.2948168

Adversarial Attacks on Crowdsourcing Quality Control

Autoren: Alessandro Checco, Jo Bates, Gianluca Demartini
Veröffentlicht in: Journal of Artificial Intelligence Research, Ausgabe 67, 2020, Seite(n) 375-408, ISSN 1076-9757
Herausgeber: Morgan Kaufmann Publishers, Inc.
DOI: 10.1613/jair.1.11332

Deadline-Aware Fair Scheduling for Multi-Tenant Crowd-Powered Systems

Autoren: Djellel Difallah, Alessandro Checco, Gianluca Demartini, Philippe Cudré-Mauroux
Veröffentlicht in: ACM Transactions on Social Computing, Ausgabe 2/1, 2019, Seite(n) 1-29, ISSN 2469-7818
Herausgeber: ACM
DOI: 10.1145/3301003

Scalable recovery of missing blocks in time series with high and low cross-correlations

Autoren: Mourad Khayati, Philippe Cudré-Mauroux, Michael H. Böhlen
Veröffentlicht in: Knowledge and Information Systems, 2019, ISSN 0219-1377
Herausgeber: Springer Verlag
DOI: 10.1007/s10115-019-01421-7

Crowd-Labeling Fashion Reviews with Quality Control

Autoren: Chernushenko, I.; Gers, F.A.; Löser, A.; Checco, A.
Veröffentlicht in: arXiv, Ausgabe 2, 2018
Herausgeber: arxiv

Suche nach OpenAIRE-Daten ...

Leistungen

Veröffentlichungen

Herunterladen Den Inhalt der Seite herunterladen