OSCAR: Opinion Stream Classification with Ensembles and Active leaRners

1. Project details

  • Project full name: Opinion Stream Classification with Ensembles and Active leaRners
  • Project acronym: OSCAR
  • Funding period: 2017 - 2019
  • Funding body: German Research Foundation (Deutsche Forschungsgemeinschaft - DFG)

2. Team

Leibniz University Hannover & L3S Research Center

  • Prof. Dr. Eirini Ntoutsi
  • M.Sc. Damianos Melidis
  • M.Sc. Amir Abolfazli
  • M.Sc. Vasileios Iosifidis
  • M.Sc. Philip Naumann

Otto von Guericke University Magdeburg

  • Prof. Dr. Myra Spiliopoulou
  • M.Sc. Vishnu Unnikrishnan

3. Project overview

Motivation

“What other people think” has always been an important piece of information for our decision-making process. But the Internet and the Web allow us now to find answers to this question beyond the circle of our personal acquaintances. Traditional sentiment mining techniques focus on static data. However, as opinions accumulate from the social streams, changes might occur like changes in the general sentiment towards a subject or towards specific facets of this subject, as well as changes in the words used to express sentiment. Subjects also change over time. In OSCAR, we develop opinion stream mining methods that deal with change and adapt the learned models continuously.

Challenges & Highlights

The first part of OSCAR is on leveraging stream mining methods to deal with vocabulary/ feature changes. A change in the feature space means that the model built upon the old words must be updated. We will accumulate information on the usage and sentiment of each word to highlight the long-term interplay between word polarity and document polarity. Second, we will work on reducing the need for labeled documents. To this end we will develop active learning methods that learn and adapt polarity models on an evolving feature space. Third, we will work on dealing with different types of change simultaneously. To this purpose, we will use ensembles. We will dedicate some ensemble members to the identification of topic trends, others to changes in the vocabulary and others to temporal changes, including periodical ones.

Potential applications & future issues

The output of OSCAR will be a complete framework, encompassing active ensemble learning methods that deal with different forms of change and learn with limited expert involvement. Such a framework can be used in other stream classification tasks, beyond sentiment analysis.

4. Publications