toolkit.xml

Roman Klinger, 2015-08-14 14:08

Download (2.699 KB)

 
1
<?xml version="1.0" encoding="UTF-8" ?>
2
<opendatametainfo>
3
  <title>The USAGE review corpus for fine-grained, multi-lingual opinion analysis</title>
4
  <organisation>
5
        <name>CITEC -- Center of Excellence Cognitive Interaction Technology, Bielefeld University</name>
6
        <url>https://cit-ec.de</url>
7
  </organisation>
8
  <description>
9
 This corpus consists of annotations of Amazon reviews for different product categories in the languages German and English. The reviews themselves are not part of this data publication. The annotations are fine-grained, including aspects and subjective phrases. In addition, the relation of an aspect to be a target of a subjective phrase is provided as well as the polarity of the subjective phrase. The corpus consists of 622 English and 611 German reviews for coffee machines, cuterly, microwaves, toaster, trash cans, vacuum cleaner, washing machines and dishwasher. The English corpus is annotated with more than 8000 aspects and 5000 subjective phrases, the German part with more than 6000 aspects and around 5000 subjective phrases (depending on the annotator). Each review is independently annotated by two annotators. Updates to these data will be available at http://www.roman-klinger.de/usagecorpus. 
10
 </description>
11
  <version>v1.0.1</version>
12
  <date>2014-03-14</date>
13
  <creators>
14
        <creator>
15
        <name>Roman Klinger</name>
16
        <url>http://www.cit-ec.de/users/rklinger</url>
17
        </creator>
18
  </creators>
19
  <contributors>
20
  <contributor>
21
          <name>Philipp Cimiano</name>
22
          <url>http://www.cit-ec.de/users/cimiano</url>
23
  </contributor>
24
  <contributor>
25
          <name>Frederike Strunz</name>
26
  </contributor>
27
  <contributor>
28
          <name>Luci Fillinger</name>
29
  </contributor>
30
  <contributor>
31
          <name>Robin Schiewer</name>
32
  </contributor>
33
  </contributors>
34

    
35
  <downloadurls>
36
        <downloadurl>https://opensource.cit-ec.de/attachments/download/324/USAGE-corpus-1.0.1.tar.gz</downloadurl>
37
  </downloadurls>
38

    
39
  <keywords>
40
                <keyword>Sentiment Analysis</keyword>
41
                <keyword>Relation Extraction</keyword>
42
                <keyword>Text Mining</keyword>
43
                <keyword>Natural Language Processing</keyword>
44
                <keyword>Machine Learning</keyword>
45
                <keyword>Reviews</keyword>
46
                <keyword>Corpus</keyword>
47
  </keywords>
48
  <formats>
49
          <format>CSV</format>
50
  </formats>
51
  <acknowledgements>
52
    The development of this database has been funded by the Its OWL project (Intelligent Technical Systems Ostwestfalen- Lippe, http://www.its-owl.de/), a leading-edge cluster of the German Ministry of Education and Research and  by the Excellence Cluster EXC 277 Cognitive Interaction Technology. The Excellence Cluster EXC 277 is a grant of the Deutsche                 Forschungsgemeinschaft (DFG) in the context of the German Excellence Initiative. 
53
  </acknowledgements>
54
</opendatametainfo>
55