Pairwise Neural Machine Translation Evaluation

Introduction
1. Automated Machine Translation (MT)
  1. Evaluation
    1. Needed
      1. Developing a new MT
      2. Comparing two MT
    2. Reference based MT
      1. Comparing the system output to one or more human reference tranlations
      2. Most Common
      3. Compute
        Absolute quality score
        Computing similarity between the machine and human translation
        Simplest case
        Computing word N-gram matches between the translation and the reference
        BLUE
        More advanced
        Take into account various aspects of linguistic similarity
        Better correlation with human judjment
    3. Human ranking
      1. Can be used to train automatic metrics
        Can be oriented to predict absolute scores
        Using
        Regression
        ?
        Ranking
        Special case
        Compare two hypotheses and referenec
        Decide which hypothesis is better
        Recent result
        Guzman
        Learning framework
        Using
        Preference kernel
        ?
        Vector machines (SVM)
        Syntactic structures
        Discourage-based structures
        High computational costs
        Training
        Testing
        Due to
        Using convolution kernels
        ?
        Over complex structures
        Simplification is needed!
Research
1. Framework for machine translation evaluation
  1. Novel!
  2. Goal
    1. Select a better translation from a pair of hypothesis, given the reference translation
  3. Using
    1. Neural networks
      1. Multi-layer
        Input layer
        Semantic info
        Syntactic info
        Lexical info
        Hidden layer
        Captures the interactions between the relevant input components
      2. Models the interaction between
        The two hypothesis
        Reference and Hypothesis
    2. Distributed vector representations
      1. Used for storing
        The Two hypothesis
      2. Based on
        Word embedding
        Sentence embedding
      3. Learned from
        Neural Networks
        Novel!
        Can be trained to optimize task-specific cost function
      4. Efficient
        Vector-based compression
        ?
  4. Experiments
    1. WMT12 metrics task
    2. Better results than by Guzman
    3. High correlation with human judjment
    4. Comparable with the best!
      1. DiscoTK
        Metric
        Combination based
        Much heavier
    5. Embeddings
      1. Syntactically oriented
      2. Semantically oriented
      3. Cumulative performance gains
        Over
        BLUE
        NIST
        METEOR
        TER
2. Simplification is needed!

Nächster

Pairwise Neural Machine Translation Evaluation

Beschreibung

Zusammenfassung der Ressource

ähnlicher Inhalt

	Erstellt von Ivan Zapreev vor mehr als 9 Jahre