NLTK：语料库级BLEU与句子级BLEU分数

：

>>> import nltk
>>> hypothesis = ['This', 'is', 'cat'] 
>>> reference = ['This', 'is', 'a', 'cat']
>>> references = [reference] # list of references for 1 sentence.
>>> list_of_references = [references] # list of references for all sentences in corpus.
>>> list_of_hypotheses = [hypothesis] # list of hypotheses that corresponds to list of references.
>>> nltk.translate.bleu_score.corpus_bleu(list_of_references, list_of_hypotheses)
0.6025286104785453
>>> nltk.translate.bleu_score.sentence_bleu(references, hypothesis)
0.6025286104785453

（注意：您必须在develop分支上提取最新版本的NLTK才能获得BLEU评分实施的稳定版本）

：

其实只要有一个参考，在你的整个语料库一个假设，既corpus_bleu()与sentence_bleu()应返回相同的值，如上面的例子。

在代码中，我们看到sentence_bleu实际上是鸭子类型corpus_bleu：

def sentence_bleu(references, hypothesis, weights=(0.25, 0.25, 0.25, 0.25),
                  smoothing_function=None):
    return corpus_bleu([references], [hypothesis], weights, smoothing_function)

如果我们查看以下参数sentence_bleu：

 def sentence_bleu(references, hypothesis, weights=(0.25, 0.25, 0.25, 0.25),
                      smoothing_function=None):
    """"
    :param references: reference sentences
    :type references: list(list(str))
    :param hypothesis: a hypothesis sentence
    :type hypothesis: list(str)
    :param weights: weights for unigrams, bigrams, trigrams and so on
    :type weights: list(float)
    :return: The sentence-level BLEU score.
    :rtype: float
    """

sentence_bleu的引用输入为list(list(str))。

因此，如果您有一个句子字符串，例如"This is a cat"，则必须对其进行标记以获取字符串列表，["This", "is", "a", "cat"]并且由于它允许多个引用，因此它必须是字符串列表的列表，例如，如果您有第二个引用，“这是一条猫”，您的输入sentence_bleu()将是：

references = [ ["This", "is", "a", "cat"], ["This", "is", "a", "feline"] ]
hypothesis = ["This", "is", "cat"]
sentence_bleu(references, hypothesis)

当涉及到corpus_bleu()list_of_references参数时，它基本上是一个列表，该列表包含了所有sentence_bleu()作为引用的内容：

def corpus_bleu(list_of_references, hypotheses, weights=(0.25, 0.25, 0.25, 0.25),
                smoothing_function=None):
    """
    :param references: a corpus of lists of reference sentences, w.r.t. hypotheses
    :type references: list(list(list(str)))
    :param hypotheses: a list of hypothesis sentences
    :type hypotheses: list(list(str))
    :param weights: weights for unigrams, bigrams, trigrams and so on
    :type weights: list(float)
    :return: The corpus-level BLEU score.
    :rtype: float
    """

除了查看内的doctest之外nltk/translate/bleu_score.py，您还可以查看unittest，nltk/test/unit/translate/test_bleu_score.py以了解如何使用内的每个组件bleu_score.py。

顺便说一句，由于使用（（）（https://github.com/nltk/nltk/blob/develop/nltk/translate/ .py＃L21）中的sentence_bleu导入，bleu``nltk.translate.__init__.py ****

from nltk.translate import bleu

将与以下相同：

from nltk.translate.bleu_score import sentence_bleu

并在代码中：

>>> from nltk.translate import bleu
>>> from nltk.translate.bleu_score import sentence_bleu
>>> from nltk.translate.bleu_score import corpus_bleu
>>> bleu == sentence_bleu
True
>>> bleu == corpus_bleu
False

其他 2022/1/1 18:40:07 有513人围观

撰写回答

你尚未登录，登录后可以

和开发者交流问题的细节

关注并接收问题和回答的更新提醒

参与内容的编辑和改进，让解决方法与时俱进

请先登录

NLTK：语料库级BLEU与句子级BLEU分数

撰写回答

推荐问题

对于HTML表单输入字段，disabled =“ disabled”和readonly =“ readonly”有什么区别？

Swift-使用downloadTaskWithURL下载视频

两套物品。A组的每个元素与B组的唯一匹配。在O（nlogn）时间内将A组的每个项目与B组的项目进行匹配

DropdownList数据源

带OGNL的Struts 2动态消息

Jenkins中不推荐使用JNLP Connections，将Windows从站连接到jenkins的新推荐方法是什么？

Jenkins：如何在Nginx反向代理后面配置Jenkins，以便JNLP从站进行连接

CSS Only饼图-如何在切片之间添加间距/填充？

类型“ Readonly <{}>”上不存在“ ValueChanging”

无法提交JPA事务：事务标记为rollbackOnly

CSS中是否存在`pointer-events：hoverOnly`或类似的东西？

SwiftUI MVVM协调器/路由器/ NavigationLink

尝试在脚本标签上触发onload事件

HTML5中是否有一个minlength验证属性？

在哪里可以下载JNLP.jar？

在ASP.net Web.Config中设置jsonSerialization maxJsonLength会产生500错误

window.onload与body.onload与document.onready [重复]

WebClient.DownloadString（）返回带有特殊字符的字符串

刷新页面后如何保持Dropdownlist值相同

如何使用Struts2标签和OGNL比较两个字符串？

分类汇总

您的鼓励是对我最大的支持