训练神经网络时的Epoch vs Iteration

训练多层感知器时，历元和迭代的区别是什么?

当前回答

根据我的理解，当你需要训练一个NN时，你需要一个包含许多数据项的大型数据集。在训练神经网络时，数据项一个一个地进入神经网络，这称为迭代;当整个数据集通过时，它被称为epoch。

2012-01-06 22:41:25

其他回答

Epoch和iteration描述的是不同的东西。

时代

epoch描述了算法看到整个数据集的次数。因此，每当算法看到数据集中的所有样本时，就完成了一个epoch。

迭代

迭代描述了一批数据通过算法的次数。在神经网络的例子中，这意味着向前传递和向后传递。因此，每当你通过神经网络传递一批数据时，你就完成了一次迭代。

例子

举个例子可能会更清楚。

假设您有一个包含10个示例(或样本)的数据集。批处理大小为2，并指定算法运行3个epoch。

因此，在每个epoch中，您有5个批次(10/2 = 5)。每个批次都通过算法，因此每个epoch有5个迭代。因为您已经指定了3个epoch，所以总共有15个迭代(5*3 = 15)用于训练。

2016-07-21 02:45:15

在神经网络术语中:

一个epoch =所有训练示例的一个向前传递和一个向后传递批大小=一次向前/向后传递中训练示例的数量。批处理大小越大，所需的内存空间就越大。迭代次数=通过次数，每次通过使用[批大小]示例的数量。需要明确的是，一次传球=一次向前传球+一次向后传球(我们不把向前传球和向后传球算作两次不同的传球)。

例如:如果你有1000个训练样本，你的批处理大小是500，那么将需要2次迭代来完成1个epoch。

供参考:权衡批大小和迭代次数来训练神经网络

术语“批处理”是模棱两可的:有些人用它来表示整个训练集，有些人用它来指代一次向前/向后传递中的训练示例的数量(就像我在这个回答中所做的那样)。为了避免这种歧义，并明确batch对应于一次正向/向后传递中训练示例的数量，可以使用术语mini-batch。

2015-08-05 21:14:23

通常，你会把你的测试集分成小批，让网络从中学习，并让训练在你的层数中一步一步地进行，一直应用梯度下降。所有这些小步骤都可以称为迭代。

一个epoch对应于整个训练集通过整个网络一次。限制这种情况是很有用的，例如对抗过拟合。

2012-10-26 21:46:14

Epoch is 1 complete cycle where the Neural network has seen all the data. One might have said 100,000 images to train the model, however, memory space might not be sufficient to process all the images at once, hence we split training the model on smaller chunks of data called batches. e.g. batch size is 100. We need to cover all the images using multiple batches. So we will need 1000 iterations to cover all the 100,000 images. (100 batch size * 1000 iterations) Once Neural Network looks at the entire data it is called 1 Epoch (Point 1). One might need multiple epochs to train the model. (let us say 10 epochs).

2019-09-23 22:58:25

epoch是用于训练的样本子集的迭代，例如，神经网络中的梯度下降算法。一个很好的参考:http://neuralnetworksanddeeplearning.com/chap1.html

请注意，该页面有一个使用epoch的梯度下降算法的代码

def SGD(self, training_data, epochs, mini_batch_size, eta,
        test_data=None):
    """Train the neural network using mini-batch stochastic
    gradient descent.  The "training_data" is a list of tuples
    "(x, y)" representing the training inputs and the desired
    outputs.  The other non-optional parameters are
    self-explanatory.  If "test_data" is provided then the
    network will be evaluated against the test data after each
    epoch, and partial progress printed out.  This is useful for
    tracking progress, but slows things down substantially."""
    if test_data: n_test = len(test_data)
    n = len(training_data)
    for j in xrange(epochs):
        random.shuffle(training_data)
        mini_batches = [
            training_data[k:k+mini_batch_size]
            for k in xrange(0, n, mini_batch_size)]
        for mini_batch in mini_batches:
            self.update_mini_batch(mini_batch, eta)
        if test_data:
            print "Epoch {0}: {1} / {2}".format(
                j, self.evaluate(test_data), n_test)
        else:
            print "Epoch {0} complete".format(j)

看看代码。对于每个历元，我们随机生成梯度下降算法输入的子集。为什么epoch是有效的，也解释了这一页。请看一看。

2015-11-20 13:18:15

训练神经网络时的Epoch vs Iteration

推荐文章

最新文章

标签