偏差在神经网络中的作用是什么?

我知道梯度下降和反向传播算法。我不明白的是:什么时候使用偏见是重要的，你如何使用它?

例如，在映射AND函数时，当我使用两个输入和一个输出时，它不会给出正确的权重。然而，当我使用三个输入(其中一个是偏差)时，它给出了正确的权重。

当前回答

在我的硕士论文中的几个实验中(例如第59页)，我发现偏差可能对第一层很重要，但特别是在最后的完全连接层，它似乎没有发挥很大的作用。

这可能高度依赖于网络架构/数据集。

2017-08-01 17:09:42

其他回答

在我研究的所有ML书籍中，W总是被定义为两个神经元之间的连通性指数，这意味着两个神经元之间的连通性更高。

放电神经元向目标神经元或Y = w * X传递的信号越强，为了保持神经元的生物学特性，我们需要保持1 >= w >= -1，但在实际回归中，w最终会变成| w | >=1，这与神经元的工作方式相矛盾。

因此，我提出W = cos(theta)，而1 >= |cos(theta)|， Y= a * X = W * X + b而a = b + W = b + cos(theta)， b是一个整数。

2017-09-10 20:19:55

在神经网络中:

每个神经元都有一个偏向您可以将偏差视为阈值(通常是阈值的相反值) 输入层的加权和+偏置决定神经元的激活偏差增加了模型的灵活性。

在没有偏差的情况下，仅考虑来自输入层的加权和可能不会激活神经元。如果神经元没有被激活，来自该神经元的信息就不会通过神经网络的其余部分传递。

偏见的价值是可以学习的。

实际上，bias = - threshold。你可以把偏差想象成让神经元输出1有多容易，如果偏差很大，神经元输出1很容易，但如果偏差很大，就很难了。

总而言之:偏置有助于控制激活函数的触发值。

观看这段视频了解更多细节。

一些更有用的链接:

Geeksforgeeks

走向数据科学

2019-02-12 14:00:52

偏差有助于得到更好的方程。

想象一下，输入和输出就像一个函数y = ax + b，你需要在输入(x)和输出(y)之间画一条正确的线，以最小化每个点和直线之间的全局误差，如果你保持这样的方程y = ax，你将只有一个参数用于适应，即使你找到了最小化全局误差的最佳参数，它也会离你想要的值很远。

你可以说，偏差使方程更灵活，以适应最佳值

2020-05-21 00:09:49

Two different kinds of parameters can be adjusted during the training of an ANN, the weights and the value in the activation functions. This is impractical and it would be easier if only one of the parameters should be adjusted. To cope with this problem a bias neuron is invented. The bias neuron lies in one layer, is connected to all the neurons in the next layer, but none in the previous layer and it always emits 1. Since the bias neuron emits 1 the weights, connected to the bias neuron, are added directly to the combined sum of the other weights (equation 2.1), just like the t value in the activation functions.1

它不实用的原因是，您同时调整权重和值，因此对权重的任何更改都会抵消对先前数据实例有用的值的更改……在不改变值的情况下添加偏置神经元可以让你控制层的行为。

此外，偏差允许您使用单个神经网络来表示类似的情况。考虑由以下神经网络表示的AND布尔函数:

(来源:aihorizon.com)

W0对应于b。 W1对应x1。 W2对应于x2。

A single perceptron can be used to represent many boolean functions. For example, if we assume boolean values of 1 (true) and -1 (false), then one way to use a two-input perceptron to implement the AND function is to set the weights w0 = -3, and w1 = w2 = .5. This perceptron can be made to represent the OR function instead by altering the threshold to w0 = -.3. In fact, AND and OR can be viewed as special cases of m-of-n functions: that is, functions where at least m of the n inputs to the perceptron must be true. The OR function corresponds to m = 1 and the AND function to m = n. Any m-of-n function is easily represented using a perceptron by setting all input weights to the same value (e.g., 0.5) and then setting the threshold w0 accordingly. Perceptrons can represent all of the primitive boolean functions AND, OR, NAND ( 1 AND), and NOR ( 1 OR). Machine Learning- Tom Mitchell)

阈值是偏置，w0是与偏置/阈值神经元相关的权重。

2010-03-19 21:38:55

扩展zfy的解释:

一个输入，一个神经元，一个输出的方程如下:

y = a * x + b * 1    and out = f(y)

其中x是输入节点的值，1是偏置节点的值; Y可以直接作为输出，也可以传递给一个函数，通常是一个sigmoid函数。还要注意，偏差可以是任何常数，但为了使一切更简单，我们总是选择1(可能这太常见了，zfy没有显示和解释它)。

你的网络试图学习系数a和b来适应你的数据。所以你可以看到为什么添加元素b * 1可以让它更好地适应更多的数据:现在你可以改变斜率和截距。

如果你有一个以上的输入，你的方程将是这样的:

y = a0 * x0 + a1 * x1 + ... + aN * 1

请注意，这个方程仍然描述一个神经元，一个输出网络;如果你有更多的神经元，你只需在系数矩阵中增加一个维度，将输入相乘到所有节点，然后将每个节点的贡献相加。

可以写成向量化的形式

A = [a0, a1, .., aN] , X = [x0, x1, ..., 1]
Y = A . XT

即，将系数放在一个数组中，(输入+偏差)放在另一个数组中，你就有了你想要的解决方案，作为两个向量的点积(你需要转置X的形状是正确的，我写了XT a 'X转置')

所以最后你也可以看到你的偏差只是一个输入来代表输出的那部分实际上是独立于你的输入的。

2016-09-10 06:39:20

偏差在神经网络中的作用是什么?

推荐文章

最新文章

标签