在tf.nn中“SAME”和“VALID”填充之间的区别是什么?tensorflow的Max_pool ?

在我看来，'VALID'意味着当我们做max pool时，边缘外不会有零填充。

根据深度学习卷积算法指南，它说池操作符中不会有填充，即只使用tensorflow的“VALID”。但什么是'SAME'填充的最大池张量流量?

当前回答

快速的解释

VALID:不要应用任何填充，也就是说，假设所有的维度都是有效的，这样输入的图像就会被你指定的过滤器和stride完全覆盖。

SAME:应用填充到输入(如果需要)，以便输入图像被过滤器和步幅完全覆盖。对于stride 1，这将确保输出图像大小与输入相同。

笔记

This applies to conv layers as well as max pool layers in same way The term "valid" is bit of a misnomer because things don't become "invalid" if you drop part of the image. Sometime you might even want that. This should have probably be called NO_PADDING instead. The term "same" is a misnomer too because it only makes sense for stride of 1 when output dimension is same as input dimension. For stride of 2, output dimensions will be half, for example. This should have probably be called AUTO_PADDING instead. In SAME (i.e. auto-pad mode), Tensorflow will try to spread padding evenly on both left and right. In VALID (i.e. no padding mode), Tensorflow will drop right and/or bottom cells if your filter and stride doesn't full cover input image.

2018-01-23 01:56:38

其他回答

Padding on/off. Determines the effective size of your input. VALID: No padding. Convolution etc. ops are only performed at locations that are "valid", i.e. not too close to the borders of your tensor. With a kernel of 3x3 and image of 10x10, you would be performing convolution on the 8x8 area inside the borders. SAME: Padding is provided. Whenever your operation references a neighborhood (no matter how big), zero values are provided when that neighborhood extends outside the original tensor to allow that operation to work also on border values. With a kernel of 3x3 and image of 10x10, you would be performing convolution on the full 10x10 area.

2018-07-17 14:32:19

有效填充是没有填充。相同的填充在某种程度上是输出与输入大小相同的填充。

2022-06-29 21:45:47

有效填充:这是零填充。希望没有混淆。

x = tf.constant([[1., 2., 3.], [4., 5., 6.],[ 7., 8., 9.], [ 7., 8., 9.]])
x = tf.reshape(x, [1, 4, 3, 1])
valid_pad = tf.nn.max_pool(x, [1, 2, 2, 1], [1, 2, 2, 1], padding='VALID')
print (valid_pad.get_shape()) # output-->(1, 2, 1, 1)

相同填充:首先，这有点难以理解，因为我们必须分别考虑官方文档中提到的两个条件。

假设输入为，输出为，填充为，步幅为，内核大小为(只考虑单个维度)

案例01::

案例02::

被计算为可用于填充的最小值。由于的值是已知的，可以用这个公式求出值。

让我们来做这个例子:

x = tf.constant([[1., 2., 3.], [4., 5., 6.],[ 7., 8., 9.], [ 7., 8., 9.]])
x = tf.reshape(x, [1, 4, 3, 1])
same_pad = tf.nn.max_pool(x, [1, 2, 2, 1], [1, 2, 2, 1], padding='SAME')
print (same_pad.get_shape()) # --> output (1, 2, 2, 1)

这里x的维数是(3,4)那么如果取水平方向(3):

若取垂直方向(4):

希望这将有助于理解实际上相同填充是如何在TF中工作的。

2018-04-15 12:46:49

填充是一种增加输入数据大小的操作。在一维数据中，你只需要在数组前加上一个常数，在2-dim中，你用这些常数包围矩阵。在n-dim中，用常数包围n-dim超立方体。在大多数情况下，这个常数是零，它被称为零填充。

下面是一个应用于2-d张量的p=1的零填充的例子:

你可以为你的内核使用任意填充，但是有些填充值比其他填充值使用得更频繁:

有效的填充。最简单的情况，意味着根本没有填充。让你的数据保持原样。相同填充有时称为半填充。之所以称为SAME，是因为对于stride=1的卷积(或池化)，它应该产生与输入相同大小的输出。之所以叫HALF是因为对于一个大小为k的核 FULL填充是最大填充，它不会导致对刚刚填充的元素进行卷积。对于一个大小为k的核，这个填充值等于k - 1。

要在TF中使用任意填充，可以使用TF .pad()

2017-05-21 22:28:27

有三种填充选择:有效(无填充)，相同(或一半)，满。你可以在这里(Theano)找到解释: http://deeplearning.net/software/theano/tutorial/conv_arithmetic.html

有效或无填充:

有效填充不涉及零填充，因此它只覆盖有效输入，不包括人工生成的零。对于内核大小为k的步幅s=1，则输出长度为((输入长度)- (k-1))。

相同或半填充:

当s=1时，相同的填充使输出的大小与输入的大小相同。如果s=1，补零的个数为(k-1)。

完全填充:

完全填充意味着内核运行整个输入，因此在结束时，内核可能只满足一个输入，其他为零。如果s=1，填充的零的数量是2(k-1)。如果s=1，则输出长度为((输入长度)+ (k-1))。

因此，填充的数量:(有效)<=(相同)<=(满)

2017-05-05 07:53:25

在tf.nn中“SAME”和“VALID”填充之间的区别是什么?tensorflow的Max_pool ?

推荐文章

最新文章

标签