IT数码 购物 网址 头条 软件 日历 阅读 图书馆
TxT小说阅读器
↓语音阅读,小说下载,古典文学↓
图片批量下载器
↓批量下载图片,美女图库↓
图片自动播放器
↓图片自动播放器↓
一键清除垃圾
↓轻轻一点,清除系统垃圾↓
开发: C++知识库 Java知识库 JavaScript Python PHP知识库 人工智能 区块链 大数据 移动开发 嵌入式 开发工具 数据结构与算法 开发测试 游戏开发 网络协议 系统运维
教程: HTML教程 CSS教程 JavaScript教程 Go语言教程 JQuery教程 VUE教程 VUE3教程 Bootstrap教程 SQL数据库教程 C语言教程 C++教程 Java教程 Python教程 Python3教程 C#教程
数码: 电脑 笔记本 显卡 显示器 固态硬盘 硬盘 耳机 手机 iphone vivo oppo 小米 华为 单反 装机 图拉丁
 
   -> 人工智能 -> 【吴恩达深度学习】04_week2_quiz Deep convolutional models -> 正文阅读

[人工智能]【吴恩达深度学习】04_week2_quiz Deep convolutional models

(1)Which of the following do you typically see as you move to deeper layer in a ConvNet?
[A] n H n_H nH? and n W n_W nW? decrease, while n C n_C nC? increases.
[B] n H n_H nH? and n W n_W nW? decrease, while n C n_C nC? also decreases.
[C] n H n_H nH? and n W n_W nW? increases, while n C n_C nC? also increases.
[D] n H n_H nH? and n W n_W nW? increases, while n C n_C nC? decreases.

答案:A
解析:典型的卷积网络结构如下。
LeNet-5网络结构如下:在这里插入图片描述
AlexNet网络结构如下:
在这里插入图片描述
VGG-16网络结构如下:
在这里插入图片描述

(2)Which of the following do you typically see in a ConvNet? (Check
all that apply.)
[A]Multiple CONV layers followed by a POOL layer.
[B]Multiple POOL layers followed by a CONV layer.
[C]FC layers in the last few layers.
[D]FC layers in the first few layers.

答案:A,C
解析:典型的卷积网络结构如上题所示

(3)In order to be able to build very deep networks, we usually only use pooling layers to downsize the height/width of the activation volumes while convolutions are used with “valid” padding. Otherwise, we would downsize the input of the model too quickly.
[A]True
[B]False

答案:B
解析:构建比较深的网络,卷积层一般用"same" padding 。如果用"valid" padding 的话每一次卷积操作都会使 height/width 缩小

(4)Training a deeper network (for example, adding additional layers to the network) allows the network to fit more complex functions and thus almost always results in lower training error. For this question, assume we’re referring to “plain” networks.
[A]True
[B]False

答案:B
解析:对于普通的网络,理论上随着网络深度的加深,应该训练的越来越好;但是实际情况随着网络深度的加深,训练误差会先减少然后增多。
在这里插入图片描述

(5)The following equation captures the computation in a ResNet block. What goes into the two blanks above?
a [ l + 2 ] = g ( W [ l + 2 ] g ( W [ l + 1 ] a [ l ] + b [ l + 1 ] ) + b [ l + 2 ] + _ _ _ _ _ _ _ ) + _ _ _ _ _ _ _ a^{[l+2]}=g(W^{[l+2]}g(W^{[l+1]}a^{[l]}+b^{[l+1]})+b^{[l+2]}+\_\_\_\_\_\_\_)+\_\_\_\_\_\_\_ a[l+2]=g(W[l+2]g(W[l+1]a[l]+b[l+1])+b[l+2]+_______)+_______
[A] z [ l ] z^{[l]} z[l] and a [ l ] a^{[l]} a[l], respectively
[B] a [ l ] a^{[l]} a[l] and 0, respectively
[C] 0 0 0 and a [ l ] a^{[l]} a[l], respectively
[D] 0 0 0 and z [ l + 1 ] z^{[l+1]} z[l+1], respectively

答案:B
解析:
a [ l + 2 ] = g ( z [ l + 2 ] + a [ l ] ) = g ( W [ l + 2 ] a [ l + 1 ] + b [ l + 2 ] + a [ l ] ) = g ( W [ l + 2 ] g ( z [ l + 1 ] ) + b [ l + 2 ] + a [ l ] ) = g ( W [ l + 2 ] g ( W [ l + 1 ] a [ l ] + b [ l + 1 ] ) + b [ l + 2 ] + a [ l ] ) \begin{aligned} a^{[ l+2 ]}&=g( z^{[ l+2 ]}+a^{[ l ]} ) \\ &=g( W^{[ l+2 ]}a^{[ l+1 ]}+b^{[ l+2 ]}+a^{[ l ]} ) \\ &=g( W^{[ l+2 ]}g( z^{[ l+1 ]} ) +b^{[ l+2 ]}+a^{[ l ]} ) \\ &=g( W^{[ l+2 ]}g( W^{[ l+1 ]}a^{[ l ]}+b^{[ l+1 ]} ) +b^{[ l+2 ]}+a^{[ l ]} ) \end{aligned} a[l+2]?=g(z[l+2]+a[l])=g(W[l+2]a[l+1]+b[l+2]+a[l])=g(W[l+2]g(z[l+1])+b[l+2]+a[l])=g(W[l+2]g(W[l+1]a[l]+b[l+1])+b[l+2]+a[l])?

(6)Which ones of the following statements on Residual Networks are true?(Check all that apply.)
[A]A ResNet with L L L layers would have on the order of L 2 L^2 L2 skip connections in total.
[B]The skip-connection makes it easy for the network to learn an identity mapping between the input and the output within the ResNet block.
[C]Using a skip-connection helps the gradient to backpropagate and thus helps you to train deeper networks.
[D]The skip-connections compute a complex non-linear function of the input to pass to a deeper layer in the network.

答案:B,C
解析:
在残差网络中 skip connections 的数量不会大于层数,故A错。注意和 DenseNet 区分。下面展示了两种常见的残差网络块。
如下图所示,skip-connections是直接短接过去,而没有进行计算。
在这里插入图片描述
即使是下图所示形式,CONV2D层只是用于调整维度,而没有进行非线性计算。
在这里插入图片描述
故D错。

(7)Suppose you have an input volume of dimension 64x64x16. How many parameters would a single 1x1 convolutional filter have (including the bias)?
[A] 2
[B] 17
[C] 1
[D] 4097

答案:B
解析:filter的维度为1x1x16,再加上1个bias,总参数个数为17 。

(8)Suppose you have an input volume of dimension n H × n W × n C n_H\times n_W\times n_C nH?×nW?×nC? . Which of the following statements you agree with? (Assume that “1x1 convolutional layer” below always uses a stride of 1 and no padding.)
[A]You can use a 1x1 convolutional layer to reduce n H n_H nH?, n W n_W nW?, and n C n_C nC?.
[B]You can use a pooling layer to reduce n H n_H nH?, n W n_W nW?, and n C n_C nC?.
[C]You can use a pooling layer to reduce n H n_H nH?, n W n_W nW?, but not n C n_C nC?.
[D]You can use a 1x1 convolutional layer to reduce n C n_C nC? but not n H n_H nH?, n W n_W nW?.

答案:C,D

(9)Which ones of the following statements on Inception Network are true? (Check all that apply.)
[A]Making an inception network deeper (by stacking more inception blocks together) should not hurt training set performance.
[B]A single inception block allows the network to use a combination of 1x1, 3x3, 5x5 convolutions and pooling.
[C]Inception networks incorporates a variety of network architectures (similar to dropout, which randomly chooses a network architecture on each step) and thus has a similar regularizing effect as dropout.
[D]Inception blocks usually use 1x1 convolutions to reduce the input data volume’s size before applying 3x3 and 5x5 convolutions.

答案:B,D
解析:参考GoogLeNet,为防止中间梯度消失,采用了辅助分类器,如图所示:
在这里插入图片描述
故A错。
Inception networks 不模仿dropout 随机选择网络结构,而是将这些网络结构堆叠起来,如图所示:
故
故C错。

(10)Which of the following are common reasons for using open-source implementations of ConvNets (both the model and/or weights)? (Check all that apply)
[A]Parameters trained for one computer vision task are often useful as pretraining for other computer vision tasks.
[B]The same techniques for winning computer vision competitions, such as using multiple crops at test time, are widely used in practical deployments (or production system deployments) of ConvNets.
[C]It is a convenient way to get working an implementation of a complex ConvNet architecture.
[D]A model trained for one computer vision task can usually be used to perform data augmentation even for a different computer vision task.

答案:A,C
解析:竞赛的一些技巧不适用于实际环境,因为太耗费算力资源,故B错。
一个训练好的计算机视觉任务模型可以通过迁移学习用于另一个任务,而不是数据增强,故D错。

  人工智能 最新文章
2022吴恩达机器学习课程——第二课(神经网
第十五章 规则学习
FixMatch: Simplifying Semi-Supervised Le
数据挖掘Java——Kmeans算法的实现
大脑皮层的分割方法
【翻译】GPT-3是如何工作的
论文笔记:TEACHTEXT: CrossModal Generaliz
python从零学(六)
详解Python 3.x 导入(import)
【答读者问27】backtrader不支持最新版本的
上一篇文章      下一篇文章      查看所有文章
加:2022-03-13 21:47:25  更:2022-03-13 21:51:37 
 
开发: C++知识库 Java知识库 JavaScript Python PHP知识库 人工智能 区块链 大数据 移动开发 嵌入式 开发工具 数据结构与算法 开发测试 游戏开发 网络协议 系统运维
教程: HTML教程 CSS教程 JavaScript教程 Go语言教程 JQuery教程 VUE教程 VUE3教程 Bootstrap教程 SQL数据库教程 C语言教程 C++教程 Java教程 Python教程 Python3教程 C#教程
数码: 电脑 笔记本 显卡 显示器 固态硬盘 硬盘 耳机 手机 iphone vivo oppo 小米 华为 单反 装机 图拉丁

360图书馆 购物 三丰科技 阅读网 日历 万年历 2025年1日历 -2025/1/9 16:19:35-

图片自动播放器
↓图片自动播放器↓
TxT小说阅读器
↓语音阅读,小说下载,古典文学↓
一键清除垃圾
↓轻轻一点,清除系统垃圾↓
图片批量下载器
↓批量下载图片,美女图库↓
  网站联系: qq:121756557 email:121756557@qq.com  IT数码