caffe

2024-03-30 10:44•Java教程•阅读 2965

定义CAFFE为caffe跟目录，caffe的核心代码都在$CAFFE/src/caffe 下，主要有以下部分：net, blob, layer, solver.

net.cpp:
net定义网络，整个网络中含有很多layers， net.cpp负责计算整个网络在训练中的forward, backward过程，即计算forward/backward 时各layer的gradient。
layers:
在$CAFFE/src/caffe/layers中的层，在protobuffer (.proto文件中定义message类型，.prototxt或.binaryproto文件中定义message的值) 中调用时包含属性name， type（data/conv/pool…）， connection structure (input blobs and output blobs)，layer-specific parameters（如conv层的kernel大小）。定义一个layer需要定义其setup, forward 和backward过程。
blob.cpp:
net中的数据和求导结果通过4维的blob传递。一个layer有很多blobs， e.g,
- 对data，weight blob大小为Number * Channels * Height * Width, 如256*3*224*224；
- 对conv层，weight blob大小为 Output 节点数 * Input 节点数 * Height * Width，如AlexNet第一个conv层的blob大小为96 x 3 x 11 x 11；
- 对inner product 层， weight blob大小为 1 * 1 * Output节点数 * Input节点数； bias blob大小为1 * 1 * 1 * Output节点数（ conv层和inner product层一样，也有weight和bias，所以在网络结构定义中我们会看到两个blobs_lr，第一个是weights的，第二个是bias的。类似地，weight_decay也有两个，一个是weight的，一个是bias的）；
  blob中，mutable_cpu/gpu_data() 和cpu/gpu_data()用来管理memory，cpu/gpu_diff()和 mutable_cpu/gpu_diff()用来计算求导结果。
slover.cpp:
结合loss，用gradient更新weights。主要函数：
Init(),
Solve(),
ComputeUpdateValue(),
Snapshot(), Restore(),//快照（拷贝）与恢复网络state
Test()；
在solver.cpp中有3中solver，即3个类：AdaGradSolver, SGDSolver和NesterovSolver可供选择。
关于loss，可以同时有多个loss，可以加regularization（L1/L2）；

上一篇 »Caffe模型读取
下一篇 »caffe中的caffemodel参数提取方法

caffe

相关推荐

caffe编译环境的错误：..build_release/src/caffe/proto/caffe.pb.h:23:35: fatal error: google/protobuf/arena.h: 没有那个文件

和我一起熟悉caffe2

make pycaffe时候报错：Makefile:501: recipe for target 'python/caffe/_caffe.so' failed

caffe 安装资料整理

Caffe学习系列，15：添加新层

caffe学习笔记，十三caffe图形化操作工具digits的使用

docker下安装caffe

安装caffe框架所需文件