caffe 预训练或者Fine-Tuning 操作

2023-11-07 20:50•移动端•阅读 3072

Borrowing Weights from a Pretrained Network

To borrow the weights of an already trained model, we need to do two things:

Rename our layer to match the name of the original model's layer. The weights are assigned by layer name, thus using the original network's layer name, we get it's weights.

For example, let say the original model had a layer name ip1, then we should name our layer ip1:

layer {
  name: "ip1"
  type: "InnerProduct"
  bottom: "pool2"
  top: "ip1"
  param {
    lr_mult: 1
  }
  param {
    lr_mult: 2
  }
  inner_product_param {
    num_output: 500
    weight_filler {
      type: "xavier"
    }
    bias_filler {
      type: "constant"
    }
  }
}

Train our new hybrid model declaring the location of the weights:

caffe train —solver ourSolver.prototxt —weights theirModel.caffemodel

What About the Other Layers of Our Network?

The other layers of our network will be initialized just like any other brand new layer (usually ~zero).

2.Fine-Tuning 将prototxt某层的lr 置为0，这层即不学习

Fine-Tuning is the process of training specific sections of a network to improve results.

Making Layers Not Learn

To stop a layer from learning further, you can set it's param attributes in your prototxt.

For example:

layer {
  name: "example"
  type: "example" 
  ...
  param {
    lr_mult: 0    #learning rate of weights
    decay_mult: 1
  }
  param {
    lr_mult: 0    #learning rate of bias
    decay_mult: 0
  }
}

参考：

https://github.com/BVLC/caffe/wiki/Fine-Tuning-or-Training-Certain-Layers-Exclusively

https://github.com/BVLC/caffe/wiki/Borrowing-Weights-from-a-Pretrained-Network

上一篇 »python操作excel——新建、写入、读取基本操作
下一篇 »ChatGPT介绍及Java API调用

caffe 预训练 或者Fine-Tuning 操作

Borrowing Weights from a Pretrained Network

What About the Other Layers of Our Network?

2.Fine-Tuning 将prototxt某层的lr 置为0，这层即不学习

Making Layers Not Learn

相关推荐

Caffe实战，十二：模型分类测试以及特征图和参数可视化示例

最强 NLP 预训练模型库 PyTorch-Transformers 正式开源：支持 6 个预训练框架，27 个预训练模型...

目标检测 tensorflow，预训练模型

caffe训练模型中断的解决办法，利用solverstate

caffe中使用crop_size剪裁训练图片

train loss与test loss结果分析，接利用caffe的solverstate断点训练

caffe学习笔记，十二用训练好的模型进行分类

基于BERT预训练的中文命名实体识别TensorFlow实现

caffe 预训练或者Fine-Tuning 操作