How do I load a caffe model and convert to a numpy array?

Posted on

Question :

How do I load a caffe model and convert to a numpy array?

I have a caffemodel file that contains layers that are not supported by ethereon’s caffe-tensorflow conversion utility. I would like to generate a numpy representation of my caffemodel.

My question is, how do I convert a caffemodel file (I also have the prototxt, if that is useful) to a numpy file?

Additional info: I have python, caffe with python interfaces etc installed. I am clearly not experienced with caffe.

Asked By: MM.


Answer #1:

Here’s a nice function that converts a caffe net to a python list of dictionaries, so you can pickle it and read it anyway you want:

import caffe

def shai_net_to_py_readable(prototxt_filename, caffemodel_filename):
  net = caffe.Net(prototxt_filename, caffemodel_filename, caffe.TEST) # read the net + weights
  pynet_ = [] 
  for li in xrange(len(net.layers)):  # for each layer in the net
    layer = {}  # store layer's information
    layer['name'] = net._layer_names[li]
    # for each input to the layer (aka "bottom") store its name and shape
    layer['bottoms'] = [(net._blob_names[bi], net.blobs[net._blob_names[bi]].data.shape) 
                         for bi in list(net._bottom_ids(li))] 
    # for each output of the layer (aka "top") store its name and shape
    layer['tops'] = [(net._blob_names[bi], net.blobs[net._blob_names[bi]].data.shape) 
                      for bi in list(net._top_ids(li))]
    layer['type'] = net.layers[li].type  # type of the layer
    # the internal parameters of the layer. not all layers has weights.
    layer['weights'] = [net.layers[li].blobs[bi].data[...] 
                        for bi in xrange(len(net.layers[li].blobs))]
  return pynet_
Answered By: Shai

Leave a Reply

Your email address will not be published. Required fields are marked *