visualization of the conv layer kernels #158

tdomhan · 2014-02-25T11:56:17Z

This is basically an adopted version of the conv layer kernel visualization from: https://code.google.com/p/cuda-convnet/source/browse/trunk/shownet.py

Here's for example the first layer of the imagenet model, that's available for download:

split layers

Add TanH = hyperbolic tangent activation layer (popular for sparse autoencoders).

up when merging, I think)

Generalize architectures to arbitrary DAGs by split layers

Set leveldb options.max_open_files = 100 and Fix BVLC#13 and BVLC#38

remove test code (no longer needed and won't compile)

im2col and col2im learn to pad and padding layer is obsolete

note that pairs of params with the same layer name are the params & bias

kloudkl · 2014-02-25T13:19:07Z

That's great! Until now, we almost have everything that cuda-convnet provided and many more that it did not.

Considering the work in #142, this is better placed in tools/extra and renamed to something like visualize_conv_layer.py.

kloudkl · 2014-02-25T13:24:13Z

python/caffe/convlayervisualization.py

Please take a look at the note about main function by @Yangqing in #107.

Yes, please use a main function. Thank you.

sergeyk · 2014-02-25T18:43:48Z

Pinging @longjon who's been working on related things.

shelhamer · 2014-02-26T08:24:08Z

Please rebase on the latest dev and file this in tools/extras or work it into the imagenet example at examples/imagenet for merge. Thanks!

longjon · 2014-02-26T09:00:52Z

python/caffe/convlayervisualization.py

We should probably import pyplot (e.g., import matplotlib.pyplot as plt) instead of pylab, as the latter is really meant for interactive work (see http://matplotlib.org/faq/usage_faq.html#matplotlib-pylab-and-pyplot-how-are-they-related).

longjon · 2014-02-26T09:03:49Z

It would be nice to follow the docstring convention of the existing Python code (i.e., omit the first blank line), and more generally to follow PEP8 / the Google Python style guide (e.g., stick to an 80 character line width).

longjon · 2014-02-26T09:14:32Z

I'm also slightly concerned about reading the protobuf file using the Python protobuf wrapper -- even though it ought to be just as good as in C++, in practice it's slower, and can fail for very large files (including Alex-sized nets). So, I'd prefer if this code used pycaffe directly to read the net (i.e., by constructing a CaffeNet), and then used that interface to access the parameters; this would also mean this code wouldn't care about the underlying file format. (On the other hand, there is something to be said for being able to examine the net without any compiled dependency; but I think that will be more frustrating than it is worth.)

sergeyk · 2014-02-26T10:36:50Z

I second Jon's suggestions.

Yangqing · 2014-02-27T04:39:28Z

I would like to point out my earlier visualization codes at decaf:

https://github.com/UCB-ICSI-Vision-Group/decaf-release/blob/master/decaf/util/visualize.py

which should have almost everything you need. A visualization in the decaf
days is at:

http://nbviewer.ipython.org/github/UCB-ICSI-Vision-Group/decaf-release/blob/master/decaf/demos/notebooks/lena_imagenet.ipynb

Yangqing

On Wed, Feb 26, 2014 at 2:36 AM, Sergey Karayev [email protected]:

I second Jon's suggestions.

Reply to this email directly or view it on GitHubhttps://github.com//pull/158#issuecomment-36112008
.

tdomhan · 2014-02-27T16:53:20Z

Thanks for the input everyone. I'll pep8-ify the code and move it to tools/extra.

@Yangqing nice! I hadn't seen the decaf visualizations yet.

@longjon have you had bigger problems with the python protobuf. For me everything worked smoothly so far. That is the imagenet model loaded really quickly and I didn't experience any crashes so far.

longjon · 2014-02-28T00:05:15Z

Indeed, it's really slow for me. This is using protoc version 2.5.0. Do you see different behavior?

In [1]: from caffe.proto import caffe_pb2

In [2]: net = caffe_pb2.NetParameter()

In [3]: with open('/u/vis/jonlong/alexnet_train_iter_470000') as f:
   ...:     data = f.read()
   ...:     

In [4]: %time net.ParseFromString(data)
CPU times: user 2min 55s, sys: 1.12 s, total: 2min 56s
Wall time: 2min 56s

In [5]: from caffe import pycaffe

In [6]: %time pycaffe.CaffeNet('/u/vis/jonlong/caffe/examples/imagenet_deploy.prototxt', '/u/vis/jonlong/alexnet_train_iter_470000')
CPU times: user 1.26 s, sys: 313 ms, total: 1.57 s
Wall time: 1.83 s
Out[6]: <caffe.pycaffe.CaffeNet at 0x63da680>

shelhamer · 2014-02-28T00:09:39Z

I too have had terrible luck with python protobuf support.

I recommend reading the params through the python net interface: net.params()[idx].data for param blob at index idx. You might model your contribution after Yangqing's decaf example, which is a nice illustration of the filters and responses.

Yangqing · 2014-02-28T00:51:12Z

Python protobuf is quite slow indeed, esp with large protobuf values. So
for large files you might want to use the C interface unless it's easier to
code up in python - balance between coding time and run time :)

There is a fast python protobuf (essentially wrapper around C) but you may
need to install it separately, AFAIK it does not come with pip.

Yangqing

On Thu, Feb 27, 2014 at 4:09 PM, Evan Shelhamer [email protected]:

I too have had terrible luck with python protobuf support.

I recommend reading the params through the python net interface. You might
model your contribution after Yangqing's decaf example, which is a nice
illustration of the filters and responses.

Reply to this email directly or view it on GitHubhttps://github.com//pull/158#issuecomment-36307978
.

shelhamer · 2014-03-13T18:29:24Z

@tdomhan thanks for the visualization proposal, but due to the implementation details cited in this thread, @longjon will port the DeCAF visualization demo in #207.

tdomhan · 2014-03-15T09:28:15Z

no worries. I'm looking forward to @longjon 's visualizations then.

Use cuDNN routine FindEx to find the best algorithm.

Aravindh Mahendran and others added 30 commits February 18, 2014 00:49

Added tanh activation function layer.

68f51c8

Added a test for the tanh layer.

9a45a0a

add SplitLayer and Net::AddSplits to transform shared bottom blobs into

207fd71

split layers

make split_layer backward obey propagate_down

08bbf6a

add split layer tests

36db05f

add split layer tests

b2a78f6

add split layer insertion tests; move split insertion code to util file

0082ac4

some cleanup

9452a31

Merge pull request BVLC#116 from aravindhm/tanh

2fce080

Add TanH = hyperbolic tangent activation layer (popular for sparse autoencoders).

eliminate redundant code with get_split_blob_name method

ec19814

change \n's to less distracting spaces in hard-coded proto strings

10d7869

give first top split blob same name as bottom blob

26414dd

allow in place computation of SplitLayer 0th top blob

98180ff

change \" in test_split_layer to ' for readability

e948da1

remove pointlessly duplicated CheckGradientExhaustive calls (I screwed

6dc6f1a

up when merging, I think)

fix comment typo

6e8d332

add test for layer with two tops that are inputs to multiple layers

1800802

remove redundant add_bottom (immediately cleared and then re-added)

f77e25a

get_split_blob_name returns a string to remove some verbosity

03c57f6

get rid of messy snprintf string concatenation

e9228ce

eliminate some cruft by relying on std::map default initializations

fce8f71

remove unnecessary include

2fb43b4

fix split layer insertion bug with in-place layers

f757b6d

add imagenet no split insertion test

bf5e75e

add idempotence test

bb8e0cd

minor cleanup; only get blob_name if needed

26630fe

Merge pull request BVLC#129 from jeffdonahue/dags-by-split

5792f44

Generalize architectures to arbitrary DAGs by split layers

Set leveldb options.max_open_files = 100. Fix BVLC#13 and BVLC#38

9c5d807

Merge pull request BVLC#154 from sguada/leveldb_max_open_files

ee00953

Set leveldb options.max_open_files = 100 and Fix BVLC#13 and BVLC#38

implemented padding aware im2col and col2im functions

995351d

mavenlin and others added 10 commits February 25, 2014 13:00

add test code to test the padding aware im2col col2im functions

10488de

add code to measure timing

5e56898

remove the pad=0 case in conv_layer and im2col_layer

871d966

remove padding layers in imagenet definitions

dbab483

unified to padding aware version

07fabad

remove test code (no longer needed and won't compile)

remove padding_layer and its test

91943d1

remove cuda_timer as is no longer needed

ab820d7

Merge pull request BVLC#128 from mavenlin/pad-im2col

ae56141

im2col and col2im learn to pad and padding layer is obsolete

name blobs and params for their layers in python wrapper

37f340b

note that pairs of params with the same layer name are the params & bias

visualization of the conv layer kernels

bfa4d5a

kloudkl reviewed Feb 25, 2014
View reviewed changes

sergeyk assigned longjon Feb 25, 2014

shelhamer added the enhancement label Feb 25, 2014

longjon reviewed Feb 26, 2014
View reviewed changes

shelhamer closed this Mar 13, 2014

lukeyeager pushed a commit to lukeyeager/caffe that referenced this pull request Jun 8, 2016

Merge pull request BVLC#158 from pooyadavoodi/caffe-0.15

4ff8e07

Use cuDNN routine FindEx to find the best algorithm.

visualization of the conv layer kernels #158

visualization of the conv layer kernels #158

Uh oh!

Conversation

tdomhan commented Feb 25, 2014

Uh oh!

kloudkl commented Feb 25, 2014

Uh oh!

kloudkl Feb 25, 2014

Choose a reason for hiding this comment

Uh oh!

shelhamer Feb 25, 2014

Choose a reason for hiding this comment

Uh oh!

sergeyk commented Feb 25, 2014

Uh oh!

shelhamer commented Feb 26, 2014

Uh oh!

longjon Feb 26, 2014

Choose a reason for hiding this comment

Uh oh!

longjon commented Feb 26, 2014

Uh oh!

longjon commented Feb 26, 2014

Uh oh!

sergeyk commented Feb 26, 2014

Uh oh!

Yangqing commented Feb 27, 2014

Uh oh!

tdomhan commented Feb 27, 2014

Uh oh!

longjon commented Feb 28, 2014

Uh oh!

shelhamer commented Feb 28, 2014

Uh oh!

Yangqing commented Feb 28, 2014

Uh oh!

shelhamer commented Mar 13, 2014

Uh oh!

tdomhan commented Mar 15, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants