Sigmoidlayer #106

tdomhan · 2014-02-13T16:46:34Z

I added a sigmoid layer.

kloudkl · 2014-02-13T17:04:50Z

Several recent researches all recommended using ReLU. My personal micro-benchmark experiments reach the similar conclusion that ReLU is not only several times faster but also often produce better accuracy.

[1] Krizhevsky, A., Sutskever, I. and Hinton, G. E. ImageNet Classification with Deep Convolutional Neural Networks. NIPS 2012: Neural Information Processing Systems, Lake Tahoe, Nevada.
[2] George E. Dahl, Tara N. Sainath, and Geoffrey E. Hinton. Improving Deep Neural Networks for LVCSR Using Rectified Linear Units and Dropout. In ICASSP 2013.
[3] Tara N. Sainath, Brian Kingsbury, Abdel-rahman Mohamed, George E. Dahl, George Saon, Hagen Soltau, Tomas Beran, Aleksandr Y. Aravkin and Bhuvana Ramabhadran . Improvements to Deep Convolutional Neural Networks for LVCSR. In ASRU 2013.
[4] M.D. Zeiler, M. Ranzato, R. Monga, M. Mao, K. Yang, Q.V. Le, P. Nguyen, A. Senior, V. Vanhoucke, J. Dean, G.E. Hinton. On Rectified Linear Units For Speech Processing. In 38th International Conference on Acoustics, Speech and Signal Processing (ICASSP), Vancouver (2013).

Yangqing · 2014-02-13T17:17:54Z

src/caffe/layers/sigmoid_layer.cu

Should put your copyright :)

Yangqing · 2014-02-13T17:22:07Z

src/caffe/layers/sigmoid_layer.cu

I am not sure about this, but would this possibly lead to errors when e.g. x is a very large negative number? An If statement dealing with x>0 cases and x<0 cases might be helpful (although it may hurt performance).

at least empirically we should be fine. I rand the following program:

#include <iostream> #include <cmath> #include <limits> inline double sigmoid(double x) { double val = 1. / (1. + exp(-x)); std::cout << "f(" << x << ") = " << val << std::endl; } int main() { sigmoid(std::numeric_limits<double>::max()); sigmoid(-std::numeric_limits<double>::max()); sigmoid(std::numeric_limits<double>::min()); sigmoid(0); }

Which results in:

f(1.79769e+308) = 1 f(-1.79769e+308) = 0 f(2.22507e-308) = 0.5 f(0) = 0.5

Which is all as expected.

Yangqing · 2014-02-13T17:24:32Z

It'll be nice to have sigmoid layer in caffe. The current MNIST LeNet example, if you look at it closely, is actually not LeNet but LeNet with sigmoid replaced by ReLU. Having a SigmoidLayer would allow us to match standard baselines in a more strict fashion. I'll pull when the minor comments are addressed.

tdomhan · 2014-02-13T17:36:20Z

I agree. ReLU is probably more useful in practice, but it's always nice to have options to compare to.
I added it because I'm interested in multi-label classification, for which I would have a sigmoid layer as a final layer.

Sigmoidlayer

Yangqing · 2014-02-13T17:56:38Z

Thanks for taking care of this! Merged.

Sigmoidlayer

…h_classification

tdomhan added 2 commits February 13, 2014 17:32

added sigmoid layer

76b8bff

sigmoid layer cpu and gpu code

5412cd4

Yangqing reviewed Feb 13, 2014
View reviewed changes

src/caffe/layers/sigmoid_layer.cu Outdated

Copy link

Member

Yangqing Feb 13, 2014

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should put your copyright :)

fixed copyright

915b1ed

Yangqing reviewed Feb 13, 2014
View reviewed changes

Yangqing added a commit that referenced this pull request Feb 13, 2014

Merge pull request #106 from tdomhan/sigmoidlayer

89a0e8e

Sigmoidlayer

Yangqing merged commit 89a0e8e into BVLC:master Feb 13, 2014

mitmul pushed a commit to mitmul/caffe that referenced this pull request Sep 30, 2014

Merge pull request BVLC#106 from tdomhan/sigmoidlayer

02ebab9

Sigmoidlayer

cypof pushed a commit that referenced this pull request Sep 19, 2017

Merge pull request #106 for fixing crash issue of classification/batc…

d626ff6

…h_classification

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Sigmoidlayer #106

Sigmoidlayer #106

Uh oh!

tdomhan commented Feb 13, 2014

Uh oh!

kloudkl commented Feb 13, 2014

Uh oh!

Yangqing Feb 13, 2014

Uh oh!

Yangqing Feb 13, 2014

Uh oh!

tdomhan Feb 13, 2014

Uh oh!

Yangqing commented Feb 13, 2014

Uh oh!

tdomhan commented Feb 13, 2014

Uh oh!

Yangqing commented Feb 13, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Sigmoidlayer #106

Sigmoidlayer #106

Uh oh!

Conversation

tdomhan commented Feb 13, 2014

Uh oh!

kloudkl commented Feb 13, 2014

Uh oh!

Yangqing Feb 13, 2014

Choose a reason for hiding this comment

Uh oh!

Yangqing Feb 13, 2014

Choose a reason for hiding this comment

Uh oh!

tdomhan Feb 13, 2014

Choose a reason for hiding this comment

Uh oh!

Yangqing commented Feb 13, 2014

Uh oh!

tdomhan commented Feb 13, 2014

Uh oh!

Yangqing commented Feb 13, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants