CPU info:
    CPU Model Name: Intel(R) Xeon(R) CPU E5-2630 v2 @ 2.60GHz
    Hardware threads: 24
    Total Memory: 268381192 kB
-------------------------------------------------------------------
=== Running /cygdrive/c/jenkins/workspace/CNTK-Test-Windows-W1/x64/release/cntk.exe configFile=C:\jenkins\workspace\CNTK-Test-Windows-W1\Examples\Image\GettingStarted/01_OneHidden.cntk currentDirectory=C:\Users\svcphil\AppData\Local\Temp\cntk-test-20160922162518.374503\Examples\Image\GettingStarted_01_OneHidden@release_gpu\TestData RunDir=C:\Users\svcphil\AppData\Local\Temp\cntk-test-20160922162518.374503\Examples\Image\GettingStarted_01_OneHidden@release_gpu DataDir=C:\Users\svcphil\AppData\Local\Temp\cntk-test-20160922162518.374503\Examples\Image\GettingStarted_01_OneHidden@release_gpu\TestData ConfigDir=C:\jenkins\workspace\CNTK-Test-Windows-W1\Examples\Image\GettingStarted OutputDir=C:\Users\svcphil\AppData\Local\Temp\cntk-test-20160922162518.374503\Examples\Image\GettingStarted_01_OneHidden@release_gpu DeviceId=0 timestamping=true forceDeterministicAlgorithms=true stderr=- trainNetwork=[SGD=[maxEpochs=3]]
CNTK 1.7+ (HEAD 216029, Sep 22 2016 16:13:35) on DPHAIM-22 at 2016/09/22 16:26:44

C:\jenkins\workspace\CNTK-Test-Windows-W1\x64\release\cntk.exe  configFile=C:\jenkins\workspace\CNTK-Test-Windows-W1\Examples\Image\GettingStarted/01_OneHidden.cntk  currentDirectory=C:\Users\svcphil\AppData\Local\Temp\cntk-test-20160922162518.374503\Examples\Image\GettingStarted_01_OneHidden@release_gpu\TestData  RunDir=C:\Users\svcphil\AppData\Local\Temp\cntk-test-20160922162518.374503\Examples\Image\GettingStarted_01_OneHidden@release_gpu  DataDir=C:\Users\svcphil\AppData\Local\Temp\cntk-test-20160922162518.374503\Examples\Image\GettingStarted_01_OneHidden@release_gpu\TestData  ConfigDir=C:\jenkins\workspace\CNTK-Test-Windows-W1\Examples\Image\GettingStarted  OutputDir=C:\Users\svcphil\AppData\Local\Temp\cntk-test-20160922162518.374503\Examples\Image\GettingStarted_01_OneHidden@release_gpu  DeviceId=0  timestamping=true  forceDeterministicAlgorithms=true  stderr=-  trainNetwork=[SGD=[maxEpochs=3]]
Changed current directory to C:\Users\svcphil\AppData\Local\Temp\cntk-test-20160922162518.374503\Examples\Image\GettingStarted_01_OneHidden@release_gpu\TestData
09/22/2016 16:26:47: Redirecting stderr to file -_trainNetwork_testNetwork.log
09/22/2016 16:26:47: -------------------------------------------------------------------
09/22/2016 16:26:47: Build info: 

09/22/2016 16:26:47: 		Built time: Sep 22 2016 16:13:35
09/22/2016 16:26:47: 		Last modified date: Thu Sep 22 13:24:23 2016
09/22/2016 16:26:47: 		Build type: Release
09/22/2016 16:26:47: 		Build target: GPU
09/22/2016 16:26:47: 		Math lib: mkl
09/22/2016 16:26:47: 		CUDA_PATH: C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v7.5
09/22/2016 16:26:47: 		CUB_PATH: C:\src\cub-1.4.1
09/22/2016 16:26:47: 		CUDNN_PATH: c:\NVIDIA\cudnn-5.1\cuda
09/22/2016 16:26:47: 		Build Branch: HEAD
09/22/2016 16:26:47: 		Build SHA1: 216029bfedd92253fd45034da1d1cc68c4d4c7f1
09/22/2016 16:26:47: 		Built by svcphil on liana-08-w
09/22/2016 16:26:47: 		Build Path: c:\jenkins\workspace\CNTK-Build-Windows\Source\CNTK\
09/22/2016 16:26:47: -------------------------------------------------------------------
09/22/2016 16:26:49: -------------------------------------------------------------------
09/22/2016 16:26:49: GPU info:

09/22/2016 16:26:49: 		Device[0]: cores = 2880; computeCapability = 3.5; type = "GeForce GTX 780 Ti"; memory = 3072 MB
09/22/2016 16:26:49: 		Device[1]: cores = 2880; computeCapability = 3.5; type = "GeForce GTX 780 Ti"; memory = 3072 MB
09/22/2016 16:26:49: 		Device[2]: cores = 2880; computeCapability = 3.5; type = "GeForce GTX 780 Ti"; memory = 3072 MB
09/22/2016 16:26:49: 		Device[3]: cores = 2880; computeCapability = 3.5; type = "GeForce GTX 780 Ti"; memory = 3072 MB
09/22/2016 16:26:49: -------------------------------------------------------------------

Configuration After Processing and Variable Resolution:

configparameters: 01_OneHidden.cntk:command=trainNetwork:testNetwork
configparameters: 01_OneHidden.cntk:ConfigDir=C:\jenkins\workspace\CNTK-Test-Windows-W1\Examples\Image\GettingStarted
configparameters: 01_OneHidden.cntk:currentDirectory=C:\Users\svcphil\AppData\Local\Temp\cntk-test-20160922162518.374503\Examples\Image\GettingStarted_01_OneHidden@release_gpu\TestData
configparameters: 01_OneHidden.cntk:dataDir=C:\Users\svcphil\AppData\Local\Temp\cntk-test-20160922162518.374503\Examples\Image\GettingStarted_01_OneHidden@release_gpu\TestData
configparameters: 01_OneHidden.cntk:deviceId=0
configparameters: 01_OneHidden.cntk:forceDeterministicAlgorithms=true
configparameters: 01_OneHidden.cntk:modelPath=C:\Users\svcphil\AppData\Local\Temp\cntk-test-20160922162518.374503\Examples\Image\GettingStarted_01_OneHidden@release_gpu/Models/01_OneHidden
configparameters: 01_OneHidden.cntk:outputDir=C:\Users\svcphil\AppData\Local\Temp\cntk-test-20160922162518.374503\Examples\Image\GettingStarted_01_OneHidden@release_gpu
configparameters: 01_OneHidden.cntk:precision=float
configparameters: 01_OneHidden.cntk:rootDir=..
configparameters: 01_OneHidden.cntk:RunDir=C:\Users\svcphil\AppData\Local\Temp\cntk-test-20160922162518.374503\Examples\Image\GettingStarted_01_OneHidden@release_gpu
configparameters: 01_OneHidden.cntk:stderr=-
configparameters: 01_OneHidden.cntk:testNetwork={
    action = "test"
minibatchSize = 1024    
    reader = {
        readerType = "CNTKTextFormatReader"
        file = "C:\Users\svcphil\AppData\Local\Temp\cntk-test-20160922162518.374503\Examples\Image\GettingStarted_01_OneHidden@release_gpu\TestData/Test-28x28_cntk_text.txt"
        input = {
            features = { dim = 784 ; format = "dense" }
            labels =   { dim = 10  ; format = "dense" }
        }
    }
}

configparameters: 01_OneHidden.cntk:timestamping=true
configparameters: 01_OneHidden.cntk:traceLevel=1
configparameters: 01_OneHidden.cntk:trainNetwork={
    action = "train"
    BrainScriptNetworkBuilder = {
imageShape = 28:28:1                        
labelDim = 10                               
        featScale = 1/256
        Scale{f} = x => Constant(f) .* x
        model = Sequential (
            Scale {featScale} :
            DenseLayer {200} : ReLU : 
            LinearLayer {labelDim}
        )
        features = Input {imageShape}
        labels = Input (labelDim)
        ol = model (features)
        ce   = CrossEntropyWithSoftmax (labels, ol)
        errs = ClassificationError (labels, ol)
        featureNodes    = (features)
        labelNodes      = (labels)
        criterionNodes  = (ce)
        evaluationNodes = (errs)
        outputNodes     = (ol)
    }
    SGD = {
        epochSize = 60000
        minibatchSize = 64
        maxEpochs = 10
        learningRatesPerSample = 0.01*5:0.005
        momentumAsTimeConstant = 0
        numMBsToShowResult = 500
    }
    reader = {
        readerType = "CNTKTextFormatReader"
        file = "C:\Users\svcphil\AppData\Local\Temp\cntk-test-20160922162518.374503\Examples\Image\GettingStarted_01_OneHidden@release_gpu\TestData/Train-28x28_cntk_text.txt"
        input = {
            features = { dim = 784 ; format = "dense" }
            labels =   { dim = 10  ; format = "dense" }
        }
    }   
} [SGD=[maxEpochs=3]]

09/22/2016 16:26:49: Commands: trainNetwork testNetwork
09/22/2016 16:26:49: precision = "float"

09/22/2016 16:26:49: ##############################################################################
09/22/2016 16:26:49: #                                                                            #
09/22/2016 16:26:49: # trainNetwork command (train action)                                        #
09/22/2016 16:26:49: #                                                                            #
09/22/2016 16:26:49: ##############################################################################

09/22/2016 16:26:49: 
Creating virgin network.
Node '<placeholder>' (LearnableParameter operation): Initializating Parameter[10 x 0] as glorotUniform later when dimensions are fully known.
Node '<placeholder>' (LearnableParameter operation): Initializating Parameter[200 x 0] as glorotUniform later when dimensions are fully known.

Post-processing network...

3 roots:
	ce = CrossEntropyWithSoftmax()
	errs = ClassificationError()
	ol = Plus()

Validating network. 15 nodes to process in pass 1.

Validating --> labels = InputValue() :  -> [10 x *]
Validating --> model.arrayOfFunctions[3].W = LearnableParameter() :  -> [10 x 0]
Validating --> model.arrayOfFunctions[1].arrayOfFunctions[0].W = LearnableParameter() :  -> [200 x 0]
Validating --> ol.x._.x.ElementTimesArgs[0] = LearnableParameter() :  -> [1 x 1]
Validating --> features = InputValue() :  -> [28 x 28 x 1 x *]
Validating --> _ol.x._.x = ElementTimes (ol.x._.x.ElementTimesArgs[0], features) : [1 x 1], [28 x 28 x 1 x *] -> [28 x 28 x 1 x *]
Node 'model.arrayOfFunctions[1].arrayOfFunctions[0].W' (LearnableParameter operation) operation: Tensor shape was inferred as [200 x 28 x 28 x 1].
Node 'model.arrayOfFunctions[1].arrayOfFunctions[0].W' (LearnableParameter operation): Initializing Parameter[200 x 28 x 28 x 1] <- glorotUniform(seed=2, init dims=[200 x 784], range=0.078087*1.000000, onCPU=true) { -0.03351783, ... }
.
Validating --> ol.x._.x.PlusArgs[0] = Times (model.arrayOfFunctions[1].arrayOfFunctions[0].W, _ol.x._.x) : [200 x 28 x 28 x 1], [28 x 28 x 1 x *] -> [200 x *]
Validating --> model.arrayOfFunctions[1].arrayOfFunctions[0].b = LearnableParameter() :  -> [200]
Validating --> ol.x._.x = Plus (ol.x._.x.PlusArgs[0], model.arrayOfFunctions[1].arrayOfFunctions[0].b) : [200 x *], [200] -> [200 x *]
Validating --> ol.x = RectifiedLinear (ol.x._.x) : [200 x *] -> [200 x *]
Node 'model.arrayOfFunctions[3].W' (LearnableParameter operation) operation: Tensor shape was inferred as [10 x 200].
Node 'model.arrayOfFunctions[3].W' (LearnableParameter operation): Initializing Parameter[10 x 200] <- glorotUniform(seed=1, init dims=[10 x 200], range=0.169031*1.000000, onCPU=true) { -0.12079262, ... }
.
Validating --> ol.PlusArgs[0] = Times (model.arrayOfFunctions[3].W, ol.x) : [10 x 200], [200 x *] -> [10 x *]
Validating --> model.arrayOfFunctions[3].b = LearnableParameter() :  -> [10]
Validating --> ol = Plus (ol.PlusArgs[0], model.arrayOfFunctions[3].b) : [10 x *], [10] -> [10 x *]
Validating --> ce = CrossEntropyWithSoftmax (labels, ol) : [10 x *], [10 x *] -> [1]
Validating --> errs = ClassificationError (labels, ol) : [10 x *], [10 x *] -> [1]

Validating network. 8 nodes to process in pass 2.


Validating network, final pass.




Post-processing network complete.

09/22/2016 16:26:50: 
Model has 15 nodes. Using GPU 0.

09/22/2016 16:26:50: Training criterion:   ce = CrossEntropyWithSoftmax
09/22/2016 16:26:50: Evaluation criterion: errs = ClassificationError


Allocating matrices for forward and/or backward propagation.

Memory Sharing: Out of 25 matrices, 10 are shared as 5, and 15 are not shared.

	{ model.arrayOfFunctions[1].arrayOfFunctions[0].W : [200 x 28 x 28 x 1] (gradient)
	  ol.x._.x : [200 x *] }
	{ ol.x : [200 x *]
	  ol.x._.x.PlusArgs[0] : [200 x *] (gradient) }
	{ model.arrayOfFunctions[1].arrayOfFunctions[0].b : [200] (gradient)
	  ol.x : [200 x *] (gradient) }
	{ model.arrayOfFunctions[3].W : [10 x 200] (gradient)
	  ol : [10 x *] (gradient) }
	{ ol.PlusArgs[0] : [10 x *]
	  ol.x._.x : [200 x *] (gradient) }


09/22/2016 16:26:50: Training 159010 parameters in 4 out of 4 parameter tensors and 10 nodes with gradient:

09/22/2016 16:26:50: 	Node 'model.arrayOfFunctions[1].arrayOfFunctions[0].W' (LearnableParameter operation) : [200 x 28 x 28 x 1]
09/22/2016 16:26:50: 	Node 'model.arrayOfFunctions[1].arrayOfFunctions[0].b' (LearnableParameter operation) : [200]
09/22/2016 16:26:50: 	Node 'model.arrayOfFunctions[3].W' (LearnableParameter operation) : [10 x 200]
09/22/2016 16:26:50: 	Node 'model.arrayOfFunctions[3].b' (LearnableParameter operation) : [10]

09/22/2016 16:26:50: No PreCompute nodes found, or all already computed. Skipping pre-computation step.

09/22/2016 16:26:50: Starting Epoch 1: learning rate per sample = 0.010000  effective momentum = 0.000000  momentum as time constant = 0.0 samples

09/22/2016 16:26:50: Starting minibatch loop.
09/22/2016 16:26:52:  Epoch[ 1 of 3]-Minibatch[   1- 500, 53.33%]: ce = 0.30956033 * 32000; errs = 9.431% * 32000; time = 2.2384s; samplesPerSecond = 14296.2
09/22/2016 16:26:53: Finished Epoch[ 1 of 3]: [Training] ce = 0.22936094 * 60000; errs = 6.970% * 60000; totalSamplesSeen = 60000; learningRatePerSample = 0.0099999998; epochTime=2.81153s
09/22/2016 16:26:53: SGD: Saving checkpoint model 'C:\Users\svcphil\AppData\Local\Temp\cntk-test-20160922162518.374503\Examples\Image\GettingStarted_01_OneHidden@release_gpu/Models/01_OneHidden.1'

09/22/2016 16:26:53: Starting Epoch 2: learning rate per sample = 0.010000  effective momentum = 0.000000  momentum as time constant = 0.0 samples

09/22/2016 16:26:53: Starting minibatch loop.
09/22/2016 16:26:54:  Epoch[ 2 of 3]-Minibatch[   1- 500, 53.33%]: ce = 0.09488918 * 32000; errs = 2.850% * 32000; time = 0.6253s; samplesPerSecond = 51174.2
09/22/2016 16:26:54: Finished Epoch[ 2 of 3]: [Training] ce = 0.09379909 * 60000; errs = 2.883% * 60000; totalSamplesSeen = 120000; learningRatePerSample = 0.0099999998; epochTime=1.18173s
09/22/2016 16:26:54: SGD: Saving checkpoint model 'C:\Users\svcphil\AppData\Local\Temp\cntk-test-20160922162518.374503\Examples\Image\GettingStarted_01_OneHidden@release_gpu/Models/01_OneHidden.2'

09/22/2016 16:26:54: Starting Epoch 3: learning rate per sample = 0.010000  effective momentum = 0.000000  momentum as time constant = 0.0 samples

09/22/2016 16:26:54: Starting minibatch loop.
09/22/2016 16:26:55:  Epoch[ 3 of 3]-Minibatch[   1- 500, 53.33%]: ce = 0.06256667 * 32000; errs = 1.972% * 32000; time = 0.6282s; samplesPerSecond = 50940.6
09/22/2016 16:26:55: Finished Epoch[ 3 of 3]: [Training] ce = 0.06414283 * 60000; errs = 1.988% * 60000; totalSamplesSeen = 180000; learningRatePerSample = 0.0099999998; epochTime=1.18468s
09/22/2016 16:26:55: SGD: Saving checkpoint model 'C:\Users\svcphil\AppData\Local\Temp\cntk-test-20160922162518.374503\Examples\Image\GettingStarted_01_OneHidden@release_gpu/Models/01_OneHidden'

09/22/2016 16:26:55: Action "train" complete.


09/22/2016 16:26:55: ##############################################################################
09/22/2016 16:26:55: #                                                                            #
09/22/2016 16:26:55: # testNetwork command (test action)                                          #
09/22/2016 16:26:55: #                                                                            #
09/22/2016 16:26:55: ##############################################################################


Post-processing network...

3 roots:
	ce = CrossEntropyWithSoftmax()
	errs = ClassificationError()
	ol = Plus()

Validating network. 15 nodes to process in pass 1.

Validating --> labels = InputValue() :  -> [10 x *1]
Validating --> model.arrayOfFunctions[3].W = LearnableParameter() :  -> [10 x 200]
Validating --> model.arrayOfFunctions[1].arrayOfFunctions[0].W = LearnableParameter() :  -> [200 x 28 x 28 x 1]
Validating --> ol.x._.x.ElementTimesArgs[0] = LearnableParameter() :  -> [1 x 1]
Validating --> features = InputValue() :  -> [28 x 28 x 1 x *1]
Validating --> _ol.x._.x = ElementTimes (ol.x._.x.ElementTimesArgs[0], features) : [1 x 1], [28 x 28 x 1 x *1] -> [28 x 28 x 1 x *1]
Validating --> ol.x._.x.PlusArgs[0] = Times (model.arrayOfFunctions[1].arrayOfFunctions[0].W, _ol.x._.x) : [200 x 28 x 28 x 1], [28 x 28 x 1 x *1] -> [200 x *1]
Validating --> model.arrayOfFunctions[1].arrayOfFunctions[0].b = LearnableParameter() :  -> [200]
Validating --> ol.x._.x = Plus (ol.x._.x.PlusArgs[0], model.arrayOfFunctions[1].arrayOfFunctions[0].b) : [200 x *1], [200] -> [200 x *1]
Validating --> ol.x = RectifiedLinear (ol.x._.x) : [200 x *1] -> [200 x *1]
Validating --> ol.PlusArgs[0] = Times (model.arrayOfFunctions[3].W, ol.x) : [10 x 200], [200 x *1] -> [10 x *1]
Validating --> model.arrayOfFunctions[3].b = LearnableParameter() :  -> [10]
Validating --> ol = Plus (ol.PlusArgs[0], model.arrayOfFunctions[3].b) : [10 x *1], [10] -> [10 x *1]
Validating --> ce = CrossEntropyWithSoftmax (labels, ol) : [10 x *1], [10 x *1] -> [1]
Validating --> errs = ClassificationError (labels, ol) : [10 x *1], [10 x *1] -> [1]

Validating network. 8 nodes to process in pass 2.


Validating network, final pass.




Post-processing network complete.

evalNodeNames are not specified, using all the default evalnodes and training criterion nodes.


Allocating matrices for forward and/or backward propagation.

Memory Sharing: Out of 15 matrices, 0 are shared as 0, and 15 are not shared.


09/22/2016 16:26:56: Minibatch[1-10]: errs = 2.560% * 10000; ce = 0.08479329 * 10000
09/22/2016 16:26:56: Final Results: Minibatch[1-10]: errs = 2.560% * 10000; ce = 0.08479329 * 10000; perplexity = 1.08849204

09/22/2016 16:26:56: Action "test" complete.

09/22/2016 16:26:56: __COMPLETED__
