StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

POMultilayer perceptron - backpropagation
primarykey
Id
10018821
data
AcceptedAnswerId
10046631
AnswerCount
1
ClosedDate
CommentCount
2
CommunityOwnedDate
CreationDate
2012-04-04T20:34:15.917
FavoriteCount
2
LastActivityDate
2017-03-21T18:29:16.230
LastEditDate
LastEditorUserId
0
OwnerUserId
520750
ParentId
0
PostTypeId
1
Score
1
ViewCount
2718
LastEditorDisplayName
text
Body
I have a school project to program multilayer perceptron that classify data into three classes. I have implemented backpropagation algorithm from <a href="http://home.agh.edu.pl/~vlsi/AI/backp_t_en/backprop.html" rel="nofollow">http://home.agh.edu.pl/~vlsi/AI/backp_t_en/backprop.html</a>. I have checked my algorithm (by manually calculating each step of backpropagation) if it really meets this explained steps and it meets. For classifing I am using one-hot code and I have inputs consisting of vectors with 2 values and three output neurons (each for individual class). After each epoch I shuffle input data. For classification I am using sigmoid function. I tried to implement softmax too, but I haven't found how looks derivative softmax. Is derivative softmax needed in weights adjusting? For checking if network successfully classified input, I am comparing if position of an output neuron with maximal output from output neurons is corresponding to position from current input one-hot code vector that equals 1. But my implementation doesn't train this neural network. I am working on this and debugging several days and looking on internet to find what I am doing wrong but I haven't find answer. I really don't know where I am making mistake. My neural network will successfully train when I have 10 inputs, but when I have 100, 200, 400 and 800 inputs it start cycling when it have one-half good classified inputs. As I said my backpropagation algorithm is good. Whole C++ project in Visual Studio 2010 with input files is here: <a href="http://www.st.fmph.uniba.sk/~vajda10/mlp.zip" rel="nofollow">http://www.st.fmph.uniba.sk/~vajda10/mlp.zip</a> Structures: <pre><code> struct input { vector<double> x; vector<double> cls; }; struct neuron { double output; double error; neuron(double o, double e): output(o), error(e) { }; }; </code></pre> Global variables: <pre><code> double alpha = 0.5; vector<vector<input>> data; vector<vector<neuron>> hiddenNeurons; vector<neuron> outputNeurons; vector<vector<vector<double>>> weights; </code></pre> Here is my code for backpropagation algorithm: <pre><code> for (int b = 0; b < data[0].size(); b++) { // calculate output of hidden neurons for (int i = 0; i < hiddenNeurons.size(); i++) { for (int j = 0; j < hiddenNeurons[i].size(); j++) { double activation = neuronActivation(0, b, i, j); hiddenNeurons[i][j].output = sigmoid(activation); } } double partError = 0; // calculate output and errors on output neurons for (int k = 0; k < outputNeurons.size(); k++) { double activation = neuronActivation(0, b, hiddenNeurons.size(), k); outputNeurons[k].output = sigmoid(activation); outputNeurons[k].error = data[0][b].cls[k] - outputNeurons[k].output; partError += pow(outputNeurons[k].error, 2); } error += sqrt(partError)/outputNeurons.size(); // if classification is wrong if (data[0][b].cls[maxOutputIndex(outputNeurons)] != 1) { wrongClass++; // error backpropagation for (int i = hiddenNeurons.size()-1; i >= 0; i--) { for (int j = 0; j < hiddenNeurons[i].size(); j++) { hiddenNeurons[i][j].error = 0.0; if (i < hiddenNeurons.size()-1) { for (int k = 0; k < hiddenNeurons[i+1].size(); k++) { hiddenNeurons[i][j].error += hiddenNeurons[i+1][k].error * weights[i+1][j][k]; } } else { for (int k = 0; k < outputNeurons.size(); k++) { hiddenNeurons[i][j].error += outputNeurons[k].error * weights[i+1][j][k]; } } } } // adjust weights for (int i = 0; i < weights.size(); i++) { int n; if (i < weights.size()-1) { n = hiddenNeurons[i].size(); } else { n = outputNeurons.size(); } for (int k = 0; k < n; k++) { for (int j = 0; j < weights[i].size(); j++) { double y; if (i == 0) { y = data[0][b].x[j]; } else { y = hiddenNeurons[i-1][j].output; } if (i < weights.size()-1) { weights[i][j][k] += alpha * hiddenNeurons[i][k].error * derivedSigmoid(hiddenNeurons[i][k].output) * y; } else { weights[i][j][k] += alpha * outputNeurons[k].error * derivedSigmoid(outputNeurons[k].output) * y; } } } } } } </code></pre> Please, can anyone tell me what I am doing wrong or give me an advice to where I must to look for a mistake? I hope that I have told everything important. Please, forgive me my bad english.
Tags
<classification><backpropagation><neural-network>
Title
Multilayer perceptron - backpropagation
singulars
PostAcceptedAnswerId
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
PostParentId
1. This table or related slice is empty.
PostTypePostTypeId
1. PTQuestion
UserLastEditorUserId
1. This table or related slice is empty.
UserOwnerUserId
1. USrwrx_
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. This table or related slice is empty.
PostsAcceptedAnswerId
1. This table or related slice is empty.
PostsParentIdCreationDate
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 POMultilayer perceptron - backpropagation
 UserUserId
 USIrwin
 VoteTypeVoteTypeId
 VTFavorite
CommentsPostId
1. COIf you use cross entropy as your error function the derivative with the softmax activation function will be the same as the derivative with the identity activation function and the sum of squared errors as error function: output-target. Btw: youre learning rate is way to high. Try something like 0.005 or lower.
 singulars
 PostPostId
 POMultilayer perceptron - backpropagation
 UserUserId
 USalfa
2. COThanks a lot, this helped me.
 singulars
 PostPostId
 POMultilayer perceptron - backpropagation
 UserUserId
 USrwrx_

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.