<?xml version="1.0" encoding="utf-8" ?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">

    <title type="text">perClass</title>
    <link rel="alternate" type="text/html" href="http://perclass.com/index.php/forums/" />
    <link rel="self" type="application/atom+xml" href="http://perclass.com/index.php/forums/atom/" />
    <updated></updated>
    <rights>Copyright (c) 2011</rights>
    <generator uri="http://expressionengine.com/" version="1.6.3">ExpressionEngine</generator>
    <id>tag:perclass.com,2011:11:05</id>


    <entry>
      <title>Leave one subject out</title>
      <link rel="alternate" type="text/html" href="http://perclass.com/index.php/forums/viewthread/351/" />      
      <id>tag:perclass.com,2011:index.php/forums/viewthread/.351</id>
      <published>2011-10-28T22:41:10Z</published>
      <updated></updated>
      <author><name>stonerain</name></author>
      <content type="html">
      <![CDATA[
        <p>Hi dear forum members,
<br />
I am trying to use loso function in the PRTools but I think I have been doing something wrong. I have 7 classes and I wrote that code;
</p>
<p>
</p><div class="codeblock"><code>
<span style="color: #0000BB">prior&nbsp;</span><span style="color: #007700">=&nbsp;</span><span style="color: #0000BB">&#91;1</span><span style="color: #007700">/</span><span style="color: #0000BB">7&nbsp;1</span><span style="color: #007700">/</span><span style="color: #0000BB">7&nbsp;1</span><span style="color: #007700">/</span><span style="color: #0000BB">7&nbsp;1</span><span style="color: #007700">/</span><span style="color: #0000BB">7&nbsp;1</span><span style="color: #007700">/</span><span style="color: #0000BB">7&nbsp;1</span><span style="color: #007700">/</span><span style="color: #0000BB">7&nbsp;1</span><span style="color: #007700">/</span><span style="color: #0000BB">7&#93;</span><span style="color: #007700">;<br /></span><span style="color: #0000BB">A&nbsp;</span><span style="color: #007700">=&nbsp;</span><span style="color: #0000BB">dataset</span><span style="color: #007700">(</span><span style="color: #0000BB">training_features</span><span style="color: #007700">,&nbsp;</span><span style="color: #0000BB">class_labels</span><span style="color: #007700">,&nbsp;</span><span style="color: #DD0000">'prior'</span><span style="color: #007700">,&nbsp;</span><span style="color: #0000BB">prior</span><span style="color: #007700">);<br /></span><span style="color: #0000BB">W&nbsp;</span><span style="color: #007700">=&nbsp;</span><span style="color: #0000BB">svc</span><span style="color: #007700">(</span><span style="color: #0000BB">&#91;&#93;</span><span style="color: #007700">,&nbsp;</span><span style="color: #DD0000">'p'</span><span style="color: #007700">,&nbsp;</span><span style="color: #0000BB">2</span><span style="color: #007700">);<br /><br /></span><span style="color: #0000BB">&#91;E&nbsp;C&nbsp;D&#93;&nbsp;</span><span style="color: #007700">=&nbsp;</span><span style="color: #0000BB">loso</span><span style="color: #007700">(</span><span style="color: #0000BB">A</span><span style="color: #007700">,&nbsp;</span><span style="color: #0000BB">W</span><span style="color: #007700">,&nbsp;</span><span style="color: #0000BB">class_nos</span><span style="color: #007700">);<br /></span><span style="color: #0000BB">confmat</span><span style="color: #007700">(</span><span style="color: #0000BB">D</span><span style="color: #007700">)</span>
</code></div><p>
</p>
<p>
Is that something wrong here?
<br />
ANy response will be appreciated. 
<br />
Thanks a lot.
</p>
      ]]>
      </content>
    </entry>

    <entry>
      <title>training SVDD with negative examples (incsvdd)</title>
      <link rel="alternate" type="text/html" href="http://perclass.com/index.php/forums/viewthread/295/" />      
      <id>tag:perclass.com,2011:index.php/forums/viewthread/.295</id>
      <published>2011-03-24T21:05:56Z</published>
      <updated></updated>
      <author><name>claudio</name></author>
      <content type="html">
      <![CDATA[
        <p>Hello!
</p>
<p>
I have a question about training SVDD with negative examples in &#8220;incsvdd&#8221;; the negatives examples appears in the objective function of the optimizacion problem?, or just appears like restriction.
</p>
<p>
I have this doubt because when I utilize oversampling on the negative examples, I get the same performance. Therefore, I think that the negatives examples just appears like restriction,, but really I&#8217;m not sure. 
</p>
<p>
Could somebody help me?, 
<br />
Claudio
</p>
      ]]>
      </content>
    </entry>

    <entry>
      <title>Combining featrures</title>
      <link rel="alternate" type="text/html" href="http://perclass.com/index.php/forums/viewthread/308/" />      
      <id>tag:perclass.com,2011:index.php/forums/viewthread/.308</id>
      <published>2011-05-14T20:11:27Z</published>
      <updated></updated>
      <author><name>strantis</name></author>
      <content type="html">
      <![CDATA[
        <p>Hi All
<br />
I have a radar signal classification problem, where the extracted features are both images (time-frequency distributions) and vectors (statistical features). There is one image feature and three vector features for each object. I want to combine these features for using them in a single classifier. My intuition tells me that I just have to vectorize the images and then concatenate the resulting vector with the rest vector features (for the same object), thus creating a large vector feature for each object. However, I am afraid that combining the features there might be problems in the classification since the feature sets were generated using totally different procedures. Is there any idea for a better way of doing this? Thanks
</p>
      ]]>
      </content>
    </entry>

    <entry>
      <title>Question about SVM formulation in &#8216;Classification, Parameter Estimation and State Estimation&#8217;</title>
      <link rel="alternate" type="text/html" href="http://perclass.com/index.php/forums/viewthread/294/" />      
      <id>tag:perclass.com,2011:index.php/forums/viewthread/.294</id>
      <published>2011-03-18T22:22:39Z</published>
      <updated></updated>
      <author><name>tadam21</name></author>
      <content type="html">
      <![CDATA[
        <p>Hello,
</p>
<p>
I am currently working on my methods section for a paper and I want to show the development of the SVM classifier. I tried finding the dual form of Equation 5.54 by hand and I came up with sign disagreements when compared to 5.56. My relationships derived from the partial derivatives of L came out to be the same as 5.55. This derivation starts on page 169 for reference.
</p>
<p>
L = 1/2*||w||^2 + SUM an (cn [w^T*zn + b] - 1) (Eq 5.54)
<br />
w = SUM an*cn*zn
<br />
SUM cn*an = 0 (Eq 5.55)
</p>
<p>
Combining the three relationships,
</p>
<p>
L = 1/2*(SUM an*cn*zn)*(SUM am*cm*zm) - SUM an
</p>
<p>
In the book, L seems to be multiplied through by a negative sign, but I am not sure why. Did I do something wrong?
</p>
<p>
Also, I tried to find the dual formulation with the slack variables included and was wondering if it was correct (exercise 6 of chapter 5).
</p>
<p>
L = 1/2*w^2 + C*SUM En + SUM an (cn [w^T*zn + b] - 1 + En) + SUM vn*En (5.61)
<br />
Partial derivative relationships
<br />
dL/dw: w = SUM an*cn*zn
<br />
dL/db: SUM cn*an = 0 
<br />
dL/dC: SUM En = 0
</p>
<p>
L = 1/2*(SUM an*cn*zn)*(SUM am*cm*zm) - SUM an + SUM vn*En
</p>
<p>
The reason I ask is I am not sure how this would be solved as L must be maximized with respect to an but I don&#8217;t think En or vn is known yet. Am I missing something?
</p>
<p>
Thank you in advance,
</p>
<p>
Thomas
</p>
      ]]>
      </content>
    </entry>

    <entry>
      <title>fracrej and C in SVDD in dd_Tools</title>
      <link rel="alternate" type="text/html" href="http://perclass.com/index.php/forums/viewthread/262/" />      
      <id>tag:perclass.com,2010:index.php/forums/viewthread/.262</id>
      <published>2010-11-04T16:19:05Z</published>
      <updated></updated>
      <author><name>benjamin</name></author>
      <content type="html">
      <![CDATA[
        <p>Hello forum,
</p>
<p>
I am using the dd_tools and especially the SVDD for classifying hyperspectral images. I am a user coming from the application side (without a deeper understanding in pattern recognition) and have some question regarding the theoretical background of the SVDD and the implementation in dd_tools.
<br />
If I understood right, the parameter &#8216;fracrej&#8217; is mainly influential in the determination of the parameter C while sigma for the flexibility of the hypersphere. I would like to understand how fracrej and C are related but also reading the dd_tools manual and the dissertation of Tax did not help. Does anybode have a simple explanation? I also wonder how the second regularization parameter C2 is optimized when I train the classifier with targets and outliers because I only define one fracrej for the target class. 
</p>
<p>
A nice explanation for the relation between C and fracrej would be highly acnowledged (-:
</p>
<p>
Thanx in advance,
<br />
ben
</p>
      ]]>
      </content>
    </entry>

    <entry>
      <title>Is it correct to calculate the average confusion matrix in kfolds&#63;</title>
      <link rel="alternate" type="text/html" href="http://perclass.com/index.php/forums/viewthread/167/" />      
      <id>tag:perclass.com,2010:index.php/forums/viewthread/.167</id>
      <published>2010-03-15T13:19:54Z</published>
      <updated></updated>
      <author><name>Jorge Amaral</name></author>
      <content type="html">
      <![CDATA[
        <p>Hi, 
</p>
<p>
 I want to compare the performance of two classifiers. Is it correct to divide the data in k-folds and calculate the confusion matrix for each fold and then average the K confusion matrixs? Is this approach valid for the area under the roc curve? 
</p>
<p>
If I am calculating the sensitivity and specificity, Is this approach is also valid?
</p>
<p>
Thanks,
</p>
<p>
Jorge
</p>
      ]]>
      </content>
    </entry>

    <entry>
      <title>Another question about performance estimation in an umbalanced classification problem</title>
      <link rel="alternate" type="text/html" href="http://perclass.com/index.php/forums/viewthread/196/" />      
      <id>tag:perclass.com,2010:index.php/forums/viewthread/.196</id>
      <published>2010-06-02T20:18:58Z</published>
      <updated>2010-06-02T20:27:01Z</updated>
      <author><name>Mariano Llamedo Soria</name></author>
      <content type="html">
      <![CDATA[
        <p>Anybody could tell me the correct way of evaluating the performance in a multi-class classification problem with umbalanced or imbalanced class presence ?
</p>
<p>
For example in this 3 class problem, I am used to have the following performance measures derived from the confusion matrix:
</p>
<p>
(this example is attached in results.txt, I dont know why the formatting is so ugly)
</p>
<p>
  True            | Estimated Labels
<br />
  Labels          | Normal Suprav Ventri| Totals
<br />
 -----------------|---------------------|-------
<br />
  Normal          |282094   1534   3991  |287619
<br />
  Supraventricular|  102    854    378  | 1334
<br />
  Ventricular     |  521   1309    742  | 2572
<br />
 -----------------|---------------------|-------
<br />
  Totals          |282717   3697   5111  |291525
</p>
<p>
Balanced Results 
<br />
-----------------
<br />
| Normal    || Supravent || Ventricul ||           TOTALS            |
<br />
|  Se   +P  ||  Se   +P  ||  Se   +P  ||   Acc   |   Se    |   +P    |
<br />
|  98%  78% ||  64%  55% ||  29%  49% ||   64%   |   64%   |   61%   |
</p>
<p>
Unbalanced Results 
<br />
-------------------
<br />
| Normal    || Supravent || Ventricul ||           TOTALS            |
<br />
|  Se   +P  ||  Se   +P  ||  Se   +P  ||   Acc   |   Se    |   +P    |
<br />
|  98% 100% ||  64%  23% ||  29%  15% ||   97%   |   64%   |   46%   |
</p>
<p>
(this example is attached in results.txt, I dont know why the formatting is so ugly)
</p>

<p>
As you can see, the presence of the 3 classes is VERY umbalanced (287619, 1334, 2572), and consequently I noticed that the positive predictive value (+P) is biased by this umbalance. Trying to avoid this, I calculated the &#8220;balanced&#8221; results, which means scaling all rows in the confusion matrix to sum the same (or balance the class presence), and then calculate the performance. As can be noted, the class sensitivity (Se) is not affected by this row-operations. The totals means the average Se and +P of all classes, and Acc is the trace of the confusion matrix divided by the total amount of examples (291525).
</p>
<p>
Thanks in advance for any comment.
<br />
Mariano.
</p>
      ]]>
      </content>
    </entry>

    <entry>
      <title>Classifier performance estimation and data division.</title>
      <link rel="alternate" type="text/html" href="http://perclass.com/index.php/forums/viewthread/182/" />      
      <id>tag:perclass.com,2010:index.php/forums/viewthread/.182</id>
      <published>2010-04-28T10:27:58Z</published>
      <updated></updated>
      <author><name>Mariano Llamedo Soria</name></author>
      <content type="html">
      <![CDATA[
        <p>I have the following doubt regarding to some pattern recognition experiments which I am doing. First of all, I am interested in doing a feature selection step, to obtain a classification model optimum in some sense, and finally estimating its performance. The final idea is porting one trained model to the &#8220;real world&#8221; to perform some classification task. Suppose I have a decent amount of data to perform a three-way partition (train-validation-test) or any other approach. My question is, which is the most correct approach when data is not a limitation ?
</p>
<p>
One idea could be dividing the dataset in three (train-validation-test). Then perform feature selection in train, estimating the criterion to maximize via crossvalidation. Then adjusting the operating point in the validation set, and finally estimating performance in the test set. Both last steps also crosssvalidating. The question of this approach is, when you need to port a trained model to the real world, where are you supposed to train this model ? In the whole data ? 
</p>
<p>
If you do the same, but avoiding crossvalidation. I mean, always train in the training set, evaluate in the validation set for feature selection and operating point determination, and te selected model, trained in the train set, and evaluated in the test set gives the final performance estimation. With this approach, it is clear that the model to port to reality is the only one I obtained from the train-validation sets ... BUT should I try different train-validation partitions previous to the final performance estimation ?
</p>
<p>
Thanks in advance for any help or comments,
<br />
Mariano.
</p>
      ]]>
      </content>
    </entry>

    <entry>
      <title>Final training for a neural netwrok after cross validation</title>
      <link rel="alternate" type="text/html" href="http://perclass.com/index.php/forums/viewthread/186/" />      
      <id>tag:perclass.com,2010:index.php/forums/viewthread/.186</id>
      <published>2010-05-03T20:10:30Z</published>
      <updated></updated>
      <author><name>Jorge Amaral</name></author>
      <content type="html">
      <![CDATA[
        <p>Hi, 
</p>
<p>
    I know that is possible to use cross validation to determine the number of hidden neurons. My question is: after I decided how many hidden neurons I use , what is the training and tuning set that i have to use. In other types of classifiers such as nearest neighbors, I just use the whole dataset, but I cannot do this with neural networks because it would cause overfitting. Should I use the best network in the crossvalidation? That is the one with the smallest validation error?
</p>
<p>
Thanks, 
</p>
<p>
      Jorge
</p>
      ]]>
      </content>
    </entry>

    <entry>
      <title>Interpreting figures</title>
      <link rel="alternate" type="text/html" href="http://perclass.com/index.php/forums/viewthread/177/" />      
      <id>tag:perclass.com,2010:index.php/forums/viewthread/.177</id>
      <published>2010-04-11T01:08:34Z</published>
      <updated></updated>
      <author><name>faiyo</name></author>
      <content type="html">
      <![CDATA[
        <p>Hi,
<br />
I will be straight to the point. I am training a mixture of gaussians to estimate a persons age from their speech. I have 168 objects with <b>13</b> features in each, <b>4</b> classes of data. I trained the gaussians using <b>gaussm</b> and <b>mogc</b> and I used <b>plotm</b>. I know that gaussm trains one gaussian per calss of data object while mogc computes one over all gaussian density.
<br />
I provided prior probabilities when I created my dataset object and set the labtype to &#8216;soft&#8217; in order to use the expectation maximisation algorithm. The following is my code:
<br />
A = dataset(inputs,labels&#8217;);
<br />
A = setlabtype(A,&#8217;soft&#8217;);
<br />
A = setprior(A, [0.16666666666667 0.22619047619048 0.33333333333333 0.27380952380952]);
<br />
W1 = mogc(A,4,0,0.8);
<br />
figure(2);
<br />
scatterd(A,[10,5]);
<br />
plotm(W1,6,10);
</p>
<p>
and attached is the resulting plot using plotm as shown. 
<br />
Can anyone please explain to me how to interpret the plot? 
<br />
What does &#8220;feature 1&#8221; and &#8220;feature 2&#8221; mean? 
</p>
<p>
Kind regards,
<br />
Faiyo.
</p>
      ]]>
      </content>
    </entry>


</feed>
