Sebastian Kirsch: Blog

Struggling with the voted perceptron

Filed under:

studies

— Sebastian Kirsch @ 21:51

Deprecated: preg_replace(): The /e modifier is deprecated, use preg_replace_callback instead in /homepages/u37107/www.sebastian-kirsch.org/moebius/blog/wp-includes/functions-formatting.php on line 76

I’m currently struggling with the voted perceptron learning algorithm by Yoav Freund and Robert Schapire: I’ve now read three different versions of this paper, and some things about the algorithm strike me as “not quite right".

The first thing is a minor issue: The voted perceptron algorithm is specified as starting with setting k := 0 and v₁ := 0, but then immediately starts to make predictions with v_k without ever initializing v₀. That can’t be quite right. My guess is that they wanted to start with v₀ := 0 (because a prediction vector of 0 is pretty pointless.)

The next problem is a little more involved. As a decision function, they use the standard f(x) := sign(w ⋅ x) – but that’s not right, the classical perceptron algorithm uses f(x) := sign(w ⋅ x - b), where b is the threshold of the decision function. It’s just that the threshold is usually hidden in this equation, because the “zeroth element” of vector x is taken to be x₀ ≡ 1. This saves you the trouble of having to update the threshold by hand.

The voted perceptron does not have this implicit assumption, and being a kernel algorithm, it cannot have it – how would you specify that a component is equivalent to 1 in feature space?

According to everything I learnt in my lectures on neural networks and AI, a perceptron cannot work with a threshold of zero, because then the origin is part of all decision hyperplanes. A threshold of anything but zero is OK, because you can scale the vector w to get the right decision hyperplane. But the perceptron is supposed to be able to solve all linearly separable problems, not only those that are separable with a hyperplane containing the origin.

The only implementation of the voted perceptron I could find (in the Weka package) neatly sidesteps this problem by always using the polynomial kernel k(x, y) = (1 + x ⋅ y)^d; if no exponent is specified, they assume d = 1; and thus get their threshold. But this is not how the voted perceptron is specified. One solution would be to specify the algorithm with x ⋅ x’ + 1 in the non-kernelized form, and k(x,x’) + 1 in the kernelized form everywhere the scalar product/kernel is used.

It’s a strange thing that no publication makes note of this problem; I will have to talk it through with someone.

Comments (1)

Pälzer Grumbiere

Filed under:

german

— Sebastian Kirsch @ 13:40

Deprecated: preg_replace(): The /e modifier is deprecated, use preg_replace_callback instead in /homepages/u37107/www.sebastian-kirsch.org/moebius/blog/wp-includes/functions-formatting.php on line 76

Ein Motiv aus der aktuellen Werbekampagne für Schweizer Kartoffeln macht wohl gerade im Saarland die Runde – allerdings nicht im Zusammenhang mit Kartoffeln, sondern unter dem Titel “Miss Pfalz”. Ein neues Kapitel in der Rivalität zwischen Pfalz und Saarland.

Comments Off

Nerd Quotient

Filed under:

general

— Sebastian Kirsch @ 13:23

Deprecated: preg_replace(): The /e modifier is deprecated, use preg_replace_callback instead in /homepages/u37107/www.sebastian-kirsch.org/moebius/blog/wp-includes/functions-formatting.php on line 76

Hm …

Comments Off

S	M	T	W	T	F	S
« Dec				Feb »
						1
2	3	4	5	6	7	8
9	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28	29
30	31

Tuesday, 11 January 2005

Struggling with the voted perceptron

Pälzer Grumbiere

Nerd Quotient