Support vector machine

In machine learning, support vector machines (SVMs, also support vector networks^[1]) are supervised max-margin models with associated learning algorithms that analyze data for classification and regression analysis. Developed at AT&T Bell Laboratories,^[1]^[2] SVMs are one of the most studied models, being based on statistical learning frameworks of VC theory proposed by Vapnik (1982, 1995) and Chervonenkis (1974).

In addition to performing linear classification, SVMs can efficiently perform non-linear classification using the kernel trick, representing the data only through a set of pairwise similarity comparisons between the original data points using a kernel function, which transforms them into coordinates in a higher-dimensional feature space. Thus, SVMs use the kernel trick to implicitly map their inputs into high-dimensional feature spaces, where linear classification can be performed.^[3] Being max-margin models, SVMs are resilient to noisy data (e.g., misclassified examples). SVMs can also be used for regression tasks, where the objective becomes $\epsilon$ -sensitive.

The support vector clustering^[4] algorithm, created by Hava Siegelmann and Vladimir Vapnik, applies the statistics of support vectors, developed in the support vector machines algorithm, to categorize unlabeled data.^{[citation needed]} These data sets require unsupervised learning approaches, which attempt to find natural clustering of the data into groups, and then to map new data according to these clusters.

The popularity of SVMs is likely due to their amenability to theoretical analysis, and their flexibility in being applied to a wide variety of tasks, including structured prediction problems. It is not clear that SVMs have better predictive performance than other linear models, such as logistic regression and linear regression.^{[citation needed]}

^ ^a ^b Cite error: The named reference CorinnaCortes was invoked but never defined (see the help page).
^ Vapnik, Vladimir N. (1997). "The Support Vector method". In Gerstner, Wulfram; Germond, Alain; Hasler, Martin; Nicoud, Jean-Daniel (eds.). Artificial Neural Networks — ICANN'97. Lecture Notes in Computer Science. Vol. 1327. Berlin, Heidelberg: Springer. pp. 261–271. doi:10.1007/BFb0020166. ISBN 978-3-540-69620-9.
^ Awad, Mariette; Khanna, Rahul (2015). "Support Vector Machines for Classification". Efficient Learning Machines. Apress. pp. 39–66. doi:10.1007/978-1-4302-5990-9_3. ISBN 978-1-4302-5990-9.
^ Ben-Hur, Asa; Horn, David; Siegelmann, Hava; Vapnik, Vladimir N. ""Support vector clustering" (2001);". Journal of Machine Learning Research. 2: 125–137.

[CorinnaCortes-1] Cite error: The named reference CorinnaCortes was invoked but never defined (see the help page).

[2] Vapnik, Vladimir N. (1997). "The Support Vector method". In Gerstner, Wulfram; Germond, Alain; Hasler, Martin; Nicoud, Jean-Daniel (eds.). Artificial Neural Networks — ICANN'97. Lecture Notes in Computer Science. Vol. 1327. Berlin, Heidelberg: Springer. pp. 261–271. doi:10.1007/BFb0020166. ISBN 978-3-540-69620-9.

[3] Awad, Mariette; Khanna, Rahul (2015). "Support Vector Machines for Classification". Efficient Learning Machines. Apress. pp. 39–66. doi:10.1007/978-1-4302-5990-9_3. ISBN 978-1-4302-5990-9.

[HavaSiegelmann-4] Ben-Hur, Asa; Horn, David; Siegelmann, Hava; Vapnik, Vladimir N. ""Support vector clustering" (2001);". Journal of Machine Learning Research. 2: 125–137.

[1]

[2]

[3]

[4]