摘 要在机器学习理论中支持向量机(SVM)有着重要的地位,无论是求解分类问题还是求解回归问题,SVM 都有着广泛的应用。本文简单的介绍了 SVM 的基本原理,讨论了 SVM 在文本分类中的应用,并详细的分析了如何利用 SVM 构造文本分类器。这里说明了文本分类的详细处理过程,并介绍了这些过程中的关键技术,如:分词技术、向量空间模型(VSM)、特征选取技术和 SVM 的交叉验证技术等等。结合着分析和讨论又概略的说明了利用 Microsoft Visual C++ 6.0 创建文本分类系统的过程,介绍了重要的类和关键处理函数的实现和优化,以及如何利用动态链接库来实现 C++到 Java 的迁移。最后给出了由本系统得到的实验数据和结论。关键字: 机器学习 文本分类 支持向量机(SVM)ABSTRACTSupport Vector Machines (SVM) has an important position in Machine learning theory, whether it is to solve the classification problem or request for the reunification issue, SVM has a wide range of applications. In this paper, a short introduction into the basic principles of SVM, a detailed discussion of the SVM in the text classification, and a careful analysis of how to make use of SVM to construct classifier for a text classification. Here's the text of the detailed classification process and introduced in the course of these key technologies, such as: segmentation technology, vector space model (VSM), features selection technology, cross-verification technology of the SVM and so on. With the analysis and discussion also briefly described the process of making use of Microsoft Visual C++ 6.0 to create the text classification system, introduced the realization and optimization of the key class and important functions, and how to use of dynamic link library to achieve the migration from C++ to Java. Finally, the experimental data and conclusions produced by this system are shown.Keywords: machine learning text classification SVM(support vector machine)毕...