Naïve Bayes vs. Decision Trees vs. Neural Networks in the Classification of Training Web Pages



IJCSI International Journal of Computer Science Issues, Vol. 4, No 1, 2009

23


[18] Segaran, T. Programming Collective Intelligence, U.S.A:
O’Reilly Media Inc, 2007.

[19] Tal, B. “Neural Network - Based System of Leading
Indicators”,
CIBC World Markets, 2003.

[20] TheMathsWork,

http://www.mathworks.com/products/neuralnet/

[21] Towell, G. & Shavlik, J. “Extracting Refined Rules from
Knowledge-Based Neural Networks”,
Machine Learning, Vol.
13, No. 1, 1993, pp. 71-101.

[22] Turney, P. “Learning to Extract Keyphrases from Text”,
Technical Report ERB-1057, Institute for Information
Technology, National Research Council of Canada, 1999.

[23] Wang C., Ding C., Meraz R., Holbrook S. “PSoL: a positive
sample only learning algorithm for finding non-coding RNA
genes”,
Bioinformatics, Vol. 22, No. 21, 2006, pp. 2590-2596.

[24] Wang, L., Li, X., Cao, C. & Yuan, S. “Combining decision
tree and Naïve Bayes for classification”,
Knowledge Based
Systems
, Vol. 19, 2006, pp. 511-515.

[25] Xhemali, D., Hinde, C.J. & Stone, R.G. 2007. “Embarking
on a Web Extraction Project”, in:
The 2007 UK Conference on
Computational Intelligence
, 2007.

[26] Zhang, N. “Hierarchical latent class models for cluster
analysis”, in:
18th National Conference on Artificial
Intelligence
, 2002, pp. 230-237.

Daniela Xhemali is an Engineering Doctorate (EngD) student at
Loughborough University, UK. She received a First Class
(Honours) BSc in Software Engineering from Sheffield Hallam
University in 2005 and an MSc with Distinction in Engineering,
Innovation and Management from Loughborough University in
2008. Daniela Xhemali has also worked in industry for two years
as a Software Engineer, programming multi-user, object oriented
applications, with large database backend. Her current research
focuses on Web Information Retrieval and Extraction, specifically
on the use of Bayes Networks, Decision Trees and Neural
Networks in the classification of web pages as well as the use of
Genetic Programming and Evolution in the extraction of specific
web information.

Dr. Christopher J. Hinde is a Senior Lecturer at Loughborough
University. He is the Programme Director of the Computer Science
& Artificial Intelligence group as well as the Programme Director of
the Computer Science & E-business group. Dr. Hinde is also the
leader of the Intelligent and Interactive Systems Research division.
His research interests include: Artificial intelligence, fuzzy
reasoning, logic programming, natural language processing, neural
nets etc.

Dr. Roger G. Stone is a lecturer at Loughborough University. He is
DANS Coordinator and the Quality Manager at Loughborough
University. Dr. Stone is also a member of the Interdisciplinary
Computing Research Division. His research interests include: Web
programming, web accessibility, program specification techniques,
software engineering tools, compiling etc.

IJCSI




More intriguing information

1. The name is absent
2. Creating a 2000 IES-LFS Database in Stata
3. Licensing Schemes in Endogenous Entry
4. Restructuring of industrial economies in countries in transition: Experience of Ukraine
5. Conservation Payments, Liquidity Constraints and Off-Farm Labor: Impact of the Grain for Green Program on Rural Households in China
6. Magnetic Resonance Imaging in patients with ICDs and Pacemakers
7. Job quality and labour market performance
8. The name is absent
9. Weather Forecasting for Weather Derivatives
10. The name is absent
11. Neural Network Modelling of Constrained Spatial Interaction Flows
12. The name is absent
13. The Role of area-yield crop insurance program face to the Mid-term Review of Common Agricultural Policy
14. TECHNOLOGY AND REGIONAL DEVELOPMENT: THE CASE OF PATENTS AND FIRM LOCATION IN THE SPANISH MEDICAL INSTRUMENTS INDUSTRY.
15. The Works of the Right Honourable Edmund Burke
16. The name is absent
17. Alzheimer’s Disease and Herpes Simplex Encephalitis
18. An Attempt to 2
19. Skill and work experience in the European knowledge economy
20. The name is absent