Levenshtein Distance (Paperback)


In database record linkage or natural language processing tasks one usually encounters problems when working with data or texts containing noise, typos and other kinds of errors. In this thesis the use of modified Levenshtein edit distances to deal with these problems is investigated. For the task of linking distinct records representing the same entity in a database we used and extended the WEKA API for Machine Learning, obtaining good precision and recall results. For the task of searching and annotating occurrences of specified words in texts written in natural language we implemented an approximate Gazetteer for GATE, the General Architecture for Text Engineering.

R1,310

Or split into 4x interest-free payments of 25% on orders over R50
Learn more

Discovery Miles13100
Mobicred@R123pm x 12* Mobicred Info
Free Delivery
Delivery AdviceShips in 10 - 15 working days


Toggle WishListAdd to wish list
Review this Item

Donate to Against Period Poverty


Product Description

In database record linkage or natural language processing tasks one usually encounters problems when working with data or texts containing noise, typos and other kinds of errors. In this thesis the use of modified Levenshtein edit distances to deal with these problems is investigated. For the task of linking distinct records representing the same entity in a database we used and extended the WEKA API for Machine Learning, obtaining good precision and recall results. For the task of searching and annotating occurrences of specified words in texts written in natural language we implemented an approximate Gazetteer for GATE, the General Architecture for Text Engineering.

Customer Reviews

No reviews or ratings yet - be the first to create one!

Product Details

General

Imprint

Lap Lambert Academic Publishing

Country of origin

Germany

Release date

June 2010

Availability

Expected to ship within 10 - 15 working days

First published

June 2010

Authors

Dimensions

229 x 152 x 6mm (L x W x T)

Format

Paperback - Trade

Pages

96

ISBN-13

978-3-8383-6243-4

Barcode

9783838362434

Categories

LSN

3-8383-6243-8



Trending On Loot