Automatic Identification of Genre in Web Pages (Paperback)


Genre is a complex but intuitively understood concept. Home pages, FAQs, blogs, etc. are examples of genres currently thriving on the web. Automatically identifying web genres would help us find documents that are more relevant to our information needs. The aim of the research described in this book is to develop automatic genre classification algorithms. There are several challenges, however, that affect the modelling of these algorithms. First, genres on the web are instantiated in web pages, which can be considered documents of a new type, much more unpredictable and individualised than documents on paper. Second, the web is unstable and fluid, undergoing a fast-paced evolution, so genre identification is influenced by phenomena such as the formation of novel genres, genre hybridism, individualisation, intra-genre and inter-genre variation. Finally, the automatically extractable genre-revealing features used up to now are not adequate to define existing and novel web genres. The author argues that automatic identification of genre in web pages needs more flexible genre classification schemes. The main body of the book describes experiments that support this claim.

R2,126

Or split into 4x interest-free payments of 25% on orders over R50
Learn more

Discovery Miles21260
Mobicred@R199pm x 12* Mobicred Info
Free Delivery
Delivery AdviceShips in 10 - 15 working days


Toggle WishListAdd to wish list
Review this Item

Product Description

Genre is a complex but intuitively understood concept. Home pages, FAQs, blogs, etc. are examples of genres currently thriving on the web. Automatically identifying web genres would help us find documents that are more relevant to our information needs. The aim of the research described in this book is to develop automatic genre classification algorithms. There are several challenges, however, that affect the modelling of these algorithms. First, genres on the web are instantiated in web pages, which can be considered documents of a new type, much more unpredictable and individualised than documents on paper. Second, the web is unstable and fluid, undergoing a fast-paced evolution, so genre identification is influenced by phenomena such as the formation of novel genres, genre hybridism, individualisation, intra-genre and inter-genre variation. Finally, the automatically extractable genre-revealing features used up to now are not adequate to define existing and novel web genres. The author argues that automatic identification of genre in web pages needs more flexible genre classification schemes. The main body of the book describes experiments that support this claim.

Customer Reviews

No reviews or ratings yet - be the first to create one!

Product Details

General

Imprint

Lap Lambert Academic Publishing

Country of origin

Germany

Release date

December 2011

Availability

Expected to ship within 10 - 15 working days

First published

December 2011

Authors

Dimensions

229 x 152 x 19mm (L x W x T)

Format

Paperback - Trade

Pages

332

ISBN-13

978-3-8473-0687-0

Barcode

9783847306870

Categories

LSN

3-8473-0687-1



Trending On Loot