Computing Reviews, the leading online review service for computing literature.

Search

Automatic extraction of Web data records containing user-generated content
Song X., Liu J., Cao Y., Lin C., Hon H. CIKM 2010 (Proceedings of the 19th ACM International Conference on Information and Knowledge Management, Toronto, ON, Canada, Oct 26-30, 2010)39-48.2010.Type:Proceedings

Date Reviewed: Aug 16 2011

Finding similarity measurement techniques for Web data records containing user-generated content or narrative text is an active area of research. This paper explains one such attempt, even though the title of paper does not suggest that explicitly. The paper points out some of the limitations of the mining data records (MDR) approach by Liu et al. [1], and suggests improvements. The authors prove their point with a case study. While the authors have tried to provide as much information as possible, readers would require reasonable domain knowledge to follow the reported work. For example, they discuss “post” and “page” at great length without pointing out what they mean; similarly, the acronym DOM is used with no explanation. Interestingly, the reported approach does not seem to use the user-generated narrative content, but the metadata associated with the posts. Perhaps space constraints contributed to the missing information.

Reviewer: Sithu D. Sudarsan	Review #: CR139351 (1203-0310)

1)	Liu, B.; Grossman, R.; Zhai, Y. Mining data records in Web pages. In Proc. of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining ACM, 2003, 601–606.

Miscellaneous (H.3.m )

Would you recommend this review?

yes

Other reviews under "Miscellaneous":	Date

Shared processing with an advanced intelligent terminal Estall C., Smith F. Research and development in information retrieval (, King’s College, Cambridge,1661984. Type: Proceedings	Sep 1 1985

Bridging the gap between AI and IR Cooper W. Research and development in information retrieval (, King’s College, Cambridge,2651984. Type: Proceedings	Nov 1 1985

Generalized success-breeds-success principle leading to time-dependent informetric distributions Egghe L., Rousseau R. Journal of the American Society for Information Science 46(6): 426-445, 1995. Type: Article	Sep 1 1996

more...

Reproduction in whole or in part without permission is prohibited. Copyright 1999-2024 ThinkLoud^®
Terms of Use | Privacy Policy