Computing Reviews, the leading online review service for computing literature.

Search

Computer vision and natural language processing: recent approaches in multimedia and robotics
Wiriyathammabhum P., Summers-Stay D., Fermüller C., Aloimonos Y. ACM Computing Surveys49 (4):1-44,2017.Type:Article

Date Reviewed: Apr 21 2017

Robot learning operates at the crossroads of disciplines such as machine learning, robotics engineering, and developmental robotics for lifelong learning. Robot skills can be divided into four categories: sensorimotor (locomotion, grasping); interactive (joint manipulation of an object); linguistic; and autonomous self-exploration or exploration through guidance from a human teacher. Therefore, robot learning can be closely related to subject areas such as adaptive control, for improving sensorimotor skills via dynamically adapting controllers; reinforcement learning, for understanding, taking actions, and planning; and developmental robotics, for more degrees of autonomous learning modalities such as those existent in human children, where lifelong learning is expected to be cumulative and of progressively increasing complexity. In this context, the paper addresses the areas of computer vision and natural language understanding, with emphasis on robot learning, exceptionally well. In particular, it provides an excellent starting point for someone to do research in this area. Also, from a teaching point of view, it provides an excellent reading list for postgraduate students. Although the paper is well written, it takes a long time to reach the point when it concentrates on computer vision and natural language understanding specifically for robots. Instead of concentrating on related work about the integration of computer vision and natural language for robots, the authors first take two separate long journeys into the semantics of natural language processing (NLP) and computer vision and/or image annotation. There is a huge body of related work about the semantics in these two separate areas, and there are many other survey papers. Therefore, the reader may feel a bit disappointed having gone through this survey, particularly if he or she is already familiar with aspects of semantic computing in NLP and computer vision.

Reviewer: Epaminondas Kapetanios	Review #: CR145210 (1707-0481)

General (I.2.0 )

Applications (I.4.9 )

Applications (I.5.4 )

Natural Language Processing (I.2.7 )

Robotics (I.2.9 )

Vision And Scene Understanding (I.2.10 )

Would you recommend this review?

yes

Other reviews under "General":	Date

Artificial experts: social knowledge and intelligent machines Collins H., MIT Press, Cambridge, MA, 1990. Type: Book (9780262031684)	Apr 1 1991

Catalogue of artificial intelligence techniques Bundy A., Springer-Verlag New York, Inc., New York, NY, 1990. Type: Book (9780387529592)	Aug 1 1991

Knowledge and inference Nagao M., Academic Press Prof., Inc., San Diego, CA, 1990. Type: Book (9780125136624)	Oct 1 1991

more...

Reproduction in whole or in part without permission is prohibited. Copyright 1999-2024 ThinkLoud^®
Terms of Use | Privacy Policy