Abstract
In this paper, we propose a novel system that is able to automatically detect and classify highlights from baseball game videos in TV broadcast. The digest system gives complete indexes of a baseball game which cover all of the status changes in a game. We achieve this by seamlessly integrating image, audio and speech clues using a maximum entropy based method. What distinguishes our system from previous ones is that we emphasize on the integration of multimedia features and the acquisition of domain knowledge through machine learning process. Integration of multimedia features is important because with the current state-of-the-art image and audio analysis techniques, most image and audio features we can extract from videos are very low level, and detecting/classifying sports game highlights based on features from single medium are doomed to yield poor performances. Acquiring domain knowledge through learning process is preferred over heuristic rules because machine learning process is more powerful for discovering and expressing domain knowledge. We perform extensive experiments on game videos including various stadiums, teams and broad-casted by different TV stations.
| Original language | English |
|---|---|
| Pages | 347-350 |
| Number of pages | 4 |
| DOIs | |
| State | Published - 2002 |
| Event | 10th International Conference of Multimedia - Juan les Pins, France Duration: 1 Dec 2002 → 6 Dec 2002 |
Conference
| Conference | 10th International Conference of Multimedia |
|---|---|
| Country/Territory | France |
| City | Juan les Pins |
| Period | 1/12/02 → 6/12/02 |