This week I learned that my PhD thesis, entitled “The Authoring Metaphor to Machine Understanding of Multimedia“, is still actively used for research purposes, but certainly not in the way it was intended ;)  My office-mate Sander found a useful application of the research, see the picture on the right.

For those of you interested in a hard copy version of the booklet, I still have a few more left, just drop me an email. More visual evidence showcasing useful applications that build upon the research is welcome.

Farmacia In Linea Dal Canada. Prezzi A Basso Costo. Assistenza Web 24 Ore Su 24. Qualcuno In Particolare Vuole Diventare Genuino A Bologna A Casa Sotto go now Naturalmente! Con Grande Sconto. Fantastici Sconti Speciali Nei Fine Settimana.
Today I gave a talk at our neighbor’s institute: CWI, the national research center for mathematics and computer science in the Netherlands. It was my first talk on a new topic, coined for the moment as socio-video search, where we aim to connect machine-based multimedia tagging, with user-tagging of multimedia. Highlighting, in particular, recent papers by Arjan Setz, Xirong Li, Daragh Byrne, and Aiden Doherty. I would like to thank the CWI-colleague researchers for having me, and also for the lively discussion on the content, which I take as a positive sign. For those who are interested, the slides are available here. More comments welcome!

wordle-cees-pubs

A Wordle of all my publication title-words.

xirong-annotating-images

The paper entitled “Annotating Images by Harnessing Worldwide User-Tagged Photos” by Xirong Li, Cees Snoek, and Marcel Worring, which will appear in the proceedings of the forthcoming ICASSP2009 conference, is available online now. Automatic image tagging is important yet challenging due to the semantic gap and the lack of learning examples to model a tag’s visual diversity. Meanwhile, social user tagging is creating rich multimedia content on the web. In this paper, we propose to combine the two tagging approaches in a search-based framework. For an unlabeled image, we first retrieve its visual neighbors from a large user-tagged image database. We then select relevant tags from the result images to annotate the unlabeled image. To tackle the unreliability and sparsity of user tagging, we introduce a joint-modality tag relevance estimation method which efficiently addresses both textual and visual clues. Experiments on 1.5 million Flickr photos and 10 000 Corel images verify the proposed method.

Good news, our short course proposal on Video Search Engines was accepted for the forthcoming IEEE Conference on Computer Vision and Pattern Recognition. In this half-day course Arnold Smeulders and I will discuss the problems of video search, present methods how to achieve state-of-the-art performance, and indicate how to obtain improvements in the near future. We give an overview of the developments and future trends in the field on the basis of the TRECVID benchmark – the leading evaluation campaign for video search engines run by NIST – where we have consistently scored a top-three performance over the last five years. A course website with a detailed course description, overview of lecture topics, and related material will follow very soon is available online now.

Toky Tower by night

This week I finally had the opportunity to visit Japan. I attended the Pacific-Rim Symposium on Image and Video Technology 2009 for lecturing our tutorial on concept-based video retrieval. With more then 50 people present during the lecture, the tutorial was well attended. I was happy to see that the topic of video search is also of scientific interest on this part of the globe. Apart from interaction with people giving talks, posters and demo’s at the conference, I had some spare time to explore Tokyo and even made a short sightseeing day trip to Nikko. My experience with this part of Japan is quite positive. Tokyo is well organized, clean, and efficient, and despite the language barrier finding your way in this metro pole is relatively straightforward. In cases you do get lost the folks around here are always willing to help. Ooh, and did I mention the quality of the food already? I was lucky that my hosts took me out to some fancy Japanese restaurants for traditional dinner, including sake of course. Today I finished my trip with a visit to Shin’ichi Satoh‘s lab at NII, presenting the latest achievements of our team in TRECVID and social image retrieval. After, again, a marvelous dinner it is now time to fly back home, but I hope to return to Japan in the near future.

Any Site visitor it is possible to find true viagra pfizer in Charlotte in this online shop Viagra naturally. Yes you have right to remain benefit from the purchase. Lowest Prices and Satisfaction Guaranteed. healthperxplus Start Your Order Viagra-pills that are aimed at natural sexual arousal in men The drug is usually used to eliminate erectile dysfunction The active substance of sildenafil tablets causes the increased blood circulation in the genital area, resulting in increased and prolonged erections, as well as orgasmic experiences.
scoreboard

We organize another VideOlympics at CIVR2009. Similar to previous editions, the event will provide a real-time evaluation of several video retrieval systems in a single showcase. Where traditional evaluation campaigns, like TRECVID, focus primarily on the effectiveness of collected retrieval results, the VideOlympics also allows to take into account the influence of interaction mechanisms and the advanced visualizations in the interface. Specifically, we again aim for a showcase that goes beyond the regular demo session: it should be fun to do for the participants and fun to watch for the conference audience. New element in the 2009 VideOlympics will be the round with novice users, which supplements the regular round with expert users. The 2009 VideOlympics again promises to be an interesting event, if you want to participate please see the guidelines on http://www.videolympics.org

nwt121

More media attention for our research! This time in the form of a report on next-generation video search in the science-oriented magazine Natuurwetenschap & Techniek / Veen Magazines. Reporter Desiree Hoving managed to summarize the highlights, challenges, and limitations of video retrieval research in a very readable article, covering a lot of research performed at the University of Amsterdam. For those of you who are familar with the Dutch language, the report is available online here.


Validating the Detection of Everyday Concepts in Visual Lifelogs

The paper entitled “Validating the Detection of Everyday Concepts in Visual Lifelogs” by Daragh Byrne, Aiden R. Doherty, Cees G. M. Snoek, Gareth J. F. Jones, and Alan F. Smeaton, which will appear in the forthcoming SAMT 2008 conference, is available online now. It reports on experiments peformed on Microsoft SenseCam data. The Microsoft SenseCam is a small lightweight wearable camera used to passively capture photos and other sensor readings from a user’s day-today activities. It can capture up to 3,000 images per day, equating to almost 1 million images per year. It is used to aid memory by creating a personal multimedia lifelog, or visual recording of the wearer’s life. However the sheer volume of image data captured within a visual lifelog creates a number of challenges, particularly for locating relevant content. Within this work, we explore the applicability of semantic concept detection, a method often used within video retrieval, on the novel domain of visual lifelogs. A concept detector models the correspondence between low-level visual features and highlevel semantic concepts (such as indoors, outdoors, people, buildings, etc.) using supervised machine learning. By doing so it determines the probability of a concept’s presence. We apply detection of 27 everyday semantic concepts on a lifelog collection composed of 257,518 SenseCam images from 5 users. The results were then evaluated on a subset of 95,907 images, to determine the precision for detection of each semantic concept and to draw some interesting inferences on the lifestyles of those 5 users. We additionally present future applications of concept detection within the domain of lifelogging.

nl_header

After 9 years of research on broadcast video analysis and retrieval, I ‘finally’ made my debut on national TV. Yesterday, VARA Nieuwslicht had an item on using human cycles, in the form of computer games and social tagged content, for helping machines understand multimedia. Indeed, the topic of my granted VENI proposal. It was funny to see that the crew spent 2 hours in my office last week, but in the end only 1 minute remained. Unfortunately the demo of our video search engine for the Dutch television archive Beeld en Geluid was not broadcasted, maybe next time?