Picking out pictures

Imense, a high-tech Cambridge start-up, has developed software that it claims could help it become the Google of image searching.

Imense, a high-tech Cambridge start-up, has developed new software that it claims could help it become the Google of image searching.

Images and video make up over 70% of the digital data available on the Internet, an estimated 15 billion images, but traditional search engines can’t index this information directly. Instead, they rely on text descriptions entered by hand.

Imense’s key innovation is a new form of image retrieval that automatically analyses images in terms of their content without the need for human generated captions.

To test their software, the company has formed a partnership with a group of particle physicists using a massive computer Grid!

‘We built a prototype of our new image analysis and search technology, but simply weren’t able to test our software on sufficiently large numbers of photos. We knew we could search tens of thousands of pictures, but couldn’t afford to try it on hundreds of thousands or millions of images. This made it difficult for Imense to get the investment we needed to develop a commercially viable product. That’s where our partnership with the particle physics Grid came in,’ explained Dr David Sinclair, one of the founders of Imense.

Spread across 17 sites, the UK particle physics Grid (GridPP) has been built to analyse the petabytes of data expected from Europe’s newest particle accelerator, the Large Hadron Collider. But its 8000 computers have also been shared with other researchers, from geophysicists to biologists.

Last year, Sinclair attended a meeting arranged by STFC about Grid opportunities for industry, and realised that Grid technology could be the answer to Imense’s problem. Image analysis is a naturally parallel process which fits perfectly with the capabilities of the Grid used by STFC scientists to process data in particle physics.

Professor Andy Parker, director of the eScience Centre, Cambridge University, led the particle physics team working with Imense. ‘Our team helped Imense develop their software to run on the Grid using a tool called Ganga, and supported them as they analysed three million images. We also dealt with issues such as security and working with Grid managers at other universities, who were very helpful,’ he said.

Imense have now reaped the rewards of their Grid experience, with an investment of more than £500,000 to help them bring a product or service to market in the coming months.

Dr Sinclair says that the Grid played a major role in this. ‘Our work with the Grid has let us demonstrate that our software can handle millions of images, at a time when we were a small company and couldn’t supply the computing power needed ourselves. This in turn impressed the investors we spoke to, and led to funding for our company.’