Lynn Cherny (em-lyon)
Challenges:
What to display once you find it, and
How to display it.
There are only so many pixels on the screen.
the background.
Also: British Library images
French Street Name Signs Dataset
Images from Google Maps Streetview (> 1 million)
10K annotated cats
A: Machine Learning.
Source on Google Big Query
SELECT * FROM [bigquery-public-data:open_images.dict] as dict
INNER JOIN [bigquery-public-data:open_images.labels] as lab
ON lab.label_name = dict.label_name
INNER JOIN [bigquery-public-data:open_images.images] as image
ON image.image_id = lab.image_id
WHERE dict.label_display_name = 'champagne'
AND lab.confidence >= .7
LIMIT 100
BigQuery SQL to find images tagged with "champagne":
YouTube8m dataset from Google
Surveillance Video, VIRAT: "The dataset is designed to be realistic, natural and challenging for video surveillance domains in terms of its resolution, background clutter, diversity in scenes, and human activity/event categories"
Unsecured webcams:
The Sheep Market is a collection of 10,000 sheep created by workers on Amazon’s Mechanical Turk. Each worker was paid $.02 (US) to “draw a sheep facing left.”
"We collected Instagram posts and collected engagement logs per post such as the number of likes and comments. In addition to these features, we added visually meaningful tags such as facial emotion, brand logo, and the number of faces based on deep learning models."
Small data, big impact: Context.
For example, rather than thinking of “photography” as a single phenomenon, it is more precise to consider it as a collection of many different “photo cultures”, each with its set of distinct aesthetic rules and defining mechanisms. ... Using deep learning, we detect 1000 types of content in the dataset of 100,000 images [from Instagram]
Instagram and Contemporary Image, by Lev Manovich
Also see FotoForensics, imageforensic.org
Twitter API is very structured, including media entities tweeted:
True or False? Every scene in Hollywood movies by how "accurate" they are
Metadata: Mario Klingemann's tsne map of tags
Yelp review dataset, w2v model by me in gensim shown in Tensorboard
50 Years of Avengers Colors
and demos
the #champagne selfies...
#champagne with text...
Increasingly theoretic-technical problems:
Can the "eye of the machine" see things the human observer might overlook or not see?
In a useful way, not just in a "mistake" way.
Slides at https://ghostweather.slides.com/lynncherny