Posts

Showing posts from February, 2016

Exploring the world on-the-go using Google Cloud Vision and Twilio

Image
Getting a computer to see and understand stuff is hard. Way hard. Like, hard for a long time. For a bit of context, slide 15 seems to be the definitive abbreviated history, dating all the way back to 1966. If you want to dig into the details, there's a free Udacity course on computer vision offered by Georgia Tech.However, you can add computer vision and intelligence capabilities to your applications without the deep understanding of machine learning by using Google Cloud Vision. The Vision API lets you send across an image, along with instructions of what you're looking to find, and will return what it "sees" in your image. You can look for labels, logos, landmarks, along with other things that don't start with the letter "l". If you're processing images that include people, you can look for faces or detect sentiment.My first project with the Vision API is quite simple: let people text an image, and receive a reply with up to three things that the …