IoT Cameras Need to be Quick, Clever, and Able to Assign Meaning

Photo-illustration by Stuart Bradford Photo-illustration: Stuart Bradford

November’s Internet of Everything column discussed the need to rethink cameras for an era of always-on operation at every corner. We’ll also have to rethink the way those cameras see.

Today, computer vision can track cars, faces, and production processes as accurately as most people can. When there’s a lot of data to sift through, computer-vision models are better than people.

But there are limits. Computers still need more time than a human to recognize a person or action. They can’t follow a person or object between multiple video cameras. They can be fooled easily. They can’t assign meaning to what they see. These are the limits engineers must overcome to make cameras more useful in manufacturing and in smart cities.

Today’s cameras can typically perform inference—using algorithms to match incoming images against a predefined model—at roughly 30 frames per second. The speed depends on the complexity of those computer-vision algorithms.

All inference is basically a trade-off among the variables of cost, speed, memory, and accuracy. A camera that can quickly infer what something is might sacrifice accuracy. Or it might need more memory, resulting in a higher device cost.

Thirty frames per second is fine for finding a face in a concert crowd after the fact. However, when it comes to more complicated computer-vision tasks, such as determining errors in a manufacturing process, computers need to speed up their capabilities or risk slowing down production lines, says Sophie Lebrecht, the director of operations at Xnor.ai, a company building software to improve computer vision. Xnor.ai’s goal is to track images at 60 frames per second.

Speeding up the frame rate at which computers can process images is just the first step. The next is to build software that can track an object between cameras in a network. For example, finding a person on one surveillance camera would allow the network to track that person as they walked in front of other cameras, automatically and in real time.

For that, we need fast image processing of complex models, plus software that will run across the camera network and can pick up the image. The goal would be to find a way to do this on a single network without sending data to the cloud. It would require an algorithm to recognize the person and another to track that person through physical space. It might also require a software overlay on the cameras or new communications protocols.

Cameras will also need to avoid “adversarial attacks,” which are a brand new area of research. Just as humans can be fooled by optical illusions, a computer’s vision may be deceived by various tricks that can distort an otherwise normal image, causing a program to perceive something that’s not there.

Perhaps the most difficult task is creating software that allows computers to ascribe meaning to what they see. It’s one thing to recognize a person is crawling; it’s another for a camera to infer that a person crawling across the floor needs help or is trying to avoid detection.

From there, the cameras—and their software—will need to decide what to do next. We’re a long way off from that, but researchers at Alphabet have already done impressive work trying to teach computer-vision algorithms to find meaning. It’s possible that one day, computers may see even better than we do, and will harness what they see for our benefit.

This article appears in the December 2018 print issue as “Cognitive Cameras.”

internet cameras internet of things algorithms adversarial attacks computer vision

Topics

Sections

More

For IEEE Members

For IEEE Members

IEEE Spectrum

Follow IEEE Spectrum

Support IEEE Spectrum

IoT Cameras Need to be Quick, Clever, and Able to Assign Meaning

Cameras will require new software to perform surveillance and monitoring tasks autonomously

Related Stories

Why the AI Boom is a Windfall for Tiny Anguilla

Cory Doctorow: Interoperability Can Save the Open Web

Bob Kahn on the Birth of “Inter-networking”

This article is for IEEE members only. Join IEEE to access our full archive.

Membership includes:

Topics

Sections

More

For IEEE Members

For IEEE Members

IEEE Spectrum

Follow IEEE Spectrum

Support IEEE Spectrum

Enjoy more free content and benefits by creating an account

Saving articles to read later requires an IEEE Spectrum account

The Institute content is only available for members

Downloading full PDF issues is exclusive for IEEE Members

Downloading this e-book is exclusive for IEEE Members

Access to Spectrum 's Digital Edition is exclusive for IEEE Members

Following topics is a feature exclusive for IEEE Members

Adding your response to an article requires an IEEE Spectrum account

Create an account to access more content and features on IEEE Spectrum , including the ability to save articles to read later, download Spectrum Collections, and participate in conversations with readers and editors. For more exclusive content and features, consider Joining IEEE .

Join the world’s largest professional organization devoted to engineering and applied sciences and get access to all of Spectrum’s articles, archives, PDF downloads, and other benefits. Learn more →

Join the world’s largest professional organization devoted to engineering and applied sciences and get access to this e-book plus all of IEEE Spectrum’s articles, archives, PDF downloads, and other benefits. Learn more →

Access Thousands of Articles — Completely Free

Create an account and get exclusive content and features: Save articles, download collections, and talk to tech insiders — all free! For full access and benefits, join IEEE as a paying member.

IoT Cameras Need to be Quick, Clever, and Able to Assign Meaning

Cameras will require new software to perform surveillance and monitoring tasks autonomously

Related Stories

Why the AI Boom is a Windfall for Tiny Anguilla

Cory Doctorow: Interoperability Can Save the Open Web

Bob Kahn on the Birth of “Inter-networking”

This article is for IEEE members only. Join IEEE to access our full archive.

Membership includes: