Web crawler start-ups raise serious privacy issues
In a small office in Ashburn, Va., ensconced among the government contractors that make up the Dulles Technology Corridor, a start-up called Babel Street is bringing government-style surveillance to an entirely new market.
The company’s web crawlers, offered under a subscription called Babel X, trawl some 40 online sources, scooping up data from popular sites such as Instagram and a Korean social media platform as well as inside “dark web” forums where criminals lurk.
Police departments investigating a crime might use the service to scan posts linked to a certain neighborhood over a specified period of time. Stadium managers use it to hunt for security threats based on electronic chatter.
The Department of Homeland Security, county governments, law enforcement agencies and the FBI use it to keep tabs on dangerous individuals, even when they are communicating in one of more than 200 languages, including emoji.
The firm, staffed by former government intelligence veterans, is part of an insular but thriving cottage industry of data aggregators that operate outside of military and intelligence agencies. The 100-person company said it is profitable, something that is rare for a tech start-up in its third year. (It declined to release financial details.) It recently took on $2.25 million from investors, bringing its total capital raised from investors to just over $5 million.
A U.S. subsidiary of the European software giant SAP is its largest institutional investor.
Businesses like Babel Street have to tread an ethical line to avoid igniting privacy concerns, even though the data they access is generally publicly available on the internet. Groups such as the American Civil Liberties Union regard the industry’s growth as a worrying proliferation of online surveillance.
“These products can provide a very detailed picture of a person’s private life,” said Matt Cagle, an ACLU lawyer who studies the issue.
Last year, Chicago-based social media aggregator Geofeedia was thrust into the national spotlight when the ACLU published a report alleging it had helped police departments track racially charged protests in Baltimore and Ferguson, Mo.
The report prompted Twitter, Facebook and Instagram to cut ties with Geofeedia, eliminating important data sources.
Perhaps as a result, Babel Street does not access individual people’s Facebook profiles.
Babel Street’s executives say they have avoided controversy by adhering to privacy standards and limiting law enforcement officers’ access to the social media information they collect.
“If someone has arrest powers, they get less access to the data than other customers,” said Jeff Chapman, a former Navy intelligence officer who founded Babel Street in 2014.
The Pentagon was Babel Street’s first customer. Agencies focused on counterterrorism would use the company’s technology to monitor terrorists’ online chatter to predict attacks.
Brand management has become an important line of business, as corporations face the increasingly difficult challenge of tracking their digital reputations. Some companies pay Babel Street to find out whether their intellectual property is being used without permission.