Shaun Bierweiler, vice president of U.S. public sector at Hortonworks, has said the company will work to provide several data technology platforms for the U.S. Census Bureau in support of the 2020Â Census project under a contract awarded in 2017.
Bierweiler told Datanami in an interview published Tuesday that the software company will offer its Hortonworks Data Platform, Hortonworks Data Flow and Hadoop distributionÂ tool to help the agency gather and store data as well as derive â€œactionable intelligenceâ€ from raw data.
â€œWhen you think about the approximately 326 million Americans that the Census Bureau is going to collect and store data on, you need a data platform thatâ€™s going to not just perform, but really operate at that industrial scale,â€ he said.
HDP will work to help the agency establish the primary data lake designed to store census-related information that includes structured and unstructured data.
â€œOnce you have a foundation that supports [marrying structured and unstructured data], really the opportunities are endless,â€ Bierweiler said.
â€œThey have the flexibility of no longer being confined to name, address, and Zip code.â€
The Census Bureau is expected to use Hadoop to perform data processing functions and HDF to help facilitate the flow of data from fieldworkers into the Census Data Lake.