Repository logo
Institutional Digital Repository
Shreenivas Deshpande Library, IIT (BHU), Varanasi

Computational scalability with Apache Flume and Mahout for large scale round the clock analysis of sensor network data

Loading...
Thumbnail Image

Date

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

In this paper, a typical scenario has been considered wherein gas sensor array responses from a WAN deployed sensor network are being received hourly, 24×7. From every sensor node, we are retrieving Static as well as Dynamic Responses with 16 sensing elements generating a.csv file of 9 MB size. Considering 1000 sensor nodes, the data received at the Hadoop Cluster at our Data Centre would be about 9 GB, which can be even more if more number of nodes, over larger geographical area and/or higher density of nodes is considered. Hence, (i) to receive and store such a huge data from a sensor network and (ii) to analyse the received data, we explored the suitability of Apache Flume and Apache Mahout to deliver high performance computational scalability on Hadoop Distributed File System. In this work, an implementation methodology for realization of such a scalable system has been presented by considering a sensor network for air pollution observation over a large geographical area, as an example. © 2015 IEEE.

Description

Keywords

Citation

Collections

Endorsement

Review

Supplemented By

Referenced By