关键词:部署;管理;web服务;Flume;可伸缩的;日志数据
摘 要:Machine-generated log data is valuable in locating causes of various hardware and software failures. The information derived from it can provide feedback in improving system architecture, reducing system degradation, and improving up-time. Recently, businesses have started using this log data for deriving business insight. Using a fault-tolerant architecture, Flume is a distributed, service for efficiently collecting, aggregating, and moving large amounts of log data. In this article, we will learn how to deploy and use Flume with a Hadoop cluster and a simple distributed web service.