Assume you are the Director of IT in an organization you work for, have knowledge of, or have researched.
In view of the need to handle increasingly larger amounts of data and to better cope with the ensuing big data V-characteristics, you need to convince the CEO that the organization needs to transform to a big-data-driven enterprise (BDDE).
The big data V-characteristics are:
Volume means that big data is always large in volume, and it doesn’t have to be a certain number of petabytes to qualify.
Velocity or speed refers to how fast the data is coming in but also to how fast you need to be able to analyze and utilize it.
Variety points to the number of sources or incoming vectors leading to your databases.
If you can’t trust the data itself, the source of the data, or the processes you are using to identify which data points are important, you have a Veracity problem.
Create a presentation (using, for example, PowerPoint) that addresses the following questions in order to convince the CEO of the benefits of becoming a BDDE:
Explain how a Hadoop Distributed File System (HDFS) manages to achieve scalability, durability, and high sequential read/write performance.
Explain why Hadoop does not have a single point of failure.
Describe the key benefits of MapReduce while considering principles, such as recovery, scalability, speed, and simplicity.
Discuss the benefits of using Apache Pig. Consider terms, such as simplicity, development time, expressiveness, and extensibility. For which categories of big data jobs should Pig be utilized, and why?
If someone decides to adopt HDFS, explain how the hardware price increases in relation to the amount of data the enterprise needs to store, and how this is achieved.
Finally, in one slide, make a convincing summary of your business case for becoming a Big-Data-Driven-Enterprise
Last Completed Projects
| topic title | academic level | Writer | delivered |
|---|
