Posts

26 Data Intensive Computing

  Hello students, In this last blog of Unit 3, we look at Data Intensive Computing. Data intensive computing are types of computing problems where data is very huge in numbers and size from few mega bytes to tera bytes to peta bytes to zeta bytes to exa bytes. Such data sizes are called to be Big Data.  There has been data explosion in last many years ever since technology has become mature and advanced.  Relational model based databases made up of tables, records, keys and relationship are not able to  manage Big Data due to huge amount of exponentially growing data and due to its strict structured format and ACID restrictions. NoSQL databases was the answer to Big Data, to all data that was huge in numbers, were unstructured or semi structured.  Big data could be split horizontally as well as vertically into managed datasets, which could be stored in repositories as required.  One could query the datasets to know its content using meta data attributes ...