Blog

The future of the big data

Introduction Big data is everywhere on a much broader scale ushering the dawn of a new age. Gartner’s recent research report highlights that 80% of the data generated by corporations is coming from heterogeneous data sources in disorderly unstructured format. Enterprise and organizational applications produce 20% of the world’s data in a structured format. The traditional technologies no longer have the capabilities to handle such disorderly proliferation of dat...

Read more

Big data analytics for spatial processing

According to IBM, the world has generated big data more than 90% in the last two years. In today's world, the taxonomy of big data could be a misnomer. Big data refers to the term that deals with astounding amounts of data deluge generated every second represented by five Vs. Volume, Veracity, Velocity, Variety, and Value. Volume The proliferation of big data generated in the last few years is enormous. As of this article, the global online Internet population climbed t...

Read more

K-Nearest Neighbor Machine Learning algorithm

The German credit dataset can be downloaded from UC Irvine, Machine learning community to indicate the predicted outcome if the loan applicant defaulted or not. Applying the logistic regression with three variables duration, amount, and installment, K-means classification, and K-Nearest Neighbor machine learning algorithm. # Logistic regression # Load the file from the hard disk after setting the work directory germandata ...

Read more