Data Ecosystem

The various elements that interact with one another in order to produce, manage, store, organize, analyze and share data To put it simply, an ecosystem is a group of elements that interact with one another. Ecosystems can be large, like the jungle in a tropical rainforest or the Australian outback. Or, tiny, like tadpoles in […]

Data Ecosystem Read More »

Data Lifecycle

The data life cycle is a framework that outlines the stages that data goes through from its initial creation or capture to its eventual deletion or archival. Here are the typical steps in the data life cycle: 2. Data Ingestion: 3. Data Storage: 4. Data Processing: 5. Data Analysis: 6. Data Visualization and Reporting: 7.

Data Lifecycle Read More »


The next concept that is crucial for understanding how clustering generally works is the idea of centroids. If you remember your high school geometry, centroids are essentially the centre points of triangles. Similarly, in the case of clustering, centroids are the center points of the clusters that are being formed.   Now before going to the

Centroid Read More »

Cost Function

Cost Function We can measure the accuracy of our hypothesis function by using a cost function. This takes an average difference (actually a fancier version of an average) of all the results of the hypothesis with inputs from x’s and the actual output y’s. J(theta_0, theta_1) = dfrac {1}{2m} displaystyle sum _{i=1}^m left ( hat{y}_{i}-

Cost Function Read More »