The various elements that interact with one another in order to produce, manage, store, organize, analyze and share data To put it simply, an ecosystem is a group of elements that interact with one another. Ecosystems can be large, like the jungle in a tropical rainforest or the Australian outback. Or, tiny, like tadpoles in […]
The data life cycle is a framework that outlines the stages that data goes through from its initial creation or capture to its eventual deletion or archival. Here are the typical steps in the data life cycle: 2. Data Ingestion: 3. Data Storage: 4. Data Processing: 5. Data Analysis: 6. Data Visualization and Reporting: 7. […]
The languages of Data Science For anyone just getting started on their data science journey, the range of technical options can be overwhelming. There is a dizzying amount of choice when it comes to programming languages. Each has its own strengths and weaknesses and there is no one right answer to the question of which one you should […]
Let’s understand some of the factors that can impact the final clusters that you obtain from the K-means algorithm. This would also give you an idea about the issues that you must keep in mind before you start to make clusters to solve your business problem. Thus, the major practical considerations involved in K-Means clustering […]
Let’s go through the K-Means algorithm using a very simple example. Let’s consider a set of 10 points on a plane and try to group these points into, say, 2 clusters. So let’s see how the K-Means algorithm achieves this goal. [Note: If you don’t know what is meant by Euclidean distance, you’re advised to […]