Machine Learning Architect

Category: Machine Learning Basics

Where is the data stored in a trained ML model?

The final trained model doesn’t store the actual training data, but rather extracts patterns, relationships, and representations from it during training. How much of the training data is “stored” in a model depends on the type of model and how it’s designed, but here’s a general breakdown: Classical Machine Learning Models (e.g., Decision Trees, SVMs):…

October 21, 2024
Addressing Data Imbalance using GCP Native Tools

Sure! Here’s the content formatted as HTML: “`html GCP Tools for Handling Imbalanced Datasets GCP Tools for Handling Imbalanced Datasets in ML 1. AI Platform (Vertex AI) Custom Model Training: Vertex AI supports training with custom code, enabling you to implement techniques like oversampling, undersampling, or class weighting. Hyperparameter Tuning: You can tune parameters such…

September 27, 2024
F1 Score in Machine Learning

Understanding the F1 Score in Machine Learning The F1 score is a measure of a model’s accuracy that takes both precision and recall into account. It is the harmonic mean of precision and recall, giving a balanced view of the performance, especially for binary classification tasks. Precision and Recall Precision is the proportion of…

September 26, 2024
Machine Learning – Labeling Best Practices

Creating labels for a machine learning dataset is a critical step, especially for supervised learning tasks where models need to learn from **labeled** examples. Here’s how you can approach creating labels for different types of machine learning datasets: ### **Steps for Creating Labels** #### 1. **Understand the Problem Domain** – Before creating labels, you need…

September 18, 2024
Labeling vs Predictions in ML

In Machine Learning (ML), **labeling** and **prediction** refer to different stages in the machine learning workflow: 1. **Labeling**: – **Definition**: Labeling is the process of assigning correct or known output values (called labels) to data points. These labels are used during the training phase of supervised learning models. For example, in an image classification task,…

September 10, 2024
Machine Learning Model Parameters and Memory Usage

The **parameters** in a machine learning (ML) model directly affect the **memory usage** because they determine the amount of data the model needs to store and process during training and inference. The more parameters a model has, the more memory it consumes. Here’s a breakdown of how this works: ### 1. **Memory for Storing Parameters**…

September 10, 2024
Are parameters known prior to Training an ML Model?

### Are Parameters Known Prior to Training an ML Model? The process of determining the parameters of a model is known as **training**. During training, the model learns from the data by iteratively adjusting its parameters to optimize a given objective function (also known as a loss function). So – No, **parameters are not known…

September 10, 2024
Principal Component Analysis (PCA) versus Feature Crossing

Feature crossing and Principal Component Analysis (PCA) are both techniques used in machine learning to manipulate features, but they serve different purposes and operate differently. Feature Crossing Feature crossing involves creating new features by combining existing ones. This technique is particularly useful in linear models, where interactions between features can help the model capture more…

July 30, 2024
Real Time Object Detection Algorithms

YOLO provides real time object detection. Logistic Regression, Naive Bayes and SVC are not for Computer vision, but for conventional Machine Learning. Need a hands-on Data Architect, AI, ML or GCP Consultant? Need help with your data journey? Start the conversation today.

November 24, 2020
Reduce Dimensionality of Data using PCA

Reducing the number of Dimensions is a common step for pre-processing data. This enables us to overcome the downsides of dimensionality. Principal Component Analysis – PCA is a technique for achieving this but it works only on numerical data. Need a hands-on Data Architect, AI, ML or GCP Consultant? Need help with your data journey?…

November 24, 2020