Ai and ML Glossary
These are just a few of the many terms and concepts related to AI, LLM, and ML. As with any field, there are always new developments and emerging trends, so it’s important to stay curious and continue learning. Enjoy!
- Large language models, such as GPT-4 and its predecessors, are a type of AI that specializes in processing natural language. They are based on deep learning algorithms that are trained on vast amounts of text data to understand the structure and meaning of language. These models are able to generate coherent sentences, answer questions, and even write entire articles or stories.
- Artificial intelligence (AI) is a broader term that encompasses any technology that can simulate human intelligence, including but not limited to natural language processing. AI can be used in a wide range of applications, from image recognition and speech recognition to predictive analytics and robotics.
- Machine learning (ML) is a subfield of AI that focuses on building algorithms that can learn from data and make predictions or decisions based on that learning. ML algorithms can be supervised, unsupervised, or semi-supervised, and they can be used for tasks such as classification, regression, clustering, and reinforcement learning.
- Deep Learning: A subfield of ML that uses neural networks with multiple layers to learn complex patterns in data.
- Neural Networks: A set of algorithms that mimic the structure and function of the human brain, used in deep learning to learn from data.
- Supervised Learning: A type of ML where the algorithm is trained on labeled data to predict outcomes for new, unlabeled data.
- Unsupervised Learning: A type of ML where the algorithm learns to recognize patterns in unlabeled data, without being given specific outcomes to predict.
- Reinforcement Learning: A type of ML where the algorithm learns by receiving feedback in the form of rewards or penalties based on its actions.
- Natural Language Processing (NLP): A subfield of AI that focuses on enabling machines to understand and process human language.
- Chatbot: A computer program designed to simulate conversation with human users, often using NLP and ML.
- Computer Vision: A subfield of AI that focuses on enabling machines to interpret and understand visual data from the world around them.
- Image Recognition: A type of computer vision that involves using ML algorithms to classify and identify objects within digital images.
- Anomaly Detection: A technique that involves using ML algorithms to detect unusual or anomalous patterns in data.
- Clustering: A technique that involves grouping data points together based on their similarities or differences.
- Regression: A type of ML that involves predicting a continuous numerical value based on input variables.
- Classification: A type of ML that involves predicting a categorical value or label based on input variables.
- Overfitting: A phenomenon in which a model is too closely fitted to the training data, resulting in poor generalization to new, unseen data.
- Underfitting: A phenomenon in which a model is too simplistic and fails to capture the complexity of the data, resulting in poor performance on the training data.
- Big Data: Extremely large data sets that can be analyzed computationally to reveal patterns, trends, and associations, often used in ML algorithms.
- Data Preprocessing: The process of preparing and cleaning data to make it usable for ML algorithms, often involving tasks such as normalization, feature scaling, and missing value imputation.
- Feature Engineering: The process of selecting and transforming input features in a way that improves the performance of an ML algorithm.
- Hyperparameters: Parameters of an ML algorithm that are set before training, often based on trial and error or other tuning methods.
- Bias: In ML, bias refers to the tendency of an algorithm to consistently over- or under-predict outcomes due to flaws in the model or training data.
- Variance: In ML, variance refers to the tendency of an algorithm to produce different outcomes when trained on different data sets.
- Ensemble Learning: A technique that involves combining multiple ML algorithms to improve predictive performance, often used in complex or uncertain scenarios.
- Deep Reinforcement Learning: A subfield of RL that involves training deep neural networks to learn complex policies in response to feedback.
- Transfer Learning: A technique that involves using a pre-trained ML model as a starting point for a new task, often to reduce the amount of training data needed or to improve performance.
- Generative Adversarial Networks (GANs): A type of neural network that involves two networks working together in a competition to generate realistic synthetic data.
- Image Segmentation: A computer vision task that involves dividing an image into multiple segments or regions based on visual similarities or differences.
- Object Detection: A computer vision task that involves identifying and localizing specific objects within an image or video.