AI Glossary

S

AI Terms Starting With “S”

17 terms defined

Scaling Laws

Core AI

Empirical relationships discovered by AI researchers showing that model performance improves predictably as model size (parameters), training data, and compute are increased. Scaling laws, first formalized by OpenAI researchers in 2020, provided the theoretical foundation for the race to build ever-larger models. They suggest that simply making models bigger and training them on more data reliably produces smarter AI.

Related:ParametersTraining DataFoundation ModelLarge Language Model

Schema Markup

GEO

Structured data code (typically in JSON-LD format) added to a webpage that explicitly tells search engines and AI systems what the content is about — including the type of entity (person, organization, course, FAQ, product), its properties, and relationships. Schema markup is one of the most powerful GEO signals because it makes your content machine-readable.

Related:Generative Engine Optimization (GEO)Knowledge GraphTopical Authority

Search Engine Optimization (SEO)

GEO

The practice of optimizing a website to rank higher in traditional search engine results pages (SERPs). SEO encompasses on-page optimization (keywords, meta tags, content quality), technical SEO (site speed, mobile-friendliness, structured data), and off-page SEO (backlinks, brand mentions). In 2026, SEO and GEO are increasingly intertwined as AI reshapes search.

Related:Generative Engine Optimization (GEO)Schema MarkupTopical Authority

Semantic Search

GEO

Search technology that understands the meaning and intent behind a query rather than just matching keywords. Semantic search uses embeddings and knowledge graphs to find content that is conceptually relevant, even if it does not contain the exact search terms. Both modern SEO and GEO rely heavily on semantic relevance.

Related:EmbeddingKnowledge GraphGenerative Engine Optimization (GEO)

Semantic Similarity

Technical

A measure of how alike two pieces of text are in meaning, regardless of the exact words used. AI systems calculate semantic similarity by comparing the vector embeddings of text. High semantic similarity means two sentences convey the same idea even if they use different words. Semantic similarity is the foundation of semantic search, RAG systems, and recommendation engines.

Related:EmbeddingSemantic SearchVector Database

Sentiment Analysis

Technical

An AI technique that identifies and extracts the emotional tone (positive, negative, neutral) from text. Businesses use sentiment analysis to monitor customer reviews, social media mentions, and support tickets at scale. In real estate, sentiment analysis can track market sentiment from news articles and social media to identify emerging trends.

Related:Natural Language Processing (NLP)Natural Language Understanding (NLU)

Sora

AI Video

OpenAI's text-to-video AI model, released in 2024, capable of generating realistic and imaginative video scenes from text prompts. Sora can create videos up to one minute long with complex camera movements, detailed scenes, and multiple characters. It represents a major leap in AI video generation capability.

Related:Text-to-VideoOpenAIRunway

Sparse Model

Technical

An AI model architecture where only a subset of the model's parameters are activated for any given input, rather than using all parameters for every computation. Sparse models (like Mixture of Experts) are more computationally efficient than dense models of the same total parameter count because they route each input to only the most relevant 'expert' sub-networks.

Related:Mixture of Experts (MoE)ParametersInference

Speech Recognition

Technical

AI technology that converts spoken language into written text. Also called Automatic Speech Recognition (ASR) or Speech-to-Text (STT). Modern speech recognition systems (like OpenAI's Whisper) achieve near-human accuracy across multiple languages and accents. Speech recognition powers voice assistants, meeting transcription tools like Otter.ai and Fireflies.ai, and voice-controlled AI interfaces.

Related:Text-to-Speech (TTS)Voice AINatural Language Processing (NLP)

Stable Diffusion

Core AI

An open-source text-to-image diffusion model developed by Stability AI, released in 2022. Unlike Midjourney (closed API) or DALL-E (OpenAI only), Stable Diffusion can be downloaded and run locally on consumer hardware, making it the most widely deployed open-source image generation model. It spawned a massive ecosystem of fine-tuned models and tools.

Related:Diffusion ModelText-to-ImageOpen Source AIMidjourney

Streaming (AI)

Technical

The technique of sending AI-generated text to the user word-by-word (or token-by-token) as it is generated, rather than waiting for the complete response before displaying it. Streaming dramatically improves the perceived speed and responsiveness of AI interfaces. ChatGPT, Claude, and virtually all modern AI chat interfaces use streaming.

Related:InferenceTokenLatency

Structured Output

Technical

AI output formatted in a specific, machine-readable structure — such as JSON, XML, or a table — rather than free-form prose. Structured output is essential for AI integrations where the output needs to be processed by another system. Modern LLMs support structured output through JSON mode or function calling, ensuring the response always matches a predefined schema.

Related:Function CallingTool UseAPI (Application Programming Interface)

Suno AI

AI Apps

An AI music generation platform that creates full songs — with vocals, lyrics, and instrumentation — from a text prompt. Users describe a style, mood, and topic, and Suno generates a complete, radio-quality track in seconds. Suno is the leading consumer AI music tool and is widely used by content creators, marketers, and musicians for background music, jingles, and creative exploration.

Example: Typing 'upbeat country song about buying your first investment property' and Suno generating a complete 3-minute song with vocals.
Related:Generative AIText-to-Speech (TTS)AI Video

Supervised Learning

Technical

A machine learning approach where a model is trained on labeled examples — input-output pairs where the correct answer is provided. The model learns to map inputs to outputs by minimizing the difference between its predictions and the correct labels. Most practical AI applications, including spam filters, image classifiers, and recommendation systems, use supervised learning.

Related:Machine LearningTraining DataUnsupervised Learning

Synthesia

AI Apps

An AI video generation platform that creates professional talking-head videos with AI avatars in 140+ languages. Synthesia is widely used by corporate training teams, marketers, and HR departments to produce multilingual video content at scale without filming equipment or actors.

Related:AI AvatarHeyGenText-to-Video

Synthetic Data

Technical

Artificially generated data used to train or test AI models, created by AI systems rather than collected from the real world. Synthetic data is used when real data is scarce, expensive, private, or biased. Many frontier AI models now use AI-generated synthetic data as a significant portion of their training corpus.

Related:Training DataFine-tuningMachine Learning

System Prompt

Prompting

A special instruction given to an AI model before the conversation begins that sets its persona, behavior, constraints, and context. System prompts are used to customize AI assistants for specific roles — for example, instructing the AI to act as a customer service agent for a specific company, always respond in a certain format, or never discuss certain topics.

Example: "You are a professional real estate marketing copywriter. Always write in an energetic, benefit-focused style. Never use jargon."
Related:PromptPrompt EngineeringCustom GPT

Ready to Apply These AI Concepts?

Learn how to implement AI in your business, career, or real estate practice.