Text
Concept Extraction
Extract key concepts, definitions, and relationships from educational content across video, slides, and code
Note: This playground provides simulated output to showcase functionality. No input data is processed or stored on our servers. Use this demo to explore the feature extractor's capabilities before integrating it into your application.
Input
Enter the text you want to process
Required
URL pointing to content file. Supported formats: MP4, PDF, ZIP (code), TXT. Default: undefined
Required
Type of content to process. Default: undefined
Content domain for better extraction. Default: general
Extraction mode to use. Default: pattern-based
Extract concept prerequisite and related relationships. Default: true
Extract concept definitions from context. Default: true
Extract code/text examples demonstrating concepts. Default: true
Classify concepts by difficulty level. Default: false
Link concepts across video, slides, and code. Default: true
Generate semantic embeddings for concepts. Default: true
Minimum confidence threshold (0.0-1.0). Default: 0.7
Output
{"document_id": "doc_abc123","collection_id": "col_xyz789","source_object_id": "obj_def456","concepts": [{"id": "concept_001","name": "malloc","category": "memory_management","definition": "A C library function that allocates a specified number of bytes from the heap and returns a pointer to the allocated memory","confidence": 0.94,"sources": [{"type": "transcript","timestamp": 125.3,"text": "we use malloc to allocate memory dynamically...","confidence": 0.91},{"type": "slide","slide_number": 15,"title": "Dynamic Memory Allocation","confidence": 0.88},{"type": "code","file": "memory.c","line": 45,"snippet": "int *ptr = malloc(sizeof(int) * 10);","confidence": 0.96}],"difficulty_level": "intermediate","importance_score": 0.87,"frequency": 15,"examples": ["malloc(sizeof(int) * 10)","ptr = malloc(100)"],"prerequisites": ["pointers","heap_memory"],"related_concepts": ["free","calloc","realloc"],"parent_concepts": ["memory_management"],"child_concepts": [],"embedding": {"model": "bge-m3","dimension": 1024,"vector": [0.023,-0.142],"normalized": true}}],"graph_stats": {"total_concepts": 47,"categories": {"memory_management": 8,"data_structure": 12,"algorithm": 5,"function": 15,"principle": 7},"difficulty_distribution": {"beginner": 15,"intermediate": 22,"advanced": 10},"average_confidence": 0.84,"total_relationships": 156},"content": {"type": "video","filename": "cs50_lecture4_memory.mp4","domain": "programming","duration": 5430.5,"title": "CS50 2024 - Lecture 4 - Memory"}}
