Data-Efficient Learning
Select most informative data to label.
Query Strategies
Uncertainty: query least confident. Diversity: query representative samples. Expected model change: query would change model most.
Pool-Based
Label small initial set. Train model. Query from unlabeled pool. Repeat.
Applications
Labeling expensive: medical, scientific. Reduces labeling cost significantly.
Key Takeaways
- Query informative samples
- Uncertainty, diversity, change strategies
- Reduces labeling cost