Survey of Software#
Systematic coverage of general-purpose software libraries. Measured research on algorithms, data structures, ML, and infrastructure—so you can build with confidence instead of guessing.
What is this? → | The Vision → | Method →
Claude Skill → Let Claude consult this research library directly in conversations. Ask about library selection, and Claude fetches surveys, synthesizes recommendations, or runs live research on uncovered topics.
Newsletter: New library research published as it’s ready. Monthly digest of what’s new — subscribe to stay current →
1.001-009: Sorting & Searching Algorithms#
Completed: 9/9
- ✅ 1.001 Sorting Libraries - Timsort, parallel sorts, radix sorts
- ✅ 1.002 Fuzzy Search - FuzzyWuzzy, RapidFuzz, TheFuzz
- ✅ 1.003 Full-Text Search - Whoosh, Tantivy, MeiliSearch clients
- ✅ 1.004 Binary Search Trees - B-trees, AVL, Red-Black implementations
- ✅ 1.005 Spatial Search - PostGIS, Elasticsearch Geo, rtree
- ✅ 1.006 Graph Search - A*, Dijkstra, BFS/DFS implementations
- ✅ 1.007 Pattern Matching Algorithms - KMP, Boyer-Moore, Aho-Corasick, Rabin-Karp
- ✅ 1.008 Time Series Search - DTW, shapelet discovery
- ✅ 1.009 Similarity Search - LSH, MinHash, SimHash
1.010-019: Graph & Network Algorithms#
Completed: 10/10
- ✅ 1.010 Graph Analysis - NetworkX, graph-tool, igraph
- ✅ 1.011 Graph Database Clients - Neo4j, ArangoDB, TigerGraph
- ✅ 1.012 Community Detection - Louvain, label propagation, spectral
- ✅ 1.013 String Algorithms
- ✅ 1.014 Network Flow - Max flow, min cost flow
- ✅ 1.015 Graph Visualization - Graphviz, Gephi, Cytoscape
- ✅ 1.016 Social Network Analysis - Centrality, clustering coefficient
- ✅ 1.017 Bipartite Matching Libraries - NetworkX, scipy.optimize, lapjv
- ✅ 1.018 Graph Embedding - node2vec, DeepWalk, PyTorch Geometric, DGL
- ✅ 1.019 Dynamic Graphs - NetworkX temporal, DyNetx, teneto
1.020-029: Mathematical & Statistical Algorithms#
Completed: 7/10
- ✅ 1.020.1 Linear Programming - CVXPY, OR-Tools, PuLP optimization
- ✅ 1.021 Statistical Testing Libraries - Hypothesis testing, A/B testing
- ✅ 1.022 Optimization - scipy.optimize, OR-Tools, CVXPY, PuLP
- ✅ 1.023 Symbolic Math Libraries - SymPy vs SageMath
- ✅ 1.024 Random Number Generation Libraries - PCG, Mersenne Twister
- ✅ 1.025 Prime Factorization & Primality Testing Libraries - Factorization, primality testing
- 1.026 Combinatorics - Permutations, combinations, partitions
- ✅ 1.027 Numerical Integration Libraries - Quadrature methods
- 1.028 FFT Libraries - FFTW, pyFFTW, scipy.fft
- 1.029 Matrix Decomposition - SVD, eigenvalue, QR
1.030-039: String & Text Algorithms#
Completed: 12/15
- ✅ 1.030 String Matching - Exact, approximate, regex engines
- ✅ 1.031 Text Diff - Myers, patience diff, semantic diff
- ✅ 1.032 String Metrics - Edit distance, Jaro-Winkler, cosine
- ✅ 1.033 NLP Libraries - spaCy, Transformers, NLTK
- ✅ 1.033.1 Intent Classification - Zero-Shot, SetFit, sentence-transformers
- ✅ 1.033.2 Chinese Word Segmentation - Jieba, CKIP, pkuseg, LTP
- ✅ 1.033.3 CJK Tokenizers for LLMs - SentencePiece, tiktoken, HuggingFace
- ✅ 1.033.4 Named Entity Recognition (CJK) - spaCy, Stanford NER, BERT-NER
- ✅ 1.034 Email Libraries - smtplib, yagmail, MIME handling
- ✅ 1.035 Tokenization - SentencePiece, HuggingFace Tokenizers, tiktoken, YouTokenToMe
- ✅ 1.035.1 Chinese Tokenization - jieba, pkuseg, word segmentation strategies
- 1.036 Stemming & Lemmatization - Porter, Snowball, WordNet
- 1.037 String Compression - LZ4, Snappy, Zstandard for strings
- 1.038 Unicode Handling - Normalization, transliteration
- ✅ 1.039 Template Engines - Jinja2, Handlebars, Mustache
1.040-049: Data Structure Libraries#
Completed: 4/11
- ✅ 1.040 Collections - sortedcontainers, blist, pyrsistent
- 1.041 Probabilistic Structures - Bloom filters, count-min sketch
- ✅ 1.042 Tries - pygtrie, datrie, marisa-trie
- 1.043 Heaps & Priority Queues - Binary heap, Fibonacci heap
- ✅ 1.043.1 Task Queues - Celery, RQ, Huey, Dramatiq
- 1.044 Specialized Trees - Union-Find, Interval tree, Segment tree
- 1.045 Specialized Buffers - Circular buffer, ring buffer
- 1.046 Available for future use
- ✅ 1.047 Caching - Redis, Memcached, Varnish
- 1.048 Available for future use
- 1.049 Meta-Data Structures - Schema inspection, reflection
1.050-059: Compression & Encoding#
Completed: 3/10
- ✅ 1.050 Compression - zlib, brotli, zstd benchmarks
- 1.051 Image Compression - WebP, AVIF, JPEG XL
- 1.052 Video Codecs - FFmpeg bindings, av
- 1.053 Audio Compression - FLAC, Opus, MP3
- 1.054 Base Encoding - base64, base58, base32
- ✅ 1.055 Binary Serialization - msgpack, protobuf, flatbuffers
- ✅ 1.056 JSON Libraries - orjson, ujson, rapidjson
- 1.057 CSV Parsing - Performance comparison
- 1.058 XML Processing - lxml vs ElementTree vs xmltodict
- 1.059 YAML Libraries - PyYAML vs ruamel.yaml
1.060-069: Cryptography, Networking & Security#
Completed: 4/10
- ✅ 1.060 Cryptographic Libraries - cryptography vs PyNaCl
- ✅ 1.061 Hashing - hashlib, xxhash, blake3
- ✅ 1.062 Password Hashing - argon2-cffi vs bcrypt (covered in 1.060)
- ✅ 1.063 JWT Libraries - PyJWT, python-jose, Authlib
- 1.064 Threshold Signature / Secret Sharing - FROST, Shamir’s Secret Sharing, threshold ECDSA
- 1.065 P2P / Mesh Networking - libp2p, ZeroMQ, nanomsg, NAT traversal, relay discovery
- 1.066 Onion Routing / Anonymous Transport - Tor stem API, I2P SAM bridge, Nym mixnet
- 1.067 WebSocket Libraries - websockets, aiohttp WS, ws, Socket.IO, µWebSockets
- 1.068 API Proxy / MITM Frameworks - mitmproxy, Envoy, Traefik, transparent interception
- 1.069 Mobile Secure Storage / Keychain - iOS Keychain, Android Keystore, react-native-keychain
1.070-079: Machine Learning Algorithms#
Completed: 4/10
- 1.070 Clustering - scikit-learn vs hdbscan vs fastcluster
- ✅ 1.071 Dimensionality Reduction - UMAP, t-SNE, PCA
- 1.072 Anomaly Detection - Isolation Forest, LOF, autoencoders
- ✅ 1.073 Time Series Forecasting - Prophet, Darts, statsmodels
- ✅ 1.074 Gradient Boosting - XGBoost, LightGBM, CatBoost
- ✅ 1.075 Deep Learning Frameworks - PyTorch, TensorFlow, JAX, MXNet
- 1.076 Reinforcement Learning - Stable Baselines, Ray RLlib
- 1.077 AutoML - auto-sklearn vs TPOT vs H2O
- 1.078 Feature Engineering - Featuretools, tsfresh
- 1.079 Model Interpretation - SHAP vs LIME
1.080-089: Geometric & Spatial Algorithms#
Completed: 2/11
- ✅ 1.080 Image Processing - OpenCV, scikit-image, Pillow
- ✅ 1.080.1 QR Code Generation - qrcode, segno, pyqrcode
- 1.081 Convex Hull - Graham scan, Jarvis march
- 1.082 Voronoi & Delaunay - scipy.spatial, triangle
- 1.083 Point Cloud Processing - Open3D, PCL bindings
- 1.084 Mesh Processing - Trimesh, PyMesh
- 1.085 Collision Detection - Shapely, SAT algorithms
- 1.086 Path Planning - RRT, PRM, A* for robotics
- 1.087 Geographic Libraries - GeoPy, geopandas, Folium
- 1.088 Coordinate Transformation - pyproj, utm
- 1.089 Spatial Indexing - H3, S2, Geohash
1.090-099: Specialized Algorithm Domains#
Completed: 3/15
- 1.090 Bioinformatics - Sequence alignment, BLAST, BioPython
- 1.091 Computer Vision (category)
- 1.091.1 Object Detection - YOLO, Detectron2, MMDetection
- ✅ 1.091.2 Face Detection - MediaPipe, Dlib, InsightFace
- 1.091.3 OCR - Tesseract, EasyOCR, PaddleOCR
- 1.091.4 Scene Detection / Video Segmentation - PySceneDetect, TransNetV2
- 1.092 Signal Processing - scipy.signal vs librosa
- 1.092.1 Audio Processing - librosa, pydub, audioread
- 1.093 Quantum Computing - Qiskit vs Cirq vs Pennylane
- ✅ 1.094 Constraint Solving - Z3, OR-Tools, PySMT
- 1.095 Game Theory & Auctions - Nashpy, game theory solvers
- ✅ 1.096 Scheduling - APScheduler, schedule, Airflow
- 1.097 Recommendation Systems - Surprise, LightFM, Implicit
- 1.098 Voting & Consensus - Social choice algorithms
- 1.099 Distributed Algorithms - Raft, Paxos implementations
1.100-109: Text & Document Processing#
Completed: 5/13
- ✅ 1.100 Text Processing - Regex, string manipulation, NLP utilities
- ✅ 1.101 PDF Processing - ReportLab, WeasyPrint, PyPDF2
- ✅ 1.102 Document Parsing - python-docx, openpyxl, pandas Excel
- ✅ 1.103 Markdown Processing - Python-Markdown, Mistune, markdown-it-py, CommonMark
- 1.104 Syntax Highlighting - Pygments, highlight.js
- ✅ 1.104.1 Code Parsing & AST - libcst, ast, redbaron
- ✅ 1.104.2 Code Formatting - Code formatting and linting libraries for Python and JavaScript/TypeScript. Black, ruff, autopep8, Prettier, ESLint.
- 1.105 Translation & i18n - gettext, Babel, i18next
- ✅ 1.106 Speech Recognition & TTS - Whisper, Coqui, Piper
- 1.106.1 Speaker Diarization - pyannote, NeMo, who’s talking
- 1.107 OCR Libraries - Tesseract, EasyOCR
- 1.108 Spell Checking - SymSpell, LanguageTool
- 1.109 Content Extraction - BeautifulSoup, Scrapy, newspaper3k
1.110-119: User Interface & Frontend#
Completed: 12/16
- ✅ 1.110 Frontend Frameworks - React, Vue, Svelte, Angular
- 1.110.1 React Meta-Frameworks - Next.js, Remix, Gatsby
- 1.110.2 Vue Meta-Frameworks - Nuxt, VitePress
- 1.110.3 Svelte Meta-Frameworks - SvelteKit
- ✅ 1.110.4 Browser Python Execution
- ✅ 1.110.5 Static Site Generators - Hugo, Docusaurus, MkDocs, Jekyll
- ✅ 1.111 State Management - Redux, Zustand, Jotai, Pinia
- ✅ 1.112 CSS Frameworks - Tailwind, Bootstrap, Material UI
- ✅ 1.113 UI Component Libraries - shadcn/ui, Radix, Headless UI
- ✅ 1.114 Build Tools - Vite, Webpack, Turbopack, esbuild
- ✅ 1.115 Form & Validation - React Hook Form, Zod, Yup
- ✅ 1.116 Data Visualization - D3.js, Chart.js, Recharts
- ✅ 1.117 Animation - Framer Motion, GSAP, React Spring, Lottie
- ✅ 1.118 Testing - Jest, Vitest, Playwright, Cypress
- ✅ 1.119 Mobile Frameworks - React Native, Flutter, .NET MAUI, Ionic
- 1.119.1 NFC Libraries - iOS CoreNFC, Android NDEF, react-native-nfc-manager, Web NFC
- 1.119.2 Push Notification Libraries - ntfy, Gotify, UnifiedPush, expo-notifications, Web Push/VAPID
1.120-129: Simulation & Modeling#
Completed: 3/11
- ✅ 1.120 Discrete Event Simulation - SimPy, Salabim, Mesa
- 1.120.1 Network Protocol Simulation - ns-3, Shadow, Mininet for adversarial/Sybil testing
- 1.121 Agent-Based Modeling - Mesa, FLAME, Repast
- ✅ 1.122 Monte Carlo Simulation - scipy.stats, pymc
- 1.123 System Dynamics - PySD, BPTK-Py
- 1.124 Continuous Simulation - scipy.integrate, FEniCS
- 1.125 Network Simulation - ns-3, Mininet
- 1.126 Traffic Simulation - SUMO, MATSim
- ✅ 1.127 Financial Simulation - QuantLib, vectorbt
- 1.128 Physics Simulation - PyBullet, MuJoCo, Gazebo
- 1.129 Hybrid Simulation - Discrete + continuous combined
1.130-139: Business Application Platforms (Self-Hosted)#
Completed: 2/10
- ✅ 1.130 Self-Hosted CRM - Twenty CRM, Odoo, SuiteCRM
- ✅ 1.131 Project Management - Plane, Taiga, OpenProject
- 1.132 Collaboration - Nextcloud, Mattermost, Rocket.Chat
- 1.133 Documentation - BookStack, Wiki.js, Outline
- 1.134 E-Commerce - WooCommerce, Magento, Saleor
- 1.135 Analytics - Matomo, Plausible, Umami, PostHog
- 1.136 DevOps - GitLab, Gitea, Jenkins, Drone
- 1.137 Customer Support - Chatwoot, Zammad, Freescout
- 1.138 Marketing Automation - Mautic, Listmonk
- 1.139 Accounting/ERP - Odoo, ERPNext, Dolibarr
1.140-149: Language Learning & Linguistics#
Completed: 8/14
- ✅ 1.140 Classical Languages - CLTK, pyLatinam, PyWORDS
- ✅ 1.141 Spaced Repetition - SM-2, SM-18, FSRS algorithms
- ✅ 1.142 Flashcard Systems - Anki, genanki, AnkiConnect
- 1.143 Language Detection - langdetect, polyglot, fastText
- 1.144 Phonetics - IPA transcription, eSpeak, Forvo API
- ✅ 1.144.1 Pinyin/Zhuyin Conversion - pypinyin, dragonmapper, pinyin
- ✅ 1.144.2 Tone Analysis - librosa, praatio, pitch detection, tone sandhi
- 1.145 Grammar Checking - LanguageTool, GrammarBot
- 1.146 Vocabulary Databases - WordNet, ConceptNet, Wiktionary
- 1.147 Language Corpora - Brown Corpus, CoNLL datasets
- ✅ 1.148 Morphological Analysis - SudachiPy, pymorphy3, UDPipe
- ✅ 1.148.1 Chinese Morphological Analysis - HanLP, Stanza, LTP, cjklib
- ✅ 1.148.2 Classical Chinese Parsing - Stanford CoreNLP, ctext.org tools, Kanbun
- 1.149 SLA Tools - CEFR assessment, proficiency testing
1.150-159: Reading & Text Analysis (Language Learning)#
Completed: 3/15
- 1.150 Classical Text Corpora - Perseus, PHI5/PHI7, Latin Library
- 1.151 Vocabulary Frequency - DCC Core, Dickinson College
- 1.152 Reading Difficulty & Readability - Flesch-Kincaid, lexical coverage, i+1 detection
- ✅ 1.152.1 CJK Readability - Character frequency, HSK/TOCFL levels, graded readers
- 1.153 Sentence Parsing - Universal Dependencies parsers, Stanza, syntax trees
- ✅ 1.153.1 Chinese Dependency Parsing - UD Chinese, Stanford CoreNLP, HanLP
- 1.154 Graded Reader Generation - Text leveling, vocab substitution
- ✅ 1.154.1 Chinese Text Simplification - MCTS dataset, neural approaches
- 1.155 Parallel Text Alignment - Latin-English, sentence alignment
- 1.156 Text Annotation UI - Glossing, interlinear display
- 1.157 Reading Comprehension - Question generation, validation
- 1.158 Vocabulary Tracking - Learning curves, retention modeling
- 1.159 Adaptive Recommendation - Personalized difficulty
1.160-169: Character-Based Writing Systems (CJK)#
Completed: 7/8
- ✅ 1.160 Character Databases - Unihan, CHISE, IDS, CJKVI
- ✅ 1.161 Radical & Component Analysis - Character decomposition, semantic components
- ✅ 1.162 Handwriting Recognition (CJK) - Zinnia, Tegaki, Google Cloud Vision, Azure
- ✅ 1.163 Character Encoding - Big5, GB2312, GBK, GB18030, Unicode CJK
- ✅ 1.164 Traditional ↔ Simplified Conversion - OpenCC, HanziConv, zhconv-rs
- ✅ 1.165 Stroke Order & Writing (CJK) - Make Me a Hanzi, KanjiVG, animCJK
- ✅ 1.166 OCR (CJK-specific) - Tesseract, PaddleOCR, EasyOCR
- 1.167-1.169 Available for future use
1.170-179: Translation & Alignment#
Completed: 4/7
- ✅ 1.170 Machine Translation APIs - DeepL, Google Translate, Azure Translator, Amazon Translate
- ✅ 1.171 Sentence Alignment - Hunalign, Bleualign, vecalign
- ✅ 1.172 Translation Memory - TMX format, OmegaT, MemoQ
- ✅ 1.173 Terminology Extraction - KeyBERT, PyATE, YAKE, spaCy
- 1.174 Computer-Assisted Translation - CAT tool comparison
- 1.175 Literary Translation Tools - Context-aware translation
- 1.175-1.179 Available for future use
1.180-189: Database Libraries#
Completed: 11/11
- ✅ 1.180 Python ORMs - SQLAlchemy, Django ORM, SQLModel, Tortoise ORM
- ✅ 1.181 Database Migration - Alembic, Django migrations, Flyway
- ✅ 1.182 Database Diff & Schema Comparison - Atlas, skeema, migra
- ✅ 1.183 Database Testing - pytest fixtures, testcontainers, factory_boy
- ✅ 1.184 Connection Pool Libraries - SQLAlchemy pool, pgBouncer, HikariCP
- ✅ 1.185 Query Builder Libraries - SQLAlchemy Core, Knex.js, jOOQ
- ✅ 1.185.1 Database Schema Inspection - SQLAlchemy, Alembic
- ✅ 1.186 Time-Series Databases - TimescaleDB, InfluxDB, QuestDB
- ✅ 1.187 Embedded Databases - SQLite, DuckDB, LevelDB
- ✅ 1.188 Version-Controlled Databases - Dolt, TerminusDB, lakeFS
- ✅ 1.189 Database Observability - pg_stat_statements, pgBadger, PMM
1.190-199: Knowledge & Content Engineering#
Completed: 0/4
- 1.190 Knowledge Graph Construction - Building concept maps from extracted text
- 1.191 Corpus Analysis & Text Mining - Large document collection analysis
- 1.192 Book/Publishing Pipeline - Pandoc, Bookdown, manuscript-to-print
- 1.193 Digital Asset Management - Organizing large timestamped media collections
- 1.194-1.199 Available for future use
1.200-219: LLM & AI Stack#
Completed: 11/12
- ✅ 1.200 LLM Orchestration - LangChain, LlamaIndex, RAG
- ✅ 1.201 LLM Agent Frameworks - AutoGen, CrewAI, MetaGPT
- 1.202 Prompt Engineering - DSPy, guidance, LMQL
- ✅ 1.203 Vector Databases - ChromaDB, Pinecone, Qdrant
- ✅ 1.204 RAG Pipelines - Document loading, chunking, retrieval
- ✅ 1.205 LLM Evaluation - RAGAS, DeepEval, PromptFoo
- ✅ 1.206 RAG Chunking Patterns - Semantic, sliding window, hierarchical strategies
- ✅ 1.207 LLM Observability & Tracing - LangSmith, Helicone, LangFuse
- ✅ 1.208 Fine-tuning frameworks - Axolotl, LLaMA Factory, Unsloth, PEFT
- ✅ 1.209 Local LLM Serving - Ollama, vLLM, llama.cpp
- ✅ 1.210 Multilingual & CJK LLMs - BLOOM, XLM-RoBERTa, mBERT, ERNIE
- ✅ 1.211 CJK Embedding Models - M3E, text2vec-chinese, LaBSE, multilingual-e5
1.220-229: Calendar & Scheduling#
Completed: 8/8
- ✅ 1.220 CalDAV/iCalendar - icalendar, caldav, vobject
- ✅ 1.221 JavaScript Calendar UI - FullCalendar, Schedule-X, EventCalendar, tui.calendar
- ✅ 1.222 Calendar Aggregation - ical.js, tsdav, mergecal, recurring-ical-events
- ✅ 1.223 CalDAV Server Libraries - Radicale, Xandikos, custom CalDAV backends
- ✅ 1.224 Timezone Handling - pytz, zoneinfo, dateutil, Luxon, Temporal API
- ✅ 1.225 Meeting Scheduling - cal.com, Rallly, Easy!Appointments, CalDAV scheduling
- ✅ 1.226 Calendar Visualization - frappe-gantt, vis-timeline, cal-heatmap, echarts
- ✅ 1.227 Calendar Internationalization - Hijri, Jalali, Hebrew, CJK, ICU4X, CLDR
1.230-239: Social Networks & Messaging#
Completed: 2/10
- ✅ 1.230 Open Social Networks - ActivityPub, AT Protocol, Matrix, Nostr
- 1.231 AT Protocol/Bluesky - Decentralized social graph
- 1.232 Matrix Protocol - Federated chat & E2EE
- 1.233 Nostr - Simple relay protocol, Lightning payments
- ✅ 1.234 Bot SDK Frameworks - matrix-nio, discord.py, python-telegram-bot
- 1.235 Social Media APIs - Twitter, Reddit, Mastodon
- 1.236 Chat Webhooks - Slack, Discord, Teams
- 1.237 Push Notifications - FCM, APNs, WebPush
- 1.238 RSS/Atom Feeds - feedparser, aggregation
- 1.239 Social Graph Analysis - Follow relationships, communities
1.300-309: Civic & Government Data#
Completed: 6/10
- ✅ 1.300 Public Finance Modeling - Revenue forecasting, budget planning, fiscal analysis
- ✅ 1.301 Government Data Access - Open data APIs, FOIA tools, data.gov integration
- ✅ 1.302 Budget Document Parsing - PDF extraction, financial statement analysis
- ✅ 1.303 Civic Entity Resolution - Agency matching, jurisdiction mapping
- ✅ 1.304 Procurement & Contracts - Vendor analysis, contract tracking
- ✅ 1.305 Fiscal Health Metrics - Financial indicators, municipal credit analysis
- 1.306-1.309 Available for future use
Research Status#
Total Defined: 207 research slots Completed: 130 pieces (63%) Remaining: 77 pieces
Navigation: Use the sidebar to browse completed research, or select a category above.
Want to understand our approach? Read the Vision →
Want to replicate this research? See the Methodology →
© 2026 Ivan Schneider · Model Citizen Developer Licensed under CC BY 4.0