Hebrew Nlp Toolkit
Verified94/100Guide developers in using Hebrew NLP models and tools including DictaLM, DictaBERT, AlephBERT, and ivrit.ai. Use when user asks about Hebrew text processing, Hebrew NLP, "ivrit", Hebrew tokenization, Hebrew NER, Hebrew sentiment analysis, Hebrew speech-to-text, or needs to process Hebrew language text programmatically. Covers model selection, preprocessing, and Hebrew-specific NLP challenges. Do NOT use for Arabic NLP (different tools) or general English NLP tasks.
Trust score 94/100 (Verified) · 1,365+ installs · 2 GitHub contributors · MIT license
Natural language processing for Hebrew remains a significant technical challenge. Hebrew is a morphologically rich language with unvocalized script, making tasks like entity extraction, sentiment analysis, and part-of-speech tagging far more complex than in English. Available models are scattered and not always well documented.
npx skills-il add skills-il/localization@v1.2.0-hebrew-nlp-toolkit --skill hebrew-nlp-toolkit -a claude-codeInstall on Claude.ai, Claude Desktop, ChatGPT, Manus, or other platforms
- 1. Click "Download ZIP" to download the skill files.
- 2. Open Claude Desktop and go to Customize > Skills.
- 3. Click "+" and select "Upload a skill", then upload the ZIP file.
- 4. Start a new conversation. The skill will activate automatically when relevant.
When to Apply
- When tokenizing and morphologically processing Hebrew text
- When performing Named Entity Recognition (NER) on Hebrew to extract names, places, organizations
- When using Hebrew BERT models like DictaBERT or AlephBERT for embeddings
- When restoring nikud (diacritization) to unvocalized Hebrew text with Dicta Nakdan
- When handling Hebrew morphology challenges (binyanim, inflections, smichut)
Try These Prompts
How do I use DictaBERT for sentiment analysis of Hebrew texts? Provide a complete Python code example.
How do I perform named entity recognition (NER) in Hebrew using DictaBERT NER? I want to identify people, places, and organizations in text.
How do I restore nikud (diacritization) to unvocalized Hebrew text? Show me both the Dicta Nakdan model and the hosted API option.
What are the best ivrit.ai tools for Hebrew speech-to-text conversion? How do I integrate them into a Python project?
Frequently Asked Questions
Changelog
Added nikud restoration, hosted Dicta APIs and alternative frameworks; corrected the DictaLM 3.0 base-model lineage and tech-report link.
May 14, 2026
Corrected DictaLM 3.0 sizes (24B, 12B Nemotron, 1.7B), verified HuggingFace model IDs, fixed VRAM requirements, added Reference Links section, and included NeoDictaBERT bilingual embeddings.
Apr 14, 2026
Related Skills
Plan domestic travel in Israel with local transportation, accommodations, national parks, and cultural considerations. Use when user asks about traveling in Israel, Israeli hotel chains, bus routes, Israel Railways, Rav-Kav card, national parks, tiyul b'aretz, Dead Sea, Eilat, or trip planning within Israel. Covers Egged/Dan/Kavim buses, train schedules, Rashut HaTeva sites, Shabbat travel restrictions, and seasonal advice.
Write and edit professional content in Hebrew including marketing copy, UX text, articles, emails, and social media posts. Use when user asks to write in Hebrew, "ktov b'ivrit", create Hebrew marketing content, edit Hebrew text, write Hebrew UX copy, or optimize Hebrew content for SEO. Covers grammar rules, register from formal to dugri, mixed Hebrew/English, gendered language, nikud and numerals, and Hebrew SEO best practices. Do NOT use for Hebrew NLP/ML tasks (use hebrew-nlp-toolkit) or translation (use a translation skill).
Generate professional Hebrew documents including PDF, DOCX, and PPTX with full RTL support and proper Hebrew typography. Use when user asks to create Hebrew PDF, generate Israeli business documents, "lehafik heshbonit", "litstor hozeh", build Hebrew Word document, create Hebrew PowerPoint, or produce Israeli templates such as Heshbonit Mas (tax invoice), Hozeh (contract), Hatza'at Mechir (proposal), or Protokol (meeting minutes). Covers reportlab, WeasyPrint, python-docx, and pptxgenjs with bidi paragraph support. Do NOT use for OCR or reading existing documents (use hebrew-ocr-forms instead).
Use at your own risk. Terms of Use · Security
Want to build your own skill? Try the Skill Creator · Submit a Skill