Skip to content
View yohasebe's full-sized avatar

Highlights

  • Pro

Block or report yohasebe

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
yohasebe/README.md

Yoichiro Hasebe

Researcher in cognitive linguistics, comparative linguistics, and corpus linguistics. I build tools and systems for language researchers, educators, and learners.

Projects

Apps

  • Monadic Chat -- Locally hosted AI chatbot platform on Docker (Mac/Win/Linux)
  • SpeechDock -- Mac menu bar app for speech-to-text with system audio capture, live subtitles, and translation

Web Services

  • TED Corpus Search Engine -- Full-text search across TED Talk transcripts
  • RSyntaxTree -- Syntax tree diagram generator for linguists
  • Paradocs -- Paragraph-oriented document presentation with sentence-level highlighting and TTS
  • jReadability.net -- Japanese text readability measurement and learning tools

Libraries and CLI Tools

  • ruby-spacy -- Ruby wrapper for the spaCy NLP library
  • WP2TXT -- Wikipedia dump to plain text converter
  • Code Packager -- Package a codebase into a single JSON file for LLM analysis
  • EngTagger / Lemmatizer -- English POS tagger and lemmatizer (Rubygems)

Blog

I write about my projects, linguistics, and other interests at yohasebe.com.

Contact

@yohasebe · yohasebe@gmail.com

Pinned Loading

  1. monadic-chat monadic-chat Public

    🤖 + 🐳 + 🐧 Monadic Chat is a locally hosted web application designed to create and utilize intelligent chatbots. By providing a Linux environment on Docker to LLMs, it enables code execution and adv…

    Ruby 67 2

  2. ruby-spacy ruby-spacy Public

    A wrapper module for using spaCy natural language processing library from the Ruby programming language via PyCall

    Ruby 67 5

  3. engtagger engtagger Public

    English Part-of-Speech Tagger Library; a Ruby port of Lingua::EN::Tagger

    Ruby 276 50

  4. wp2txt wp2txt Public

    A command-line tool to extract plain text from Wikipedia dumps with category and section filtering

    Ruby 193 37

  5. rsyntaxtree rsyntaxtree Public

    Syntax tree generator for linguistic research

    Ruby 120 20

  6. openai-chat-api-workflow openai-chat-api-workflow Public

    🎩 An Alfred 5 Workflow for using OpenAI Chat API to interact with GPT models 🤖💬 It also allows image generation/editing/understanding 🖼️, speech-to-text conversion 🎤, and text-to-speech synthesis 🔈

    319 11