bootjp

Principal Engineer and Independent Researcher specializing in distributed consensus algorithms (Raft) and distributed data stores ranging from KVS and databases to file systems. I have deep expertise in NewSQL architectures from Spanner and TiDB to Aurora and Aurora DSQL, including systems designed around the “Log is the Database” principle, while keeping my focus broad across distributed systems in general.

After hands-on work on large-scale delivery and low-latency systems at Supership and PLAID, I now lead partial microservice decomposition and the design/implementation of a loosely coupled, schema-driven event bus at hacomono to support its multi-product strategy. I build the integration foundation between the main product and new products in close collaboration with product development teams.

In AdTech and other low-latency, high-throughput, highly scalable domains, I have designed and operated delivery and analytics platforms as well as notification management systems for smartphone carrier services that apply client-centric consistency models. I specialize in building scalable systems resilient to traffic spikes.

I build robust and scalable system infrastructure by continuously cycling between theory (academic research and self-implementation) and practice (large-scale commercial development).

Creations

Main Self-Made Products

  • create-raft-redis — Code for “Building a Raft-based Redis-compatible KVS in Go”
  • go-kvlib — Library for Raft-compatible KVS created for the above material
  • dcache — Asynchronous distributed cache using CoreDNS plugin
  • disable_twitter_shortcut — Extension to disable keyboard shortcuts on Twitter Web
  • elastickv — An experimental project aimed at developing a distributed key-value store for cloud environments, similar to DynamoDB
  • echo_middleware_path_auth — Path-based authentication middleware for Echo
  • jenkins_consecutive_fail_detector — Monitoring tool to notify only consecutive failures of Jenkins jobs
  • adaptsize-go — Go implementation of size-aware admission policy (based on AdaptSize paper)
  • For other projects, please visit my GitHub Profile.

Books (Technical Doujinshi)

  • Learning by Building: Raft-based Redis-compatible KVS in Go — TechBookFest 16. A hands-on book focused on building a strongly consistent KVS using the Raft consensus algorithm with Redis as a subject. Beyond implementation, it explores the fundamental necessity of Raft in distributed systems, drawing on real-world failure cases like AWS to explain the design philosophy of resilient systems. Build a distributed KVS from scratch to gain a practical understanding of distributed consensus.

Others (Services / Non-GitHub)

Publications

International Peer-Reviewed Conference Papers

  • Yoshiaki Ueda, (Title TBD: Performance Optimization for LSM Trees)
    • In preparation

Domestic Non-Peer-Reviewed Papers

  • Yoshiaki Ueda, Toshinobu Usui, Construction of a Low-Latency Distributed KVS for Personalized Content Delivery
  • Yoshiaki Ueda, Proposal for Improving DNS Cache Hit Rate and Response Speed for Distributed Systems
    • Proceedings of the 84th National Convention of IPSJ, 2022-03-05, Preprint, Student Encouragement Award

Technical Books (Doujinshi)

  • Learning by Building: Raft-based Redis-compatible KVS in Go
    • TechBookFest 16 (Technical Doujinshi / 79 pages, ver 1.2)
    • Goal
      • An introductory hands-on book that teaches distributed systems, consistency models, and Raft by implementing a Redis-compatible KVS.
    • Structure
      • The first half covers distributed systems fundamentals, Redis/Redis Cluster strengths and limitations, consistency models, and Raft core concepts (leader election, log replication, safety).
      • The second half connects theory to implementation through a full Redis-compatible KVS build.
    • Hands-on implementation (Go)
      • Uses hashicorp/raft, implementing the datastore, Snapshot, and StateMachine (Apply/Snapshot/Restore).
      • Implements RESP with redcon and supports GET/SET/DEL.
      • Handles follower-side requests via MOVED redirection to the leader, validated in a 3-node setup.
    • Production context
      • Links the design choices to real strong-consistency/high-availability use cases, including PayPay’s TiDB adoption.
    • TechBookFest Product Page / BOOTH

Presentations

Experience

Principal Engineer (Distributed Systems)

hacomono Inc.

Leading platform modernization to support a multi-product strategy for a vertical SaaS in the wellness industry.

  • Planned and designed partial microservice decomposition on top of the existing platform to enable multi-product delivery.
  • Designed and implemented a loosely coupled, schema-driven event bus to connect the main product with new products. (Presentation)
    • Introduced reliable schema definitions using Protocol Buffers and common validation logic.
    • Improved developer productivity through CI-based compatibility checks (automatic detection of breaking changes) and internal SDK maintenance.
    • Established observability with SLI/SLO definition and measurement, and drove operations using Datadog dashboards/alerts.
    • Partnered with new product development teams to define event contracts, rollout flows, and operational rules.
    • Led cross-team platform adoption support through onboarding, operational workflow design, and consultation handling.
  • Planned and designed scalable architecture updates to handle sudden traffic spikes in SaaS.
  • Operational Improvements: Designing and implementing systems to detect unused index queries.
  • Formulating and evangelizing distributed system design principles internally and externally.
    • Message Queue Reliability: preventing data loss during Worker crashes by advocating for Visibility Timeouts and atomic processing (e.g., Lua scripts/Redis Stream). (Article)
    • Cloud-Native DB Selection: Evaluating trade-offs between consistency and scaling based on ‘Log is the Database’ and compute-storage separation architectures. (Article)
    • Time & Consistency in Distributed Systems: Systematizing and explaining the impact of clock skew on consistency, with case studies on Spanner (TrueTime) and TiDB (Timestamp Oracle). (Article)
  • As Principal Engineer, supporting across the organization as a consultant and reviewer for distributed system designs.
    • Regularly joining architecture and recurring project meetings across multiple teams to understand system-wide context and challenges.
    • Running 1:1s with engineering leaders and VP/executive stakeholders to align priorities and drive cross-team problem solving.
    • Connecting consultation paths across teams to remove decision and execution bottlenecks.

Sep 2024 - Present

Organizer

Distributed Systems Meetup (Tech Community on VRChat)

Operating a study community specialized in distributed systems on VRChat.

  • Continuing reading circles and explanations of the latest distributed systems/database papers such as Aurora Serverless v2, MemoryDB, AdaptSize, etc.
  • Responsible for material composition and progression design to absorb skill level differences from beginners to practitioners.
  • Contributing to improving distributed system literacy across the industry through community management.

2018 - Present

Independent Researcher (Distributed Systems / Databases / Distributed KVS / Distributed Storage)

Independent Research

Theoretical research and implementation of distributed systems, distributed storage, and distributed KVS, and paper writing.

  • Clarified behaviors and trade-offs of strong consistency systems through implementation and verification of distributed storage using Raft.
  • Writing and presenting papers on low-latency distributed KVS for personalized delivery (preparing a paper on KVS data tiering).
  • Technical verification through survey and implementation of distributed system related papers.

2018 - Present

Representative / Distributed Systems Engineer

btj.systems LLC

Backend development for distributed search engines and metaverse infrastructure.

  • Designed and built an Elasticsearch-compatible distributed search engine combining Lucene and Raft for a search SaaS that could no longer use Elasticsearch due to license changes.
  • Designed and developed backend services related to network synchronization communication and space provision on a metaverse platform.

2023 - 2024

Software Engineer (Distributed Systems / Delivery Infrastructure)

PLAID, Inc.

Responsible for developing the large-scale delivery and data processing infrastructure supporting the CX platform.

  • Replacement and performance improvement of delivery systems handling massive traffic.
  • Achieved significant cost reduction by redefining SLOs and establishing a robust operational framework for Preemptible Instances.
  • Acted as a technical advisor by researching academic papers on public clouds and providing feedback/insights to internal development teams.
  • Supported multiple teams across the organization as a technical consultation point for distributed data stores and event-driven architectures.
  • Organized design guidelines for operation-conscious loosely coupled architectures and shared them as documentation.

2021 - 2023

Software Engineer (Search Infrastructure / Large-scale Delivery / Notification Infrastructure)

Supership Inc.

Responsible for development and operation of search infrastructure, large-scale data delivery, and notification management for smartphone carriers.

  • Development of search engines and search-linked advertising systems, and construction/operation of large-scale aggregation infrastructure using Spark.
  • Design and development of a high-speed engine for mapping GPS coordinates to administrative areas.
  • Backend development and stable operation of notification management systems for carriers with tens of millions of users.
  • Performance optimization and infrastructure design for massive data delivery in high-traffic environments.

2018 - 2021

Software Engineer (AdTech / Access Analytics)

Software Development Company (Confidential)

Development of AdTech areas such as SSP/DSP and construction of script-tag based access analysis infrastructure.

  • Development and operation of SSP/DSP systems.
  • Design and implementation of Google Analytics-like access analysis tools based on script tags.

Before 2018

Representative / Developer

Rental Server Business (Student Startup)

Started a rental server business during junior high school, consistently handling service launch and operation.

  • Consistent implementation from server construction/operation to customer support.

Student Days (Junior High ~)

Individual Developer

Personal Development / Activities

Product development and tool creation based on technical interests.

  • Construction and operation of a self-made system to streamline paper input (PDF/HTML Markdown conversion, translation, audio generation pipeline using VOICEVOX).
  • Development and operation of VRChat related tools (resize.gg, auto-rejoin tool, etc.).
  • Technical trial and error and output to solve daily problems.

2010 - Present
Third-Party Notices