Hard Skills / Experience
- 5+ years of hands-on research or development roles.
- Deep expertise in at least one runtime (Node.js, Python, or Java/JVM), including understanding of internals (event loop, GC, tracing hooks, bytecode/JIT, etc.).
- Hands-on experience building in-process production components (SDKs, agents, profilers, monitoring/security tools) that must be safe, stable, and backward-compatible.
- Strong performance engineering skills – profiling CPU/memory, avoiding overhead, understanding how instrumentation affects runtime behavior.
- Defensive engineering mindset – experience designing systems that fail-open, degrade gracefully, protect the host application, and never introduce instability.
- Track record debugging production issues (latency, memory leaks, regressions, deadlocks) in real-world distributed systems.
- Solid understanding of modern backend architectures — experience with microservices, distributed systems, async and event-driven patterns, containers/orchestration (Docker/K8s), cloud runtimes, and the performance or reliability challenges they introduce.
- Proven ability to ship stable, resilient, maintainable systems in production.
Engineering Excellence / Mindset
- Ability to anticipate technical risks, identify bottlenecks, and drive long-term engineering improvements.
- Takes ownership of code quality, documentation, reliability, and observability.
- Comfortable working with product teams to balance technical trade-offs with user and business needs.
- Autonomous and proactive; capable of mentoring others or leading technical initiatives.
Bonus Points
- Background in security agents, observability tools, or other components deployed directly into customer environments.
- Experience with APM agents, JVM agents, Python tracing, V8 internals, or other instrumentation/profiling frameworks.
- Experience with telemetry systems (metrics, tracing, logging), including batching, rate-limiting, and safe data collection.
- Familiarity with sampling techniques, bytecode manipulation, eBPF, or low-overhead tracing.
- Exposure to safety-critical or high-throughput environments where reliability and minimal overhead are mandatory.
- Contributions to open-source instrumentation, tracing, or internals-related projects.
Requirements
- On-site role, 5 days a week.
- Based in Tel Aviv.