01
Product & UX
Designed the core loop, Weave chat, research memory timeline, and extension onboarding across VS Code, Cursor, and Windsurf.
AI systems for reliable technical R&D — an IDE extension that automates the idea-to-experimentation loop with verifiable research memory.
Long story short
On a three-founder team at Founders Inc., I own UI/UX for Weave — the agent layer that guides scientists from context to hypothesis, experiment, and export with an immutable research memory.
01
Designed the core loop, Weave chat, research memory timeline, and extension onboarding across VS Code, Cursor, and Windsurf.
02
Led a full UI revamp — light mode, Blend mode, waiting states, permissions, and webapp token flows.
03
Shipped dozens of UX fixes from a structured extension audit — formatting, numbering, chat scroll, and install flows.
How might we automate the idea-to-experimentation loop for technical R&D — with skepticism built in and a verifiable record of everything?
Out-of-the-box AI tools for R&D — like Claude Code or Benchling — are either too niche for cross-domain research or too broad to do anything rigorously. Manual R&D cannot keep pace as AI pushes more research than teams can absorb.
Frontier models lean expensive without break-even value, or cheap and jack-of-all-trades — both unviable for rigorous R&D.
Domain-specific tools excel in one area but hit a ceiling — constrained to a single field.
Teams lose time on irreproducible work while pressure to ship output keeps accelerating.
Rigour erodes; significant work stays invisible, unrecognised, and unrewarded.
R&D in AI/ML, bioinformatics, physics, and pharma looks different on the surface — but the underlying scientific method is the same.
Noteweave is my research partner that never forgets anything. Guides like an advisor, works like an assistant.
Build agent systems that improve as frontier models scale in reasoning — with inbuilt skepticism that eliminates unreliable research upfront. Automate the idea-to-experimentation loop with a human in the loop and verifiable records.
Agent architectures that scale with model reasoning
Skepticism as a feature — unreliable research filtered upfront
Human-in-the-loop idea → experiment automation
Immutable IP log and audit trail for every project
Every session follows the same scientific rhythm — from understanding context to exporting findings as research artifacts.
| Stage | Trigger | Output | Weave's role |
|---|---|---|---|
| Understand space | User input | Weave chat output | Intervenes when data ingestion is incomplete |
| Create hypothesis | User approval | Structured hypothesis | Surfaces candidates; user edits or rejects |
| Perform experiments | User approval | Experiment data | Live progress, early failure detection, full logging |
| Analyse results | Run completes | Findings | Summarises results and proposes next steps |
| Export findings | User intent | Research paper + IP log | Blocks export on null data with clear errors |
A failed experiment in Noteweave is never lost work. Every failure enriches the research memory and moves the project forward.
Research memory is Noteweave's primary USP — a scientific repository that functions as both knowledge layer and IP logging system. Events are captured automatically, ordered chronologically, and searchable.
No manual entry — the system logs as the project progresses.
Progressive event logs with tags — project start, context, hypotheses, experiments, evaluations.
Each log: timestamp, contributor, event description, and testimonial references.
Exportable, immutable audit trail — litigation-ready institutional record.
Weave is the AI capability layer between scientific knowledge and the user. Assistive, warm, and motivating — failures are framed as learning moments.
Observes actions quietly — logs context without interrupting.
Surfaces when attention is needed — explains errors, suggests recovery.
Drives progress — hypothesis creation, live experiment updates, artifact generation.
Converses in natural language and analyses results.
Logs all events without user action.
Generates structured, testable claims from context.
Requires approval
Hypotheses are structured, testable claims generated by Weave from ingested context — approved by the user before any experiment runs. Every run is logged with environment snapshots, reproducibility scores, and Weave notes.
Noteweave complements coding agents like Claude Code — using academic research to propose product improvements today, with solitary coding agents on the roadmap.
| Competitor | Cross-domain R&D | Verifiable IP log | Hypothesis → experiment loop | IDE-native |
|---|---|---|---|---|
| Noteweave | ||||
| Claude Code | ||||
| Benchling | ||||
| ChatGPT / Claude | ||||
| Perplexity |
100+ users
Extension users
Launched ~6 weeks ago · Founders Inc. residency
The first session must deliver one moment of value — no onboarding wizard, just a single prompt: What are you working on?
| Minute | What happens | Success signal |
|---|---|---|
| 0–1 | Clean project creation — one prompt | Project created |
| 1–3 | User pastes a paper or describes a problem; Weave confirms understanding | Context accepted |
| 3–6 | Weave surfaces a structured hypothesis | Hypothesis approved |
| 6–10 | First experiment runs or hypothesis is refined in detail | Experiment initiated |
I led a structured UI/UX audit and revamp across the VS Code extension, webapp, and marketing site — fixing chat behaviour, onboarding, permissions, and visual systems.
Smooth auto-scroll, clickable links, fixed numbering (1, 2, 3…), and line breaks after e.g.
Renamed Auto → Blend Mode as default; mode-switching no longer drops answers mid-chat
Added personality — Output Maxing…, Aura Farming…, Rizzing… — with fade instead of abrupt swaps
Download CTA moved above plans; workspace-first gating — no chat until a folder is open
State-of-the-art ML models powering Weave's reasoning — continuously improved as frontier models scale.
Automated, immutable IP log trail — defensible legal and institutional record of discovery.
Research memory accumulates value over time — leaving means losing continuity.
Users trust Weave when it flags unreliable research upfront — not when it generates more.
Optimising chat alone misses the value — the timeline, IP log, and export artifacts create retention.
A structured audit with Discord feedback turned dozens of small bugs into a cohesive UI system.
Paridhi was my Co-founder in 2 startups, which we scaled to users from 30+ countries. She has phenomenal grit and thinking. You are at a loss if you do not have her in your team!
Ready for next?