kanessa

The AI agent that lives beside your cursor.

Now in private beta - Windows

Kanessa hears you, watches your screen in real time, and quietly takes the work off your hands - from drafting emails to navigating apps you've never seen before.

No credit card · Free during beta · Local-first audio & screen processing

mail.app - Sarah Chen · Q4 deck
REC
SC
Sarah Chen
to me · Mon 9:42 AM
Inbox · Q4
Re: Q4 deck - feedback before Thursday?

Hey - just went through the Q4 deck you sent over. Overall it's looking sharp, but I had a few thoughts before the exec review on Thursday.

Could you send the revised version by EOD tomorrow? Happy to jump on a quick call if easier.

Thanks,
Sarah

Draft reply

Drafting reply...

Q4-deck.pdf
Send
Listening
you:
01/05 · Listening

Works alongside the apps you already use

Windows logoWindows
Chrome logoChrome
Slack logoSlack
Notion logoNotion
Figma logoFigma
Gmail logoGmail
Linear logoLinear
VS Code logoVS Code
Spotify logoSpotify
Windows logoWindows
Chrome logoChrome
Slack logoSlack
Notion logoNotion
Figma logoFigma
Gmail logoGmail
Linear logoLinear
VS Code logoVS Code
Spotify logoSpotify

Capabilities

One agent. Three senses. Infinite tasks.

Hears you, naturally.

Push to talk, wake word, or always-on. Kanessa understands context, tone and intent - no rigid commands.

Sees your screen, in real time.

Continuous vision of whatever you're looking at - apps, PDFs, dashboards. It knows what you mean by 'this'.

RUN

Acts on your laptop, end-to-end.

Clicks, types, navigates, fills forms, opens apps. Hand off the boring tasks and watch them get done.

How it works

From a sentence to a finished task.

Kanessa is built around a tight loop of listen → see → reason → act. Every step is observable, interruptible, and stays on your machine by default.

  1. 01
    Speak naturally

    Hold a key, say a word, or just start talking. Kanessa listens with low-latency on-device speech.

  2. 02
    It sees what you see

    Screen frames stream to the agent so it understands the context of your request - the actual pixels in front of you.

  3. 03
    It plans the steps

    Kanessa reasons about the task, breaks it into clicks, keystrokes and tool calls, then asks before anything destructive.

  4. 04
    It does the work

    Watch the cursor move. Take over any time. Get a summary of what changed when it's done.

Guides you through any app

Stuck in Figma or Photoshop? Kanessa walks you through it - step by step.

It watches what's on your screen, points at the right tool, and narrates the next move. Like having a senior designer over your shoulder.

Figma
Kanessa
Hero.fig
hero.psd
GUIDING
Frame Fill Stroke
0200400600800100012001400
Start a new frame · Press F
Step 1 · Figma
You said: "Help me design a hero section"
Frame · Hero
01 / 10

Showcase

Things people say to Kanessa.

Real prompts. No setup, no scripts, no integrations to configure first.

"Reply to Sarah and say I'll send the deck by Friday."

Drafted in Gmail · sent on confirmation

"Find a 30-min slot with the design team next week."

Pulled calendars · proposed 3 times

"Turn the spec on my screen into a Linear ticket."

Read screen · created KEN-241

"Clean up my desktop and group these into folders."

Sorted 47 files · 4 new folders

"Show me how to set up a hero section frame in Figma."

Opened Frame tool · drew 1440×900 · added headline layer

"Hey Kanessa, teach me how to use this new software to edit videos."

Opened timeline · imported clips · showed cut/trim/transition shortcuts

FAQ

Questions, answered.

Is Kanessa always listening and watching?+

Only when you want it to be. You can run it push-to-talk, with a wake word, or always-on. Screen capture is paused by default and only activates per session - with a clear on-screen indicator.

Where is my data processed?+

Audio and screen frames are processed locally where possible. When a task requires a cloud model, only the minimum necessary context is sent, encrypted in transit, and never used to train models.

Can it actually click buttons in any app?+

Yes. Kanessa controls the OS the way you do - via accessibility APIs and pixel-aware vision. It works in browsers, native apps, and even unfamiliar interfaces.

What if I want to take over?+

Move your mouse or hit Escape and Kanessa instantly pauses. It will summarize what it has done so far and wait for instructions.

Which platforms are supported?+

Windows is in private beta today. A Linux build is on the roadmap.

Give your cursor a brain.

Join the private beta. We're onboarding new users every week.

We'll never share your email.