Persistent memory
Hybrid retrieval with per-type staleness, so it remembers your stack, your priorities, and your tone across sessions instead of re-learning every conversation. This is the line between a tool you use and a worker who knows you.
Guide — — by Mahmoud Zalt
How an AI personal assistant works under the hood for engineers. Memory, credential isolation, integrations, and the control you need to trust it.
Most developers already use a code assistant. Copilot, Tabnine, something in the IDE that completes a function or scaffolds a CRUD endpoint. Those tools live inside the editor and operate on source. An AI personal assistant operates on everything around the code: the tickets, the email threads, the standup notes, the calendar tetris, the research you do before a design doc. It is the non-coding overhead, not the code itself. Conflating the two is the single most common evaluation mistake, and it leads people to pick a tool that solves a problem they did not have.
The second distinction is reactive versus autonomous. A chat window like ChatGPT or Claude is reactive. You prompt, it answers, you close the tab. An autonomous assistant runs without prompting: it triages the inbox at 7am, extracts action items from a thread, and blocks focus time before you open your laptop. Sistava is built on the autonomous model. You hire an Employee, give it Duties, connect its tools, and it works on a schedule and on events, not only when you type.
Strip away the marketing and a capable AI personal assistant is four layers: a reasoning model, a memory store, a tool layer, and a credential boundary. The model decides what to do. Memory gives it continuity across days, not just within one context window. The tool layer is how it reads your mailbox or writes a calendar event. The credential boundary is what keeps your OAuth tokens out of the model's reach. Understanding these four layers tells you everything about whether a given product is trustworthy.
Hybrid retrieval with per-type staleness, so it remembers your stack, your priorities, and your tone across sessions instead of re-learning every conversation. This is the line between a tool you use and a worker who knows you.
Email, calendar, search, and file access exposed as discrete, permissioned actions. The model invokes a tool; it does not get raw access to your accounts. Each tool is something you can grant or revoke.
Tokens and keys live behind a process boundary the model cannot read. A prompt injection cannot leak credentials it never had access to. This is the security property that matters most for proprietary work.
Work fires on a cron-like schedule and on inbound events (new email, new meeting), not only when you ask. That is what makes triage happen before you wake up.
When you evaluate any assistant, map it to these four layers. If the vendor cannot explain where credentials live relative to the model, that is your answer. Local-only tools win on data control but usually give up proactive behavior and real-world actions, because an offline model on your laptop is not going to watch your inbox at 3am. Cloud platforms give you the autonomy but make the credential boundary the thing you must verify.
For developers the integration list is the spec sheet. You want the assistant connected to where work actually happens, and you want each connection scoped tightly. The pattern that holds up is OAuth with least privilege: read and write on email and calendar, nothing else, and the scopes visible so you know exactly what was granted. A good assistant lets you exclude senders, subjects, or labels from processing entirely, so anything confidential never reaches the model.
An assistant that acts on your behalf is only useful if its failure modes are bounded. Three properties separate a production-grade assistant from a demo. First, every autonomous action should be reviewable and reversible: a drafted reply waits for approval, a scheduled event can be undone, and there is a log of what it did and why. Second, edge cases should escalate to you rather than guess. Third, mistakes should feed back into memory so the same error does not repeat. If a tool cannot show you its action history, you cannot trust it with your inbox.
| Dimension | Traditional | With Sista |
|---|---|---|
| Scope | Code assistant: source inside the editor | Personal assistant: email, calendar, tasks, research around the code |
| Trigger | On keystroke, inside the IDE | On schedule and on inbound events, runs without prompting |
| State | Per-session context window | Persistent memory across days, learns your tone and priorities |
| Credentials | Usually editor or repo scoped | Isolated behind a process boundary the model cannot read |
| Output | Completions and suggestions | Real actions: drafts sent, meetings booked, briefs delivered |
| Operations | Runs in your editor | Hosted, no servers or infra for you to run |
The table is the short version of the build-versus-buy question every engineer asks here. You could wire an open-source agent to your mailbox over a weekend. The honest tradeoff is that you then own the credential isolation, the retry logic, the memory store, the scheduler, and the on-call when it sends a bad email at 2am. Sistava ships those as the product. You keep the control surface and skip the operations.
The whole sequence is intentionally undramatic. There is no API key rotation, no plugin marketplace to audit, no infrastructure to babysit. You hire, you scope two OAuth connections, you write a few preference rules, and the Employee is doing real work before lunch. The work you skip is exactly the work that should never have needed a human: the triage, the scheduling back-and-forth, the first draft of the status update.
If you want the role-first framing instead of the architecture-first one, the solution page is the better read. It describes the same triage, scheduling, and briefing work as a job description rather than a feature list. That framing is useful even for engineers, because it forces the right question before you configure anything: what do I actually want off my plate. Once that is answered, the technical setup is almost incidental, because the Employee inherits the responsibility, not just a set of buttons.
Copilot is a code assistant. It completes and generates source inside your editor. An AI personal assistant works on everything around the code: email triage, calendar management, ticket follow-ups, research, and drafting. They are complementary, not competing. Copilot replaces typing; a personal assistant replaces the non-coding overhead that eats most of your day.
OAuth tokens and keys sit behind a process boundary the reasoning model never reads directly. The model invokes scoped tools; it does not hold raw credential values. That design means a prompt injection cannot leak a token it never had access to, which is the security property that matters when the assistant touches proprietary or regulated work.
Yes. You pick the underlying model per Employee, so you can match reasoning quality and cost to the job. You are not locked into a single provider's default, and you can change it later without redoing your setup.
No. Sistava is hosted, so there are no servers, schedulers, or memory stores for you to run or patch. You keep the control surface, scoped OAuth, approval workflows, exclusion rules, and action history, without owning the operations. A self-hosted open-source agent is possible, but then the retry logic, isolation, and on-call are yours.
You set exclusion rules during setup. Specific senders, subjects, or labels can be removed from processing entirely, so confidential threads never reach the model. Anything marked sensitive routes straight to you with no AI involvement, and you can adjust those rules at any time.
Every autonomous action is reviewable and reversible, and edge cases escalate to you rather than guess. When you correct a draft or fix a misscheduled meeting, that feedback feeds memory so the same mistake does not repeat. Start in approval mode and the blast radius of any early error is a draft you edit, not an email already sent.
Most people report drafts that read like their own writing by week two or three. The assistant learns from the examples you seed and from every edit you make. The more representative the samples and the more feedback you give in the first week, the faster it converges on your voice and your priorities.
The point of all this is not to add another tool to your stack. It is to delete the overhead that surrounds the engineering, the triage and scheduling and drafting that no one was hired to do. Map any assistant to its four layers, verify where the credentials live, keep the control dial in your hand, and let the Employee absorb the parts of the day that never needed a human. Then spend the recovered hours on the work that actually ships.