Review documents are prepared in minutes, not hours
What it does
Automation turns the labor-intensive collection of facts about an employee's work into a background process. An AI agent reads the data sources that HR already maintains — a task tracker, HRIS, report folders — and drafts a review using a unified template for each employee in the cycle. The manager spends time on judgment and feedback, not on reconstructing the quarter's timeline.
Step-by-step process
- Review cycle trigger. An HR admin initiates review preparation for a single employee or in batch for a department via the HRIS interface or a separate form.
- Artifact collection. The agent connects to HRIS and the file storage, exports the employee's data for the period: completed tasks, reports, 1-on-1 notes, previous review results, OKR or KPI goals.
- Source normalization. Documents in various formats — PDF, DOCX, spreadsheets, markdown — are converted to a unified text form via parsers before being passed to the LLM.
- Summarization.The AI model compresses lengthy artifacts into concise points organized by pre-agreed blocks: achievements, growth areas, goal alignment, engagement.
- Draft generation. The agent fills in the corporate performance review template — the same headings, tone, and section length as in the finalized documents from previous cycles.
- Export for approval. The draft is saved to the manager's folder in the file storage and to the employee's profile in HRIS for editing and final approval.
- Versioning. Each iteration is saved as a separate file — the manager can see what was added during manual editing and what came from the AI.
What this automation does not do
- It does not make decisions on promotion, bonuses, or termination. The draft is the basis for a conversation between the manager and HR, not the final verdict.
- It does not replace the 1-on-1 meeting. Work facts are collected automatically, but feedback is delivered to the employee by a real person.
- It does not assess soft skills without input data. If the sources contain no notes on teamwork, leadership, or communication — the agent does not infer, but marks the section as requiring manual input.
How it works
The automation is built on a low-code architecture — no custom backend. Core components: an orchestrator (workflow engine or Zapier), an LLM layer on an AI model, connectors to HRIS and file storage, a prompt and template store. The flow runs in three logical stages: context collection → AI processing → export and notifications. At each stage, data goes through validation — if a document is not found or a field is empty, the agent flags it and continues rather than failing.
Implementation steps
- Setting up HRIS connectors. A read-only API key is created for the HRIS (BambooHR, Hibob, Humaans, 1С:ЗУП or equivalent) with access to employee profiles, goals, and review history. For file storage — OAuth integration with Google Drive, Dropbox Business, or SharePoint.
- Digitizing the review template. The HR team provides the current corporate template — section structure, length, tone. The template is converted into a structured prompt with placeholders.
- Mapping sources to sections. For each review section (for example, "Quarterly achievements") sources are specified: a task tracker with Done status, closed tickets, git commits, reports from a specific folder.
- Setting up prompts and guardrails. The prompt is built with explicit instructions: stay within the bounds of facts, make no personal judgments, flag gaps in data, use a neutral tone.
- Test on 3-5 employees. A run on real data from people in the previous cycle, comparison with the original document, calibration of prompts to the company's style.
- Manual gate before sending. In all versions, the draft goes to the manager first, never directly to the employee.
- Logging and audit. Each run is saved: which sources were used, the prompt version, which LLM, how many tokens. Required for compliance and error review.
System components
Component | Purpose | Example |
|---|---|---|
Orchestrator | Flow logic, triggers, error handling | low-code platform, Zapier |
LLM | Summarization and draft generation | AI model |
HRIS connector | Profiles, goals, review history | BambooHR, Hibob |
File connector | Reports, 1-on-1 notes, templates | Google Drive, SharePoint |
Prompt store | Template and prompt versions | Notion, Git repository |
Security and compliance
Employee data is sensitive. The AI agent operates in the AI model's enterprise mode without sending data to public training datasets. Access is limited to HR and managers, logs are retained for GDPR audit. If needed, LLM processing is moved to self-hosted infrastructure. The data processing template is recorded in a DPIA (Data Protection Impact Assessment) before the first run.
Potential pitfalls
AI hallucinates when sources contradict each other — for example, a task is closed but marked as failed in the report. The agent must honestly flag contradictions rather than picking a convenient version. The second risk is uniformity of phrasing: if all drafts sound the same, trust in the tool drops. Addressed by prompt variability and randomized section order. The third risk is the manager signing off on the draft without edits, making AI the sole voice in the review. Mitigated by a mandatory manual gate and a manager checklist before finalization.
Prerequisites
Automation starts at a mid-maturity level of HR processes. You don't need a powerful HR infrastructure — you need regularity and unified sources: consistent places where employees' work artifacts end up.
Data and access
- An active HRIS with API or export — BambooHR, Hibob, Humaans, HR-Link, 1С:ЗУП, or equivalent.
- A file storage organized by employee — Google Drive, Dropbox Business, SharePoint, Notion.
- An agreed-upon performance review template in at least one language.
- A history of 3-5 past reviews for calibration to the company's style.
- A regular review cycle — quarterly, semi-annual, or annual.
Team readiness
- An HR manager or HRBP willing to invest 10-15 hours in setup and pilot testing.
- One technical person — internal or a partner — for configuring low-code connectors.
- An agreed data policy decision: where the review context is stored, who sees the logs, how long the history is retained.
Timeline
Complexity: simple (week). 2-4 weeks from start to first working cycle. Of these: one week for connectors and template digitization, one week for prompts and calibration, 1-2 weeks for testing with 3-5 employees and feedback from pilot managers.
Pain points
- Inconsistent Quality
- Time on Manual Reports
FAQ
How long does implementation take?
The base configuration launches in 2-4 weeks, provided the HRIS and file storage already exist. Week one — connecting connectors and digitizing the review template. Week two — configuring prompts and calibrating to the company's style. The final 1-2 weeks — testing with 3-5 employees from the previous cycle and refining based on feedback from pilot managers.
We don't have a full HRIS — just Google Sheets and document folders. Does this work?
Yes, with caveats. If employees and goals are tracked in a structured spreadsheet and work artifacts end up in predictable folders, a low-code integration connects directly to Google Drive and Sheets. Summarization quality depends on data structure — if the spreadsheets are a mess, the agent will reflect that. It is worth cleaning up the sources first, then implementing the AI agent.
What are the risks and what can go wrong?
Three main risks. The first — AI hallucinations when sources contradict each other (a task is closed but reported as failed). Resolved by an explicit instruction to flag contradictions. The second — uniform phrasing that undermines trust in the document. Resolved by prompt variability. The third — employee data leakage. Resolved by the enterprise mode of the AI model, access restrictions, and log auditing.
Does this work in our industry?
Automation is horizontal — not tied to industry specifics. Suitable for any company of 5-50 employees with a regular performance review cycle: IT, agencies, manufacturing, retail, healthcare. Specifics are reflected in the review template and source mapping — if you have engineers, a task tracker and git are connected; if you have sales reps — a CRM and deal reports.
How does this fit into the current performance review process?
The AI agent generates a draft before the manager's meeting with the employee. The draft is sent to the employee's card in the HRIS or the manager's folder, edited manually there, approved by HR, and handed out at the 1-on-1. The meeting itself, the promotion decision, and the feedback remain with people — Grow2.ai does not touch this stage.
How do you avoid bias in AI-generated reviews?
Bias comes from two sources: LLM training data and the prompt structure. We have no influence over the first — we use an enterprise LLM model with a known profile. We have full influence over the second: the prompt prohibits personal assessments, requires references to facts, and does not use demographic attributes. Before launch, we run a control test on past reviews — comparing tone by gender, age, and role.
Want this in your business?
Book a free audit — we'll show how this automation will work for you.