Media & images
View images in a first-party viewer and let agents read images, audio, and video into context.
Research isn’t only text. Multilo views images natively and lets agents read images, audio, and video into context — so a figure, a recorded interview, or a lecture clip can be part of the work.
The Image Viewer
Open images in a first-party viewer built into the file explorer. Any image in your project opens cleanly, the same way documents do.
Agents see images
Agents can see the images you bring into context — a chart, a diagram, a scanned page — and reason about them, not just acknowledge that a file exists. Attach an image in chat and ask about it.
Audio & video
Multilo transcribes audio and video on-device so agents can read what was said — useful for interviews, talks, and recorded data. The same machinery powers Speech.
Local first
Image viewing and transcription run on your machine; media is sent to a model only when an agent genuinely needs it to answer.