Calorai · Multi-modal AI
Type · Snap · Speak
Get free
[CR-87FE3A] · MULTI-MODAL · v2.4

Type. Snap. Speak. We log the calories.

Three input modes, one calorie tracker. Type a quick description, snap a photo, or speak it aloud — Calorai's AI handles food recognition, portion estimation and macro breakdown.

Type Snap Speak
● Type 120ms
Chat-style input
two eggs, sourdough, half an avocado
Got it. Logging now — 480 kcal, 24g protein. ✓
480 kcal
CR-87FE3A · TYPED
● Snap 1.2s
Photo recognition
DETECTED: 4 ITEMS
Chicken · Quinoa · Avocado · Spinach
685 kcal
CR-87FE3B · PHOTO
● Speak 0.8s
Voice transcription
"salmon and sweet potato"
540 kcal
CR-87FE3C · VOICE
[01] · How it works

Pick the input. Skip the typing.

Most calorie apps assume you'll happily type "190 g grilled chicken breast skinless" at 9pm. Calorai assumes you won't, and gives you two faster ways out.

[01] · Type

Chat what you ate.

"Two eggs, sourdough, avocado." The LLM parses your sentence, matches each item to the database, returns calories + macros. About a hundred milliseconds, no menus.

[02] · Snap

Photograph the plate.

The vision model identifies up to four items in one frame (composite meal detection), estimates portion size from visual cues, and returns a single entry. Editable in two taps.

[03] · Speak

Say it out loud.

Hold the mic, describe the meal. Whisper-based transcription shows what was heard, the LLM logs it. Built for the moment your hands are full. Try it →

[02] · Three modes, one log

Three inputs. Same entry.

● Typed 08:14 · 0.12s
Chat input
"chicken bowl with brown rice and avocado"
● Snapped 13:02 · 1.2s
Photo input
[AI VISION OUTPUT]
── Detected: 4 items
── Chicken: ~150g
── Brown rice: ~120g
── Avocado: ~60g
── Spinach: ~40g
● Spoken 19:24 · 0.8s
Voice input
"chicken bowl with brown rice and a bit of avocado"
● Calorai entry CR-87FE3A · 685 KCAL
Chicken bowl, brown rice, avocado
Same entry · 3 input paths · 1 timestamp
685 kcal
P 42g
C 78g
F 22g

The friction lives in how you log, not in the calories. Calorai removes it three different ways.

Pick your input mode
[03] · Who it's for

Four people who pick a different mode each meal.

Calorai isn't trying to be the only nutrition app you ever use. It's trying to be the one that survives because logging at lunch doesn't feel like a chore.

Typers who want speed

One sentence beats five database searches. The LLM parses "two eggs and a slice of sourdough" faster than you can find the right egg.

Snappers at restaurants

The dish on the menu doesn't exist in the MyFitnessPal database. The plate in front of you does — point the camera, get a working estimate.

Voice people, hands full

Driving, holding the baby, in the middle of cooking. Hold the mic for two seconds, the meal is logged before the rice is on the plate.

Mixers who use all three

Breakfast typed (it's always the same). Lunch photographed (variety). Dinner spoken (cooking with hands). The data doesn't care.

[04] · What's inside

The modules that actually shipped.

Six features built around the multi-modal input idea. Things on the roadmap that haven't earned their place are still on the roadmap.

● AI · Vision
±10%
Typical accuracy · Plated dishes

Composite meal detection.

The vision model lists up to four items in one frame — chicken + rice + avocado + spinach — and totals them as one entry.

● Sync

Apple Health · Health Connect.

Bidirectional sync writes meals and reads steps/workouts. Training-day calorie budget adjusts automatically — no manual offset math.

● Voice

Confirm-before-log.

Voice transcription always shows the heard text and the AI's interpretation before committing. Edit in two taps, log when right.

● Offline

Typed entries work offline.

Database-cached typed logging keeps working without signal. Photo and voice queue locally, sync when you're back online.

● Privacy

Voice clips deleted post-transcription.

Photos and voice clips are processed server-side and deleted unless you opt into model improvement. Analytics opt-out leaves the app fully functional.

[05] · Compare honestly

Calorai vs the apps people open at meals.

Where multi-modal input wins, where the established databases still win, and the spots where you should probably keep MyFitnessPal in the rotation.

Capability
Calorai
MyFitnessPal
Cronometer
Cal AI
Voice logging
Yes
No
No
No
Chat-style typed logging
LLM-parsed
Search-based
Search-based
Limited
Photo recognition
Composite
Pro only
No
Core feature
Verified database size
Growing
14M items
Verified
Growing
Barcode scanning
Limited
Yes
Yes
Yes
Time to log a typical meal
~3 sec
~40 sec
~45 sec
~6 sec
Honest read: if you eat mostly packaged food with barcodes, MyFitnessPal and Cronometer remain the right tool — their databases have a fifteen-year head start that Calorai isn't going to close on packaged inventory. The multi-modal advantage matters most on whole-food meals, home cooking, and restaurants where the dish doesn't appear in any database. If you log breakfast in 3 seconds three days in a row, the habit survives. Get Calorai →
[06] · From actual users

Three reviews. One we wouldn't have published.

The 4-star one is here because pretending nobody's frustrated insults the customer who is.

● Typed
★★★★★
"I chat-log breakfast in five seconds. That's the entire reason I'm still tracking — every previous app I'd quit by week two when typing got tedious."
Leo K.
Berlin · Daily typer · 4 mo
● Snapped
★★★★★
"Photo mode at restaurants is the killer feature — the dish isn't in any database, but the camera nails the entry close enough that I just edit the portion."
Amara T.
Lagos · Eats out 4×/wk · 6 mo
● Spoken
★★★★☆
"Voice is genuinely magical when it works. In a quiet kitchen it's flawless. On a busy street with traffic, it logged 'chicken thighs' as 'kitchen tights' — twice. Edit-and-fix takes a few taps, then you give up."
Rina S.
Tokyo · Voice user · 3 mo
[07] · The story behind it

Built because typing was killing the habit.

A lot of nutrition apps now have one good AI feature bolted onto a 2014-era logging UX. Calorai started from the input side, not the analytics side.

The hypothesis was uncomfortable: the calorie-tracking problem isn't a database problem, it's a typing problem. MyFitnessPal has 14 million food items and the best barcode coverage in the industry, and the median user still logs for 14 days and stops. Cronometer has the most verified nutrition data in any consumer app, and the drop-off curve looks the same.

People don't stop tracking because they ran out of foods to log. They stop because at the end of a 14-hour day, typing "200 g chicken thigh, skin on, baked" into a search bar feels like a third job. Calorai was built to answer one specific question: what if you could log a meal in the time it takes to say what you ate?

The architecture is three input pipelines that converge into one entry: an LLM-parsed chat field for typed input, a vision model trained on plated meals for photo input, and a Whisper-based transcription pipeline for voice. All three resolve into the same Calorai log structure — same shape, same macros, same Apple Health sync. The free tier covers typed and photo logging. Premium ($4.99/month) unlocks voice, offline-cached typing, composite meal detection and weekly analytics.

Honest paragraph Four real limits to know before you download. First: voice transcription is excellent in quiet rooms and shaky in noisy ones — a busy street or a loud kitchen sometimes mis-hears cooking vocabulary in funny ways. Second: the verified food database is newer and smaller than MyFitnessPal's — they have a fifteen-year head start on packaged inventory and we are not closing that gap on barcoded goods anytime soon. Third: correcting a misidentified item takes about three to five taps — not a deal-breaker, but not the one-tap experience the marketing wants. Fourth: there are several apps in the App Store using the "CalorAI" name from different developers — verify the developer name before downloading to make sure you get the multi-modal version.

None of those make Calorai the wrong choice — they make it a specific choice. If your nutrition is mostly whole-food and home-cooked, multi-modal input is meaningfully faster than database search. If most of what you eat is packaged with a barcode, MyFitnessPal is probably still the right tool. The two pair well. Start a week →

The facts

PlatformiOS · Android
Input modesType · Snap · Speak
Free tierType + Photo
Premium addsVoice · Offline · Analytics
Premium price$4.99/mo · 3-day trial
SyncApple Health · Health Connect
AI accuracy±10% photo · ±5% typed
Time to log~3 sec / meal
App rating4.5 ★ · 12.3k reviews
[08] · Questions, answered

The things people ask before they download.

What does "multi-modal" actually mean here?
Three input modes for logging the same meal: type a quick description ("two eggs and a slice of sourdough"), snap a photo, or hold the mic and speak it out loud. The result is the same Calorai entry regardless of which input you used. Use whichever fits the moment — typing for breakfast at home, photo for restaurants, voice for hands-busy lunch.
How accurate is the photo recognition?
Single plated dishes — bowl, plate, sandwich — typically land within 10–15% of true calories. Composite meals with multiple items in one frame are detected (the vision model lists each component), but portion-sizing for hidden ingredients in stews and casseroles remains the weak spot. The voice and typed inputs sidestep this entirely — you describe what you ate, the AI looks it up.
Is Calorai free?
The free tier covers daily typed and photo logging plus the daily ring. Premium ($4.99/month or discounted annual) unlocks voice logging, weekly trend analytics, offline mode, composite meal detection and unlimited entries. The 3-day free trial converts to a yearly subscription — check the App Store for current pricing on your region.
How does Calorai compare to MyFitnessPal?
MyFitnessPal wins on database size (14M items, 15-year head start) and barcode breadth — if most of your food has a barcode, log it there. Calorai wins on input speed — typing, photo or voice are all faster than searching a database. Use Calorai daily, MyFitnessPal when you want a deep packaged-food audit.
What are the honest limits of Calorai?
Four real ones. (1) AI accuracy drops on composite/hidden-ingredient dishes (stews, casseroles, layered burritos). (2) Voice transcription has cooking-vocabulary edge cases (rare ingredients, regional dish names) and struggles in noisy environments. (3) Correcting a misidentified item takes 3–5 taps — not bad, but not one tap either. (4) The verified food database is newer and smaller than MyFitnessPal's. We trade database depth for input speed — that's the design choice, not a bug.
What happens if voice transcription gets it wrong?
You see the AI's interpretation before it logs — "Did you mean: chicken thigh, 150g, with brown rice?" — and confirm or edit in two taps. The transcript is shown so you can verify what was heard. Voice fails most often on noisy environments (subway, busy cafe), where typing usually wins anyway.
Does Calorai work offline?
Photo and voice logging require connection (the AI processing runs server-side). Typed entries with cached database items work fully offline. Premium subscribers get expanded offline-cached entries that sync when reconnection happens.
What about privacy and data?
Photos and voice clips are processed on Calorai servers and not used for advertising. The App Store privacy disclosure shows some anonymised analytics shared for product improvement — you can opt out in Settings without losing functionality. Voice clips are deleted after transcription unless you opt into model improvement contributions.
Is there a real difference between the various "CalorAI" apps in the store?
Yes — and this is worth being honest about. Several apps in the App Store and Google Play use "CalorAI" branding from different developers (Wizards Dynamics, Pham Van Thien, Freehold Ventures, techxnix, Manuel Worlitzer). They share the photo-logging concept but differ in input modes, database depth and pricing. Verify the developer name before download to make sure you're getting the multi-modal version described here.
[09] · Get the app · CR-FINAL-87FE

Type. Snap. Speak. Eat.

Three input modes, three seconds, one calorie log. Free tier covers typed and photo logging — voice is unlocked with Premium. Apple Health and Health Connect ready out of the box.

Get Calorai free
iOS & Android · Apple Health · Health Connect · No ads