Research

Making rumen health visible, one thermal frame at a time.

Our approach combines optical gas imaging with purpose-built deep learning. We capture the gases cattle emit, teach lightweight networks to find and measure those plumes, and connect plume behavior to subacute ruminal acidosis — all in real time.

Data & Methodology

Three complementary data sources

We build from the controlled to the chaotic — establishing ground truth in the lab, then proving the system on live animals.

Controlled CO2 release chamber
01 CONTROLLED CO₂ RELEASE

A calibrated gas chamber

Known volumes of CO₂ are released into an insulated chamber and recorded with FLIR optical gas imaging cameras. Because every release is measured, each frame carries a precise, trustworthy label — the foundation for supervised plume segmentation.

This controlled corpus lets the models learn what a gas plume looks like thermally before they ever face a noisy barn.

02 IN-VITRO FERMENTATION

Recreating the rumen in a bottle

Using ANKOM gas-production arrays, we simulate rumen fermentation under controlled diet and pH conditions — including those that drive acidosis. The systems measure gas output directly, tying the chemistry of fermentation to the gas signatures our cameras observe.

This bridges the gap between what a camera sees and what is actually happening inside the rumen.

In-vitro rumen fermentation system
Field data collection with live cattle
03 FIELD DEPLOYMENT

Live cattle, real conditions

Finally, dual OGI cameras record cattle at the SIU farm. Models pre-trained on controlled data are fine-tuned on these real recordings — adapting to the motion, clutter, and thermal variability of an actual barn.

It is here that the system has to earn its accuracy, and where it ultimately delivers.

The detection pipeline

01 · CAPTURE

OGI thermal video

Cameras render CO₂ and methane as visible plumes.

02 · SEGMENT

Plume isolation

Lightweight networks mask the plume pixel-by-pixel.

03 · QUANTIFY

Emission metrics

Plume area and dynamics become fermentation proxies.

04 · CLASSIFY

SARA risk

Vision-language models flag acidosis risk.

Models

A system that grew across three papers

ISVC 2025 · SPRINGER

CarboFormer

Single-gas CO₂ segmentation

A lightweight semantic segmentation architecture purpose-built for CO₂ detection in optical gas imaging. CarboFormer pairs strong accuracy with a small footprint — pre-trained on controlled CO₂ release data and fine-tuned on real cattle recordings — so it can run in real time on edge hardware.

92.98%mIoU
5.07Mparameters
84.68FPS
WACV 2026 · WORKSHOP

FUME

Multi-gas emission network

FUME extends the system from one gas to many. A fused, unified network jointly segments CO₂ and methane plumes from livestock rumen emissions — recognizing that the two gases together tell a richer story about fermentation and animal health than either alone.

CO₂ + CH₄ jointly Unified architecture IEEE/CVF HARVEST-Vision
UNDER REVIEW

VLMDual

Vision-language distillation

The most recent step closes the loop from perception to diagnosis. VLMDual is a vision-language distillation framework that jointly performs classification and plume segmentation — distilling the knowledge of large vision-language models into an efficient network that can both see the plume and judge acidosis risk.

Joint classify + segment Knowledge distillation Smart Agricultural Technology
Benchmark

CarboFormer vs. the field

Semantic segmentation performance, pre-trained on controlled CO₂ release and fine-tuned on real cattle data. Best result in each column in maroon.

Model mIoU % mAcc % mFscore % mPrec. % mRecall % aAcc %
FastSCNN87.1891.2493.0095.1391.2495.31
BiseNetV287.9592.5993.4694.4292.5995.52
DDRNet88.2892.8693.6694.5292.8695.65
Swin-T91.1595.2695.3195.3695.2696.71
SegNeXt91.1695.4595.3195.1795.4596.71
DeepLabv3+91.6895.0395.6096.2095.0396.96
SegFormer-B092.3596.0095.9795.9596.0097.18
CarboFormer (Ours)92.9896.3196.0796.8495.3197.00
Source: Islam et al., “CarboFormer,” ISVC 2025. Pre-trained on controlled CO₂ release → fine-tuned on real cattle data.
Benchmark · Multi-Gas

FUME vs. the field

Joint CO₂ + methane segmentation, evaluated against efficient and OGI-specific models. Metrics are averaged across both gas modalities (three classes each). The FUME row is highlighted.

Model Acc % mIoU % Dice % HD95 ↓ ASD ↓ Params M FPS
BiSeNetV2 99.04 75.91 85.85 72.13 20.89 2.77 282.49
ENet 97.86 46.67 54.13 101.16 34.24 0.26 369.00
DDRNetSlim 94.98 56.97 70.55 140.50 46.13 3.39 602.41
ESPNetV2 90.81 51.88 58.00 92.32 26.55 2.22 387.60
DANet 48.50 47.23 54.49 103.85 34.65 1.43 334.45
RTFNet 46.47 48.56 55.55 107.90 33.22 35.01 257.73
MTINet 46.37 44.53 52.27 113.11 39.20 1.80 314.47
CMX 33.65 26.69 29.64 187.34 61.28 4.41 505.05
Gasformer 89.56 62.09 74.31 42.51 10.56 3.65 177.50
GasTwinFormer 86.27 47.80 55.77 47.77 14.18 3.35 158.20
CarboFormer 87.92 54.95 65.04 41.14 10.37 3.81 107.40
FUME (Ours) 98.82 80.99 89.23 46.58 14.00 1.28 326.80
Source: Islam et al., “FUME,” WACV 2026 Workshops (HARVEST-Vision), Table 1 (selected metrics). HD95 / ASD in pixels; ↓ lower is better.
Funding & Acknowledgment

This work is supported by the United States Department of Agriculture, National Institute of Food and Agriculture (USDA-NIFA), through the Capacity Building Grants for Non-Land-Grant Colleges of Agriculture.

GRANT NO.2023-70001-40997