Agentic Memory
Continual learning of AI agents — in-context learning, continual fine-tuning, and unlearning.

Context is the new weight. Low-latency control of what to remember, forget, and explore decides next-gen world-model-aware, self-improving AI. I work on continual learning, unlearning, memory management, and task adaptation — at the agentic context (in-context RL, harness) and model level (distillation, finetuning).
Continual learning of AI agents — in-context learning, continual fine-tuning, and unlearning.
In-context world models, adaptation to post-training task worlds, and adapting agents in evolving envs.
Efficient attention architectures, KV-cache compression, latent segmentation, and recurrent transformers.
Synthetic data training, risks of multi-agent interaction, post-training guardrails, and AI behavioral study.
photonics & metasurface design · dynamic systems
bio-inspired flight, sensing, and locomotion
policy learning for physical control and agent behavior
regularization design for variational autoencoders
diffusion models · vision-language model architecture
coordination, division of labor, and mutual verification across agents — continual adaptation at the population level
in-context learning, continual fine-tuning, unlearning, and memory scaffolds — adapting agents at the context and model level
predictive world models and the architectures to serve them in real time