Paper Bites

Small notes for reading research papers carefully.

Each Paper Bite is a focused Markdown note: why the paper matters, the core idea, how it works, what to inspect in the results, and practical takeaways.

Paper brief · 7 min read

Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment

Chain-of-Zoom treats extreme super-resolution as a sequence of zoom steps, using multi-scale prompts and preference alignment to guide details beyond a model's usual scale range.

Read

Paper brief · 6 min read

LoRA: Low-Rank Adaptation of Large Language Models

LoRA freezes a pretrained language model and learns small low-rank update matrices, making task adaptation much cheaper to train, store, and switch.

Read

Paper brief · 7 min read

Adding Conditional Control to Text-to-Image Diffusion Models

ControlNet adds spatial conditioning to pretrained text-to-image diffusion models while protecting the original model's generation quality through zero-initialized control branches.

Read

Paper brief · 7 min read

ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense Predictions

ViT-CoMer keeps the flexibility of a plain Vision Transformer, then adds a parallel convolutional branch so dense prediction tasks can use richer multi-scale features.

Read