GRPOLossProcessor
GRPOLossProcessor is a task processor wrapper designed for GRPO reinforcement learning training. It extends InputProcessor with GRPO-specific data preparation.
from twinkle.processor import GRPOLossProcessor
processor = GRPOLossProcessor(
device_mesh=...,
padding_free=False,
framework='transformers',
)
model.set_processor(processor)
GRPOLossProcessor wraps the base InputProcessor and adds handling for GRPO-specific fields such as advantages, old log-probabilities, and reference log-probabilities that are required by the GRPO loss function.
For standard SFT tasks, use
InputProcessordirectly. UseGRPOLossProcessorwhen your training loop involves GRPO or its variants.