Posted on May 14

GRPO to get structured data

I saw this line "GRPO gives developers a way to teach models how to reason better, faster, and without breaking the bank." and just loved it.
I personally feel we will see more task specific models and GRPO is a good choice if the task is logical or format-sensitive..
Took a 1.5B Qwen2.5-Coder model, fine-tuned with GRPO, asking to extract structured JSON from OCR text based on 'any user-defined schema'. Needs more work but it works!

Here is the model, you can use it in combination with paddleocr:

https://huggingface.co/MayankLad31/invoice_schema

My interest in the intersection of AI and developer advocacy has been growing significantly.

Top comments (4)

Dotallio • May 14

Really cool use case! Did you run into any tricky edge cases with OCR errors or inconsistent fields during extraction?

Mayank Laddha • May 15

I was more focused on getting structured JSON based on 'any user-defined schema'..so basically user can define the schema. for ocr i used paddleocr but i guess we can use better ones..

Nathan Tarbert • May 14

Pretty cool seeing how GRPO levels things up for structured data. I love trying new stuff like this, especially when it's not just hype.

Mayank Laddha • May 14

thanks! yeah for me unless i try and see for myself, it is difficult to know how useful something is or does it really work?