SparkNLP - 1123 Introducing InternVL #14578

prabod · 2025-05-16T05:12:38Z

Description

This pull request introduces support for the InternVLForMultiModal model, a multimodal large language model designed for visual question answering. The changes include the addition of a new annotator, utility functions for image preprocessing, and test cases to validate functionality. This annotator can load InternVL 2, 2.5 and 3 family of models.

Motivation and Context

How Has This Been Tested?

Screenshots (if appropriate):

Types of changes

Bug fix (non-breaking change which fixes an issue)
Code improvements with no or little impact
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)

Checklist:

My code follows the code style of this project.
My change requires a change to the documentation.
I have updated the documentation accordingly.
I have read the CONTRIBUTING page.
I have added tests to cover my changes.
All new and existing tests passed.

Signed-off-by: Prabod Rathnayaka <prabod@rathnayaka.me>

DevinTDHa

Some minor changes needed for the python part (configProtoBytes not used), which I'll remove during the merge, but other than that looks good to me!

DevinTDHa · 2025-05-23T09:42:49Z

python/sparknlp/annotator/cv/internvl_for_multimodal.py

+
+ outputAnnotatorType = AnnotatorType.DOCUMENT
+
+ configProtoBytes = Param(Params._dummy(),


Config Protobytes not needed

DevinTDHa · 2025-05-23T09:43:43Z

python/test/annotator/cv/internvl_for_multimodal_test.py

+ image_assembler = ImageAssembler().setInputCol("image").setOutputCol("image_assembler")
+
+ imageClassifier = (InternVLForMultiModal \
+ .loadSavedModel("/mnt/research/Projects/ModelZoo/internVL/models/int4/OpenGVLab/InternVL2-1B", self.spark) \


We should change this to pretrained() for the master branch

* add intervl scala api * add internvl python api * internvl docs * update scala and python api for tests Signed-off-by: Prabod Rathnayaka <prabod@rathnayaka.me> * add notebook * InternVL: minor python adjustments --------- Signed-off-by: Prabod Rathnayaka <prabod@rathnayaka.me> Co-authored-by: Devin Ha <devin@trungducha.de>

prabod added 5 commits May 16, 2025 05:05

add intervl scala api

dee49e0

add internvl python api

f4b01be

internvl docs

f7a3480

update scala and python api for tests

d99b2dd

Signed-off-by: Prabod Rathnayaka <prabod@rathnayaka.me>

add notebook

132433d

prabod requested a review from DevinTDHa May 16, 2025 05:12

prabod self-assigned this May 16, 2025

prabod added the new-feature Introducing a new feature label May 16, 2025

DevinTDHa changed the base branch from master to release/602-release-candidate May 23, 2025 08:20

DevinTDHa approved these changes May 23, 2025

View reviewed changes

InternVL: minor python adjustments

237b6e1

DevinTDHa merged commit 56512b0 into release/602-release-candidate May 23, 2025
4 checks passed

DevinTDHa mentioned this pull request May 23, 2025

Release Spark NLP 6.0.2 #14583

Merged

10 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

SparkNLP - 1123 Introducing InternVL #14578

SparkNLP - 1123 Introducing InternVL #14578

Uh oh!

prabod commented May 16, 2025

DevinTDHa left a comment

DevinTDHa May 23, 2025

DevinTDHa May 23, 2025

Uh oh!

Labels

3 participants


		outputAnnotatorType = AnnotatorType.DOCUMENT

		configProtoBytes = Param(Params._dummy(),

SparkNLP - 1123 Introducing InternVL #14578

SparkNLP - 1123 Introducing InternVL #14578

Uh oh!

Conversation

prabod commented May 16, 2025

Description

Motivation and Context

How Has This Been Tested?

Screenshots (if appropriate):

Types of changes

Checklist:

DevinTDHa left a comment

Choose a reason for hiding this comment

DevinTDHa May 23, 2025

Choose a reason for hiding this comment

DevinTDHa May 23, 2025

Choose a reason for hiding this comment

Uh oh!

Labels

3 participants