feat(vertexai): Gemini multimodal output #8922

dlarocque · 2025-04-10T21:43:22Z

Adds new ResponseModality enum that allows users to specify which modalities should be included in a response.
Since we provide a text() accessor, a similar inlineDataParts() accessor was added to return all InlineDataPart[] in the first candidate.

API Proposal: https://goto.google.com/vinf-multimodal-output-api (internal)

changeset-bot · 2025-04-10T21:43:26Z

🦋 Changeset detected

Latest commit: b898f35

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 2 packages

Name	Type
firebase	Minor
@firebase/vertexai	Minor

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

github-actions · 2025-04-10T21:43:38Z

Vertex AI Mock Responses Check ⚠️

A newer major version of the mock responses for Vertex AI unit tests is available. update_vertexai_responses.sh should be updated to clone the latest version of the responses: v10.0

google-oss-bot · 2025-04-10T21:53:13Z

Size Report ¹

Affected Products

`@firebase/auth`

Type	Base (`ea1f913`)	Merge (`2a36e4a`)	Diff
browser	193 kB	193 kB	+209 B (+0.1%)
cordova	166 kB	166 kB	+209 B (+0.1%)
main	147 kB	147 kB	+194 B (+0.1%)
module	193 kB	193 kB	+209 B (+0.1%)
react-native	165 kB	165 kB	+194 B (+0.1%)

@firebase/auth-cordova
Type Base (ea1f913) Merge (2a36e4a) Diff
browser 166 kB 166 kB +209 B (+0.1%)
module 166 kB 166 kB +209 B (+0.1%)
@firebase/auth-web-extension
Type Base (ea1f913) Merge (2a36e4a) Diff
browser 142 kB 142 kB +209 B (+0.1%)
main 159 kB 159 kB +200 B (+0.1%)
module 142 kB 142 kB +209 B (+0.1%)
@firebase/auth/internal
Type Base (ea1f913) Merge (2a36e4a) Diff
browser 204 kB 204 kB +209 B (+0.1%)
main 173 kB 174 kB +200 B (+0.1%)
module 204 kB 204 kB +209 B (+0.1%)
@firebase/data-connect
Type Base (ea1f913) Merge (2a36e4a) Diff
browser 21.4 kB 21.7 kB +281 B (+1.3%)
main 23.7 kB 23.9 kB +266 B (+1.1%)
module 21.4 kB 21.7 kB +281 B (+1.3%)
@firebase/database
Type Base (ea1f913) Merge (2a36e4a) Diff
browser 249 kB 249 kB +62 B (+0.0%)
main 254 kB 254 kB +61 B (+0.0%)
module 249 kB 249 kB +62 B (+0.0%)
@firebase/database-compat/standalone
Type Base (ea1f913) Merge (2a36e4a) Diff
main 366 kB 366 kB +61 B (+0.0%)
@firebase/firestore
Type Base (ea1f913) Merge (2a36e4a) Diff
browser 384 kB 385 kB +243 B (+0.1%)
main 594 kB 595 kB +291 B (+0.0%)
module 384 kB 385 kB +243 B (+0.1%)
react-native 384 kB 385 kB +243 B (+0.1%)
@firebase/firestore-lite
Type Base (ea1f913) Merge (2a36e4a) Diff
browser 114 kB 114 kB +226 B (+0.2%)
main 157 kB 157 kB +401 B (+0.3%)
module 114 kB 114 kB +226 B (+0.2%)
react-native 114 kB 114 kB +227 B (+0.2%)
@firebase/functions
Type Base (ea1f913) Merge (2a36e4a) Diff
browser 14.0 kB 14.1 kB +73 B (+0.5%)
main 14.6 kB 14.7 kB +67 B (+0.5%)
module 14.0 kB 14.1 kB +73 B (+0.5%)
@firebase/storage
Type Base (ea1f913) Merge (2a36e4a) Diff
browser 58.0 kB 58.4 kB +389 B (+0.7%)
main 59.4 kB 60.0 kB +540 B (+0.9%)
module 58.0 kB 58.4 kB +389 B (+0.7%)
@firebase/util
Type Base (ea1f913) Merge (2a36e4a) Diff
browser 23.4 kB 23.5 kB +123 B (+0.5%)
main 29.5 kB 29.7 kB +193 B (+0.7%)
module 23.4 kB 23.5 kB +123 B (+0.5%)
@firebase/vertexai
Type Base (ea1f913) Merge (2a36e4a) Diff
browser 34.7 kB 35.9 kB +1.18 kB (+3.4%)
main 35.7 kB 36.9 kB +1.21 kB (+3.4%)
module 34.7 kB 35.9 kB +1.18 kB (+3.4%)

`bundle`

39 size changes

Type	Base (`ea1f913`)	Merge (`2a36e4a`)	Diff
auth (Anonymous)	77.7 kB	77.8 kB	+132 B (+0.2%)
auth (EmailAndPassword)	87.8 kB	87.9 kB	+131 B (+0.1%)
auth (GoogleFBTwitterGitHubPopup)	105 kB	105 kB	+249 B (+0.2%)
auth (GooglePopup)	102 kB	102 kB	+131 B (+0.1%)
auth (GoogleRedirect)	102 kB	102 kB	+131 B (+0.1%)
auth (Phone)	95.2 kB	95.3 kB	+132 B (+0.1%)
database (Append to a list of data)	150 kB	150 kB	+104 B (+0.1%)
database (Filtering data)	149 kB	149 kB	+104 B (+0.1%)
database (Listen for child events)	165 kB	165 kB	+104 B (+0.1%)
database (Listen for value events + Detach listeners)	165 kB	165 kB	+104 B (+0.1%)
database (Listen for value events)	165 kB	165 kB	+104 B (+0.1%)
database (Read data once)	165 kB	165 kB	+104 B (+0.1%)
database (Save data as transactions)	167 kB	167 kB	+104 B (+0.1%)
database (Sort data)	150 kB	151 kB	+104 B (+0.1%)
database (Write data)	149 kB	149 kB	+104 B (+0.1%)
firestore (CSI Auto Indexing Disable and Delete)	274 kB	275 kB	+229 B (+0.1%)
firestore (CSI Auto Indexing Enable)	274 kB	275 kB	+229 B (+0.1%)
firestore (Persistence)	306 kB	306 kB	+229 B (+0.1%)
firestore (Query Cursors)	251 kB	252 kB	+231 B (+0.1%)
firestore (Query)	249 kB	249 kB	+231 B (+0.1%)
firestore (Read data once)	237 kB	237 kB	+231 B (+0.1%)
firestore (Read Write w Persistence)	330 kB	331 kB	+231 B (+0.1%)
firestore (Realtime updates)	239 kB	239 kB	+231 B (+0.1%)
firestore (Transaction)	216 kB	216 kB	+231 B (+0.1%)
firestore (Write data)	216 kB	216 kB	+231 B (+0.1%)
firestore-lite (Query Cursors)	105 kB	105 kB	+270 B (+0.3%)
firestore-lite (Query)	101 kB	101 kB	+270 B (+0.3%)
firestore-lite (Read data once)	76.0 kB	76.3 kB	+270 B (+0.4%)
firestore-lite (Transaction)	101 kB	102 kB	+270 B (+0.3%)
firestore-lite (Write data)	85.6 kB	85.9 kB	+270 B (+0.3%)
functions (call)	34.9 kB	35.0 kB	+113 B (+0.3%)
storage (getBytes)	42.5 kB	42.8 kB	+302 B (+0.7%)
storage (getDownloadURL)	44.6 kB	44.9 kB	+302 B (+0.7%)
storage (getMetadata)	44.0 kB	44.3 kB	+302 B (+0.7%)
storage (list + listAll)	43.5 kB	43.8 kB	+302 B (+0.7%)
storage (updateMetadata)	44.3 kB	44.6 kB	+302 B (+0.7%)
storage (uploadBytes)	49.2 kB	49.5 kB	+302 B (+0.6%)
storage (uploadBytesResumable)	59.1 kB	59.4 kB	+302 B (+0.5%)
storage (uploadString)	49.4 kB	49.7 kB	+302 B (+0.6%)

`firebase`

16 size changes

Type	Base (`ea1f913`)	Merge (`2a36e4a`)	Diff
firebase-auth-compat.js	141 kB	141 kB	+207 B (+0.1%)
firebase-auth-cordova.js	138 kB	138 kB	+284 B (+0.2%)
firebase-auth-web-extension.js	120 kB	121 kB	+284 B (+0.2%)
firebase-auth.js	158 kB	158 kB	+284 B (+0.2%)
firebase-compat.js	797 kB	797 kB	+546 B (+0.1%)
firebase-data-connect.js	17.9 kB	18.2 kB	+302 B (+1.7%)
firebase-database-compat.js	164 kB	164 kB	+93 B (+0.1%)
firebase-database.js	187 kB	187 kB	+123 B (+0.1%)
firebase-firestore-compat.js	342 kB	342 kB	+223 B (+0.1%)
firebase-firestore-lite.js	132 kB	133 kB	+300 B (+0.2%)
firebase-firestore.js	443 kB	443 kB	+326 B (+0.1%)
firebase-functions-compat.js	10.5 kB	10.5 kB	+89 B (+0.9%)
firebase-functions.js	14.9 kB	15.0 kB	+126 B (+0.8%)
firebase-storage-compat.js	39.8 kB	40.1 kB	+279 B (+0.7%)
firebase-storage.js	46.4 kB	46.7 kB	+310 B (+0.7%)
firebase-vertexai.js	28.3 kB	29.2 kB	+955 B (+3.4%)

Test Logs

https://storage.googleapis.com/firebase-sdk-metric-reports/dIhHPMPItz.html

google-oss-bot · 2025-04-10T22:04:38Z

Size Analysis Report ¹

This report is too large (443,341 characters) to be displayed here in a GitHub comment. Please use the below link to see the full report on Google Cloud Storage.

Test Logs

https://storage.googleapis.com/firebase-sdk-metric-reports/uc142TVveY.html

github-actions · 2025-04-30T14:58:45Z

Changeset File Check ✅

No modified packages are missing from the changeset file.
No changeset formatting errors detected.

hsubox76 · 2025-04-30T16:26:06Z

packages/vertexai/src/types/enums.ts

+ *
+ * @beta
+ */
+export const ResponseModality = {


This is the code object we agreed we should be exporting instead of TS enums so I get that, but we've had build issues in the past mixing JS code in types files so we should probably put these in a separate file. Looks like it's not causing build issues now so maybe we can move it along with the others whenever we plan to convert all our enums to JS objects.

packages/vertexai/src/types/responses.ts

docs-devsite/vertexai.generationconfig.md

rachelsaunders · 2025-05-05T14:28:23Z

docs-devsite/vertexai.generationconfig.md

+
+Generation modalities to be returned in generation responses.
+
+- Multimodal response generation is only supported in some Gemini models and versions; see [model versions](https://firebase.google.com/docs/vertex-ai/models)<!-- -->. - Only image generation (`ResponseModality.IMAGE`<!-- -->) is supported.


Suggested change

- Multimodal response generation is only supported in some Gemini models and versions; see [model versions](https://firebase.google.com/docs/vertex-ai/models). - Only image generation (`ResponseModality.IMAGE`) is supported.

- Multimodal response generation is only supported by some Gemini models and versions; see [model versions](https://firebase.google.com/docs/vertex-ai/models). - Only image generation (`ResponseModality.IMAGE`) is supported.

rachelsaunders · 2025-05-05T14:28:52Z

packages/vertexai/src/types/requests.ts

+   * Generation modalities to be returned in generation responses.
+   *
+   * @remarks
+   *  - Multimodal response generation is only supported in some Gemini models and versions; see {@link https://firebase.google.com/docs/vertex-ai/models | model versions}.


Suggested change

* - Multimodal response generation is only supported in some Gemini models and versions; see {@link https://firebase.google.com/docs/vertex-ai/models | model versions}.

* - Multimodal response generation is only supported by some Gemini models and versions; see {@link https://firebase.google.com/docs/vertex-ai/models | model versions}.

dlarocque force-pushed the dl/gemini-image-out branch 2 times, most recently from 9a05c5b to 4f7f1ec Compare April 16, 2025 13:25

feat(vertexai): Gemini multimodal output

1d58a06

dlarocque force-pushed the dl/gemini-image-out branch from 4f7f1ec to 1d58a06 Compare April 30, 2025 14:38

dlarocque changed the title ~~[WIP] feat(vertexai): Gemini multimodal output~~ feat(vertexai): Gemini multimodal output Apr 30, 2025

Add changeset

172bc4b

dlarocque requested a review from hsubox76 April 30, 2025 14:57

dlarocque marked this pull request as ready for review April 30, 2025 14:57

dlarocque requested review from a team as code owners April 30, 2025 14:57

fix formatting

755d28b

add minor firebase bump

f0fbe8b

hsubox76 reviewed Apr 30, 2025

View reviewed changes

packages/vertexai/src/types/responses.ts Outdated Show resolved Hide resolved

hsubox76 approved these changes Apr 30, 2025

View reviewed changes

update docs

2d498c2

dlarocque requested a review from rachelsaunders May 2, 2025 20:41

update inlineDataParts docs to add plural

fea4dda

rachelsaunders requested changes May 5, 2025

View reviewed changes

docs-devsite/vertexai.generationconfig.md Outdated Show resolved Hide resolved

Refer to docs for gemini model support

b898f35

dlarocque requested a review from rachelsaunders May 5, 2025 14:26

rachelsaunders approved these changes May 5, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(vertexai): Gemini multimodal output #8922

feat(vertexai): Gemini multimodal output #8922

dlarocque commented Apr 10, 2025 •

edited

Loading

changeset-bot bot commented Apr 10, 2025 •

edited

Loading

github-actions bot commented Apr 10, 2025

google-oss-bot commented Apr 10, 2025 •

edited

Loading

`@firebase/auth`

`@firebase/auth-cordova`

`@firebase/auth-web-extension`

`@firebase/auth/internal`

`@firebase/data-connect`

`@firebase/database`

`@firebase/database-compat/standalone`

`@firebase/firestore`

`@firebase/firestore-lite`

`@firebase/functions`

`@firebase/storage`

`@firebase/util`

`@firebase/vertexai`

`bundle`

`firebase`

google-oss-bot commented Apr 10, 2025 •

edited

Loading

github-actions bot commented Apr 30, 2025 •

edited

Loading

hsubox76 Apr 30, 2025

rachelsaunders May 5, 2025

rachelsaunders May 5, 2025


		Generation modalities to be returned in generation responses.

		- Multimodal response generation is only supported in some Gemini models and versions; see [model versions](https://firebase.google.com/docs/vertex-ai/models)<!-- -->. - Only image generation (`ResponseModality.IMAGE`<!-- -->) is supported.

	* - Multimodal response generation is only supported in some Gemini models and versions; see {@link https://firebase.google.com/docs/vertex-ai/models \| model versions}.
	* - Multimodal response generation is only supported by some Gemini models and versions; see {@link https://firebase.google.com/docs/vertex-ai/models \| model versions}.

feat(vertexai): Gemini multimodal output #8922

Are you sure you want to change the base?

feat(vertexai): Gemini multimodal output #8922

Conversation

dlarocque commented Apr 10, 2025 • edited Loading

changeset-bot bot commented Apr 10, 2025 • edited Loading

🦋 Changeset detected

github-actions bot commented Apr 10, 2025

Vertex AI Mock Responses Check ⚠️

google-oss-bot commented Apr 10, 2025 • edited Loading

Size Report 1

Affected Products

@firebase/auth

@firebase/auth-cordova

@firebase/auth-web-extension

@firebase/auth/internal

@firebase/data-connect

@firebase/database

@firebase/database-compat/standalone

@firebase/firestore

@firebase/firestore-lite

@firebase/functions

@firebase/storage

@firebase/util

@firebase/vertexai

bundle

firebase

Test Logs

google-oss-bot commented Apr 10, 2025 • edited Loading

Size Analysis Report 1

Test Logs

github-actions bot commented Apr 30, 2025 • edited Loading

Changeset File Check ✅

hsubox76 Apr 30, 2025

Choose a reason for hiding this comment

rachelsaunders May 5, 2025

Choose a reason for hiding this comment

rachelsaunders May 5, 2025

Choose a reason for hiding this comment

dlarocque commented Apr 10, 2025 •

edited

Loading

changeset-bot bot commented Apr 10, 2025 •

edited

Loading

google-oss-bot commented Apr 10, 2025 •

edited

Loading

Size Report ¹

`@firebase/auth`

`@firebase/auth-cordova`

`@firebase/auth-web-extension`

`@firebase/auth/internal`

`@firebase/data-connect`

`@firebase/database`

`@firebase/database-compat/standalone`

`@firebase/firestore`

`@firebase/firestore-lite`

`@firebase/functions`

`@firebase/storage`

`@firebase/util`

`@firebase/vertexai`

`bundle`

`firebase`

google-oss-bot commented Apr 10, 2025 •

edited

Loading

Size Analysis Report ¹

github-actions bot commented Apr 30, 2025 •

edited

Loading