@@ -255,26 +255,26 @@ Low, high, and embedding models have different rate limits. To see which type of
255
255
<td>1</td>
256
256
</tr >
257
257
<tr >
258
- <th rowspan="4" scope="rowgroup" style="box-shadow: none" ><b>Azure OpenAI o1-mini </b></th>
258
+ <th rowspan="4" scope="rowgroup"><b>Azure OpenAI o1 and o3 </b></th>
259
259
<th style="padding-left: 0"><b>Requests per minute</b></th>
260
260
<td>Not applicable</td>
261
+ <td>1</td>
262
+ <td>2</td>
261
263
<td>2</td>
262
- <td>3</td>
263
- <td>3</td>
264
264
</tr >
265
265
<tr >
266
266
<th><b>Requests per day</b></th>
267
267
<td>Not applicable</td>
268
+ <td>8</td>
269
+ <td>10</td>
268
270
<td>12</td>
269
- <td>15</td>
270
- <td>20</td>
271
271
</tr >
272
272
<tr >
273
273
<th><b>Tokens per request</b></th>
274
274
<td>Not applicable</td>
275
275
<td>4000 in, 4000 out</td>
276
276
<td>4000 in, 4000 out</td>
277
- <td>4000 in, 4000 out</td>
277
+ <td>4000 in, 8000 out</td>
278
278
</tr >
279
279
<tr >
280
280
<th><b>Concurrent requests</b></th>
@@ -284,7 +284,7 @@ Low, high, and embedding models have different rate limits. To see which type of
284
284
<td>1</td>
285
285
</tr >
286
286
<tr >
287
- <th rowspan="4" scope="rowgroup" style="box-shadow: none"><b>Azure OpenAI o3 -mini</b></th>
287
+ <th rowspan="4" scope="rowgroup" style="box-shadow: none"><b>Azure OpenAI o1-mini, o3-mini, and o4 -mini</b></th>
288
288
<th style="padding-left: 0"><b>Requests per minute</b></th>
289
289
<td>Not applicable</td>
290
290
<td>2</td>
@@ -313,7 +313,7 @@ Low, high, and embedding models have different rate limits. To see which type of
313
313
<td>1</td>
314
314
</tr >
315
315
<tr >
316
- <th rowspan="4" scope="rowgroup" style="box-shadow: none"><b>DeepSeek-R1</b></th>
316
+ <th rowspan="4" scope="rowgroup" style="box-shadow: none"><b>DeepSeek-R1 and MAI-DS-R1 </b></th>
317
317
<th style="padding-left: 0"><b>Requests per minute</b></th>
318
318
<td>1</td>
319
319
<td>1</td>
0 commit comments