|
61 | 61 | "\n", |
62 | 62 | "### OpenAI Model Evolution \n", |
63 | 63 | "\n", |
64 | | - "\n", |
| 64 | + |
| 65 | + "\n", |
| 66 | + |
65 | 67 | "\n", |
66 | 68 | "### Key Characteristics\n", |
67 | 69 | "\n", |
|
79 | 81 | "\n", |
80 | 82 | "## 3A. Use Case: Long-Context RAG for Legal Q&A\n", |
81 | 83 | "\n", |
82 | | - "\n", |
| 84 | + |
| 85 | + "\n", |
| 86 | + |
83 | 87 | "## 🗂️ TL;DR Matrix\n", |
84 | 88 | "\n", |
85 | 89 | "This table summarizes the core technology choices and their rationale for **this specific Long-Context Agentic RAG implementation**.\n", |
|
133 | 137 | "id": "db9bad1b", |
134 | 138 | "metadata": {}, |
135 | 139 | "source": [ |
136 | | - "\n", |
| 140 | + |
| 141 | + "\n", |
| 142 | + |
137 | 143 | "\n", |
138 | 144 | "\n", |
139 | 145 | "## Agentic RAG System: Model Usage\n", |
|
1815 | 1821 | "================================================================================\n", |
1816 | 1822 | "\n", |
1817 | 1823 | "## 3B. Use Case: AI Co-Scientist for Pharma R&D\n", |
1818 | | - "\n", |
| 1824 | + |
| 1825 | + "\n", |
| 1826 | + |
1819 | 1827 | "\n", |
1820 | 1828 | "This section details how to build an AI system that functions as a \"co-scientist\" to accelerate experimental design in pharmaceutical R&D, focusing on optimizing a drug synthesis process under specific constraints.\n", |
1821 | 1829 | "\n", |
|
1855 | 1863 | "\n", |
1856 | 1864 | "The system employs a multi-agent architecture that emulates a high-performing scientific team. Different AI components, acting in specialized roles (such as ideation, critique, and learning from outcomes), collaborate using various models and tools to execute the workflow.\n", |
1857 | 1865 | "\n", |
1858 | | - "\n", |
| 1866 | + |
| 1867 | + "\n", |
| 1868 | + |
1859 | 1869 | "\n", |
1860 | 1870 | "### 2.1. **Scientist Input & Constraints:** \n", |
1861 | 1871 | "The process starts with the scientist defining the goal, target compound, and constraints." |
|
2463 | 2473 | "\n", |
2464 | 2474 | "## 3C. Use Case: Insurance Claim Processing\n", |
2465 | 2475 | "\n", |
2466 | | - "\n", |
| 2476 | + |
| 2477 | + "\n", |
| 2478 | + |
2467 | 2479 | "\n", |
2468 | 2480 | "Many businesses are faced with the task of digitizing hand filled forms. In this section, we will demonstrate how OpenAI can be used to digitize and validate a hand filled insurance form. While this is a common problem for insurance, the same techniques can be applied to a variety of other industries and forms, for example tax forms, invoices, and more.\n", |
2469 | 2481 | "\n", |
|
2493 | 2505 | "\n", |
2494 | 2506 | "The high level basic architecture of the solution is shown below.\n", |
2495 | 2507 | "\n", |
2496 | | - "\n", |
| 2508 | + |
| 2509 | + "\n", |
| 2510 | + |
2497 | 2511 | "\n", |
2498 | 2512 | "This task is complex and requires a wide variety of model capabilities, including vision, function calling, reasoning, and structured output. While `o3` is capable of doing all of these at once, we found during experimentation that `o4-mini` alone was not sufficient to achieve the necessary performance. Due to the higher relative costs of `o3`, we instead opted for a two-stage approach.\n", |
2499 | 2513 | "\n", |
|
2503 | 2517 | "\n", |
2504 | 2518 | "To demonstrate concretely how this works, let's look at a sample image of an insurance form.\n", |
2505 | 2519 | "\n", |
2506 | | - "\n", |
| 2520 | + |
| 2521 | + "\n", |
| 2522 | + |
2507 | 2523 | "\n", |
2508 | 2524 | "While the form itself is fairly straightforward, there is missing data and ambiguous information that will be difficult for a traditional OCR system to fill out correctly. First, notice that the zip code and county have been omitted. Second, the email address of the user is ambiguous \\-- it could be `[email protected]` or `[email protected]`. In the following sections, we will walk through how a well-designed solution can handle these ambiguities and return the correct form results.\n", |
2509 | 2525 | "\n", |
|
3186 | 3202 | "\n", |
3187 | 3203 | "## Adaptation Decision Tree\n", |
3188 | 3204 | "\n", |
3189 | | - "\n", |
| 3205 | + |
| 3206 | + "\n", |
| 3207 | + |
| 3208 | + |
3190 | 3209 | "\n", |
3191 | 3210 | "## Communicating Model Selection to Non-Technical Stakeholders\n", |
3192 | 3211 | "\n", |
|
3286 | 3305 | "\n", |
3287 | 3306 | "## Contributors\n", |
3288 | 3307 | "\n", |
3289 | | - "- Kashyap Coimbatore Murali\n", |
3290 | | - "- Nate Harada \n", |
3291 | | - "- Sai Prashanth Soundararaj \n", |
3292 | | - "- Shikhar Kwatra " |
| 3308 | + |
| 3309 | + "- [Kashyap Coimbatore Murali](https://www.linkedin.com/in/kashyap-murali/)\n", |
| 3310 | + "- [Nate Harada](https://www.linkedin.com/in/nate-harada/) \n", |
| 3311 | + "- [Sai Prashanth Soundararaj](https://www.linkedin.com/in/saiprashanths/)\n", |
| 3312 | + "- [Shikhar Kwatra](https://www.linkedin.com/in/shikharkwatra/)" |
| 3313 | + |
3293 | 3314 | ] |
3294 | 3315 | } |
3295 | 3316 | ], |
|
0 commit comments