This commit is contained in:
@@ -974,18 +974,19 @@
|
||||
|
||||
<div class="doc doc-contents first">
|
||||
|
||||
<p>PDF client abstractions for OmniRead.</p>
|
||||
<hr />
|
||||
<h4 id="omniread.pdf.client--summary">Summary</h4>
|
||||
<h3 id="omniread.pdf.client--summary">Summary</h3>
|
||||
<p>PDF client abstractions for OmniRead.</p>
|
||||
<p>This module defines the <strong>client layer</strong> responsible for retrieving raw PDF
|
||||
bytes from a concrete backing store.</p>
|
||||
<p>Clients provide low-level access to PDF binaries and are intentionally
|
||||
decoupled from scraping and parsing logic. They do not perform validation,
|
||||
interpretation, or content extraction.</p>
|
||||
<p>Typical backing stores include:
|
||||
- Local filesystems
|
||||
- Object storage (S3, GCS, etc.)
|
||||
- Network file systems</p>
|
||||
<p>Typical backing stores include:</p>
|
||||
<ul>
|
||||
<li>Local filesystems</li>
|
||||
<li>Object storage (S3, GCS, etc.)</li>
|
||||
<li>Network file systems</li>
|
||||
</ul>
|
||||
|
||||
|
||||
|
||||
@@ -1014,14 +1015,20 @@ interpretation, or content extraction.</p>
|
||||
Bases: <code><span title="abc.ABC">ABC</span></code></p>
|
||||
|
||||
|
||||
<p>Abstract client responsible for retrieving PDF bytes
|
||||
from a specific backing store (filesystem, S3, FTP, etc.).</p>
|
||||
<p>Abstract client responsible for retrieving PDF bytes.</p>
|
||||
<p>Retrieves bytes from a specific backing store (filesystem, S3, FTP, etc.).</p>
|
||||
|
||||
|
||||
<details class="notes" open>
|
||||
<summary>Notes</summary>
|
||||
<p><strong>Responsibilities:</strong></p>
|
||||
<div class="language-text highlight"><table class="highlighttable"><tr><td class="linenos"><div class="linenodiv"><pre><span></span><span class="normal">1</span></pre></div></td><td class="code"><div><pre><span></span><code>- Implementations must accept a source identifier appropriate to the backing store, return the full PDF binary payload, and raise retrieval-specific errors on failure
|
||||
<div class="language-text highlight"><table class="highlighttable"><tr><td class="linenos"><div class="linenodiv"><pre><span></span><span class="normal">1</span>
|
||||
<span class="normal">2</span>
|
||||
<span class="normal">3</span>
|
||||
<span class="normal">4</span></pre></div></td><td class="code"><div><pre><span></span><code>- Implementations must accept a source identifier appropriate to
|
||||
the backing store.
|
||||
- Return the full PDF binary payload.
|
||||
- Raise retrieval-specific errors on failure.
|
||||
</code></pre></div></td></tr></table></div>
|
||||
</details>
|
||||
|
||||
@@ -1165,7 +1172,9 @@ from a specific backing store (filesystem, S3, FTP, etc.).</p>
|
||||
<details class="notes" open>
|
||||
<summary>Notes</summary>
|
||||
<p><strong>Guarantees:</strong></p>
|
||||
<div class="language-text highlight"><table class="highlighttable"><tr><td class="linenos"><div class="linenodiv"><pre><span></span><span class="normal">1</span></pre></div></td><td class="code"><div><pre><span></span><code>- This client reads PDF files directly from the disk and returns their raw binary contents
|
||||
<div class="language-text highlight"><table class="highlighttable"><tr><td class="linenos"><div class="linenodiv"><pre><span></span><span class="normal">1</span>
|
||||
<span class="normal">2</span></pre></div></td><td class="code"><div><pre><span></span><code>- This client reads PDF files directly from the disk and returns
|
||||
their raw binary contents.
|
||||
</code></pre></div></td></tr></table></div>
|
||||
</details>
|
||||
|
||||
|
||||
Reference in New Issue
Block a user