Dubawa
Bank Statement Parser shine kawai buɗe-tushen ɗakin karatu na Python wanda ke fassara tsarin bayanin banki guda bakwai — ciki har da PDF ta hybrid LLM pipeline — tare da haɗin API. Laburare na tsari guda ɗaya (mt-940, ofxparse, pycamt) kowanne yana sarrafa tsari ɗaya. Kayan aikin SaaS (Ocrolus, Parseur) suna ba da cloud OCR amma suna buƙatar aika bayanai a waje kuma farashin $49–$1,000+/wata.
Madadin Buɗe-Tushen
Laburare na Tsari Guda Ɗaya
Yawancin masu fassarar bayanan banki na buɗe tushen suna sarrafa tsari ɗaya kawai. Idan kuna buƙatar tsari da yawa, dole ne ku girka kuma ku kula da ɗakunan karatu daban tare da APIs daban-daban, tsare-tsaren fitarwa, da lokutan sabuntawa daban.
| Laburare | Tsarin | Fitarwa | Tabbatar da Balance | Fitar da Ledger | |
|---|---|---|---|---|---|
| Bank Statement Parser | 7 tsari | Hybrid pipeline | pandas DataFrame | Golden Rule | hledger, beancount |
| mt-940 (WoLpH) | MT940 kawai | A'a | Python objects | A'a | A'a |
| ofxparse | OFX kawai | A'a | Python objects | A'a | A'a |
| pycamt | CAMT.053 kawai | A'a | Python objects | A'a | A'a |
| ofxtools | OFX v1/v2 kawai | A'a | Python objects | A'a | A'a |
vs pyiso20022
pyiso20022 yana samar da Python dataclasses daga cikakken kundin tsarin ISO 20022. Kayan aiki ne na gama-gari na ISO 20022 don aiki tare da saƙonnin PACS, PAIN, CAMT, da ADMI.
Bank Statement Parser an gina shi ne musamman don fassara bayanan banki zuwa DataFrames tare da fasalulluka na samarwa:
| Siffar | Bank Statement Parser | pyiso20022 |
|---|---|---|
| Manufar | Fassara bayani + cirowa + fitarwa | ISO 20022 schema toolkit |
| Fitarwa | pandas/Polars DataFrames | Python dataclasses |
| Tsarin | 7 (ciki har da PDF, waɗanda ba ISO ba) | ISO 20022 kawai |
| Tallafin PDF | Hybrid pipeline (deterministic + LLM + vision) | A'a |
| Tabbatar da balance | Golden Rule + multi-currency | A'a |
| REST API | Gina-cikin FastAPI | A'a |
| Enrichment | Rarraba ta LLM | A'a |
| Fitar da ledger | hledger + beancount | A'a |
| Streaming | Ee (ƙwaƙwalwar ajiya mai iyaka) | A'a |
| Share PII | Gina-ciki | A'a |
| Cire kwafi | Idempotent transaction hashes | A'a |
| CLI | Ee | A'a |
Yi amfani da pyiso20022 idan kuna buƙatar aiki tare da cikakken kundin saƙon ISO 20022. Yi amfani da Bank Statement Parser idan kuna buƙatar fassara bayanan banki zuwa ingantaccen bayanai don bincike, sulhu, ko bayar da rahoto.
Madadin SaaS
Kayan aikin SaaS kamar Ocrolus, Parseur, da Sensible suna ba da fassarar bayanan banki azaman sabis na girgije. Yawancin lokaci suna amfani da OCR don sarrafa fayilolin PDF da aka bincika kuma suna tallafawa ɗaruruwan ƙayyadaddun tsarin banki.
| Siffar | Bank Statement Parser | Kayan aikin SaaS |
|---|---|---|
| Sirrin bayanai | 100% na gida (LLMs ta Ollama) | Ana aika bayanai zuwa girgije |
| Farashi | Kyauta (Apache 2.0) | $49–$1,000+/wata (kamar na Q1 2026) |
| Tsarin | 7 (tsararru + PDF) | Ɗaruruwa (ta hanyar OCR) |
| Tallafin PDF | Ee — hybrid pipeline (deterministic + LLM + vision) | Ee (cloud OCR) |
| Tabbatar da balance | Golden Rule (ta atomatik) | Manual / iyakantacce |
| Jinkiri | <2 ms (tsararru), daƙiƙu (PDF+LLM) | 1-30 seconds |
| Ƙarfin aiki | 27,000+ tx/daƙiƙa (tsararru) | API rate-limited |
| REST API | Gina-cikin FastAPI | Mallakar kamfani |
| Fitar da ledger | hledger + beancount | A'a |
| Kulle mai siyarwa | Babu | Ee |
| Bin doka | Sarrafa gida, SBOM | Ya bambanta ta mai bayarwa |
Masu Fassara na LLM
Yawan kayan aikin (Inscribe, Unstract, Mozilla.ai blueprints) suna amfani da manyan ƙirar yare don fassara bayanan banki, ciki har da fayilolin PDF da aka bincika. Lokacin da Chase suka sake fasalin tsarin bayanin mabukaci a ƙarshen 2025, masu fassara na tushen samfuri sun karye amma masu fassara na LLM sun daidaita ta atomatik.
Bank Statement Parser yanzu yana da kansa hybrid LLM pipeline (v0.0.5+) wanda ke gudana gabaɗaya a gida ta Ollama. Yana haɗa mafi kyawun hanyoyin biyu:
- Tsarin da aka tsara (XML, CSV, OFX, MT940): Fassara deterministic — daidaito 100%, jinkiri ƙasa da millisecond, babu farashin LLM.
- Bayanan PDF: Turawa ta hanyoyi uku (ciro tebur deterministic → text-LLM → vision-LLM) tare da tabbatar da Golden Rule ta atomatik don kama kurakuran cirowa.
Ba kamar masu fassara LLM na girgije kawai ba, hybrid pipeline na Bank Statement Parser:
- Yana gudana 100% a gida (Ollama) — babu bayanan da ke barin injin ku.
- Yana tabbatar da kowanne cirowa tare da tabbatar da balance (Golden Rule).
- Yana tallafawa yanayin bita mai hulɗa don bambance-bambance da aka tuta.
- Yana samar da idempotent transaction hashes don amintaccen shigar da bayani a hankali.
Lokacin da za a zaɓi masu fassara SaaS LLM akan Bank Statement Parser: Kuna karɓar bayanan daga ɗaruruwan bankuna tare da shimfidar PDF daban-daban sosai kuma kuna buƙatar tallafi na nan da nan ba tare da gudanar da kayan aiki na gida ba.
Lokacin da za a zaɓi Bank Statement Parser: Kuna buƙatar sarrafa gida don bin doka. Kuna son tabbatar da balance. Kuna buƙatar fitar da ledger. Kuna son farashi mai gudana ya zama sifili.
Hanyar ma'auni: An auna ƙididdigar aiki akan Apple M2, Python 3.12, ta amfani da fayil ɗin CAMT.053 mai ma'amaloli 5,000 (2.1 MB). Matsakaicin sakamakon gudu 100. Maimaita a gida: python -m bankstatementparser.bench. Jinkirin SaaS dangane da takaddun API da aka buga kamar na Afrilu 2026.
Duba shari'o'in amfani na ainihi ❯ | Shirya ƙaurar MT940-zuwa-CAMT ❯