Mai nazarin bayanin banki ɗakin karatu ne na Python mai buɗewa wanda ke bincika bayanan banki daga tsari shida zuwa pandas DataFrames masu tsari. Duk sarrafa bayanai yana faruwa a cikin gida -- babu kiran yanar gizo, fitarwa mai daidaituwa, da ɓoye PII ta atomatik.
Wa Ne Wannan Don?
- Ƙungiyoyin baitulmali da ke ƙaura daga MT940 zuwa CAMT.053 waɗanda suke buƙatar mai bincike da ke sarrafa tsoffin tsari da sababbin tsari lokacin sauyi.
- Masu haɓaka fintech da ke gina hanyoyin sulhu, rahoto, ko lissafi waɗanda suke son dogara guda ɗaya maimakon haɗa mt940 + ofxparse + dabaru na CSV na musamman.
- Ƙungiyoyin bin doka da suke buƙatar ɓoye PII ta asali da fitarwa mai daidaituwa da za a iya bincikata wanda bai taɓa aika bayanai zuwa ayyuka na waje ba.
- Kowa da ya ƙi aika bayanan kuɗi masu mahimmanci zuwa SaaS na wani ɓangare na uku lokacin da kayan aiki na gida mai buɗewa za su iya yin aikin.
Tsarin Da Ake Tallafawa
| Tsari | Ma'auni | Nau'in Fayil | Ajin Mai Bincike |
|---|---|---|---|
| CAMT.053 | ISO 20022 Bayanin Banki-zuwa-Abokin ciniki | .xml |
CamtParser |
| PAIN.001 | ISO 20022 Fara Canja Kuɗi | .xml |
Pain001Parser |
| CSV | Fitarwa gabaɗayan banki | .csv |
CsvStatementParser |
| OFX | Open Financial Exchange | .ofx |
OfxParser |
| QFX | Quicken Financial Exchange | .qfx |
QfxParser |
| MT940 | Ma'aunin SWIFT | .mt940, .sta |
Mt940Parser |
Duk tsarin suna samar da pandas DataFrames masu daidaituwa tare da sunaye masu daidaituwa na ginshiƙai, wanda ke sa sarrafa ƙasa ya zama mai rashin kulawa da tsari.
Manyan Ƙarfin
- Gano Tsari ta Atomatik:
detect_statement_format()yana gano tsarin;create_parser()yana ƙirƙira mai binciken da ya dace. - Bincike Mai Streaming: Sarrafa manyan fayiloli (50 MB+, ma'amaloli 50K+) tare da ƙwaƙwalwar ajiya mai iyaka ta amfani da
parse_streaming(). - Sarrafa Layi Ɗaya: Bincika fayiloli da yawa a lokaci guda tare da
parse_files_parallel()ta amfani da ProcessPoolExecutor. - Cire Kwafi: Gano kwafi daidai da abin da ake zargi tare da ƙimar amincewa masu bayani.
- Bincike Cikin Ƙwaƙwalwa:
from_string()dafrom_bytes()don hanyoyin SFTP da API ba tare da rubutu zuwa faifai ba. - Sarrafa ZIP Mai Tsaro:
iter_secure_xml_entries()tare da iyakokin haɗin damuwa, ƙayyadaddun girman shigarwa, da ƙin shigarwar da aka ɓoye. - Fitarwa: CSV, JSON, Excel (
.xlsx), da Polars DataFrames na zaɓi.
Tsaro Da Sirri
- Ɓoye PII: Ana rufe sunaye, IBANs, da adireshi ta asali a fitowar CLI. Zaɓi
--show-piidon ganin duka. - Kariya Daga XXE: Binciken XML yana amfani da
resolve_entities=False,no_network=True,load_dtd=False. - Kariya Daga Bam Na ZIP: Iyakokin haɗin damuwa (100:1 ta asali), ƙayyadaddun girman shigarwa (10 MB), ƙin shigarwar da aka ɓoye.
- Hana Bin Hanyar Fayil: Jerin tsare-tsare masu haɗari da warware hanyoyin haɗin kai.
- Tsaron Sarkar Samarwa: Dogaro da aka kulle da SHA-256 hash, CycloneDX SBOM, tabbacin asalin gini.
Aiki
| Ma'auni | Ƙima |
|---|---|
| Yawan CAMT.053 | Ma'amaloli 27,000+/s |
| Yawan PAIN.001 | Ma'amaloli 52,000+/s |
| Jinkirin kowane ma'amala (CAMT) | Microseconds 37 |
| Jinkirin kowane ma'amala (PAIN.001) | Microseconds 19 |
| Lokaci zuwa sakamako na farko | < 2 ms |
| Girman ƙwaƙwalwa (ma'amaloli 1K-50K) | Mai daidaituwa (streaming) |
| Rufin gwaji | Rufin reshe 100% |
| Gwaje-gwaje | 467 a cikin fayilolin gwaji 29 |
Fara Ginawa
Fara tare da shigarwa da misalai ❯
"Ma'ajiyar GitHub"