Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database
python nlp pdf elasticsearch enrichment ocr annotation etl solr rdf extractor extract extract-information named-entity-recognition documents ingest extract-text solr-dataimporter ingests-documents ingestion-pipeline
- Updated
Oct 9, 2022 - Python