Apache NiFi 101 Timothy Spann Developer Advocate / StreamNative Thursday 23-Sept-2021
Agenda Thursday 14:10 UTC Apache NiFi 101: Introduction and Best Practices Timothy Spann In this talk, we will walk step by step through Apache NiFi from the first load to first application. I will include slides, articles and examples to take away as a Quick Start to utilizing Apache NiFi in your real-time dataflows. I will help you get up and running locally on your laptop, Docker or in CDP Public Cloud.
Speaker Bio DZone Zone Leader and Big Data MVB @PaasDev https://github.com/tspannhw https://www.datainmotion.dev/ https://github.com/tspannhw/SpeakerProfile https://dev.to/tspannhw https://sessionize.com/tspann/ https://www.slideshare.net/bunkertor Timothy Spann Developer Advocate @
Powered by Apache Pulsar, StreamNative provides a cloud-native, real-time messaging and streaming platform to support multi-cloud and hybrid cloud strategies. Built for Containers Cloud Native StreamNative Cloud Flink SQL
StreamNative Solution Application Messaging Data Pipelines Real-time Contextual Analytics Tiered Storage APP Layer Computing Layer Storage Layer StreamNative Platform IaaS Layer Micro Service Notification Dashboard Risk Control Auditing Payment ETL
Architecture https://nifi.apache.org/docs/nifi-docs/html/overview.html
Flow File https://nifi.apache.org/docs/nifi-docs/html/overview.html Flow Files are content and key/value pairs for attributes that are each event/message/file that has been introduced into NiFi.
Processor https://nifi.apache.org/docs/nifi-docs/html/overview.html A Java component that runs in NiFi to route, process or manipulate data. You can build your own if it is not included in Standard NiFi or not in the open source. Controller Like a connection pools, connections, processes that ingest or work with outside data.
Connection https://nifi.apache.org/docs/nifi-docs/html/overview.html These link together NiFi processors. Process Groups Groups of processors. These are versionable and reusable components/modules.
Record Processors https://www.datainmotion.dev/2019/03/advanced-xml-processing-with-apache.html ● XML, CSV, JSON, AVRO and more ● Schemas or Inferred Schemas ● Easily convert between them ● Support SQL with Apache Calcite
Record Processors https://www.datainmotion.dev/2019/03/advanced-xml-processing-with-apache.html
Caching https://dev.to/tspannhw/flank-using-apache-kudu-as-a-cache-for-fda-updates-4knj
Provenance https://www.datainmotion.dev/2021/01/automating-starting-services-in-apache.html
Metrics, Status, Charts https://www.clouddataops.dev/data-flow-experience
DevOps https://www.datainmotion.dev/2021/01/automating-starting-services-in-apache.html https://nipyapi.readthedocs.io/en/latest/ nifi-toolkit/bin/cli.sh nifi list-param-contexts -u http:/ /edge2ai-1.dim.local:8080 nifi-toolkit/bin/cli.sh nifi pg-list -u http:/ /edge2ai-1.dim.local:8080 nifi-toolkit/bin/cli.sh nifi pg-set-param-context -u http:/ /edge2ai-1.dim.local:8080 ...
DevOps https://dev.to/tspannhw/automating-starting-services-in-apache-nifi-and-applying-parameters-5h4n https://github.com/tspannhw/ApacheConAtHome2020/blob/main/scripts/setupnifi.sh nifi pg-list nifi pg-status nifi pg-get-services nifi pg-enable-services -u http:/ /edge2ai-1.dim.local:8080 --processGroupId root nifi pg-start -u http:/ /edge2ai-1.dim.local:8080 -pgid LOOKTHISUP nifi list-param-contexts -u http:/ /edge2ai-1.dim.local:8080 -verbose nifi create-reporting-task -u http:/ /edge2ai-1.dim.local:8080 -verbose -i
Consume MQTT This could read from Apache Pulsar - MoP (MQTT on Pulsar)
Listen FTP Let Apache NiFi be your FTP server
All Data - Anytime - Anywhere - Multi-Cloud - Multi-Protocol Multi- inges t Multi- inges t Multi-ingest Merge Priority
● https://www.datainmotion.dev/2020/06/no-more-spaghetti-flows.html ● https://github.com/tspannhw/EverythingApacheNiFi ● https://www.datainmotion.dev/2019/03/apache-nifi-101.html ● https://nifi.apache.org/docs/nifi-docs/html/nifi-in-depth.html ● https://pierrevillard.com/best-of-nifi/ ● https://blogs.apache.org/nifi/ ● https://www.nifi.rocks/documents/nifi-expression-language-cheat-sheet.pdf ● https://dev.to/tspannhw/new-features-of-apache-nifi-1-13-0-45ln ● https://dev.to/tspannhw/tracking-satellites-with-apache-nifi-44o7 ● https://www.datainmotion.dev/2021/01/flank-using-apache-kudu-as-cache-for.html ● https://www.datainmotion.dev/2020/12/basic-understanding-of-cloudera-flow.html Deeper Content @PaasDev https://datainmotion.dev/ timothyspann
streamnative.io Pulsar Summit Europe October 6, 2021 Pulsar Summit Asia November 20-21, 2021 Contact us at partners@pulsar-summit.org to become a sponsor or partner

ApacheCon 2021: Apache NiFi 101- introduction and best practices