
Apache NiFi supports robust and scalable directed graphs of data routing, transformation and NiFi is based on technology before called “Niagara Files” that was in development and used at scale within the NSA for the last 8 years and was made stable to the Apache Software Foundation through the NSA Technology Transfer Program.
Its main features are:
Some of the use cases include, but are not limited to:
Apache Nifi we use to automate the process and it is more reliable and secure way to collect data from Source to destination. To overcome real time benchmarks such as limited or expensive bandwidth while getting data quality and reliability. Everything that happens to data is monitored by the users.
Cutting edge Big Data Engineering Services at your Finger Tips
Read More
Let’s look at a simple ETL task like reading data from Local, converting character set and uploading to the database.
In UpStream Processor we fetch CSV file .Here we convert the CSV File Format into Avro Format.Conversion will happen in ConvertCSVToAvro Processor
Once we convert CsvToAvro then we need to convert once again like Json Format. Using ConvertAvroToJson Processor will convert the Avro schema into Json File Format. We can do some customization as well (optional).
Now finally we get Json File. Now we have to import the flowfile into respective databases using ConvertJsonToSQL
Incoming FlowFile is Entire Json File Format .
Parameters listed below:
Databases Connection (whatever databases you want to connect)
Statement Type: INSERT ,UPDATE,DELETE
Table name: Respective table name
Schema Name: Optional
Leverge your Biggest Asset Data
Inquire Now
Finally Execute statement in Putsql. We have to connect respective databases and load data from local to database. PutSql Processor is to load flowfile into Databases.
By Indium
By Indium
By Uma Raj
By Uma Raj
By Abishek Balakumar
Alex is a Big Data Evangelist and a Certified Big Data Engineer with many years of experience. He has helped clients to optimize custom Big Data Implementation, migrate legacy systems to Big Data ecosystem, and build integrated Big Data and Analytics solutions to help business leaders generate custom analytics without need of IT.