PDI (also known as Kettle) is an open source data extraction, transformation, and loading (ETL) tool that supports various common data sources, such as various databases, flat files, XML files, Excel files, Access files, etc.
Users can conveniently design the data process and define data format conversion by dragging and dropping.
In addition to data conversion, Kettle also supports many common operations in the form of jobs, such as sending and receiving emails, FTP uploads, downloads, file management, etc.
By using jobs and processes together, users can easily complete most data processing work.
Even non-developers can do some simple data processing work through Kettle, such as:
Select products with a sales amount greater than 1,000 yuan from an Excel file and put them into an Access file. Functions like this can be configured directly through the graphical interface without writing a line of code.
- Green versionView
- Green versionView
- Green versionView
- Green versionView
It is a powerful weapon for developers. Through its own functions and the extended functions of plug-ins, it is possible to do any data processing work you want to do.
Ganji.com information collector is a plug-in of the PDI platform, which can collect the title, phone number, release time, URL, and user-defined collection items of Ganji.com information.

















Useful
Useful
Useful