Pig

Published: June 01, 2016

Pig, formally known as Apache Pig, is a high-level platform for creating programs that run on the Apache Hadoop system. The language for this platform is called Pig Latin. Pig can execute its Hadoop jobs in MapReduce, Apache Tez, or Apache Spark. Pig Latin abstracts the programming from the Java MapReduce idiom into a notation which makes MapReduce programming high level, similar to that of SQL for RDBMSs. Pig Latin can be extended using User Defined Functions (UDFs) which the user can write in Java, Python, JavaScript, Ruby or Groovy[2] and then call directly from the language. Apache Pig was originally developed at Yahoo Research around 2006 for researchers to have an ad-hoc way of creating and executing MapReduce jobs on very large data sets.

Pig is a tool for use SQL-like instructions for a known relational data bases, with parallel query systems. This give Pig the features:

Easy to learn and easy to use.
Allow the user abstract to the high-level avoiding to enter in the management of a non-relation DB or parallel queries.
Fits perfect in the Hadoop ecosystem.

Material

https://pig.apache.org/

Share on

Twitter Facebook Xing LinkedIn Telegram Whatsapp

Pig

See also

Material

Share on

You May Also Enjoy

¿Para qué sirven los modelos?

La importancia de como mirar los datos (I): Introducción

Futur de la llar a mig termini

Chatbot: oportunidades e a miña propia proposición