Tutorials
Guidebook for tutorials. Check this when you don't know what tutorial suits your demand.
🙋 I'm very new to Dataverse
: Introduces very basic, but core steps to use Dataverse.
🙋I want to use my custom function
: If you want to use your custom function, you have to register the function on Dataverse. These will guide you from register to apply it on pipeline.
🙋I need to test my ETL process with samples
: When you want to get test(sample) data to quickly test your ETL process, or need data from a certain point to test your ETL process
🙋 I want to run it on EMR cluster
🙋Is there any real-world dataset to use Dataverse?
: Shows how to use common crawl data.
🙋 I want to use Pyspark UI
: Helps you to use Pyspark UI to monitor the spark job in Docker environment.
Last updated