📈 BigData Junction


Introduction #

At BigData Junction, we are your ultimate destination for mastering the vast landscape of big data technologies. Whether you are a beginner looking to dive into the world of data or an experienced professional aiming to sharpen your skills, we have something for everyone.

Explore comprehensive resources, tutorials, and guides on:

  • Python: Unlock the power of Python for seamless automation, cutting-edge data analysis and machine learning.
  • SQL: Master the art of querying and managing databases efficiently.
  • Spark & PySpark: Learn to process and analyze large data sets with the speed and simplicity of Apache Spark.
  • Airflow: Discover the best practices for orchestrating complex data workflows with Apache Airflow.
  • Data Warehousing: Dive into the architecture and management of data warehouses to optimize your data storage solutions.
  • Git: Understand version control and collaborative coding with Git.

Join us at BigData Junction to stay ahead in the ever-evolving field of data science and big data. Our content is designed to help you build robust, scalable, and efficient data solutions, ensuring you have the skills to excel in today’s data-driven world.

Stay curious, stay informed, and keep exploring with BigData Junction!

Index #

Here are the courses that I currently have notes available for, and their statuses:

Although I have personal notes for many other classes, they do not meet my quality standards for making them public at this time. If I have time at some point in the distant future, they may make an appearance, but don’t count on this happening anytime soon.

If you made your own notes/resources for a CS, Data Science, or EE course and would like me to put a link to them here, let me know (contributing)!

Basic Principles #

Here are some principles that I try to follow when creating notes. I’ll probably make a blog post at some point to go over this in more detail, but for now this outline should be enough to show what I hope to accomplish.

  1. Content is more fun when it’s important: Answer the question “why should I care about this?” before actually spending time on whatever topic is at hand. If answering it is a struggle, then it’s probably not important enough to need to remember in the future.
  2. Make it interactive: It’s way easier to concentrate on something if it’s directly applicable to a problem, question, or situation at hand. Interject conceptual notes with illustrated examples and practice problems whenever possible.
  3. Notes are rarely self-contained: It’s impossible to fully cover most topics on a single page, and topics may be deeply related to content from other courses. Link to external resources or further learning opportunities whenever possible, just in case it becomes necessary to research the topic further in the future.
  4. Type a lot of stuff really fast: For this verbose style of note-taking to be effective for me, I need to be able to completely put down thoughts on the page before I lose them. If you’re thinking of doing this on your own, I’d recommend getting good at touch typing, and hitting up monkeytype for some practice. I’m going against all the research that suggests handwriting is more effective than typing, because the purpose of my notes is not for memorizing or even remembering any of the content, but rather to create a complete repository of knowledge that I and others can easily search in the future.

About this website #

My notes are hosted on Netlify and are built on my custom Amethyst theme for Hugo. You can view the source code here.

All of the notes here are formatted in Markdown, and the majority was created using Obsidian. These notes are a small fraction of my Obsidian vault; I intend to publish other small bits of it in various places such as my blog, devlog, or mastodon if you’re curious.

If you’re interested in contributing, take a look at the contribution guide.

Contact me #

Want to chat with me about these notes, or something else? You can find my contact info here.