
Photo by Lieve Ransijn on Unsplash
This is a “curated” list of pointers I have found handy during my long coding sessions (a major part of which I still spend yak shaving someone else’s code).
Angular related: Link to heading
- Reloading current route in Angular 5 / Angular 6 (reload component when clicking browser’s back button)
- Angular material datatable example (sortable, selectable and filterable)
- Angular material checkbox + forms example
- How to prevent robots from crawling an Angular application
ng-number-picker
Python Dev and Data analysis: Link to heading
Matplotlib: Save images in high quality (EPS, PDF)Matplotlib: Prevent plot labels from getting cut off when saving themMatplotlib: How to put the legend out of the plotMatplotlib: Reverse the order of legendMatplotlib: Read the data and plotting with multiple markersMatplotlib: Using matplotlib in jupyter notebooks — comparing methods and some tips (%matplotlib <>)python re: Non-capturing groupspip 10 and apt: How to avoid “Cannot uninstall X” errors for distutils packagesscipy & scikit-learn: How to calculate Silhouette Score of the scipy’s fcluster using scikit-learn silhouette score?pandas: How to read a large csv file in chunkspandas: Parallel operations over a pandas DataFramepandas: Random selection per groupfolium: Map Tilesfolium: Gallery of examplesfolium: Marker clusterGeoPandas: Label Polygons (see second answer)GeoPandas: An Approach for Checking Overlaps and Gaps in Polygons using GeopandasGeoPandas: R-tree Spatial Indexing with Python (bounding box matches vs exact polygon matches)GeoPandas: Nearest Neighbour Analysis (Nearest polygon to point)ipyleaflet: Get coordinates out of map drawingsJupyterHow to check the source code of a moduleGeoalchemy2: query all users within X meterespandas: Normalize semi-structured JSON data into a flat table- Streaming data from Postgres into Python
- Creating a grid based on GeoHashes
- Time series distances: Dynamic Time Warping (DTW)
- Running
Flaskin production with Docker - Build a RESTful API with Flask – The TDD Way
Docker & Kubernetes related: Link to heading
- How To Build a Node.js Application with Docker
- Starting a shell in the Docker Alpine container
- How To Share Data between Docker Containers
- Docker Compose for Data Science (Jupyter + PostgreSQL)
kubectl: Pull docker image from private repository- Running
PostgreSQLusing Docker Compose (with persistent volume) - Using
docker-compose, how to execute multiple commands - Automatically build and push Docker images using GitLab CI
k8s: Pull an Image from a Private Registryk8s: Configure persistent volume storageK8s,juju: Kubernetes Core bundleK8S,juju: Restart K8s related services manually (for the juju bundle)K8S: How to fix — Kubernetesnamespacedeleting stuck in Terminating state- Build and Push to Docker With Maven
Git related Link to heading
Hadoop related: Link to heading
Decommisioning Hadoop datanodes: Link to heading
- Decommissioning slave nodes - IBM Knowledge Center
- Decommissioning DataNodes Using the Command Line - Cloudera
Fix connection issues between the namenode and datanodes: Link to heading
Java related: Link to heading
mavenAccessing resource files inside a jar: Java properties file in jar not found when running jarJetty,REST,CORS: Embedding Jetty Server with Jersey as RESTful container with CORS
KafKa related: Link to heading
Kafka Streams: How to compute an (windowed) average?Kafka Streams: Write a Kafka Streams ApplicationKSQL:ksql-pythonA python wrapper for the KSQL REST API
LaTeX related: Link to heading
PostgreSQL related: Link to heading
CREATE STATISTICS: The Postgres 10 feature you didn’t know about- PostgreSQL
ExplainVisualizer - The Internals of PostgreSQL: Ch. 3 Query Processing (See subsection 3.2.2.2 on
run costandselectivity) - Using
date_truncto group by hour with a timestamp field? - Insert trigger to Update another table using PostgreSQL
Spark related: Link to heading
- On why large executor memory does not imply better performance
- On the
spark.sql.autoBroadcastJoinThresholdsetting - SparkSQL - Shuffle Hash and Sort Merge Joins in Apache Spark
- Broadcast Hash Joins in Apache Spark
- Understanding Apache Spark on YARN
- Pyspark: Split multiple array columns into rows (
explodemethod) - Convert comma separated string to array in pyspark dataframe