This is a “curated” list of pointers I have found handy during my long coding sessions (a major part of which I still spend yak shaving someone else’s code).
Angular related: Link to heading
- Reloading current route in Angular 5 / Angular 6 (reload component when clicking browser’s back button)
- Angular material datatable example (sortable, selectable and filterable)
- Angular material checkbox + forms example
- How to prevent robots from crawling an Angular application
ng-number-picker
Python Dev and Data analysis: Link to heading
Matplotlib
: Save images in high quality (EPS, PDF)Matplotlib
: Prevent plot labels from getting cut off when saving themMatplotlib
: How to put the legend out of the plotMatplotlib
: Reverse the order of legendMatplotlib
: Read the data and plotting with multiple markersMatplotlib
: Using matplotlib in jupyter notebooks — comparing methods and some tips (%matplotlib <>
)python re
: Non-capturing groupspip 10 and apt
: How to avoid “Cannot uninstall X” errors for distutils packagesscipy & scikit-learn
: How to calculate Silhouette Score of the scipy’s fcluster using scikit-learn silhouette score?pandas
: How to read a large csv file in chunkspandas
: Parallel operations over a pandas DataFramepandas
: Random selection per groupfolium
: Map Tilesfolium
: Gallery of examplesfolium
: Marker clusterGeoPandas
: Label Polygons (see second answer)GeoPandas
: An Approach for Checking Overlaps and Gaps in Polygons using GeopandasGeoPandas
: R-tree Spatial Indexing with Python (bounding box matches vs exact polygon matches)GeoPandas
: Nearest Neighbour Analysis (Nearest polygon to point)ipyleaflet
: Get coordinates out of map drawingsJupyter
How to check the source code of a moduleGeoalchemy2
: query all users within X meterespandas
: Normalize semi-structured JSON data into a flat table- Streaming data from Postgres into Python
- Creating a grid based on GeoHashes
- Time series distances: Dynamic Time Warping (DTW)
- Running
Flask
in production with Docker - Build a RESTful API with Flask – The TDD Way
Docker & Kubernetes related: Link to heading
- How To Build a Node.js Application with Docker
- Starting a shell in the Docker Alpine container
- How To Share Data between Docker Containers
- Docker Compose for Data Science (Jupyter + PostgreSQL)
kubectl
: Pull docker image from private repository- Running
PostgreSQL
using Docker Compose (with persistent volume) - Using
docker-compose
, how to execute multiple commands - Automatically build and push Docker images using GitLab CI
k8s
: Pull an Image from a Private Registryk8s
: Configure persistent volume storageK8s
,juju
: Kubernetes Core bundleK8S
,juju
: Restart K8s related services manually (for the juju bundle)K8S
: How to fix — Kubernetesnamespace
deleting stuck in Terminating state- Build and Push to Docker With Maven
Git related Link to heading
Hadoop related: Link to heading
Decommisioning Hadoop datanodes: Link to heading
- Decommissioning slave nodes - IBM Knowledge Center
- Decommissioning DataNodes Using the Command Line - Cloudera
Fix connection issues between the namenode and datanodes: Link to heading
Java related: Link to heading
maven
Accessing resource files inside a jar: Java properties file in jar not found when running jarJetty
,REST
,CORS
: Embedding Jetty Server with Jersey as RESTful container with CORS
KafKa related: Link to heading
Kafka Streams
: How to compute an (windowed) average?Kafka Streams
: Write a Kafka Streams ApplicationKSQL
:ksql-python
A python wrapper for the KSQL REST API
LaTeX related: Link to heading
PostgreSQL related: Link to heading
CREATE STATISTICS
: The Postgres 10 feature you didn’t know about- PostgreSQL
Explain
Visualizer - The Internals of PostgreSQL: Ch. 3 Query Processing (See subsection 3.2.2.2 on
run cost
andselectivity
) - Using
date_trunc
to group by hour with a timestamp field? - Insert trigger to Update another table using PostgreSQL
Spark related: Link to heading
- On why large executor memory does not imply better performance
- On the
spark.sql.autoBroadcastJoinThreshold
setting - SparkSQL - Shuffle Hash and Sort Merge Joins in Apache Spark
- Broadcast Hash Joins in Apache Spark
- Understanding Apache Spark on YARN
- Pyspark: Split multiple array columns into rows (
explode
method) - Convert comma separated string to array in pyspark dataframe