Use OperSearch, Kendra, Transcribe and Glue to extract information from an S3 document pool.

Photo by Maksym Kaharlytskyi on Unsplash

In Tom Hanks’ movie Finch, I was amazed by Finch’s book processing robot. It automatically cuts the book spines off and feeds the pages to a scanner. The contents are then digitized and then put to good use — go watch the film and you know what I mean here…

Use Neo4j to investigate institutional holdings in the stock market

As a retail stock investor, I always keep an eye on what the “smart money” is buying or selling. “Smart money” refers to institutional investors — pension funds, mutual funds, hedge funds, banks, insurance companies, and other big investors. They are also called the whales of Wall Street or the…

How to build a metagenomic binning pipeline on AWS (Part 1)

Bioinformatics is leaping into the cloud

In the webinar “Scaling genomics workloads using HPC on AWS” on July 14, 2021, I learned that the heavyweights such as AstraZeneca and Illumina have already moved their genome analyses into the AWS cloud and have been reaping the great benefits ever since. The cloud reduced both the runtime and…

Three museums in Yokohama, Stavanger and Berlin taught me something unexpected

Everyone loves a good museum visit. It is an intensive learning session in our free time. Although it is the objects themselves that do all the talking, but let’s not overlook the contributions of the museum curators. They carefully select exhibits to educate and entertain the visitors. It is a…

Ridiculous sequencing results revealed how errors propagated from one research study to a global database

Garbage in, garbage out. But first you need to know what garbage looks like.

Figure 1. Carp in the soil. https://en.wikipedia.org/wiki/File:Cyprinus_carpio.jpeg

Last year, when we were working at a publication about three Cyanobacteria, my colleague Pia Marter told me that the our three metagenome-assembled genomes (MAG) contain some DNA fragments from Cyprinus carpio (common carp). My first…

Sixing Huang

Certified Neo4j Professional, German bioinformatician in BGI Shenzhen. I want to learn more about Cloud, machine learning, Japanese and to travel the world.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store