On This Day: March 1 — cleverdevil

Listening Mar 1, 2025 at 7:31 PM

Listened to Why Kent State Is Always Bad

Why Kent State Is Always Bad By Split Zone Duo: College Football Podcast

Listen on Splitzoneduo

Jonathan's location at time of posting

LaCour

Stationary

18.0 km/h

60%

Listening Mar 1, 2025 at 2:17 PM

Listened to Offseason Vibe Checks 2025: The Sun Belt

Offseason Vibe Checks 2025: The Sun Belt By Split Zone Duo: College Football Podcast

Listen on Splitzoneduo

Jonathan's location at time of posting

LaCour

Stationary

3.6 km/h

80%

Watching Mar 1, 2025 at 7:00 AM

Six Nations: Full Contact

S02E05 "Eyes on the Prize"

Everyone on the England team is desperate to be picked for the match against Ireland, while the reigning champions are focused on retaining their title.

Watched on Trakt

View on Trakt

Jonathan's location at time of posting

LaCour

Stationary

95%

Watching Mar 1, 2025 at 7:00 AM

Six Nations: Full Contact

S02E06 "Great Expectations"

Italy's new coach is determined to reverse their consistent losing streak. Could they ever stand a chance against the unpredictable Scottish team?

Watched on Trakt

View on Trakt

Jonathan's location at time of posting

LaCour

Stationary

95%

Listening Mar 1, 2025 at 2:58 AM

Listened to ATP Tier List: Corporate Logos

Nobody is immune to the perils of the modern logo redesign. By Accidental Tech Podcast: Specials

Listen on Atp

Jonathan's location at time of posting

LaCour

Stationary

95%

Listening Mar 1, 2025 at 1:28 AM

Listened to The Bank: New names emerging for Canes recruiting class

The Bank: New names emerging for Canes recruiting class By The CanesInSight Podcast

Listen on Canesinsight

Jonathan's location at time of posting

LaCour

Stationary

95%

Checkin Mar 1, 2024 at 7:34 PM

Pacific Standard Prime

Note Mar 1, 2024 at 12:41 PM

Biggest question marks for me are with Mercedes and McLaren. George managed to qualify on the second row, while Lewis struggled mightily. The McLaren was all over the place.

I also think Checo is done for if he continues to have such a wide execution gap with Max.

#F1

#F1

Jonathan's location at time of posting

LaCour

Stationary

-3.6 km/h

80%

Note Mar 1, 2024 at 12:38 PM

Solid quali for Ferrari. Car looks fast again over one lap. If the terrible tire degradation from last year’s car has been reduced, I will feel great about the potential for the season. 🏎️ #F1 #ForzaFerrari

Jonathan's location at time of posting

LaCour

Stationary

-3.6 km/h

80%

Watching Mar 1, 2024 at 2:00 AM

Survivor

S46E01 "This Is Where the Legends Are Made"

Eighteen new castaways embark on the adventure of a lifetime when they are left stranded on the breathtaking islands of Fiji. Tribes must be the first to crack the code to earn essential camp supplies. Then, three castaways go on a journey away from their new tribes.

Watched on Trakt

View on Trakt

Jonathan's location at time of posting

LaCour

Stationary

-3.6 km/h

80%

Watching Mar 1, 2023 at 11:00 PM

Taskmaster

S11E05 "Slap and Tong"

Greg Davies and Alex Horne set their captive band of comics humiliating missions involving cling film, fashion for insects, and face-based shape-shifting

Watched on Trakt

View on Trakt

Jonathan's location at time of posting

LaCour

stationary

39%

Watching Mar 1, 2023 at 10:00 PM

Coraline (2009)

Movie

A young girl discovers an idealized parallel universe behind a secret door in her new home, unaware that it contains a sinister secret.

Watched on Trakt

View on Trakt

Jonathan's location at time of posting

LaCour

stationary

49%

Checkin Mar 1, 2022 at 11:28 PM

Aloft San Francisco Airport

Checkin Mar 1, 2022 at 10:35 AM

Oakland International Airport (OAK)

Checkin Mar 1, 2022 at 7:42 AM

Delta Sky Club

Checkin Mar 1, 2022 at 7:26 AM

Los Angeles International Airport (LAX)

LAX - OAK - SFO - BOS - LAX

Los Angeles International Airport (LAX)

Watching Mar 1, 2021 at 11:00 PM

Last Week Tonight with John Oliver

S08E03 "Episode 212"

Last Week Tonight with John Oliver 8x03 "Episode 212"

Watched on Trakt

View on Trakt

Jonathan's location at time of posting

LaCour

Stationary

59%

Note Mar 1, 2021 at 5:29 PM

For those that are curious, I wrote up a summary of how I integrated my personal health data from Apple HealthKit into my website. - https://cleverdevil.io/2021/taking-control-of-my-personal-health-data

Jonathan's location at time of posting

LaCour

stationary

100%

Article Mar 1, 2021 at 5:08 PM

Taking Control of my Personal Health Data

1502 words •

8 min read

Over the past few years, I've invested time and effort into extricating important data and content from external services, and bringing it into systems that I own and control. I've moved on from Facebook and Instagram, established tracking for my movie, tv, and podcast activity, automatically track my location in multiple ways, and much more. But, for years now, one type of data has eluded me: my personal health data.

As of today, that has changed! I'd like to share with you what I've built.

Overview of Enhancements

My website now features my personal health metrics in several places. First, there is now a health section which shows both daily health metrics and historical metrics. You can go backward and forward in time and compare my daily metrics to historical min, max, and average values.

For the daily metrics, I use the familiar Apple Activity Rings format, and include supporting metrics across a variety of categories, including activity, heart health, and sleep analysis.

Daily Health Metrics Screenshot

For the historical metrics, I am particularly proud of the visualization. Each metric has a bar representing the minimum, maximum, and average values, and the gradient that is used to fill the bar adjusts to reflect the position of the average value.

Historical Health Metrics Screenshot

In addition, I have augmented my monthly summaries.

Monthly Health Summary Screenshot

Each day is represented by an Activity Ring and can be clicked on to view detailed, in-context metrics for that day.

Overall, I am quite pleased with how this project has turned out. Navigating through health metrics is snappy, the visualizations are attractive and useful, and it fits in neatly with the rest of my site.

Now that we've walked through what these features look like in practice, let's discuss how I gather the data and make it useful.

Unlocking HealthKit

I've owned an Apple Watch since the Series 2 watch was released, and have worn it fairly consistently ever since. As a result, I've got quite a lot of data amassed on my iPhone in Apple Health. That data is accessible through the Health app, and also via the HealthKit APIs. While I am a pretty strong developer, my skillset doesn't include much in the way of iOS development. I've made a few attempts at building an iOS app that will allow me to extract my HealthKit data automatically, but never made it far before I ran out of steam.

A few weeks ago, I discovered an app called Health Auto Export (which I will refer to as HAE for the rest of this post), which neatly solves the problem. HAE has many great features, but the key feature is "API Export," which allows you to automatically have your HealthKit data sent to an HTTP endpoint in JSON or CSV format, with control over time period and aggregation granularity. With this app in hand, I set about creating an API to store, index, and make that data searchable.

Introducing Health Lake

HAE uses a simple, but nested JSON data structure to represent health metrics. Because the data is structured, in plain-text, and will mostly sit at rest, a data lake is a natural target to store the data. Data lakes on Amazon Web Services (AWS) are generally implemented with Amazon S3 for storage, as it is well-suited to the use case, is deeply integrated with AWS' data, analytics, and machine learning (DAML) services.

In order to keep most of the complexity out of my website, I decided to build a microservice which is entirely focused on getting data into the data lake and making it useful. I call this service Health Lake, and the source is available on GitHub.

Sync and Store

Let's take a look at the first endpoint of Health Lake, which accepts data from HAE, trasforms it to align with the requirments for AWS's DAML services, and stores it in S3 - HTTP POST /sync.

HAE structures its data in a nested format:

{
    "data": {
        "metrics": [
            {
                "units": "kcal",
                "name": "active_energy",
                "data": [
                    {
                        "date": "2021-01-20 00:00:00 -0800",
                        "qty": 370.75
                    },
                    ...
                ]
            },
            ...
        ],
    }
}

As you can see, the data is nested fairly deeply. In order to simplify my ability to query the data, Health Lake transforms the data to a flatter structure, with each data point being formatted in JSON on a single line. On each sync, I create a single object that contains many data points, one per line, in a format like this:

{"name": "active_energy", "date": "2021-01-20 00:00:00 -0800", "units": "kcal", "qty": 370.75 }
...

Each sync object is stored in my target S3 bucket with the key format:

syncs/<ISO-format date and time of sync>.json

The prefix on the object name is critical, as it enables the indexing and querying of sync data independent from other data in the bucket.

Querying the Data Lake

Now that we have data being sent to our data lake and stored in an efficient, standardized format, we can focus on making that data searchable. Very often, I use relational databases like MySQL or PostgreSQL to store data and make it searchable with SQL. AWS provides a few great services which allow you to treat your data lake as a series of database tables that can be queried using SQL.

The first service we'll leverage is AWS Glue, which provides powerful data integration capabilities:

AWS Glue is a serverless data integration service that makes it easy to discover, prepare, and combine data for analytics, machine learning, and application development. AWS Glue provides all of the capabilities needed for data integration so that you can start analyzing your data and putting it to use in minutes instead of months.
Data integration is the process of preparing and combining data for analytics, machine learning, and application development. It involves multiple tasks, such as discovering and extracting data from various sources; enriching, cleaning, normalizing, and combining data; and loading and organizing data in databases, data warehouses, and data lakes.

Using AWS Glue, I created a database called "health," and then created a "crawler," which connects to my data store in S3, walks through all of the data, and attempts to infer the schema based upon hints and classifiers. The crawler can be run manually on-demand, or can be scheduled to run on a regular basis to continuously update the schema as new fields are discovered. Here is what the configuration of my crawler looks like in the AWS Glue console:

AWS Glue Crawler Configuration Screenshot

Upon the first run of the crawler, a new table was created in my health database called syncs, which inferred the following schema:

AWS Glue Table Schema Screenshot

I wasn't able to get the crawler to match the date format properly, so I ended up creating a "view" which adds a proper column that is a timestamp using the following SQL statement:

CREATE OR REPLACE VIEW 
    history 
AS SELECT
    date_parse(substr(date, 1, 19), '%Y-%m-%d %H:%i:%s') as datetime,
    *
FROM
    syncs

Now that our data lake has been crawled, and a database, table, and view have been defined in our AWS Glue Data Catalog, we can use Amazon Athena to query our data like using standard SQL. Athena is entirely serverless, so there is no infrastructure to manage, and you pay only for the queries that you run.

Daily Metrics

For our daily metric view, we need a summary of all metrics gathered on a specific day. To accomplish this, I added an endpoint to our microservice:

HTTP GET /detail/<YYYY-MM-DD>

In response to this request, the client will receive a JSON data structure collecting all data points for that day. Under the hood, the microservice is running the following SQL query:

SELECT * FROM history 
WHERE
    datetime >= TIMESTAMP 'YYYY-MM-DD 00:00:00'
AND
    datetime <= TIMESTAMP 'YYYY-MM-DD 23:59:59'

Because I pay for every query that I run on Athena, and to achieve great performance, I store the query results in the proper format for the client in S3 after I run the query. I then implemented some intelligence to decide if, for any given request, I should pull from the cache, or regenerate fresh data. Take a look at the source code for more detail.

Monthly Metrics

To show our monthly summaries, we need to get data for each day of the month. Rather than sending a request and query for every single day of the month, I decided to implement another endpoint to our microservice:

HTTP GET /summary/<YYYY-MM>

In response to this request, the client will receive a JSON data structure collecting all data points for the month, sorted by date. To accomplish this, I run the following SQL query:

SELECT * FROM history
WHERE
    datetime >= TIMESTAMP 'YYYY-MM-01 00:00:00'
AND
    datetime <= TIMESTAMP 'YYYY-MM-31 00:00:00'

The start and end range are actually calculated to ensure I have the proper end date, as not every month has the same number of days. Again, to save costs and improve performance, results are intelligently cached in our S3 bucket.

Global Metrics

Generating a global summary of all data points in the data lake was a bit more challenging. To make things more efficient, I created another view in my database with this query. Results are, again, intelligently cached.

Website Integration

With all of this great data available to me, it was time to integrate it with my website, which uses the Known CMS. I have created a Known plugin that provides enhancements that are specific to my website. Using this plugin, I simply send requests to the Health Lake microservice, parse the JSON, and create my visualizations.

Conclusions

Overall, I am quite pleased that I have been able to integrate this data into my website, and more importantly, to free the data from its walled garden and place it under my control and ownership.

#indienews

Listening Mar 1, 2021 at 12:21 AM

Listened to Dear Internet Episode 54: Chili Sommelier

Dear Internet, On this weeks show we talk about dealing with your girlfriends terrible singing, having a coitus green house, and potatoes in...

Listen on Overcast

Jonathan's location at time of posting

LaCour

stationary

100%

Watching Mar 1, 2020 at 10:25 PM

Bob's Burgers

S10E14 "Wag the Song"

Tina competes with Gene and Louise to write a new school song for Wagstaff. Jimmy Pesto’s new steel awning creates a blinding glare for Bob and Linda.

Watched on Trakt

View on Trakt

Jonathan's location at time of posting

LaCour

Stationary

62%

Watching Mar 1, 2020 at 9:50 PM

Tom Papa: You're Doing Great! (2020)

Movie

Comedian Tom Papa takes on body image issues, social media, pets, Staten Island, the "old days" and more in a special from his home state of New Jersey.

Watched on Trakt

View on Trakt

Jonathan's location at time of posting

LaCour

Stationary

62%

Note Mar 1, 2020 at 6:51 PM

Our leadership team at work has been having very frank, careful discussions about the coronavirus. It’s good to be a part of a team that values our people, and embraces a distributed workforce. Empowering the team to make wise decisions

Jonathan's location at time of posting

LaCour

walking

82%

Listening Mar 1, 2020 at 1:38 PM

Listened to 367: Antitrust Relief Valve

Apple possibly adding default-app choice on iOS, shorter HTTPS-certificate lifespans, and Casey’s humongous-gust-of wind. By Accidental Tech Podcast

Listen on Overcast

Jonathan's location at time of posting

walking

7.2 km/h

85%

Checkin Mar 1, 2020 at 9:57 AM

Riviera United Methodist Church

Watching Mar 1, 2019 at 11:05 PM

Gold Rush: White Water

S02E09 "The Widow Maker"

The Dakota boys are dredging towards a nugget trap they believe holds a jackpot of gold. After a rough season, this could this be the life-changing fortune they desperately need.

Watched on Trakt

View on Trakt

Jonathan's location at time of posting

LaCour

stationary

64%

Watching Mar 1, 2019 at 10:30 PM

Gold Rush

S09E19 "Cold War"

Parker's parents visit and offer help; Rick continues to battle frozen ground to reach his goal; shocking news forces Tony to make a drastic decision about his season.

Watched on Trakt

View on Trakt

Jonathan's location at time of posting

LaCour

Stationary

63%

Note Mar 1, 2019 at 6:58 PM

This was a great primer episode for sous vide beginners! I’ve been cooking sous vide for ages, and I still learned an amazing new soft poached egg technique. Worth a watch.

https://cleverdevil.io/2019/americas-test-kitchen-19x08-sous-vide-for-everybody#comments

Jonathan's location at time of posting

LaCour

walking

69%

Watching Mar 1, 2019 at 5:55 PM

America's Test Kitchen

S19E08 "Sous Vide for Everybody"

Test cook Dan Souza makes Bridget Sous Vide Seared Steaks. Equipment expert Adam Ried then reviews his top pick for sous vide machines, and test cook Elle Simone makes a show stopping sous vide staple: Soft-Poached Eggs.

Watched on Trakt

View on Trakt

Jonathan's location at time of posting

LaCour

Stationary

74%

Watching Mar 1, 2019 at 12:00 AM

The Big Family Cooking Showdown

S02E04 "Episode 4"

Tonight, the three remaining families face a surprise head-to-head challenge for which they have to nominate one family member to cook on their own. There are pastry disasters, fruit controversy and raw egg whites as the cooks tackle a lemon meringue pie. For the second task, the families are reunited to make a two-course meal with lamb and almonds as the star ingredients. The pressure is on as only two families can make it through to Friday's play-offs.

Watched on Trakt

View on Trakt

Jonathan's location at time of posting

LaCour

Stationary

75%

Note Mar 1, 2018 at 7:40 PM

Gorgeous view! Where are you?

Note Mar 1, 2018 at 7:09 PM

Lacey is out with friends tonight, so I’m treating the kids to their first ever viewing of Back to the Future in the home theater. 🚗💨⏰

Checkin Mar 1, 2018 at 3:20 PM

Rebel Republic Bar & Grill

Biz meeting!

Rebel Republic Bar & Grill

Like Mar 1, 2018 at 8:21 AM

Marco Arment on Twitter: "I’ve never seen so much energy, political heat, and momentum for change after a shooting before. It’s working. Keep going. Don’t stop at half-measures. Restoring the assault-weapons ban should be the baseline of our expectations, not the maybe-someday stretch goal."

Photo Mar 1, 2014 at 12:32 PM

March 1

Travels & Check-ins

Listened to Why Kent State Is Always Bad

Listened to Offseason Vibe Checks 2025: The Sun Belt

Six Nations: Full Contact

Six Nations: Full Contact

Listened to ATP Tier List: Corporate Logos

Listened to The Bank: New names emerging for Canes recruiting class

Pacific Standard Prime

Survivor

Taskmaster

Coraline (2009)

Aloft San Francisco Airport

Oakland International Airport (OAK)

Delta Sky Club

Los Angeles International Airport (LAX)

Last Week Tonight with John Oliver

Taking Control of my Personal Health Data

Overview of Enhancements

Unlocking HealthKit

Introducing Health Lake

Sync and Store

Querying the Data Lake

Daily Metrics

Monthly Metrics

Global Metrics

Website Integration

Conclusions

Listened to Dear Internet Episode 54: Chili Sommelier

Bob's Burgers

Tom Papa: You're Doing Great! (2020)

Listened to 367: Antitrust Relief Valve

Riviera United Methodist Church

Gold Rush: White Water

Gold Rush

America's Test Kitchen

The Big Family Cooking Showdown

Rebel Republic Bar & Grill

Let's go fly a kite.