Figure 7: NumPy / datetime objects - date time chart - GitHub stars of GMT and the wrappers #10

yvonnefroehlich · 2025-01-04T08:50:02Z

This PR adds a JN for NumPy - array like data - arrays of integers / floats and datatime objects:

Read CSV files with GitHub stars over time
Use NumPy datetime object
Plot these arrays via Figure.plot
Add auto-legend

The GitHub stars of of GMT (the red from the GMT logo), GMT / MEX (the orange for the MATLAB logo), GMT.jl (the purple from the Julia logo], and PyGMT (the blue from the Python logo) are used as data. Data as CSV files are included for completeness / reference.

Preview:

gitnotebooks · 2025-01-04T08:50:06Z

Found 1 changed notebook. Review the changes at https://app.gitnotebooks.com/GenericMappingTools/pygmt-paper-figures/pull/10

seisman · 2025-10-17T03:14:26Z

My suggestions:

Instead of hard-coding the data in the scripts, I prefer to load it from the CSV file, i.e.

>>> import pandas as pd
>>> df = pd.read_csv("/home/seisman/Downloads/star-history-20251017.csv")
>>> df["Date"] = df["Date"].str.split(" \\(").str[0]
>>> df["Date"] = pd.to_datetime(df["Date"], format="%a %b %d %Y %H:%M:%S GMT%z")

The main purpose is to show that PyGMT can integrate well with existing Python workflows (load data from a file, process data, then visualize) and also can handle datetime correctly.

Use for loop to avoid code duplicate, for example

fig = pygmt.Figure()
for csvfile, color, label in zip(
    ["star-history-gmt.csv", "star-history-pygmt.csv", "star-history-gmtjl.csv"],
    ["238/86/52", "63/124/173", "170/121/193"],
    ["GMT", "PyGMT", "GMT.jl"],
): 
    df = pd.read_csv(csvfile)
    df["Date"] = df["Date"].str.split(" \\(").str[0]
    df["Date"] = pd.to_datetime(df["Date"], format="%a %b %d %Y %H:%M:%S GMT%z")

    fig.plot(x=df["Date"], y=df["Stars"], pen=color, no_clip=True)
    fig.plot(x=df["Date"], y=df["Stars"], fill=color, style="a0.35c", no_clip=True, label=label)
fig.legend(position="jTL")
fig.show()

…p | Remove codes for temporal aligement

yvonnefroehlich · 2025-10-17T13:07:11Z

Thanks for your detailed recommondations!

Instead of hard-coding the data in the scripts, I prefer to load it from the CSV file

Yes, I agree reading the data directly from the CSV files is better. I remember that reading the files did not work directly for me and that the codes in this PR were still the codes I used for the figure for the AGU talk 🙈.

Following your suggestions the example is updated in commit 9d3fb0d.

pygmt_paper_numpy_datetime.ipynb

Co-authored-by: Dongdong Tian <seisman.info@gmail.com>

seisman · 2025-11-05T09:07:53Z

The Jupyter Notebook appears to be broken, likely because one of my suggestions didn’t conform to the notebook’s JSON format. In other words, we should avoid making changes to notebooks directly through the GitHub web interface.

Edit: I just saw that you have fixed it. Thanks.

yvonnefroehlich · 2025-11-05T09:27:23Z

The Jupyter Notebook appears to be broken, likely because one of my suggestions didn’t conform to the notebook’s JSON format. In other words, we should avoid making changes to notebooks directly through the GitHub web interface.

Edit: I just saw that you have fixed it. Thanks.

Actually, I am wondering if it would make the discussion and review more convenient to first start with normal Python scripts and, after we agreed on (nearly) final versions of the figures and codes, copy things into a JN (and maybe split it over multiple cells)?

seisman · 2025-11-05T10:28:11Z

Actually, I am wondering if it would make the discussion and review more convenient to first start with normal Python scripts and, after we agreed on (nearly) final versions of the figures and codes, copy things into a JN (and maybe split it over multiple cells)?

Yes, I also free a normal Python script is more convenient.

yvonnefroehlich · 2025-11-05T12:03:04Z

Actually, I am wondering if it would make the discussion and review more convenient to first start with normal Python scripts and, after we agreed on (nearly) final versions of the figures and codes, copy things into a JN (and maybe split it over multiple cells)?

Yes, I also free a normal Python script is more convenient.

Added normal Python scripts for all figures (PRs).

yvonnefroehlich · 2025-11-21T14:32:11Z

Just wondering if we want to include GMT/MEX here to mention all wrappers (not sure how much work is currently done for the MATLAB one; also, never used it):

seisman · 2025-11-30T09:51:23Z

Fig7_PyGMT_datetime.py

+    )
+
+    fig.plot(x=df["Date"], y=df["Stars"], pen=color)
+    fig.plot(x=df["Date"], y=df["Stars"], fill=color, style="a0.35c", label=wrapper)


Perhaps we should use four different symbols, although the star symbol is appropriate for a plot showing GitHub star history.

Hm, I think I like to keep the stars. What does the others think?

The benefits of using different symbols are twofold: first, they demonstrate that PyGMT supports a wide variety of marker styles, and that auto-legend works well with them. Second, symbols with distinct markers are more effective for gray-scale printing and are more accessible for colorblind readers. For me, it's a little difficult to distinguish Julia's purple from Python's blue.

BTW, where did you get the colors?

The GMT red "238/86/52" is from the gmtlogo.c in the GMT repo

The Matlab one?

The Julia purple "170/121/193" should be "149/88/178" (https://github.com/JuliaLang/julia-logo-graphics).

The Python blue "63/124/173" should be "48/105/152" (https://www.brandcolorcode.com/python).

With the Python yellow "255/212/59", the figure looks good to me, even with the same symbols:

Agree with @seisman. Especially for gray-scale printing (some people still do that 😆) and colorblind readers different symbols make it much easier to distinguish between the different entries.

I suggest to avoid yellow color on a white background.

Would change the legend entry order so that it follows the order of the last shown data point of each entry. Makes it much easier to read the figure.:

GMT

PyGMT

GMT.jl

GMT/MEX

distinct markers are more effective for gray-scale printing and are more accessible for colorblind readers.

This is a fair argument. Changed it in commit c144d8c.

gray-scale printing (some people still do that 😆)

Yep, students xD (and then color it by hand).

BTW, where did you get the colors?

Good question. I think these are the RGB codes I used for the example for the AGU presentation one year ago. But though this are the offical RGB codes, but unfortuantely I do not remember what went wrong here 🙁. Thanks for spotting! For the MATLAB logo, I extracted the RGB code from the logo, as I did not directly finde a source for the offical RGB code. Edit: Also fixed in the codes for the AGU presentation.

I suggest to avoid yellow color on a white background.

Agree, yellow on white background is not optimal and it depends on the screen how good the contrast / visibility is. Thus would keep the Python's blue

Would change the legend entry order so that it follows the order of the last shown data point of each entry. Makes it much easier to read the figure.

Hm. The order is currently based on the start of the project; maybe this is helpful for descripting the figure in the text.

Yep, I got this. If people prefer this order I am OK with reordering this (see commit 0018c93). Probably, only a few people will recognize the order based on the start of the projects.

Another option would be to get rid of the whole legend and add the repo labels at the end of each corresponding line. Just depends on if we want to show legend creation or not...

This example shows the auto-legend feature, so I feel we should keep it.

Would keep the auto-legend feature. I think it's important to show that PyGMT can do this. For simple automazied plots updating the legend manually can get a bit inconvient. GMT-specific legend files or passing this synthax via an io.StringIO object would need some more explanation.

BTW, where did you get the colors?

Good question. I think these are the RGB codes I used for the example for the AGU presentation one year ago. But though this are the offical RGB codes, but unfortuantely I do not remember what went wrong here 🙁. Thanks for spotting! For the MATLAB logo, I extracted the RGB code from the logo, as I did not directly finde a source for the offical RGB code. Edit: Also fixed in the codes for the AGU presentation.

Regarding the MATLAB color: I still did not find an offical RGB code for the orange in the MATLAB logo. On this website (https://de.mathworks.com/help/matlab/visualize/creating-the-matlab-logo.html) MATLAB shows how users can create the logo on their own. There they use the color s.FaceColor = [0.9 0.2 0.2], but the afterwards applied lighting effects change the apperance of the color.

star_history_gmt.csv

michaelgrund · 2025-12-01T10:29:04Z

Fig7_PyGMT_datetime.py

+    ["238/86/52", "253/131/68", "170/121/193", "63/124/173"],
+    strict=False,
+):
+    df = pd.read_csv(f"star_history_{file}.csv")


I also made a comment regarding the data source in the manuscript: Afaik it's not possible to directly load the data via URL? Would make the workflow much more professional than loading (outdated) individual csv's.

it's not possible to directly load the data via URL?

I think the answer is no; however, Fig. 4 already demonstrates how to load a dataset via a URL, so reading CSV files in this example is not that bad.

In fact, I'm not happy with the star-history data, as the stars are unevenly sampled.

GitHub provides an API to retrieve star history. The Python script below returns the timestamp when each user starred the PyGMT project. The code is about 30 lines long, so it may be too long to include in the manuscript. Nevertheless, we could use the script to obtain the full history of project stars and then resample the data at every three-month intervals.

import requests import pandas as pd owner, repo = "GenericMappingTools", "pygmt" headers = {"Accept": "application/vnd.github.v3.star+json"} timestamps = [] page = 1 while True: r = requests.get( f"https://api.github.com/repos/{owner}/{repo}/stargazers", headers=headers, params={"per_page": 100, "page": page}, ) data = r.json() if not data: break timestamps += [s["starred_at"] for s in data] # full ISO 8601 timestamp page += 1 timestamps.sort() # ISO strings sort chronologically df = pd.DataFrame({"timestamp": timestamps}) df["cumulative_stars"] = range(1, len(df) + 1) print(df)

Would be nice, but for me it still looks like, it is only possilbe to get a link which creates an live-updated time chart for the project GitHub README. And the CSV files one can download contain the date of downloading in the file name. Also the temporal sampling is a bit interesting, as it is not equally spaced and different for different repos; would prefer to have the data points for all repos at the same dates.

Nevertheless, we could use the script to obtain the full history of project stars and then resample the data at every three-month intervals.

I guess @seisman means to group the data into 3-months buckets and just show for each project quarterly numbers. This would allow to have the same time-spacing for all repos.

It seems like @seisman and I wrote our comments nearly at the same time 🙃, and we are both pointing out that the star-history data does not have an equal time spacing. And I am actually having the impression that the data points that are reported in the CSV file change over time.

Tried the request script from Dongdong. I do not think we have to show this directly in the manuscript. We can include it as a separate cell in the JN, but for the example itself, it should be OK to load the saved CSV files and focus on the plotting.

I'm thinking about which columns to store in the CSV file and what to include in the manuscript:

Date and Stars at three-month intervals. It can be plotted directly, so the example code can focus on PyGMT plotting.

Date and Stars for each GitHub user. Stars go from 1 to the max. In the example code, we'd need to resample it to three-month intervals before plotting.

UserName and Date for each GitHub user. This is basically the raw data. We need to do data processing to get the cumulative stars before plotting. This also makes the code follow a clearer read → process → visualize workflow.

yvonnefroehlich added 3 commits January 4, 2025 09:34

Add draft of JN for numpy - array, datatime

ba6f420

Include output figure

c0b903b

Reduce resolution of output image in JN

248a5bd

yvonnefroehlich added 6 commits January 5, 2025 18:40

Include GMT.jl

414c8b8

Add CSV files [temporary for reference]

c73b38b

Adjust filename

b46752c

Reduce resolution of output image in JN

2150e2a

Fix date in filename

e525ea6

Add plot for aligned time series

7434785

seisman changed the title ~~Figure XY: NumPy~~ Figure 8: Plotting time series Oct 17, 2025

yvonnefroehlich added 2 commits October 17, 2025 14:55

Update star history data | Reade data from file | Reduce code via loo…

9d3fb0d

…p | Remove codes for temporal aligement

Remove wrong data point

80cf207

seisman reviewed Oct 18, 2025

View reviewed changes

seisman mentioned this pull request Nov 5, 2025

Figure 3: Backgroundmaps - coast, remote dataset, tilemap, 3-D - relief of Iceland #8

Merged

4 tasks

yvonnefroehlich and others added 8 commits November 5, 2025 09:45

Remove unneeded import of numpy

40bb6ce

Co-authored-by: Dongdong Tian <seisman.info@gmail.com>

Use upper-case letter for y label

6853dce

Co-authored-by: Dongdong Tian <seisman.info@gmail.com>

Extend x-axis in negative direction to avoid no_clip=True

9188574

Co-authored-by: Dongdong Tian <seisman.info@gmail.com>

Add a box around the legend

37d95e3

Co-authored-by: Dongdong Tian <seisman.info@gmail.com>

Remove using no_clip parameter

8b22b2f

Co-authored-by: Dongdong Tian <seisman.info@gmail.com>

Fix file

abcc381

Remove date from files for star history

2550708

Make legend box look nice and use Box class

305321c

Move to normal python script (temporaly)

331718c

seisman mentioned this pull request Nov 10, 2025

Tracking the status of the examples included in the paper #14

Open

4 tasks

yvonnefroehlich added 2 commits November 12, 2025 21:38

Rename files and save output figure

c0ef9ed

Merge remote-tracking branch 'origin/main' into fig/numpy

2ee2822

seisman changed the title ~~Figure 8: Plotting time series~~ Figure 7: Plotting time series Nov 20, 2025

Adjust figure number

29a78d5

yvonnefroehlich changed the title ~~Figure 7: Plotting time series~~ Figure 7: NumPy / datetime objects - date time chart - GitHub stars of GMT, GMTjl and PyGMT Nov 21, 2025

yvonnefroehlich added 5 commits November 21, 2025 13:34

Adjust annotations for y axis

3118402

Update starts data csv files

7ab6240

Update starts data csv files

6aafdce

Add gmtmex

38d582d

Sort by starting time of project

268dd4b

yvonnefroehlich changed the title ~~Figure 7: NumPy / datetime objects - date time chart - GitHub stars of GMT, GMTjl and PyGMT~~ Figure 7: NumPy / datetime objects - date time chart - GitHub stars of GMT and the wrappers Nov 21, 2025

yvonnefroehlich added 3 commits November 27, 2025 10:37

Follow coding style

38b04a2

Merge remote-tracking branch 'origin/main' into fig/numpy

81891b2

Merge remote-tracking branch 'origin/main' into fig/numpy

f9ff990

seisman reviewed Nov 30, 2025

View reviewed changes

yvonnefroehlich added 2 commits November 30, 2025 16:03

merge remote-tracking branch 'origin/main' into fig/numpy

caa4edc

Import Box class

fd86eb7

seisman reviewed Dec 1, 2025

View reviewed changes

star_history_gmt.csv Show resolved Hide resolved

michaelgrund reviewed Dec 1, 2025

View reviewed changes

yvonnefroehlich added 5 commits December 1, 2025 12:31

Fix RGB codes for colors

79389d5

Use different symbols

c144d8c

Test requesting github stars data via Dongdong's code

2e8a472

Merge remote-tracking branch 'origin/main' into fig/numpy

73ea374

Reorder based y value of laster record

0018c93

yvonnefroehlich mentioned this pull request Dec 2, 2025

011_agu_FTLJG_2024: Correct RGB code for Python's blue yvonnefroehlich/gmt-pygmt-plotting#130

Merged

yvonnefroehlich added 2 commits December 2, 2025 11:09

Adjust color for MATLAB

3da5bff

Remove code for old order

8f9dd85

Figure 7: NumPy / datetime objects - date time chart - GitHub stars of GMT and the wrappers #10

Are you sure you want to change the base?

Figure 7: NumPy / datetime objects - date time chart - GitHub stars of GMT and the wrappers #10

Uh oh!

Conversation

yvonnefroehlich commented Jan 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gitnotebooks bot commented Jan 4, 2025

Uh oh!

seisman commented Oct 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yvonnefroehlich commented Oct 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

seisman commented Nov 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yvonnefroehlich commented Nov 5, 2025

Uh oh!

seisman commented Nov 5, 2025

Uh oh!

yvonnefroehlich commented Nov 5, 2025

Uh oh!

yvonnefroehlich commented Nov 21, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

michaelgrund Dec 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

yvonnefroehlich Dec 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

yvonnefroehlich Dec 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

yvonnefroehlich Dec 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

yvonnefroehlich commented Jan 4, 2025 •

edited

Loading

seisman commented Oct 17, 2025 •

edited

Loading

yvonnefroehlich commented Oct 17, 2025 •

edited

Loading

seisman commented Nov 5, 2025 •

edited

Loading

michaelgrund Dec 1, 2025 •

edited

Loading

yvonnefroehlich Dec 1, 2025 •

edited

Loading

yvonnefroehlich Dec 2, 2025 •

edited

Loading

yvonnefroehlich Dec 1, 2025 •

edited

Loading