More

jlev · 2025-09-11T16:08:24 1757606904

This change https://github.com/eggert/tz/commit/baea52df7ac9c4b53857556f... was a huge pain in my ass in 2013.

I was working in Libya on voter registration tools with the UN and the High National Election Commission. The government decided to not implement a planned TZ change, and didn't inform the public until the day of. Not the hardest thing we dealt with, that was a full country internet shutoff by a mob outside our data centre (https://www.bbc.com/news/world-africa-25481794). Sometimes the politics of a project are more complicated than the technology...

We did implement an all-SMS voter registration system, which was pretty cool. Hasn't been used much since, but it's all open source. https://github.com/hnec-vr

ta1243 · 2025-09-11T17:43:08 1757612588

> I was working in Libya

Complete tangent, but I don't think many Americans know this (I'm assuming you're an American)

If you aren't American, you now are now ineligible to go to America as a tourist without an expensive hasslesome visit to a US embassy. (No online ESTA)

I have friends that have gone to countries like Libya, and Syria to do similar international work. An British engineer I know recently went to Syria for a few days.

I pointed out that he is no longer allowed to go to the US without going to the embassy for a visa. He's ineligible for an ESTA.

He said "fine, work will have to pay for it".

I then pointed out this is for the rest of his life. He regularly holidays in Florida. He might leave the media or change jobs so they no longer pay for a visa.

I've been asked to go to Iraq in the past, but I've said no because of this. Was a very expensive weekend for my friend.

Another friend is in the British Army, he's gone to various places as part of both British and NATO deployment, not using his personal passport - but using travel orders. He managed to avoid going to Iraq which is lucky for him, means he can still get an ESTA.

elAhmo · 2025-09-11T18:35:10 1757615710

Many Americans (and non-Americans) also don't know that not all countries are supported with ESTA. So, for me, being a citizen of a '3rd world country' in Europe, I have to visit the embassy.

Although I am resident in EU and haven't been to any of those 'flagged' countries.

jlev · 2025-09-11T18:02:14 1757613734

I am an American, and most of the other folks on the project were as well. I did go through an interview with Customs officers when I returned to the US, but it didn't affect my ability to use GlobalEntry. I don't have clearance, but it might cause questions if I did apply for that in the future.

I've also been to Syria as a tourist several times, and at one point had to maintain a second passport for visiting Israel or the West Bank. You can't travel to most of the Arab world if you have Israeli stamps, but you can get another book from the US to keep them separate.

Sorry for your friend. Borders are bullshit.

ta1243 · 2025-09-12T11:34:01 1757676841

Second passport is fine, but they stopped stamping at Tel Aviv many years ago. I still got a stamp last time I went to gaza, but that was about a decade ago and pre current passport. Obviously not a concern now)

jlev · 2025-09-05T19:42:47 1757101367

I did something similar for my wedding website back in 2013. We used a mail-in service that produced a decent TTF, and then I converted it to a WOFF. Still online at https://ruthandjosh.net/story/ (warning, millennial cringe ahead)

wvbdmp · 2025-09-06T00:00:45 1757116845

Hell yeah, keep that site up. What a marvel in this age of link rot.

jlev · 2025-09-06T02:02:17 1757124137

Thank you, I’m never letting the domain go.

Lu2025 · 2025-09-06T02:56:03 1757127363

It's a lovely story, no cringe detected.

dotancohen · 2025-09-05T21:08:32 1757106512

Thank you for that cringe! What a great way to end the week. Shabbat shalom.

Weetile · 2025-09-05T22:05:15 1757109915

So is it true that Ruth had champagne on the flight without you?

chneu · 2025-09-06T07:07:41 1757142461

this rules.

i actually think the design/layout is kind of timeless.

aswinmohanme · 2025-09-05T19:57:16 1757102236

warmed my heart, wishing you a great life together

jinushaun · 2025-09-07T03:01:00 1757214060

I’ll take millennium cringe over gen-z nihilism.

ezequiel-garzon · 2025-09-06T13:42:26 1757166146

This is awesome! A mere upvote wouldn't be enough to point this out, thanks for sharing.

jsjddnnsndn · 2025-09-06T01:29:04 1757122144

Nice story. Most people dont meet in a "fate" kind of way but you did.

loveiswork · 2025-09-06T11:41:46 1757158906

Incredible, loved the blog

cauliflower2718 · 2025-09-06T16:01:02 1757174462

This is so cute!! I hope you two are having a lovely life together.

caminanteblanco · 2025-09-05T22:36:48 1757111808

So wholesome, and here's a belated congratulations!

jszymborski · 2025-09-06T00:36:36 1757118996

Love it!!

stavros · 2025-09-05T23:11:12 1757113872

Well that was delightful.

faxmeyourcode · 2025-09-06T03:07:00 1757128020

love this, what a great handmade feel

jlev · 2025-06-30T20:06:43 1751314003

A modest proposal to turn Alameda's former naval station into an AI and robotics hub.

"All it would take" is

- Exercising the Endangered Species Act 7(j) exemption to reactivate this plot of land, a former US Naval base that closed in 1997

- Invoking the National Emergencies Act to maximally accelerate federal permitting across all domains, and

- Transferring ownership of the land from the VA to the Department of Defense.

jlev · 2025-03-13T17:59:32 1741888772

Aaron Swartz, cofounder of Reddit and inventor of RSS and Markdown, was hounded to death by an overzealous prosecutor for downloading articles from JSTOR, with the intent to learn from them. He was charged with over a million dollars in fines and could have faced 35 years in prison.

He and Sam Altman were in the same YC class. OpenAI is doing the same thing at a larger scale, and their technology actually reproduces and distributes copyrighted material. It's shameful that they are making claims that they aren't infringing creator's rights when they have scraped the entire internet.

https://flaminghydra.com/sam-altman-and-aaron-swartz-saw-the... https://en.wikipedia.org/wiki/Aaron_Swartz

falcor84 · 2025-03-13T21:16:22 1741900582

I'm familiar with Aaron Swartz's case, and that is actually why I phrased it as "books". In any case, while tragic, Swartz wasn't prosecuted for copyright infringement, but rather for wire fraud and computer fraud due to the manner in which he bypassed protections in MIT's network and the JSTOR API. This wouldn't have been an issue if he downloaded the articles from a source that freely shared them, like sci-hub.

h2zizzle · 2025-03-14T13:47:49 1741960069

It would be incredibly naive to assume that the scraping done for these models did not at any point circumvent protections.

The fundamental contention is that both accessed, saved and distributed material that they didn't have a "right" to access, save, and distribute. One was made a billionaire for it and another was driven to suicide. It's not tragic, it's societal malpractice.

kgdiem · 2025-03-13T18:37:48 1741891068

Will what OpenAI & others serve as precedent for Alexandra Elbakyan of SciHub and avenge Aaron?

Cynically, I imagine it will not but I hope that it could.

concerndc1tizen · 2025-03-13T19:20:52 1741893652

You could argue that they are avenging him in doing exactly what he did, or worse, and not being punished for it. They are establishing precedent.

Dylan16807 · 2025-03-13T19:35:30 1741894530

I'm responding specifically to this sentence:

> It's shameful that they are making claims that they aren't infringing creator's rights when they have scraped the entire internet.

Scraping the Internet is generally very different from piracy. You are given a limited right to that data when you access it, and you can make local copies. if further use does something sufficiently non-copying, then creator rights aren't being infringed.

mirekrusin · 2025-03-13T19:53:10 1741895590

Can you compress the internet including copyrighted material and then sell access to it?

At what percentage of lossy compression it becomes infringement?

Dylan16807 · 2025-03-13T20:26:02 1741897562

> Can you compress the internet including copyrighted material and then sell access to it?

Define access?

If you mean sending out the compressed copy, generally no. For things people normally call compression.

If you want to run a search engine, then you should be fine.

> At what percentage of lossy compression it becomes infringement?

It would have to be very very lossy.

But some AI stuff is. For example there are image models with fewer parameters than source images. Those are, by and large, not able to store enough data to infringe with. (Copying can creep in with images that have multiple versions, but that's a small sliver of the data.)

codedokode · 2025-03-14T01:12:23 1741914743

Commercial audio generation models were caught reproducing parts of copyrighted music in a distorted and low-quality form. This is not "learning", just "imitating".

Also, as I understand they didn't even buy the CDs with music for training; they got it somewhere else. Why do organizations that prosecute people for downloading a movie do not want to look if it is ok to make a business on illegal copies of copyrighted works?

Dylan16807 · 2025-03-14T01:48:00 1741916880

I said "some" for a reason.

a_wild_dandan · 2025-03-13T20:50:21 1741899021

When you identify where the infringing party has stored the source material in their artifact.{zip,pdf,safetensor,connectome,etc}. In ML, this discovery stage is called "mechanistic interpretability", and in humans it's called "illegal."

Dylan16807 · 2025-03-13T22:54:12 1741906452

It's not that clear cut. Since they're talking about taking lossy compression to the limit, there are ways to go so lossy that you're not longer infringing even if you can point exactly at where it's stored.

Like cliff's notes.

yieldcrv · 2025-03-13T19:07:50 1741892870

It was overzealous prosecution of the breaking into a closet to wire up some ethernet cables to gain access to the materials

Not the downloading with intent

And apparently the most controversial take on this community is the observation that many people would have done the trial, plea and time, regardless of how overzealous the prosecution was

triceratops · 2025-03-13T22:05:53 1741903553

> breaking into a closet

"The closet's door was kept unlocked, according to press reports"

When's the last time a kid with no record, a research fellow at Harvard, got threatened with 35 years for a simple B&E?

yieldcrv · 2025-03-14T01:55:25 1741917325

They threaten

Its the plea or sentencing where that stuff gets taken into account for a reduction to community service

DrillShopper · 2025-03-14T13:20:01 1741958401

I'm glad you still have that much faith in the system. That's much more faith than I have in the system (and more faith than I had in the system back then, too).

apetresc · 2025-03-13T20:59:58 1741899598

Wasn’t John Gruber the inventor of Markdown?

andsoitis · 2025-03-15T10:34:17 1742034857

> for downloading articles from JSTOR, with the intent to learn from them

For context, according to sources, he downloaded 4.8 million articles.

falcor84 · 2025-03-15T15:51:05 1742053865

Maybe he was about to train an LLM on them /s

tzs · 2025-03-13T21:51:19 1741902679

35 years is a press release sentence. The way DOJ calculates sentences when they write press releases ignores the alleged facts of the particular case and just uses for each charge the theoretically maximum possible sentence that someone could get for that charge.

To actually get that maximum typically requires things like the person is a repeat offender, drug dealing was involved, people were physically harmed, it involved organized crime, it involved terrorism, a large amount of money was involved, or other things that make it an unusual big and serious crime.

The DOJ knows exactly what they are alleging the defendant did. They could easily looks at the various factors that affect sentencing for the charge and see which apply to that case and come up with a realistic number but that doesn't make it sound as impressive in the press release.

Another thing that inflates the numbers in the press releases is that defendants are often charged with several related charges. For many crimes there are groups of related charges that for sentencing get merged. If you are charged with say 3 charges from the same group and convicted on all you are only sentenced for whichever one of them has the longest sentence.

If you've got 3 charges from such a group in the press release the DOJ might just take the completely bogus maximum for each as described above and just add those 3 together.

Here's a good article on DOJ's ridiculous sentence numbers [1].

Here's a couple of articles from an expert in this area of law that looks specifically at what Swartz was charged with and what kind of sentence he was actually looking at [2][3].

Why do you think Swartz was downloading the articles to learn from them? As far as I've seen know one knows for sure what he was intending.

If he wanted to learn from JSTOR articles he could have downloaded them using the JSTOR account he had through his research fellowship at Harvard. Why go to MIT and use their public JSTOR WiFi access, and then when that was cut off hide a computer in a wiring closet hooked into their ethernet?

I've seen claims that he wanted to do was meta research about scientific publishing as a whole which could explain why he needed to download more than he could download with his normal JSTOR account from Harvard, but again why do that using MIT's public WiFi access? JSTOR has granted more direct access to large amounts of data for such research. Did he talk to them first to try to get access that way?

[1] https://web.archive.org/web/20230107080107/https://www.popeh...

[2] https://volokh.com/2013/01/14/aaron-swartz-charges/

[3] https://volokh.com/2013/01/16/the-criminal-charges-against-a...

codedokode · 2025-03-14T01:16:52 1741915012

He might have wanted other people to have access to the knowledge, and for free. In comparison, AI companies want to sell access to the knowledge they got by scraping copyrighted works.

jlev · 2025-03-05T14:39:06 1741185546

I used to work at a company that used Sentinel-2 data and a large scale AI model to detect changes in land use and land cover anywhere in the world. They provide free global data at 10m resolution on an annual basis, or paid versions at 3m resolution over a custom timeframe.

https://www.impactobservatory.com/

jlev · on Feb 4, 2025

This seems like it goes against most standard development practices.

jlev · on April 25, 2024

Phrenology 2.0!

jlev · on Oct 2, 2023

You can tell that this author knows their shit because the blog post is entirely illustrated with furry memes.

jlev · on Sept 11, 2023

I did a project at the MIT Center for Future Civic Media on building a hyper-local radio platform, and we'd always use Car Talk as an example of the kind of interactions we wanted to enable. Like, what if you were a vet in rural Uganda, and you wanted to have a show for farmers to describe their problems and get expert help. Call it Goat Talk, and we'd descend into bleats and baahs.

jlev · on Aug 10, 2023

This was an informal thing for a long time, but I didn't know that there's now an actual certificate. I may have to go back and request mine retroactively.

JohnFen · on Aug 10, 2023

Any pirate worthy of the title would just forge their own.

p1mrx · on Aug 10, 2023

https://www.google.com/search?q=mit+pirate+certificate&tbm=i...

Do what you want 'cause a pirate is free.