How does this compare to SingleFile? https://www.npmjs.com/package/single-file-c...

gildas · on March 24, 2024

Author of SingleFile here, one of the major differences is that monolith doesn't use a web browser to take page captures. As a result, it doesn't support JavaScript, for example. SingleFile, on the other hand, requires a Chromium-based browser to be installed. It should also produce smaller pages and is capable of generating ZIP or self-extracting ZIP files. However, it will take longer to capture a page. Note that since version 2, it is now possible to download executable files of the CLI tool [1].

[1] https://github.com/gildas-lormeau/single-file-cli/releases

darkteflon · on March 24, 2024

SingleFile is amazing - use it tens of times every day across desktop and mobile. Can’t recall a single instance of it breaking. Thank you sincerely for your excellent work.

gildas · on March 25, 2024

Thanks a lot! Believe me, there have been a lot of bugs (+900 issues closed today) because it's hard to save a web page actually. You were lucky not to suffer ;)

darkteflon · on March 25, 2024

I bet! The proof of that must surely be in how poor a job formats like .webarchive do of it.

SingleFile just makes this one really complex, really important thing trivially easy, and in a portable format. For anyone curating a knowledge base it’s an absolute godsend.

I didn’t see any donation instructions on your GitHub - I for one would certainly love to chip in if you could point me in the right direction?

gildas · on March 25, 2024

You can find links to sponsor the project here [1] in the section "Sponsor this project" at the bottom right of the page.

[1] https://github.com/gildas-lormeau/SingleFile

profsummergig · on March 25, 2024

How do you use it on mobile? Is there an app for it? I don't see it on the Google Play store.

gildas · on March 25, 2024

It's officially available on Firefox for Android [1] and Safari [2] on mobile. You might also be able to use it with Kiwi Browser [3] on Android.

[1] https://addons.mozilla.org/android/addon/single-file/

[2] https://apps.apple.com/app/singlefile-for-safari/id644432254...

[3] https://play.google.com/store/apps/details?id=com.kiwibrowse...

voltaireodactyl · on March 25, 2024

Anybox (on Mac and ios) also supports SingleFile, presenting as a WebDAV server for archives to be saved. It’s flawless and hugely convenient in my experience.

alpacca-farm · on March 25, 2024

Just stumbled across Monolith and SingleFile recently and it's fascinating to see how these tools approach the challenge of web archiving in different ways. SingleFile seems to be a powerhouse, especially for those who rely heavily on JavaScript-laden pages. The ability to produce smaller pages and even generate ZIP files is pretty handy for content archiving and sharing.

That said, Monolith's approach of not requiring a web browser could be a game changer for simpler projects or where installing a Chromium-based browser isn't viable. It strikes me as a more straightforward, lightweight solution, albeit with the clear trade-off of not supporting JavaScript.

Has anyone run into situations where one tool clearly outperformed the other in real-world usage? I'm particularly curious about the impact on performance and convenience when choosing between these two, especially for mobile use. Also, kudos to the authors and contributors of these tools. The tech community benefits greatly from such innovations that help preserve and share knowledge.

supriyo-biswas · on March 25, 2024

Is this a LLM generated comment? The structure of this response seems to be too close to the “while X, it’s also important to Y” construction that LLMs like to use.

Anyway, to answer your question, lots of pages need JS to work correctly, so using Singlefile is the better option.

alpacca-farm · on March 27, 2024

Haha, ouch. Sadly not, but I'll have to work on how I come across in the future. Thanks for the answer!

samstave · on March 25, 2024

May you please share what workflow is having you do this so much each day?

What do?

darkteflon · on March 25, 2024

Sure!

I use SingleFile to save a copy of every article / post / SO & forum discussion I find interesting or useful. I sort them into two buckets: work, and not-work.

I’ve been doing this for 10+ years (before SingleFile I used things like .pdf, plain .html, .webarchive files - although these all have drawbacks).

In the pre-LLM era, I would then interface with these almost exclusively through a search front-end. I use Houdahspot on Mac and easySearch on iOS. That lets me see everything interesting I’ve read on a particular subject just by typing it in (with the usual caveats that apply to basic keyword search - although in practice that alone has proven very effective). Because it’s just a folder of essentially zipped .html files, there’s no lock-in.

Now that we’ve got LLMs, I plug those 10+ years of files straight into my RAG pipeline using llama-index. It’s quite nice :)

felipefar · on March 25, 2024

Sorry for the ignorance, but if the forum posts require login to access then you won't be able to use SingleFile, right?

Also, how is the quality of the output generated compared to a .pdf? I'm used to print PDFs from chrome for articles that I want to save, but the layout can become awkward sometimes, and navigation bars can appear several times and hide portions of the text.

I like this feature from chrome, but it's not consistently reliable.

freedomben · on March 25, 2024

If you use the browser extension, then pages requiring login are no problem because you are already logged in.

The output compared to PDF is like night and day. It is high Fidelity versus low Fidelity. At this point now, I only use PDF if for some reason I need it

tfsh · on March 25, 2024

SingleFile operates in the context of your browser, so it scrapes files with your cookie jar meaning you will be authenticated and specifically it'll scrape files as you see them.

In most cases SingleFile outputs looks identical to the real thing. Though I generally only use it on simpler sites such as recipes and technical blogs.

n8henrie · on March 25, 2024

> requires a Chromium-based browser to be installed

Not to try to correct the author here, but it supports geckobrowser as well (not just chromium-based), right?

I'm currently trying to package for nixpkgs[0] and am using Firefox for the checkPhase.

[0]: https://github.com/NixOS/nixpkgs/pull/283878

pbnjeh · on March 25, 2024

I was about to post a similar question: What does this mean for those using the Firefox versions of the extensions (SingleFile as well as the version that zips the result)?

codazoda · on March 25, 2024

What does SingleFile do? The intro tells you how to run it, but not what it does.

genewitch · on March 25, 2024

For me it bridged the gap that warped into existence between the time when "take screenshot" existed on firefox and when webpages figured out some people did this to archive pages and started putting crap in to either mess with the layout or otherwise "break" the resulting file.

It snapshots a web page to a single html file. At least that's what i use it for. I use it to both archive stuff and to have proof that some site published something.

The next order up would be archivebox or whatever archive.org uses (the name escapes me) - which is a very heavy caching proxy that can save entire websites into a single directory in a way that wget/curl and all the other crawlers cannot.

If you care that the exact layout and everything is perfect, right now i think singlefile is aces.

freedomben · on March 25, 2024

It takes whatever is in the Dom of the page you are viewing, and sticks it into a single HTML file that can be served later and will reproduce with high Fidelity the source page.

I use it to export an HTML file that I can stick in my logseq archive for later. So much better than just printing to a PDF!

eviks · on March 25, 2024

It saves a web page into a single file

mikae1 · on March 25, 2024

> SingleFile, on the other hand, requires a Chromium-based browser to be installed.

I'm using it as a a Firefox extension. Am I missing something?

gildas · on March 25, 2024

Sorry for the confusion, I was referring to SingleFile run via the command line interface.

pbnjeh · on March 25, 2024

Thank you for the clarification. Great extension!

mikae1 · on March 25, 2024

Thanks (it's awesome) :)

Capricorn2481 · on March 25, 2024

On the front page, Monolith says it embeds javascript. Are you saying it doesn't use this javascript to render the page before taking a snap shot?

throwaway290 · on March 25, 2024

It probably means that if this JS fetched more JS it won't be included so if you render offline it will be broken.

slmjkdbtl · on March 25, 2024

Sounds like they fetch the JS code from the url and embed those code in the HTML, but doesn't have a JS engine to execute those JS.

DavideNL · on March 25, 2024

@gildas Curious, is there any specific reason why singlefile-cli is not available in Homebrew on macOS ?

PS. I use SingleFile a lot, it's great... Thank you!

gildas · on March 25, 2024

In fact, I've only been making executable files available for a few weeks now. I'll have to see how to distribute them via Homebrew and so on.

tiagod · on March 27, 2024

I've been using SingleFile for ages now... it's my favorite browser extension after uBlock, thank you for your great tool! :)