Rost Glukhov - Accordionist
Thursday, 29 May 2025
Convert HTML content to Markdown using LLM and Ollama
https://www.glukhov.org/post/2025/05/html-to-markdown-using-llm/
ReaderLM-v2
I have tried the next such model version - reader-lm-v2. ReaderLM-v2 is built on Qwen2.5-1.5B-Instruction. I can confirm: it works, but the conversion is somehow slow-ish…
Can you imagine the 500KB html webpage that you need to convert extract a text from? Maybe there is 100000 tokens? or let it be even 10k tokens.
I took a sample page of 121KB and conversion time on my PC is: ~1sec.
Calling Ollama Commandline
#!/bin/bash
MODEL="milkey/reader-lm-v2:latest"
INPUT_FILE="prompt.html"
OUTPUT_FILE="response.md"
# Read file content as prompt
PROMPT="Extract the main content from the given HTML and convert it to Markdown format.\nhtml:\n $(cat "$INPUT_FILE")"
# Call Ollama and save the response
ollama run "$MODEL" "$PROMPT" > "$OUTPUT_FILE"
echo "Ollama response saved to $OUTPUT_FILE"
Saturday, 23 March 2019
Бирюзовые колечки и цыганочка под баян
Бирюзовые да золотые колечки & Цыганочка - под баян: https://youtu.be/fwAZfO8jbfo
Tortoise an gold rings and Ciganochka on the botton accordion
Monday, 18 March 2019
Клубника со льдом под баян
Клубника со льдом [Крематорий] - под баян: https://youtu.be/NqQK-h94zMk
Tuesday, 5 March 2019
Russian Melbourne festival - full clip
Full video
https://youtu.be/3ujlN1O30M4
Russian Melbourne Festival 2019
Русский фестиваль в Мельбурне 2019
Saturday, 2 March 2019
Russian Festival in Melbourne 2019
https://youtu.be/Owe964RCwjI
Tuesday, 26 February 2019
Pakofesta 2019 parade in Geelong, Australia -- Russian part
https://youtu.be/-Qu_GO2Zi38
Thursday, 7 February 2019
Nice to see you!
Welcome, my dear friend, here I'll publish news about music I play and learn and some sheetmustic if you are interested in the same.
Subscribe to:
Comments (Atom)