Skip to content

Commit

Permalink
📝
Browse files Browse the repository at this point in the history
  • Loading branch information
davidgasquez committed May 15, 2024
1 parent 2623c25 commit 840440f
Show file tree
Hide file tree
Showing 8 changed files with 38 additions and 36 deletions.
1 change: 1 addition & 0 deletions Emergence.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,6 +4,7 @@

When things interact, they often birth new, unpredictable forms. **The sum total of a system is more than its competent parts**. E.g: biological [[evolution]], Conway's Game of Life, Wikipedia, Minecraft.

<!-- markdownlint-disable-next-line MD033 -->
<iframe width="640" height="360" src="https://www.youtube.com/embed/o_ZuWbX-CyE" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe>

You can't top-down plan emergent [[systems]], but you can provoke them into being. [Here are some attributes that emergent systems often have](http://gordonbrander.com/pattern/provoking-emergence/):
Expand Down
2 changes: 1 addition & 1 deletion Finances.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@

- Today's savings buy tomorrow's liberty. Think long term. [A dollar today is worth more than a dollar tomorrow](https://putanumonit.com/2017/02/10/get-rich-slowly/). People will take that dollar and do something profitable with it. Financial safety is freedom.
- Saving money and investing money are both good [[Habits]]. Small amounts of money invested regularly for many decades without deliberation is one path to wealth.
* [Be boring with most of your portfolio](https://twitter.com/VitalikButerin/status/1743991285214527532).
- [Be boring with most of your portfolio](https://twitter.com/VitalikButerin/status/1743991285214527532).
- Spend (relatively) more money on things you interact with the most, time-wise (as well as intensity-wise).
- Up to a point, being richer will not necessarily make you happier.
- Spend on others, especially people you are close to. Positive feedback loop: Prosocial spending makes you happier, and happiness makes you more likely to spend pro-socially.
Expand Down
1 change: 1 addition & 0 deletions Hobbies.md
Original file line number Diff line number Diff line change
Expand Up @@ -18,6 +18,7 @@
- Number of people in a rush
- Pigeons QoL
- Taxi cars brand

## Resources

- [The NoSurf Activities List](https://nosurf.net/activity-list/)
1 change: 0 additions & 1 deletion News.md
Original file line number Diff line number Diff line change
Expand Up @@ -22,4 +22,3 @@
- When we're talking about very unpopular beliefs, polls can only give a weak signal. Any possible source of noise ([Lizardman's Constant](https://slatestarcodex.com/2013/04/12/noisy-poll-results-and-reptilian-muslim-climatologists-from-mars/)) can easily overwhelm the signal. Beware of [bad designed polls](https://slatestarcodex.com/2020/05/28/bush-did-north-dakota/).
- Uncertainty doesn't sell. Nuance doesn't sell. Long, complex lectures don't sell. A video of someone saying "it's complicated" will never perform the way one would of someone using confident, flippant, polarizing rhetoric, and that's a huge problem.
- Main rule of fast-moving situation (e.g: early days of [[COVID-19]]): No one knows anything.
-
26 changes: 13 additions & 13 deletions Open Data.md
Original file line number Diff line number Diff line change
Expand Up @@ -142,55 +142,55 @@ Package managers have been hailed among the most important innovations Linux bro

> I'm not super clear on these answers! Please [reach out](https://davidgasquez.github.io/) if you want to chat about it.
1. What would be a great use case to start with?
### 1. What would be a great use case to start with?

I'd say [chain related data](https://davidgasquez.github.io/blockchain-data-pipelines/). Is open and people are eager to get their hands on it. I'm [working on that area](https://davidgasquez.github.io/gitcoin-data/), so I might be biased.

2. Why should people use this instead of doing their own thing?
### 2. Why should people use this instead of doing their own thing?

[If everybody could converge to it, e.g: _"datapackage.json_" as a metadata and schema description standard, then, an ecosystem of utilities and libraries for processing data would take advantage of it](https://news.ycombinator.com/item?id=15346836).

3. What is the incentive for people to adopt it?
### 3. What is the incentive for people to adopt it?

I wonder if there are ways to use novel mechanisms (e.g: DAOs) to incentive people? Also, companies like [Golden](httpfs://golden.com/) and [index.as](https://index.as/) are doing interesting work on monetizing data curation.

4. How can LLMs help "building bridges"?
### 4. How can LLMs help "building bridges"?

LLMs could infer schema, types, and generate some metadata for us. [[Large Language Models|LLMs can parse unstructured data (CSV) and also generate structure from any data source (scrapping websites)]] making it easy to [create datasets from random sources](https://tomcritchlow.com/2021/03/29/open-scraping-database/).

They're definitely blurring the line between structured and unstructured data too. Imagine pointing a LLMs to a GitHub repository with some CSVs and get the auto-generated `datapakage.json`.

5. How can we stream/update new data reliably? E.g: some datasets like Ethereum `blocks` could be updated every few minutes.
### 5. How can we stream/update new data reliably? E.g: some datasets like Ethereum `blocks` could be updated every few minutes

I don't have a great answer. Perhaps just push the new data into partitioned datasets?

7. Is it possible to [mount large amount of data](https://rclone.org/commands/rclone_mount/) ([FUSE](https://github.com/datalad/datalad-fuse)) from a remote source and get it dynamically as needed?
### 7. Is it possible to [mount large amount of data](https://rclone.org/commands/rclone_mount/) ([FUSE](https://github.com/datalad/datalad-fuse)) from a remote source and get it dynamically as needed?

It should be possible. I wonder if we could mount all datasets locally and explore them as if they were in your laptop.

8. Can new table formats play efficiently with IPFS?
### 8. Can new table formats play efficiently with IPFS?

Parquet could be a great fit if we figure out how to deterministically serialize it and integrate with IPLD. This will reduce their size as unchanged columns could be encoded in the same CID.

Later on I think it could be interesting to explore running [`delta-rs`](https://github.com/delta-io/delta-rs) on top of IPFS.

9. How to work with private data?
### 9. How to work with private data?

Homomorphic encryption?
Not sure. Homomorphic encryption?

9. How could something like [Ver](https://raulcastrofernandez.com/data-discovery-updates/) works?
### 9. How could something like [Ver](https://raulcastrofernandez.com/data-discovery-updates/) works?

If you can envision the table you would like to have in front of you, i.e., you can write down the attributes you would like the table to contain, then the system will find it for you. This probably needs a [[Knowledge Graphs]]!

10. How can a [[Knowledge Graphs]] [help with the data catalog](https://docs.atomicdata.dev/usecases/data-catalog.html)?
### 10. How can a [[Knowledge Graphs]] [help with the data catalog](https://docs.atomicdata.dev/usecases/data-catalog.html)?

It could help users connect datasets. With good enough core datasets, it could be used as an LLM backend.

11. [How would a Substack for databases look like](https://tomcritchlow.com/2023/01/27/small-databases/)?
### 11. [How would a Substack for databases look like](https://tomcritchlow.com/2023/01/27/small-databases/)?

An easy tool for creating, maintaining and publishing databases with the ability to restrict parts or all of it behind a pay wall. Pair it with the ability to send email updates to your audience about changes and additions.

12. Curated and small data (e.g: at the community level) is not reachable by Google. How can we help there?
### 12. Curated and small data (e.g: at the community level) is not reachable by Google. How can we help there?

Indeed! With LLMs on the rise, community curated datasets become more important as they don't appear in the big data dumps.

Expand Down
10 changes: 5 additions & 5 deletions Piano.md
Original file line number Diff line number Diff line change
Expand Up @@ -9,9 +9,9 @@
- In the academic side, [Intro Theory by Dmiti Tymoczko](https://dmitri.mycpanel.princeton.edu/teaching.html). His course works on a much more fundamental level than any other music theory course.
- [Study Music](https://github.com/vpavlenko/study-music) is a great GitHub repo with lots of music theory resources.

## Youtube Channels:
## Youtube Channels

- [Bill Hilton](https://www.youtube.com/user/billhiltonbiz)
- [Nahre Sol](https://www.youtube.com/channel/UC8R8FRt1KcPiR-rtAflXmeg)
- [Andrew Furmanczyk](https://www.youtube.com/channel/UCpzgTNTgQsR9YYsyOm3k3KQ)
- [Dan the composer](https://www.youtube.com/user/danthecomposer/)
- [Bill Hilton](https://www.youtube.com/user/billhiltonbiz)
- [Nahre Sol](https://www.youtube.com/channel/UC8R8FRt1KcPiR-rtAflXmeg)
- [Andrew Furmanczyk](https://www.youtube.com/channel/UCpzgTNTgQsR9YYsyOm3k3KQ)
- [Dan the composer](https://www.youtube.com/user/danthecomposer/)
3 changes: 2 additions & 1 deletion Social Media Issues.md
Original file line number Diff line number Diff line change
@@ -1,10 +1,11 @@
# Social Media Issues

<!-- markdownlint-disable-next-line MD033 -->
<iframe width="640" height="360" src="https://www.youtube.com/embed/k1BneeJTDcU" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe>

- Internet algorithms are [[Systems|complex profit-maximizing systems]] that want to spoon feed you whatever you're most likely to click on. This is a win-win, symbiotic relationship—until it's not. When the algorithm is luring in your primitive mind against you, the relationship is parasitic. [The algorithm will learn to show thing that will further confirm and strengthen your existing viewpoints](https://www.ted.com/talks/eli_pariser_beware_online_filter_bubbles).
- Each app is competing against the other apps. Only the apps that grains your [[Focus|attention]] continue. Over time, your attention is more and more hacked by these apps. All exist to sell your attention to advertisers. Each one has a team optimizing the attention hacking.
- Social Media apps might be dangerous due to the amount of data they track. Data is not the new gold, it is the new oil, and it damages the social environment. [If you feel you are being watched, you change your behavior](https://www.socialcooling.com/). [Loss of privacy leads to loss of freedom](https://robindoherty.com/2016/01/06/nothing-to-hide.html). This may limit our desire to speak or think freely thus bring about chilling effects on [society—or social cooling](https://reasonandmeaning.com/2017/10/31/what-is-social-cooling/).
- Social Media apps might be dangerous due to the amount of data they track. Data is not the new gold, it is the new oil, and it damages the social environment. [If you feel you are being watched, you change your behavior](https://www.socialcooling.com/). [Loss of privacy leads to loss of freedom](https://robindoherty.com/2016/01/06/nothing-to-hide.html). This may limit our desire to speak or think freely thus bring about "chilling effects" on [society—or social cooling](https://reasonandmeaning.com/2017/10/31/what-is-social-cooling/).
1. Your data is collected and scored.
2. Your digital reputation may affect your opportunities.
3. People start changing their behavior to get better scores.
Expand Down
Loading

0 comments on commit 840440f

Please sign in to comment.