Skip to content

Commit

Permalink
Merge branch 'main' of github.com:davidgasquez/handbook
Browse files Browse the repository at this point in the history
  • Loading branch information
davidgasquez committed May 24, 2023
2 parents b63cefe + 25a50d6 commit caa3c24
Show file tree
Hide file tree
Showing 4 changed files with 12 additions and 8 deletions.
4 changes: 2 additions & 2 deletions Data/Dashboards.md
Original file line number Diff line number Diff line change
Expand Up @@ -15,8 +15,8 @@
- Why this dashboard exists.
- Who it's for.
- When it was built, and if and when it's set to expire .
- What features it's tracking via links to team repositories, project briefs, screenshots, or video walkthroughs.
- Take Aways.
- What features it's tracking via links to team repositories, project briefs, screenshots, or video walk-throughs.
- Take-aways.
- Metadata (owner, related OKRs, TTL, …).
- Make them so its easy to go one layer down (X went down in Y location, or for Z new users, etc).
- Recreate dashboard from first principles periodically.
Expand Down
12 changes: 7 additions & 5 deletions Data/Data Culture.md
Original file line number Diff line number Diff line change
Expand Up @@ -19,7 +19,7 @@
- The modern data team needs to have *real organizational power*—it needs to be able to say "no” and mean it. If your data team does not truly have the power to say no to stakeholders, it will get sent on all kinds of wild goose chases, be unproductive, experience employee churn, etc.
- Data should report to the CEO. Ideally at least with some weekly metrics split into (a) notable trends, (b) watching close, and (c) business as usual.
- If data is the most precious asset in a company, does it make sense to have only one team responsible for it?
- [People talk about data as the new oil but for most companies it’s a lot closer uranium](https://news.ycombinator.com/item?id=27781286). Hard to find people who can to handle or process it correctly, nontrivial security/liabilities if PII is involved, expensive to store and a generally underwhelming return on effort relative to the anticipated utility.
- [People talk about data as the new oil but for most companies it’s a lot closer uranium](https://news.ycombinator.com/item?id=27781286). Hard to find people who can to handle or process it correctly, nontrivial security/liabilities if PII is involved, expensive to store and a generally underwhelming return on effort relative to the anticipated utility.
- [The pain in data teams come from needing to influence PMs/peers with having little control of them. Data teams need to become really great internal marketers/persuaders](https://anchor.fm/census/episodes/The-evolution-of-the-data-industry--data-jobs-w-Avo-CEO-and-Co-founder-Stefania-Olafsdottir-e16hu1l). That said, it shouldn't be the data team job to convince the organization to be data driven. That's not an effective way of spending resources.
- People problems are orders of magnitude more difficult to solve than data problems.
- **Integrate data where the decision is made**. E.g: Google showing restaurant scores when you're looking something for dinner.
Expand All @@ -42,7 +42,7 @@
- You won't have the best allocation of resources in a reactive team. Data teams need extra [[slack]]. [Balance user requests with actual needs](https://scientistemily.substack.com/p/product-management-skills-for-data).
- How can we measure the data team impact?
- Making a [[Writing a Roadmap|roadmap]] can help you telling if you are hitting milestone deadlines or letting them slip.
- Embedded data team members need to help other teams build their roadmap too.
- Embedded data team members need to help other teams build their roadmap too.
- Also, having a changelog ([do releases!](https://betterprogramming.pub/great-data-platforms-use-conventional-commits-51fc22a7417c)) will help show the team impact on the data product across time.
- [Push for a *centralization of the reporting structure*, but keeping the *work management decentralized*](https://erikbern.com/2021/07/07/the-data-team-a-short-story.html).
- Unify resources (datasets, entities, definitions, metrics). Have one source of truth for each one and make that clear to everyone. That source of truth needs heavy curation. Poor curation leads to confusion, distrust and…. lots of wasted effort.
Expand Down Expand Up @@ -84,11 +84,12 @@
- [Data ownership is a hard problem](https://www.linkedin.com/posts/chad-sanderson_heres-why-data-ownership-is-an-incredibly-activity-6904107936533114880-gw8n/). Data is fundamentally generated by services (or front-end instrumentation) which is managed by engineers. CDC and other pipelines are built by data engineers. The delineation of ownership responsibilities is very rarely established, with each group wanting to push 'ownership' onto someone else so they can do the jobs they were hired for.
- [Becoming a data-driven organization is a journey, which unfolds over time and requires critical thinking, human judgement, and experimentation](https://hbr.org/2022/02/why-becoming-a-data-driven-organization-is-so-hard). Fail fast, learn faster.
- [Path to create a data-driven organization](https://twitter.com/_abhisivasailam/status/1520274838450888704):
- 1. Get a well-placed leader with influence to message, model, and demand data-driven execution .
- 2. Hire/fire based on data aptitude and usage.
- 3. Create mechanisms that force analytical conversations. Sometimes there is no way around spending an afternoon breaking down metrics by different segments until you find The Thing.
- 1. Get a well-placed leader with influence to message, model, and demand data-driven execution.
- 2. Hire/fire based on data aptitude and usage.
- 3. Create mechanisms that force analytical conversations. Sometimes there is no way around spending an afternoon breaking down metrics by different segments until you find The Thing.
- [Start small. Don't try to wrangle data for the entire company until you have the tools and process down for one team](https://data-columns.hightouch.io/your-first-60-days-as-a-first-data-hire-weeks-3-4/).
- Difficulty to work with data scales exponentially with size.
- [Rule of thumb; your first customer as a data person should be growth](https://twitter.com/josh_wills/status/1577699871335010304).
- [Data is used largely to answer these questions](https://roundup.getdbt.com/p/bring-back-scenario-analysis):
- What already happened? Descriptive statistics.
- What will happen? Predictive statistics.
Expand All @@ -98,3 +99,4 @@
- Analytics: Making sense of data to guide decisions
- Make your [modeling technique](https://data-columns.hightouch.io/untitled-2/) explicit.
- Have a documentation [entry-point for Data](https://github.com/mozilla/data-docs).
- [For self-serve, aim to own as little as possible but keep in mind you can't make people do what you want but cans top them for doing what you don't want](https://youtu.be/wyW6hQGZxgY).
2 changes: 1 addition & 1 deletion Large Language Models.md
Original file line number Diff line number Diff line change
Expand Up @@ -17,7 +17,7 @@
- Designing prompts is an iterative process that requires a lot of experimentation to get optimal results. Start with simple prompts and keep adding more elements and context as you aim for better results.
- Be very specific about the instruction and task you want the model to perform. The more descriptive and detailed the prompt is, the better the results.
- Avoid saying what not to do but say what to do instead.
- Follow [Prompt Engineering Guide](https://www.promptingguide.ai/) and [OpenAI Best Practices](https://help.openai.com/en/articles/6654000-best-practices-for-prompt-engineering-with-openai-api).
- Follow [Prompt Engineering Guide](https://www.promptingguide.ai/), [Brex's Prompt Engineering Guide](https://github.com/brexhq/prompt-engineering), and [OpenAI Best Practices](https://help.openai.com/en/articles/6654000-best-practices-for-prompt-engineering-with-openai-api).

#### Prompt Creator

Expand Down
2 changes: 2 additions & 0 deletions Web3.md
Original file line number Diff line number Diff line change
Expand Up @@ -42,6 +42,8 @@

### Blockchain Indexing Projects

There is an emerging landscape of [Data and Blockchain](https://primodata.org/landscape.html).

#### Companies

- [Tokenflow](https://docs.tokenflow.live/)
Expand Down

0 comments on commit caa3c24

Please sign in to comment.