22 comments

  • sshine 1 hour ago
    The number one power move I have is Nix integration. The availability of tooling, secrets, environment and the ability for the agent to modify its own environment is... well, I don't know how people live without it. I guess you guys still install things using commands and hope everything you need is present on the next machine? Developer machine, CI environment, deployment environment: They're all derived from a single source, and compiling and running always works on every machine.

    In Claude I use /branch and /rename a lot (context checkpoints, fork, go back)

    I use sandboxing almost exclusively: https://github.com/nix-tools/bubblebox -- it's a generalisation of Numtide's claudebox with a few fixes and some feature additions (more coming). This is best compared to always running your Claude in Docker containers, except there's no Docker runtime. Works fine in WSL and nix-darwin, too.

    • oulipo2 31 minutes ago
      For those who don't want the complexity of Nix, Mise is a good compromise
      • _kblcuk_ 1 minute ago
        +100. I also dig fnox (encrypted-secrets-in-git) and hk (pre-hooks manager that is actually fast and stays out of the way) by the same author, pretty much default for any project I start nowadays.

        Though I also use nix to manage my machines :-D

      • arcanemachiner 7 minutes ago
        For those who don't know: Mise is a version manager (among other things), and is said to be an improvement over its predecessor, asdf:

        https://mise.en.dev

        https://asdf-vm.com

  • netdevphoenix 1 hour ago
    What happens when you have a codebase made with claude using this setup and claude is down for let's say 8 hours? Are you able to efficiently, smoothly and productively take over the codebase?
    • ThunderBee 26 minutes ago
      You could say the same thing about any always online software suite and it would be equally fair as we move into more agentic development workflows.

      EX. Sure, you could go back to the old ways of using a drafting table for your engineering work if CAD went down but it would be exponentially slower…

      Personally with my workflow I spend 30-60 minutes per Claude feature spec doc when I’m pair planning. If Claude goes down I would just prepare spec docs on my own until it came back online and then rapidly review them before calling the coding workflow.

    • redhale 33 minutes ago
      Just use a fallback, like Codex CLI. Takes a little effort upfront to ensure your configuration is wired correctly for both harnesses, but it is pretty easy to get them 90% identical (there will almost always be some experimental / edge case features that differ across harnesses, but in my experience those are negligible in practice).
    • stavros 1 hour ago
      What happens when you have a codebase made with gcc for let's say 8 hours? Are you able to efficiently, smoothly and productively take over the assembly code?
      • sokoloff 0 minutes ago
        [delayed]
      • Planktonne 1 hour ago
        1. When and how would gcc go down?

        2. How often do you think that happens, compared to Claude?

        • stavros 57 minutes ago
          You can use a local model, which will go down exactly as often as gcc will. We may still have hopeful notions of being able to understand the codebase, but the reality seems to be that the codebases we don't understand will be the ones that will win out in the market, because they'll be cheaper while still only having about as many bugs as they had when people wrote them.
          • Planktonne 56 minutes ago
            We're explicitly not talking about local models here; we're talking about Claude.
            • stavros 54 minutes ago
              Because you're better able to take over the codebase a local model wrote than one Claude wrote? The original question was about taking over an LLM-written codebase, it doesn't sound to me like the argument was about a codebase that Claude, specifically, wrote.
              • notachatbot123 44 minutes ago
                The original question is:

                > What happens when you have a codebase made with claude using this setup and claude is down for let's say 8 hours?

                So: - A codebase made with Claude - Using this [Claude] setup - Claude is down

              • JoRyGu 45 minutes ago
                Brother, look at the first comment in the chain you replied to. It very specifically was about Claude.
                • stavros 44 minutes ago
                  Well, in that case, it's also very specifically about this guy's codebase, so none of us can really say anything on this.
      • ares623 1 hour ago
        wat?
      • IceDane 52 minutes ago
        Is this really a position you want to take in public with your real name and identity and everything plastered over your profile?
        • stavros 50 minutes ago
          What can I say, we can't all be geniuses.
  • rkuska 2 hours ago
    Regarding:

    ``` # Development Workflow

    *Always use `bun`, not `npm`.*

    # 1. Make changes

    # 2. Typecheck (fast)

    bun run typecheck

    # 3. Run tests

    bun run test -- -t "test name" # Single suite bun run test:file -- "glob" # Specific files

    # 4. Lint before committing

    bun run lint:file -- "file1.ts" bun run lint

    # 5. Before creating PR

    bun run lint:claude && bun run test ```

    I have these things in pre-commit, this way the targets are always ran and the agent is forced to fix them (I ask claude to commit changes). The agents are erratic and very often skip these steps. Anything that can be deterministic I keep as scripts.

    Regarding commits; both codex and claude are terrible at writing them. I have in my user CLAUDE.md:

    ``` Pattern: `type(scope): message` where type is `fix`, `feat`, `chore`, `docs`, `refactor`, or `style`; scope marks what is affected; message is a short lowercased description.

    Keep subject and body lines under 72 characters. Always write a body explaining what, how, and why in continuous human-readable text. For fixes include the error message being fixed. No first-person speech. Re-read the actual git diff before writing — the message must describe what changed, not what was planned.

    Use following command to create commit:

    ```bash git commit -F - <<'EOF' type(scope): subject line

    Body paragraph explaining what, how, and why. EOF ```

    ```

    Without it would write the body as a single long sentence; when asked to fix lines it would just insert \n (newlines), which were not respected and were instead just rendered as characters.

    Another thing I find helpful is VOCABULARY.md. Very often the agent would assume (connect?) a different thing than what I had in mind, with VOCABULARY I make sure when I say "thing" claude and I have both the same "understading" (connection?) what "thing" is.

    • trick-or-treat 1 hour ago
      Isn't it simpler to use claude's vocabulary? I don't see a good use case for this.
    • hansmayer 54 minutes ago
      I mean at this point, you should just write a few deterministic orchestration scripts to automate away the boring parts and write the code yourself. Why are we wasting our time on making the wonder shit-machine work?
  • pantulis 14 minutes ago
    The post goes to the point. Somehow this must be buried in Anthropic's documentation but I miss this kind of back-to-basic posts. Even if they are LLM-penned.
  • thedeadp12t 33 minutes ago
    In the recent weeks, I think the harness/model came to a point that you can just ask it to do stuff and it just does. You can use plan mode, you can also use superpowers, or whatever other skill, but given that you'll review something anyway, why not work directly with code instead of silly amounts of md files?
  • egorthinks 42 minutes ago
    Claude Code with skills is undoubtedly powerful and useful, but it doesn't always work as expected.

    I always get the best results when I have live feedback with it.

  • big-chungus4 3 hours ago
    Out of curiosity, how much does it cost to daily drive Claude like this?
    • rethab 31 minutes ago
      I only use opus 4.7 and am on the 100$/mo plan. I usually make sure the context does not grow beyond 30-40% of the 1m tokens. On heavy coding days where I do something pretty similar to this, I would occasionally run into the five hour limit, but that happens like once per week and then it wouldn't take too long to reset. Note that I use caveman, but I'm not sure to what extent that really helps.
    • iammjm 2 hours ago
      about 10-22€/month is the minimum since you need Claude Code, which means you either need the pro subscription (22€) or an API with some credit on it
    • ares623 3 hours ago
      isn't it $20/month /s
  • Uptrenda 15 minutes ago
    Nerds and their tendency to over-complicate everything. What is wrong with just an IDE with a simple claude integration?
  • sandrello 1 hour ago
    To me, this kind of talk exhibits the very cultish and con side of the whole genAI train. In a way, it does a poor job especially when the intent is positive about the technology, it sheds a bad look on it.

    Generally, and more so with paid products, one should expect to get something that is ready to be used, tuned by who's selling it at the best of their efforts. Instead, this is basically saying that the product is actually not much more than an empty box, and that it is your responsibility to augment it with third-party plugins and markdown texts that make it finally useful. And you better be carefully selecting the skills you install, you don't want to end up with second tier material made by GithubInfluencerA, you definitely need the work of GithubInfluencerB.

    In the end, it's what is giving companies fuel to keep the hype running, because it allows to counter every possible argument or doubt about the technology, especially the ones made in good faith. No matter the problem you're facing, the blame is definitely on you, the user, for not setting up the tool in the right way.

    I'm struggling in a lot of ways in accepting LLMs, but if I'll ever come completely sold on them and take this technology seriously, it won't be before this mood has gone away.

    • gorgmah 57 minutes ago
      I see this kind of first-gen coding agents a bit like the AI-era microsoft excel: you need to be a poweruser to use it correctly, otherwise you'll end up failing catastrophically. Hence the amount of different ways to use it.

      Having an "unfinished" product is also a great marketing tool for companies like anthropic: each skill/plugin/guide that you see on the internet is boosting their SEO + social validation metrics.

    • redhale 20 minutes ago
      I understand and sympathize with this point of view.

      I would just say this: there is a difference between advice for using a product, and for _optimizing_ your use of a product. Between a user and a power user.

      I think devs probably disproportionately like to see themselves as power users of any given tool, and thus with coding agents, there are 1000 "systems" being thrown out on GitHub on any given day. Generally speaking, it is safe to avoid these, especially if you're new to the tool.

      But saying the fact that people are into optimizing their setups indicates some fundamental deficiency of the tool misses the point, I think.

      Claude Code and Codex CLI (and OpenCode, and I'm sure many others) are _remarkably_ effective right out of the box. The teams behind these tools must make them _generically_ useful so that they are accessible to as many people, and as many use cases, as possible. That is part of why, when you become familiar with the tool, there is typically going to be a level of customization you can apply to it to optimize it for _your_ use cases, beyond the generic out of the box configuration.

      Similarly, I don't think it would be fair to critique VS Code simply because most power users augment it with a suite of extensions. In fact, it's customizability/extensibility is part of what makes it great.

  • maipen 5 minutes ago
    > Delegate, do not pair-program. Cat Wu (Claude Code team): “The model performs best if you treat it like an engineer you’re delegating to, not a pair programmer you’re guiding line by line.” Write a crisp brief upfront, then let it run.

    This is also how you get a slop codebase that you won’t easily understand.

    It becomes a labyrinth that only the Agent knows. It’s not a catastrophe when your making prototypes or projects like you see on X.

    But if you are expanding your codebase or trying to build something more professional and maintainable. I find it important to explicitly spec things bit by bit so I can understand and some what keep my writing style in this codebase. But this is only productive when you have a fast model otherwise it kills your chain of thought while you wait for the output.

    If the model is slow, delegation is probably the only way.

  • hansmayer 1 hour ago
    Oh great! Another AI slop article about "working" with AI (= working for AI). Do you notice how much bloody work you put in the boring parts, only to leave out the most creative aspect of software engineering to a slot-machine?
    • omgmajk 59 minutes ago
      Written by an LLM, deployed by an agent to the blog, posted to HN by a bot, upvoted by more bots to market "AI".
  • niraj898 3 hours ago
    Honestly, claude code has saved so many hours of finding bugs for developers
  • jamesdeakee 39 minutes ago
    [flagged]
  • claud_ia 40 minutes ago
    [flagged]
  • Bolin-Weng_666 12 minutes ago
    [flagged]
  • rtolkachev 56 minutes ago
    [flagged]
  • rtolkachev 57 minutes ago
    [flagged]
  • Boussettah 1 hour ago
    [flagged]
  • Ozzie-D 1 hour ago
    [flagged]
  • bhupendraTale05 5 hours ago
    [flagged]
  • arps18 5 hours ago
    [flagged]
  • coolness 3 hours ago
    [dead]