diff --git a/src/SUMMARY.md b/src/SUMMARY.md
index 106db508e..68bcd8d15 100644
--- a/src/SUMMARY.md
+++ b/src/SUMMARY.md
@@ -144,6 +144,7 @@
     - [ADTs and Generic Arguments](./ty_module/generic_arguments.md)
     - [Parameter types/consts/regions](./ty_module/param_ty_const_regions.md)
 - [`TypeFolder` and `TypeFoldable`](./ty-fold.md)
+- [Normalization and Aliases](./normalization/normalization.md)
 - [Parameter Environments](./param_env/param_env_summary.md)
     - [What is it?](./param_env/param_env_what_is_it.md)
     - [How are `ParamEnv`'s constructed internally](./param_env/param_env_construction_internals.md)
@@ -166,7 +167,6 @@
         - [Coinduction](./solve/coinduction.md)
         - [Caching](./solve/caching.md)
         - [Proof trees](./solve/proof-trees.md)
-        - [Normalization](./solve/normalization.md)
         - [Opaque types](./solve/opaque-types.md)
         - [Significant changes and quirks](./solve/significant-changes.md)
     - [`Unsize` and `CoerceUnsized` traits](./traits/unsize.md)
diff --git a/src/normalization/normalization.md b/src/normalization/normalization.md
new file mode 100644
index 000000000..091671d3a
--- /dev/null
+++ b/src/normalization/normalization.md
@@ -0,0 +1,116 @@
+# Normalization and Aliases
+
+<!-- toc -->
+
+## What is normalization
+
+In Rust there are a number of types that are considered equal to some "underlying" type, for example inherent associated types, trait associated types, free type aliases (`type Foo = u32`), and opaque types (`-> impl RPIT`). Alias types are represented by the [`TyKind::Alias`][tykind_alias] variant, with the kind of aliases tracked by the [`AliasTyKind`][aliaskind] enum.
+
+Normalization is the process of taking these alias types and determining the underlying type that they are equal to. For example given some type alias `type Foo = u32`, normalizing `Foo` would give `u32`.
+
+[tykind_alias]: https://doc.rust-lang.org/nightly/nightly-rustc/rustc_type_ir/enum.TyKind.html#variant.Alias
+[aliaskind]: https://doc.rust-lang.org/nightly/nightly-rustc/rustc_type_ir/enum.AliasTyKind.html
+
+## Entry points to normalization
+
+When interfacing with the type system it will often be the case that it's necessary to request a type be normalized. There are a number of different entry points to the underlying normalization logic and each entry point should only be used in specific parts of the compiler.
+
+An additional complication is that the compiler is currently undergoing a transition from the old trait solver to the new trait solver. As part of this transition our approach to normalization in the compiler has changed somewhat significantly, resulting in some normalization entry points being "old solver only" slated for removal in the long-term once the new solver has stabilized.
+
+Here is a rough overview of the different entry points to normalization in the compiler.
+- `infcx.at.normalize`
+- `tcx.normalize_erasing_regions`
+- `infcx.query_normalize`
+- `traits::normalize_with_depth(_to)`
+
+### `infcx.at.normalize/deeply_normalize/structurally_normalize`
+
+[`normalize`][normalize]/[`deeply_normalize`][deeply_normalize]/[`structurally_normalize`][structurally_normalize] are the main normalization entry points for normalizing during various analysis' such as type checking, impl wellformedness checking, collecting the types of RPITITs, etc. It's able to handle inference variables during normalization and will return any nested goals required for the normalization to hold. 
+
+These normalization functions are often mirrored on other contexts that wrap an [`InferCtxt`][infcx], such as [`FnCtxt`][fcx] or [`ObligationCtxt`][ocx]. They behave largely the same except that these wrappers can either handle providing some of the arguments to the normalize functions or handle the returned goals itself.
+
+Due to the new normalization approach of the new solver the `normalize` method is a no-op under the new solver and is slated for removal once the new solver is stabilized. Under the new solver the intention is to delay normalization up until matching on the type is actually required, at which point `structurally_normalize` should be called. In some rare cases it is still desirable to eagerly normalize a whole value ahead of time and so `deeply_normalize` exists.
+
+When matching on types during HIR typeck we would like to emit an error if the type is an inference variable as we do not know what type it will wind up being inferred to. The `FnCtxt` type (used during HIR typeck) has a method for this, [`fcx.structurally_resolve`][structurally_resolve], when the new solver is enabled it will *also* attempt to normalize the type via `structurally_normalize`.
+
+Due to this there is a pattern in HIR typeck where a type is first normalized via `normalize` (doing nothing in the new solver), and then `structurally_resolve`'d (normalizing in the new solver, but erroring on inference variables under both solvers). This pattern should be preferred over calling `structurally_normalize` during HIR typeck.
+
+[normalize]: https://doc.rust-lang.org/nightly/nightly-rustc/rustc_trait_selection/infer/at/struct.At.html#method.normalize
+[deeply_normalize]: https://doc.rust-lang.org/nightly/nightly-rustc/rustc_trait_selection/traits/normalize/trait.NormalizeExt.html#tymethod.deeply_normalize
+[structurally_normalize]: https://doc.rust-lang.org/nightly/nightly-rustc/rustc_trait_selection/traits/trait.StructurallyNormalizeExt.html#tymethod.structurally_normalize_ty
+[infcx]: https://doc.rust-lang.org/nightly/nightly-rustc/rustc_trait_selection/infer/struct.InferCtxt.html
+[fcx]: https://doc.rust-lang.org/nightly/nightly-rustc/rustc_hir_typeck/fn_ctxt/struct.FnCtxt.html
+[ocx]: https://doc.rust-lang.org/nightly/nightly-rustc/rustc_trait_selection/traits/struct.ObligationCtxt.html
+[structurally_resolve]: https://doc.rust-lang.org/nightly/nightly-rustc/rustc_hir_typeck/fn_ctxt/struct.FnCtxt.html#method.structurally_resolve_type
+
+### `tcx.normalize_erasing_regions`
+
+[`normalize_erasing_regions`][norm_erasing_regions] is generally used by parts of the compiler that are not doing type system analysis' as this normalization entry point does not handle inference variables, lifetimes, or any diagnostics. Lints and codegen make heavy use of this entry point as they typically are working with fully inferred aliases that can be assumed to be well formed (or atleast, are not responsible for erroring on). 
+
+[norm_erasing_regions]: https://doc.rust-lang.org/nightly/nightly-rustc/rustc_middle/ty/struct.TyCtxt.html#method.normalize_erasing_regions
+
+### `infcx.query_normalize`
+
+[`infcx.query_normalize`][query_norm] is very rarely used, it has almost all the same restrictions as `normalize_erasing_regions` (cannot handle inference variables, no diagnostics support) with the main difference being that it retains lifetime information. For this reason `normalize_erasing_regions` is the better choice in almost all circumstances as it is more efficient due to caching lifetime-erased queries.
+
+In practice `query_normalize` is used for normalization in the borrow checker, and elsewhere as a performance optimization over `infcx.normalize`. Once the new solver is stabilized it is expected that `query_normalize` can be removed from the compiler as the new solvers normalization implementation should be performant enough for it to not be a performance regression.
+
+[query_norm]: https://doc.rust-lang.org/nightly/nightly-rustc/rustc_trait_selection/infer/at/struct.At.html#method.query_normalize
+
+### `traits::normalize_with_depth(_to)`
+
+[`traits::normalize_with_depth(_to)`][norm_with_depth] is only used by the internals of the old trait solver. It is effectively calling into the internals of how normalization is implemented by the old solver. Other normalization entry points cannot be used from within the internals of the old trait solver as it would result in handling goal cycles and recursion depth incorrectly.
+
+When the new solver is stabilized, the old solver and its implementation of normalization will be removed (of which this function is part of).
+
+[norm_with_depth]: https://doc.rust-lang.org/nightly/nightly-rustc/rustc_trait_selection/traits/normalize/fn.normalize_with_depth.html
+    
+# Alias handling
+
+> FIXME: This section is somewhat incomplete and could do with expansion
+
+## Ambiguous vs Rigid Aliases
+
+Aliases can either be "ambiguous" or "rigid". 
+
+When an alias cannot yet be normalized due to containing inference variables, such as `<_ as Iterator>::Item`, we consider it to be an "ambiguous" alias where "ambiguous" refers to the fact that it is not clear what type implements `Iterator` yet.
+
+On the other hand if we can determine the source for how the self type (`_` in the previous example) implements the trait (e.g. `Iterator`) but the source does not determine the underlying type of the associated type (e.g. `Item`) then it is considered a "rigid" alias.
+
+We generally consider types to be rigid if their "shape" isn't going to change, for example `Box` is rigid as no amount of normalization can turn a `Box` into a `u32`, whereas `<vec::IntoIter<u32> as Iterator>::Item` is not rigid as it can be normalized to `u32`.
+
+If an alias is not ambiguous and also is not rigid then it is either not well formed (the self type does not implement the trait), or it can simply be normalized to its underlying type.
+
+## Alias Equality
+
+In the old solver equating two aliases simply equates the generic arguments of the aliases. This is incorrect as two ambiguous aliases may wind up having their generic arguments inferred differently but still normalizing to the same rigid type.
+
+To work around this the old solver eagerly normalizes all types to ensure the unnormalized types are never encountered. We also (in the new solver too) normalize ambiguous aliases to some new inference variable `?r` to represent its normalized (rigid) form, then emit a goal to defer proving that the alias normalizes to `?r`
+
+This has a few advantages:
+- Matching on a `Ty` after normalization will only encounter rigid aliases or inference variables, not ambiguous aliases
+- When we *are* able to normalize the ambiguous alias we can wait until it normalized to something rigid instead of a different ambiguous alias before inferring `?r`.
+- In cases where multiple aliases wind up being required to be equal to `?r` inference can be stronger as the first alias to be normalized to a rigid type can infer `?r`.
+- In the old solver we never encounter ambiguous aliases and so cannot wind up accidentally equating two ambiguous aliases generic arguments
+
+There are some shortcomings with this around higher ranked types containing ambiguous aliases that make use of the bound variables, e.g. `for<'a> fn(<?x as Trait<'a>>::Assoc)`. Creating an inference variable `?r` to represent the normalized form of `<?x as Trait<'a>>::Assoc` is problematic as `?r` would be unable to name the lifetime `'a` due to being in a [lower universe][universes] even though there could exist some type to infer `?x` to that would implement `for<'a> Trait<'a, Assoc = &'a u32>`.
+
+In both the old and new solver we do not normalize aliases to inference variables if they make use of bound vars from a higher ranked type. In the old solver [this is unsound](https://github.com/rust-lang/rust/issues/102048) in coherence due to equality of aliases simply equating the generic arguments regardless of whether the alias is rigid.
+
+In the new solver this is not a soundness bug as we do not equate the arguments of aliases unless they are known to be rigid. 
+
+[universes]: https://rustc-dev-guide.rust-lang.org/borrow_check/region_inference/placeholders_and_universes.html#what-is-a-universe
+
+## Normalization as a side effect of equality
+
+Under the new solvers approach to normalization and equality of aliases we check equality of aliases with a [`PredicateKind::AliasRelate`][aliasrelate] goal that can be deferred until furthur inference progress has been made, if necessary.
+
+`AliasRelate(lhs, rhs)` is implemented by first structurally normalizing both the `lhs` and the `rhs` and then relating the resulting rigid types (or inference variables). Importantly, if `lhs` or `rhs` ends up as an alias, this alias can now be treated as rigid and gets unified without emitting a nested `AliasRelate` goal: [source][structural-relate].
+
+This means that `AliasRelate` with an unconstrained `rhs` ends up acting as a function which fully normalizes `lhs` before assigning the resulting rigid type to an inference variable. This is used by `fn structurally_normalize_ty` both [inside] and [outside] of the trait solver.
+
+[aliasrelate]: https://doc.rust-lang.org/nightly/nightly-rustc/rustc_middle/ty/type.PredicateKind.html#variant.AliasRelate
+[structural_norm]: https://github.com/rust-lang/rust/blob/2627e9f3012a97d3136b3e11bf6bd0853c38a534/compiler/rustc_trait_selection/src/solve/alias_relate.rs#L140-L175
+[structural-relate]: https://github.com/rust-lang/rust/blob/a0569fa8f91b5271e92d2f73fd252de7d3d05b9c/compiler/rustc_trait_selection/src/solve/alias_relate.rs#L88-L107
+[inside]: https://github.com/rust-lang/rust/blob/a0569fa8f91b5271e92d2f73fd252de7d3d05b9c/compiler/rustc_trait_selection/src/solve/mod.rs#L278-L299
+[outside]: https://github.com/rust-lang/rust/blob/a0569fa8f91b5271e92d2f73fd252de7d3d05b9c/compiler/rustc_trait_selection/src/traits/structural_normalize.rs#L17-L48
diff --git a/src/solve/normalization.md b/src/solve/normalization.md
deleted file mode 100644
index 99dc20c46..000000000
--- a/src/solve/normalization.md
+++ /dev/null
@@ -1,127 +0,0 @@
-# Normalization in the new solver
-
-> FIXME: Normalization has been changed significantly since this chapter was written.
-
-With the new solver we've made some fairly significant changes to normalization when compared
-to the existing implementation.
-
-We now differentiate between "one-step normalization", "structural normalization" and
-"deep normalization".
-
-## One-step normalization
-
-One-step normalization is implemented via `NormalizesTo` goals. Unlike other goals
-in the trait solver, `NormalizesTo` always expects the term to be an unconstrained
-inference variable[^opaques]. Think of it as a function, taking an alias as input
-and returning its underlying value. If the alias is rigid, `NormalizesTo` fails and
-returns `NoSolution`. This is the case for `<T as Trait>::Assoc` if there's a `T: Trait`
-where-bound and for opaque types with `Reveal::UserFacing` unless they are in the
-defining scope. We must not treat any aliases as rigid in coherence.
-
-The underlying value may itself be an unnormalized alias, e.g.
-`NormalizesTo(<<() as Id>::This as Id>::This)` only returns `<() as Id>::This`,
-even though that alias can be further normalized to `()`. As the term is
-always an unconstrained inference variable, the expected term cannot influence
-normalization, see [trait-system-refactor-initiative#22] for more.
-
-Only ever computing `NormalizesTo` goals with an unconstrained inference variable
-requires special solver support. It is only used by `AliasRelate` goals and pending
-`NormalizesTo` goals are tracked separately from other goals: [source][try-eval-norm].
-As the expected term is always erased in `NormalizesTo`, we have to return its
-ambiguous nested goals to its caller as not doing so weakens inference. See
-[#122687] for more details.  
-
-[trait-system-refactor-initiative#22]: https://github.com/rust-lang/trait-system-refactor-initiative/issues/22
-[try-eval-norm]: https://github.com/rust-lang/rust/blob/2627e9f3012a97d3136b3e11bf6bd0853c38a534/compiler/rustc_trait_selection/src/solve/eval_ctxt/mod.rs#L523-L537
-[#122687]: https://github.com/rust-lang/rust/pull/122687
-
-## `AliasRelate` and structural normalization
-
-We structurally normalize an alias by applying one-step normalization until
-we end up with a rigid alias, ambiguity, or overflow. This is done by repeatedly
-evaluating `NormalizesTo` goals inside of a snapshot: [source][structural_norm].
-
-`AliasRelate(lhs, rhs)` is implemented by first structurally normalizing both the
-`lhs` and the `rhs` and then relating the resulting rigid types (or inference
-variables). Importantly, if `lhs` or `rhs` ends up as an alias, this alias can
-now be treated as rigid and gets unified without emitting a nested `AliasRelate`
-goal: [source][structural-relate].
-
-This means that `AliasRelate` with an unconstrained `rhs` ends up functioning
-similar to `NormalizesTo`, acting as a function which fully normalizes `lhs`
-before assigning the resulting rigid type to an inference variable. This is used by
-`fn structurally_normalize_ty` both [inside] and [outside] of the trait solver.
-This has to be used whenever we match on the value of some type, both inside
-and outside of the trait solver.
-
-<!--
-FIXME: structure, maybe we should have an "alias handling" chapter instead as
-talking about normalization without explaining that doesn't make too much
-sense.
-
-FIXME: it is likely that this will subtly change again by mostly moving structural
-normalization into `NormalizesTo`.
--->
-
-[structural_norm]: https://github.com/rust-lang/rust/blob/2627e9f3012a97d3136b3e11bf6bd0853c38a534/compiler/rustc_trait_selection/src/solve/alias_relate.rs#L140-L175
-[structural-relate]: https://github.com/rust-lang/rust/blob/a0569fa8f91b5271e92d2f73fd252de7d3d05b9c/compiler/rustc_trait_selection/src/solve/alias_relate.rs#L88-L107
-[inside]: https://github.com/rust-lang/rust/blob/a0569fa8f91b5271e92d2f73fd252de7d3d05b9c/compiler/rustc_trait_selection/src/solve/mod.rs#L278-L299
-[outside]: https://github.com/rust-lang/rust/blob/a0569fa8f91b5271e92d2f73fd252de7d3d05b9c/compiler/rustc_trait_selection/src/traits/structural_normalize.rs#L17-L48
-
-## Deep normalization
-
-By walking over a type, and using `fn structurally_normalize_ty` for each encountered
-alias, it is possible to deeply normalize a type, normalizing all aliases as much as
-possible. However, this only works for aliases referencing bound variables if they are
-not ambiguous as we're unable to replace the alias with a corresponding inference
-variable without leaking universes.
-
-<!--
-FIXME: we previously had to also be careful about instantiating the new inference
-variable with another normalizeable alias. Due to our recent changes to generalization,
-this should not be the case anymore. Equating an inference variable with an alias
-now always uses `AliasRelate` to fully normalize the alias before instantiating the
-inference variable: [source][generalize-no-alias]
--->
-
-[generalize-no-alias]: https://github.com/rust-lang/rust/blob/a0569fa8f91b5271e92d2f73fd252de7d3d05b9c/compiler/rustc_infer/src/infer/relate/generalize.rs#L353-L358
-
-## Outside of the trait solver
-
-The core type system - relating types and trait solving - will not need deep
-normalization with the new solver. There are still some areas which depend on it.
-For these areas there is the function `At::deeply_normalize`. Without additional
-trait solver support deep normalization does not always work in case of ambiguity.
-Luckily deep normalization is currently only necessary in places where there is no ambiguity.
-`At::deeply_normalize` immediately fails if there's ambiguity.
-
-If we only care about the outermost layer of types, we instead use
-`At::structurally_normalize` or `FnCtxt::(try_)structurally_resolve_type`.
-Unlike `At::deeply_normalize`, structural normalization is also used in cases where we
-have to handle ambiguity.
-
-Because this may result in behavior changes depending on how the trait solver handles
-ambiguity, it is safer to also require full normalization there. This happens in
-`FnCtxt::structurally_resolve_type` which always emits a hard error if the self type ends
-up as an inference variable. There are some existing places which have a fallback for
-inference variables instead. These places use `try_structurally_resolve_type` instead.
-
-## Why deep normalization with ambiguity is hard
-
-Fully correct deep normalization is very challenging, especially with the new solver 
-given that we do not want to deeply normalize inside of the solver. Mostly deeply normalizing
-but sometimes failing to do so is bound to cause very hard to minimize and understand bugs.
-If possible, avoiding any reliance on deep normalization entirely therefore feels preferable.
-
-If the solver itself does not deeply normalize, any inference constraints returned by the
-solver would require normalization. Handling this correctly is ugly. This also means that
-we change goals we provide to the trait solver by "normalizing away" some projections.
-
-The way we (mostly) guarantee deep normalization with the old solver is by eagerly replacing
-the projection with an inference variable and emitting a nested `Projection` goal. This works
-as `Projection` goals in the old solver deeply normalize. Unless we add another `PredicateKind`
-for deep normalization to the new solver we cannot emulate this behavior. This does not work
-for projections with bound variables, sometimes leaving them unnormalized. An approach which
-also supports projections with bound variables will be even more involved. 
-
-[^opaques]: opaque types are currently handled a bit differently. this may change in the future
diff --git a/src/solve/significant-changes.md b/src/solve/significant-changes.md
index c82b5d468..88dbfa967 100644
--- a/src/solve/significant-changes.md
+++ b/src/solve/significant-changes.md
@@ -106,4 +106,4 @@ their ambiguous nested goals are returned to the caller which then evaluates the
 See [#122687] for more details.
 
 [#122687]: https://github.com/rust-lang/rust/pull/122687
-[normalization]: ./normalization.md
+[normalization]: ../normalization/normalization.md