Automatons for integrity
Bugs are always messy. Though they become more of a problem when you’re dealing with anything of value. And what’s more valuable than money? Handling payments is really delicate, one anomaly can have quite the impact: from the simple “wrong amount” bug to the complex exploit. Running tests and in huge quantities is a must, but so is being intelligent about it. How do we do this? Automatons, also known as finite state machines. FSMs can be used to detect anomalies, and thus preserve the integrity of any state-driven system. Here, it’s less about using FSMs to solve your problem, but more about using them to check on you.
Aparté: Quick review of Finite State Machines
Let’s go through it word-by-word, but starting with the M:
Machine: A machine here is the process that takes input (data), treats it, and outputs it. For example, imagine the data here is a transaction, it comes in the machine, the machine eventually sets the status as captured and outputs it.
State: State is pretty intuitive, it’s the different ‘statuses’ data can be in. For example, the data of a transaction can be in the states pending, authorized, and captured for a transaction. The different states can also be a combination of elements of a transaction, you could define the states of a transaction as: 1. pending with amount=$0, 2. pending with amount>$0, 3. authorized with amount>$0, 4. captured with amount>$0. So our machine takes this data from state to state.
Finite: This just means that our machine, our state-machine, can only be in one of a finite number of states at the same time. Simple, a transaction can only be pending, authorized, or captured and never in-between or two at the same time. So it’s a finite-state-machine.
By now you should see it in your head. It should look like this:
Arrow: transition, Double circle: (accept) state, PDG: pending, ATH: authorized, CPT: captured
So you can see in the drawing you start of at the initial state, PDG, from there you can only go to ATH, and from there you, wait you can go to both ATH and CPT? Well yeah, if the authorization expires it’d be better if the customers could try the authorization again. So, that’s why there’s a redundant arrow. Something else here is that all the states are accept states. It means the data is in a valid state if it stops in any of the accept state. Some automatons have states where the data can’t end on. For example here I could make the ATH state not be an accept state, so that the transactions would have to be always captured, because you can’t stop on ATH, but only on the final state CPT.
In short, a FSM is defined by a list of its states which include its initial and final state and the conditions for each transition. It’s for handling data in an assured and simplified manner. But, it can also be used to check the evolution of your data to make sure the integrity of a flow is preserved, and that no bugs happen.
The entire power of FSMs is that they offer bug🐛-free checking. This is huge —and in a world where bugs are bound to happen— this almost feels surreal. The only drawback is that you need a flow to apply this on. Basically, where your data goes from state to state, which should be pretty much everywhere. You get to describe exactly how your data shifts. Let me give you an example.
Example
To keep it simple, let’s say you’re handling payments. The first step is to design your FSM to handle your flow of data. You have a transaction object, which contains generic information: the item, the amount, the currency, maybe the fees, and the status of the transaction. Hmm, alright. How can that be viewed as state-driven data. Simple! The status! To keep it simple, we’ll say that the status can go from pending to authorized to captured.
Similar to the apparté, we have this simple design in mind:
Status: PDG=pending, ATH=authorized, CPT=captured
You can see it now. This is the way this simplified payment system is going to work. Note that once you have defined the transitions, this means it has to happen this way, and no other way. You can always come back and adjust though.
We have the transitions that are pending -> authorized -> captured. But those transitions can also have ‘rules’. For example, you can say that you cannot go from state pending to state authorized if there’s no delivery address, or dumber, if the amount has changed in between both states. This preserves integrity. Of course you can also have starting rules for the pending state (e.g. the amount must be strictly greater than 0). These transition rules combined with the transitions themselves will ensure that no bugs happen, that data integrity failures are possible.
The next step is coding that FSM. It’s the easiest step, trust me. There are plenty of reputable libraries in all the languages that allow you to implement your FSM easily. You can even take a look at the libraries’ code, it’s not very complicated. I recommend that you pick a library which allows you to set rules for the transitions. Often times there are no difference between rules and transitions, since a rule is just a transition condition. Since I’m a Gopher we’ll be looking at an example in Go of integrating our little FSM. I’ve forked a library (https://github.com/ProcessOut/fsm) in order to make this process simpler.
Where A = authorization, C = capture, (p) = pending, (s) = success, (f) = failed
The transitions are quite simple here, but as in the commented example we could also define more complex transitions that come from more than one data point (and not just the status). Now it’s time for the transitions and the transition rules. Note that in a way, a transition from state to state itself is a transition rule, and this is reflected in the package. Functions that define the rules here are called Guards.
Output:
So not only are FSMs really simple to integrate, they’re also really fast. It’s quite important when you have a lot of tests, and don’t wanna wait a while after every change. We applied FSMs here to the logic of transactions, but the sky’s the limit, and I’m really interested as to where you guys find you can apply this type of logic, do not hesitate to speak to me about this: guillaume@processout.com. Sure, you might have to modify your data structures a bit, but you’ll be reaping the benefits of “forced integrity”, a.k.a. no bugs possible.
The advantage is not only in the way that FSMs are coded, it’s also about the way your data moves. If you think about your data movements as if it was an FSM, it’s usually a much cleaner design, and as a result can even improve your coding speed.
Nice links
- Package of the code shown here
- This designer is what I use to draw beforehand: Online FSM designer
- The good ol’ Wikipedia link on FSMs
- The good ol’ Wikipedia link on state machines applied as a design pattern.
- This is about FSMs in general, you need not read this: Introductory semi-complex paper
- And guillaume@processout.com, I’d be happy to talk about this article with you.